Moc05g31880 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc05g31880
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr5: 23871010 .. 23873025 (-)
RNA-Seq ExpressionMoc05g31880
SyntenyMoc05g31880
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGAATGCGAAGCCCACAACTCTTGTTCAAATGAATCAAATCTCAATTCCCGCCGGCGCCGTTATTCCATGGGCTCTGCAAGCGATCCGCCGCGCCGACGGGATGAACTACGCCGCCTATGGCCGCCTTATTCAGCACTGCGCCGACCGCCGCTTCCTCCGCCTCGGTAAGCAGCTTCACGCCCGTCTTGTTCTACTTTCCGTCACTCCCGATAACTTCCTCGGATCGAAGCTCATCGCGTTCTACTCAAAATCCGGCAGCCTTAGAGATGCCTACAATGTGTTCGGTAACATTTCTCATAAGAACATTTTCTCCTGGAATGCTCTGTTTATCAGCTACACTCTTCACAATATGCACTCCGATATGCTTAAGCTGTTTTCGTCTTTGGTTAATTCAAATGCGATGGATGTTAAGCCTGATAAGTTCACGATCACTTGTGTTTTGAAAGCGTTGGCTTCGTCGTTTACTGATTCGATTTTGGCTAAGGAAGTTCATTGTTTCGTTCTTCGACGAGGACTTGAGTCTGATATTTTTGTTGTCAACGCTTTGGTTACTTATTACTCGAGGTGTGAGGAGGTGGTTTTAGCGAGAATTGTGTTTGGTAGAATGCCTGAGAGAGATATAGTGTCTTGGAATGCGATGGTGGCTGGGTTCTCTCAGGGTGGGTTCTATGAAGAGTGCAAAGAACTGTTCAAAGAGATGTTGAGTTCAGTGGAGCTGAAGCCTAATGCATTAACCGCAGTCAGCGTTTTGCAAGCTTGTGCTCAGTCAAATGATCTCATTTTCGGCATGGAAGTTCATAGATTCGTCAACGAAAGTCAGATTGAAATGGATGTTTCGTTGTGCAATGCTGTTATTGGATTGTATGCCAAGTGTGGTAGCTTGGATTATGCTCGGGAGTTATTCGAAGAAATGCCTGAGAAGGATGAGGTCACCTATGGCTCAATGATATCGGGCTACATGGTCCATGGTTCTGTTAACCAAGCCATGGATCTTTTCCAAGAGCTAAAAAAACCAGCATTGAGCACATGGAATGCTGTGATTTCTGGTCTGGTTCAGAATAATCAACAAGATGGAGTTCTAGATATATTTCGAGCAATGCAGTCCCATGGTTGCAGACCAAATGCTGTGACACTTGCGAGTGTTCTTCCCGTTTTCTCACATTTTTCAACCTTAAAAGGTGGGAAAGAAATTCATGCTTATGCTGTTAGAAACGGTTACAATGGGAATATTTATGTTGCTACTGCTATCATTGATTCTTATGCTAAGTCTGGTTACCTCCACGGGGCATGGCAAGTTTTCGATTTAGTAAAAGGTAGGAGTCTAATCATTTGGACGGCAATAATTTCAGCATATGCTGCACATGGTGATGCTAACGTGGCCCTTAGTCTTTTTTATGAGATGTTGAGAAATGGGATTCAGCCAGACCCGGTAACCTTTACATCGGTATTGGTTGCTTGTGCCCATTCAGGAGAGTTAGATGAAGCCTGGAAGATATTTAACATCATGTTACCAGAGTATGGGATTCAACCATTAGTCGAGCATTATGCTTGCATGGTAGGAGTTCTTAGTCGAGCGGGAAAGCTCTCTGATGCTGTTGATTTTATTTCTAAAATGCCAATTGAACCCAGTGCAAAAGTTTGGGGTGCTCTGCTCAATGGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTATGTTTTTGATCGTCTGCTTGAGATCGAACCTGAAAATACAGGTACCTACATCATCATGGCTAATTTATATTCACAATCTGGAAGATGGAAAGAAGCTGACAAGGTTAGGGATTTGATGAAGGAAGTTGGACTGAGGAAGATCCCGGGAAGTAGCTGGATAGAAACGAGCGGAGGGTTGCATAGTTTCGTAGCTAGAGATACTTCAAATGACAGTACCCCAGAGATTTATGAAATGTTAGAAGGTTTACTTGGATTGATGAAAGAAGAAGGATACATTCTGCAAAATGAGATAGATGAGGACTGTGGCAGTGGTTAG

mRNA sequence

ATGAGGAATGCGAAGCCCACAACTCTTGTTCAAATGAATCAAATCTCAATTCCCGCCGGCGCCGTTATTCCATGGGCTCTGCAAGCGATCCGCCGCGCCGACGGGATGAACTACGCCGCCTATGGCCGCCTTATTCAGCACTGCGCCGACCGCCGCTTCCTCCGCCTCGGTAAGCAGCTTCACGCCCGTCTTGTTCTACTTTCCGTCACTCCCGATAACTTCCTCGGATCGAAGCTCATCGCGTTCTACTCAAAATCCGGCAGCCTTAGAGATGCCTACAATGTGTTCGGTAACATTTCTCATAAGAACATTTTCTCCTGGAATGCTCTGTTTATCAGCTACACTCTTCACAATATGCACTCCGATATGCTTAAGCTGTTTTCGTCTTTGGTTAATTCAAATGCGATGGATGTTAAGCCTGATAAGTTCACGATCACTTGTGTTTTGAAAGCGTTGGCTTCGTCGTTTACTGATTCGATTTTGGCTAAGGAAGTTCATTGTTTCGTTCTTCGACGAGGACTTGAGTCTGATATTTTTGTTGTCAACGCTTTGGTTACTTATTACTCGAGGTGTGAGGAGGTGGTTTTAGCGAGAATTGTGTTTGGTAGAATGCCTGAGAGAGATATAGTGTCTTGGAATGCGATGGTGGCTGGGTTCTCTCAGGGTGGGTTCTATGAAGAGTGCAAAGAACTGTTCAAAGAGATGTTGAGTTCAGTGGAGCTGAAGCCTAATGCATTAACCGCAGTCAGCGTTTTGCAAGCTTGTGCTCAGTCAAATGATCTCATTTTCGGCATGGAAGTTCATAGATTCGTCAACGAAAGTCAGATTGAAATGGATGTTTCGTTGTGCAATGCTGTTATTGGATTGTATGCCAAGTGTGGTAGCTTGGATTATGCTCGGGAGTTATTCGAAGAAATGCCTGAGAAGGATGAGGTCACCTATGGCTCAATGATATCGGGCTACATGGTCCATGGTTCTGTTAACCAAGCCATGGATCTTTTCCAAGAGCTAAAAAAACCAGCATTGAGCACATGGAATGCTGTGATTTCTGGTCTGGTTCAGAATAATCAACAAGATGGAGTTCTAGATATATTTCGAGCAATGCAGTCCCATGGTTGCAGACCAAATGCTGTGACACTTGCGAGTGTTCTTCCCGTTTTCTCACATTTTTCAACCTTAAAAGGTGGGAAAGAAATTCATGCTTATGCTGTTAGAAACGGTTACAATGGGAATATTTATGTTGCTACTGCTATCATTGATTCTTATGCTAAGTCTGGTTACCTCCACGGGGCATGGCAAGTTTTCGATTTAGTAAAAGGTAGGAGTCTAATCATTTGGACGGCAATAATTTCAGCATATGCTGCACATGGTGATGCTAACGTGGCCCTTAGTCTTTTTTATGAGATGTTGAGAAATGGGATTCAGCCAGACCCGGTAACCTTTACATCGGTATTGGTTGCTTGTGCCCATTCAGGAGAGTTAGATGAAGCCTGGAAGATATTTAACATCATGTTACCAGAGTATGGGATTCAACCATTAGTCGAGCATTATGCTTGCATGGTAGGAGTTCTTAGTCGAGCGGGAAAGCTCTCTGATGCTGTTGATTTTATTTCTAAAATGCCAATTGAACCCAGTGCAAAAGTTTGGGGTGCTCTGCTCAATGGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTATGTTTTTGATCGTCTGCTTGAGATCGAACCTGAAAATACAGGTACCTACATCATCATGGCTAATTTATATTCACAATCTGGAAGATGGAAAGAAGCTGACAAGGTTAGGGATTTGATGAAGGAAGTTGGACTGAGGAAGATCCCGGGAAGTAGCTGGATAGAAACGAGCGGAGGGTTGCATAGTTTCGTAGCTAGAGATACTTCAAATGACAGTACCCCAGAGATTTATGAAATGTTAGAAGGTTTACTTGGATTGATGAAAGAAGAAGGATACATTCTGCAAAATGAGATAGATGAGGACTGTGGCAGTGGTTAG

Coding sequence (CDS)

ATGAGGAATGCGAAGCCCACAACTCTTGTTCAAATGAATCAAATCTCAATTCCCGCCGGCGCCGTTATTCCATGGGCTCTGCAAGCGATCCGCCGCGCCGACGGGATGAACTACGCCGCCTATGGCCGCCTTATTCAGCACTGCGCCGACCGCCGCTTCCTCCGCCTCGGTAAGCAGCTTCACGCCCGTCTTGTTCTACTTTCCGTCACTCCCGATAACTTCCTCGGATCGAAGCTCATCGCGTTCTACTCAAAATCCGGCAGCCTTAGAGATGCCTACAATGTGTTCGGTAACATTTCTCATAAGAACATTTTCTCCTGGAATGCTCTGTTTATCAGCTACACTCTTCACAATATGCACTCCGATATGCTTAAGCTGTTTTCGTCTTTGGTTAATTCAAATGCGATGGATGTTAAGCCTGATAAGTTCACGATCACTTGTGTTTTGAAAGCGTTGGCTTCGTCGTTTACTGATTCGATTTTGGCTAAGGAAGTTCATTGTTTCGTTCTTCGACGAGGACTTGAGTCTGATATTTTTGTTGTCAACGCTTTGGTTACTTATTACTCGAGGTGTGAGGAGGTGGTTTTAGCGAGAATTGTGTTTGGTAGAATGCCTGAGAGAGATATAGTGTCTTGGAATGCGATGGTGGCTGGGTTCTCTCAGGGTGGGTTCTATGAAGAGTGCAAAGAACTGTTCAAAGAGATGTTGAGTTCAGTGGAGCTGAAGCCTAATGCATTAACCGCAGTCAGCGTTTTGCAAGCTTGTGCTCAGTCAAATGATCTCATTTTCGGCATGGAAGTTCATAGATTCGTCAACGAAAGTCAGATTGAAATGGATGTTTCGTTGTGCAATGCTGTTATTGGATTGTATGCCAAGTGTGGTAGCTTGGATTATGCTCGGGAGTTATTCGAAGAAATGCCTGAGAAGGATGAGGTCACCTATGGCTCAATGATATCGGGCTACATGGTCCATGGTTCTGTTAACCAAGCCATGGATCTTTTCCAAGAGCTAAAAAAACCAGCATTGAGCACATGGAATGCTGTGATTTCTGGTCTGGTTCAGAATAATCAACAAGATGGAGTTCTAGATATATTTCGAGCAATGCAGTCCCATGGTTGCAGACCAAATGCTGTGACACTTGCGAGTGTTCTTCCCGTTTTCTCACATTTTTCAACCTTAAAAGGTGGGAAAGAAATTCATGCTTATGCTGTTAGAAACGGTTACAATGGGAATATTTATGTTGCTACTGCTATCATTGATTCTTATGCTAAGTCTGGTTACCTCCACGGGGCATGGCAAGTTTTCGATTTAGTAAAAGGTAGGAGTCTAATCATTTGGACGGCAATAATTTCAGCATATGCTGCACATGGTGATGCTAACGTGGCCCTTAGTCTTTTTTATGAGATGTTGAGAAATGGGATTCAGCCAGACCCGGTAACCTTTACATCGGTATTGGTTGCTTGTGCCCATTCAGGAGAGTTAGATGAAGCCTGGAAGATATTTAACATCATGTTACCAGAGTATGGGATTCAACCATTAGTCGAGCATTATGCTTGCATGGTAGGAGTTCTTAGTCGAGCGGGAAAGCTCTCTGATGCTGTTGATTTTATTTCTAAAATGCCAATTGAACCCAGTGCAAAAGTTTGGGGTGCTCTGCTCAATGGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTATGTTTTTGATCGTCTGCTTGAGATCGAACCTGAAAATACAGGTACCTACATCATCATGGCTAATTTATATTCACAATCTGGAAGATGGAAAGAAGCTGACAAGGTTAGGGATTTGATGAAGGAAGTTGGACTGAGGAAGATCCCGGGAAGTAGCTGGATAGAAACGAGCGGAGGGTTGCATAGTTTCGTAGCTAGAGATACTTCAAATGACAGTACCCCAGAGATTTATGAAATGTTAGAAGGTTTACTTGGATTGATGAAAGAAGAAGGATACATTCTGCAAAATGAGATAGATGAGGACTGTGGCAGTGGTTAG

Protein sequence

MRNAKPTTLVQMNQISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG
Homology
BLAST of Moc05g31880 vs. NCBI nr
Match: XP_022145703.1 (pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia])

HSP 1 Score: 1326.2 bits (3431), Expect = 0.0e+00
Identity = 660/660 (100.00%), Postives = 660/660 (100.00%), Query Frame = 0

Query: 12  MNQISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTP 71
           MNQISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTP
Sbjct: 1   MNQISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTP 60

Query: 72  DNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLV 131
           DNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLV
Sbjct: 61  DNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLV 120

Query: 132 NSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRC 191
           NSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRC
Sbjct: 121 NSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRC 180

Query: 192 EEVVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSV 251
           EEVVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSV
Sbjct: 181 EEVVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSV 240

Query: 252 LQACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDE 311
           LQACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDE
Sbjct: 241 LQACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDE 300

Query: 312 VTYGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSH 371
           VTYGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSH
Sbjct: 301 VTYGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSH 360

Query: 372 GCRPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGA 431
           GCRPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGA
Sbjct: 361 GCRPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGA 420

Query: 432 WQVFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHS 491
           WQVFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHS
Sbjct: 421 WQVFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHS 480

Query: 492 GELDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGA 551
           GELDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGA
Sbjct: 481 GELDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGA 540

Query: 552 LLNGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGL 611
           LLNGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGL
Sbjct: 541 LLNGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGL 600

Query: 612 RKIPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG 671
           RKIPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG
Sbjct: 601 RKIPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG 660

BLAST of Moc05g31880 vs. NCBI nr
Match: XP_038905794.1 (pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_038905795.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_038905796.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_038905797.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida])

HSP 1 Score: 1177.2 bits (3044), Expect = 0.0e+00
Identity = 574/655 (87.63%), Postives = 616/655 (94.05%), Query Frame = 0

Query: 17  IPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLG 76
           +PA   + WALQA+RR D MNY AYGRLIQHC D+ F+RLGKQLHARLVL SV PDNFLG
Sbjct: 11  VPATVCLSWALQALRRTDEMNYGAYGRLIQHCTDQLFVRLGKQLHARLVLSSVAPDNFLG 70

Query: 77  SKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAM 136
           SKLIAFYSKSGSLRDAYNVFGNISHKNIF+WNALFISYTLHNMH DML+LFSSLVNSN+ 
Sbjct: 71  SKLIAFYSKSGSLRDAYNVFGNISHKNIFTWNALFISYTLHNMHIDMLRLFSSLVNSNST 130

Query: 137 DVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVL 196
           DVKPDKFTITCVLKALAS F++S+LAKEVHCF+LRR LE DIFVVNAL+T+YSRC+E+VL
Sbjct: 131 DVKPDKFTITCVLKALASLFSNSVLAKEVHCFILRRELEFDIFVVNALITFYSRCDELVL 190

Query: 197 ARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACA 256
           ARIVF RMPE+DIVSWNAMVAG+SQGGFYEECKELFK MLSSVELKPNALT VSVLQACA
Sbjct: 191 ARIVFDRMPEKDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALTTVSVLQACA 250

Query: 257 QSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGS 316
           QSNDLIFGMEVHRFV+ESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMP+KDEVTYGS
Sbjct: 251 QSNDLIFGMEVHRFVSESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPKKDEVTYGS 310

Query: 317 MISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPN 376
           MISGYMV+G VNQAMDLF+EL++P LSTWNAVISGLVQNNQQD VLDIFRAMQSHGCRPN
Sbjct: 311 MISGYMVYGFVNQAMDLFRELERPVLSTWNAVISGLVQNNQQDEVLDIFRAMQSHGCRPN 370

Query: 377 AVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFD 436
            VTLASVLP+FSHFST+KGGKEIHAYA+R  Y+GNIYVAT II+SYAKSGYLHGA QVFD
Sbjct: 371 TVTLASVLPIFSHFSTIKGGKEIHAYAIRKAYDGNIYVATGIINSYAKSGYLHGARQVFD 430

Query: 437 LVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDE 496
            +KGRSLIIWTAIISAYAAHGDANVALSLFYEML NGIQPDPVTFTSVLVACAHSGELDE
Sbjct: 431 QLKGRSLIIWTAIISAYAAHGDANVALSLFYEMLANGIQPDPVTFTSVLVACAHSGELDE 490

Query: 497 AWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGA 556
           AWKIFN++LP+YGIQP VEHYACMVGVLSRAGKLSDAV+FISKMP EP+AKVWGALLNGA
Sbjct: 491 AWKIFNVLLPKYGIQPPVEHYACMVGVLSRAGKLSDAVEFISKMPFEPTAKVWGALLNGA 550

Query: 557 SVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPG 616
           SVAGDVELGKYVFDRL EIEPENTG Y+IMANLYSQ GRWKEADKVRDLMKEVGL+KIPG
Sbjct: 551 SVAGDVELGKYVFDRLFEIEPENTGNYVIMANLYSQFGRWKEADKVRDLMKEVGLKKIPG 610

Query: 617 SSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG 672
           +SWIET GGL SF+ARDTSN+ TPEIY MLEGLLGLMKEEG ILQ+EID+DCGSG
Sbjct: 611 NSWIETRGGLQSFIARDTSNNRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSG 665

BLAST of Moc05g31880 vs. NCBI nr
Match: KAG6580575.1 (ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1157.9 bits (2994), Expect = 0.0e+00
Identity = 572/649 (88.14%), Postives = 606/649 (93.37%), Query Frame = 0

Query: 23   IPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAF 82
            I  ALQ IRR+DGMNY AYGRLIQHC D+RF RLGKQLHARLVL SV PDNFLGSKLIA 
Sbjct: 716  IDGALQLIRRSDGMNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIAL 775

Query: 83   YSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDK 142
            YSKSGSLRDAYNVF +ISHKNIFSWNALFISYTLHNMH+DMLKLFSSLVN N+ DVKPDK
Sbjct: 776  YSKSGSLRDAYNVFDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNLNSTDVKPDK 835

Query: 143  FTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFG 202
            FT+TCVLKALAS FT+SILAKEVHCFVLRRGLESDIFVVNAL+T+YSRC+E+VLARI+F 
Sbjct: 836  FTVTCVLKALASLFTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELVLARIMFD 895

Query: 203  RMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSNDLI 262
            R PERDIVSWNAMVAG+SQGGFYE+CKELFK ML S E KPNALTAVSVLQACAQSNDLI
Sbjct: 896  RTPERDIVSWNAMVAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAQSNDLI 955

Query: 263  FGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYM 322
            FGMEVH+FVNES IEMDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYGSMISGYM
Sbjct: 956  FGMEVHKFVNESGIEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYM 1015

Query: 323  VHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPNAVTLAS 382
            VHG VNQAMDLF+EL++PALSTWNAVISGLVQNNQQDGV+DIFRAMQ HGCRPN VTLAS
Sbjct: 1016 VHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLAS 1075

Query: 383  VLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRS 442
            VLP+FSHFSTLKGGKEIHAYAVRN Y+GNIYVATAIIDSYAKSGYLHGA QVFD  K RS
Sbjct: 1076 VLPIFSHFSTLKGGKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLHGARQVFDQSKRRS 1135

Query: 443  LIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFN 502
            LIIWTAIISAYAAHGDAN  LSLFYEML NGI+PDPVTFTSVLVACAHSGELDEAWKIFN
Sbjct: 1136 LIIWTAIISAYAAHGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELDEAWKIFN 1195

Query: 503  IMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDV 562
            ++LPE+GIQPLVEHYACMVGVLSRAGKLSDAV+FISKMPIEP+AKVWGALLNGASVAGDV
Sbjct: 1196 VLLPEFGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDV 1255

Query: 563  ELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIET 622
            ELGKYVFDRLL+IEPENTG YIIMANLYSQ GRWKEAD+VRDLMKEVGL+KIPG+SWIET
Sbjct: 1256 ELGKYVFDRLLDIEPENTGNYIIMANLYSQFGRWKEADRVRDLMKEVGLKKIPGNSWIET 1315

Query: 623  SGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG 672
             GGL SFVARDTSND TPEIY  LEGL+ LMKEEG I Q+EID+DCGSG
Sbjct: 1316 RGGLQSFVARDTSNDRTPEIYGTLEGLVRLMKEEGLIQQHEIDDDCGSG 1364

BLAST of Moc05g31880 vs. NCBI nr
Match: KAG7017327.1 (ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1154.8 bits (2986), Expect = 0.0e+00
Identity = 571/649 (87.98%), Postives = 605/649 (93.22%), Query Frame = 0

Query: 23   IPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAF 82
            I  ALQ IRR+DGMNY AYGRLIQHC D+RF RLGKQLHARLVL SV PDNFLGSKLIA 
Sbjct: 735  IDGALQLIRRSDGMNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIAL 794

Query: 83   YSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDK 142
            YSKSGSLRDAYNVF +ISHKNIFSWNALFISYTLHNMH+DMLKLFSSLVN N+ DVKPDK
Sbjct: 795  YSKSGSLRDAYNVFDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNLNSTDVKPDK 854

Query: 143  FTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFG 202
            FT+TCVLKALAS FT+SILAKEVHCFVLRRGLESDIFVVNAL+T+YSRC+E+VLARI+F 
Sbjct: 855  FTVTCVLKALASLFTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELVLARIMFD 914

Query: 203  RMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSNDLI 262
            R PERDIVSWNAMVAG+SQGGFYE+CKELFK ML S E KPNALTAVSVLQACAQSNDLI
Sbjct: 915  RTPERDIVSWNAMVAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAQSNDLI 974

Query: 263  FGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYM 322
            FGMEVH+FVNES IEMDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYGSMISGYM
Sbjct: 975  FGMEVHKFVNESGIEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYM 1034

Query: 323  VHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPNAVTLAS 382
            VHG VNQAMDLF+EL++PALSTWNAVISGLVQNNQQDGV+DIFRAMQ HGCRPN VTLAS
Sbjct: 1035 VHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLAS 1094

Query: 383  VLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRS 442
            VLP+FSHFSTLKGGKEIHAYAVRN Y+GNIYVATAIIDSYAKSGYL GA QVFD  K RS
Sbjct: 1095 VLPIFSHFSTLKGGKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLQGARQVFDQSKRRS 1154

Query: 443  LIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFN 502
            LIIWTAIISAYAAHGDAN  LSLFYEML NGI+PDPVTFTSVLVACAHSGELDEAWKIFN
Sbjct: 1155 LIIWTAIISAYAAHGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELDEAWKIFN 1214

Query: 503  IMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDV 562
            ++LPE+GIQPLVEHYACMVGVLSRAGKLSDAV+FISKMPIEP+AKVWGALLNGASVAGDV
Sbjct: 1215 VLLPEFGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDV 1274

Query: 563  ELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIET 622
            ELGKYVFDRLL+IEPENTG YIIMANLYSQ GRWKEAD+VRDLMKEVGL+KIPG+SWIET
Sbjct: 1275 ELGKYVFDRLLDIEPENTGNYIIMANLYSQFGRWKEADRVRDLMKEVGLKKIPGNSWIET 1334

Query: 623  SGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG 672
             GGL SFVARDTSND TPEIY  LEGL+ LMKEEG I Q+EID+DCGSG
Sbjct: 1335 RGGLQSFVARDTSNDRTPEIYGTLEGLVRLMKEEGLIQQHEIDDDCGSG 1383

BLAST of Moc05g31880 vs. NCBI nr
Match: XP_022934145.1 (pentatricopeptide repeat-containing protein At2g37310 [Cucurbita moschata] >XP_022934146.1 pentatricopeptide repeat-containing protein At2g37310 [Cucurbita moschata] >XP_022934147.1 pentatricopeptide repeat-containing protein At2g37310 [Cucurbita moschata] >XP_022934148.1 pentatricopeptide repeat-containing protein At2g37310 [Cucurbita moschata])

HSP 1 Score: 1134.0 bits (2932), Expect = 0.0e+00
Identity = 559/635 (88.03%), Postives = 592/635 (93.23%), Query Frame = 0

Query: 36  MNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAFYSKSGSLRDAYNV 95
           MNY AYGRLIQHC D+RF RLGKQLHARLVL SV PDNFLGSKLIA YSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKSGSLRDAYNV 60

Query: 96  FGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASS 155
           F +ISHKNIFSWNALFISYTLHNMH+DMLKLFSSLVN N+ DVKPDKFT+TCVLKALAS 
Sbjct: 61  FDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNVNSTDVKPDKFTVTCVLKALASL 120

Query: 156 FTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAM 215
           FT+SILAKEVHCFVLRRGLESDIFVVNAL+T+YSRC+E+ LARI+F R PERDIVSWNAM
Sbjct: 121 FTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELALARIMFDRTPERDIVSWNAM 180

Query: 216 VAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 275
           VAG+SQGGFYE+CKELFK ML S E KPNALTAVSVLQACA SNDLIFGMEVH+FVNES 
Sbjct: 181 VAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAHSNDLIFGMEVHKFVNESG 240

Query: 276 IEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGSVNQAMDLFQ 335
           IEMDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYGSMISGYMVHG VNQAMDLF+
Sbjct: 241 IEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFR 300

Query: 336 ELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPNAVTLASVLPVFSHFSTLKG 395
           EL++PALSTWNAVISGLVQNNQQDGV+DIFRAMQ HGCRPN VTLASVLP+FSHFSTLKG
Sbjct: 301 ELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKG 360

Query: 396 GKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRSLIIWTAIISAYAA 455
           GKEIHAYAVRN Y+GNIYVATAIIDSYAKSGYL GA QVFD +K RSLIIWTAIISAYAA
Sbjct: 361 GKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLQGARQVFDQLKRRSLIIWTAIISAYAA 420

Query: 456 HGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFNIMLPEYGIQPLVE 515
           HGDAN  LSLFYEML NGI+PDPVTFTSVLVACAHSGELDEAWKIFN++LPE+GIQPLVE
Sbjct: 421 HGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELDEAWKIFNVLLPEFGIQPLVE 480

Query: 516 HYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDVELGKYVFDRLLEI 575
           HYACMVGVLSRAGKLSDAV+FISKMPIEP+AKVWGALLNGASVAGDVELGKYVFDRLL+I
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFDRLLDI 540

Query: 576 EPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIETSGGLHSFVARDTS 635
           EPENTG YIIMANLYSQ GRWKEAD VRDLMKEVGL+KIPG+SWIET  GL SFVARDTS
Sbjct: 541 EPENTGNYIIMANLYSQFGRWKEADNVRDLMKEVGLKKIPGNSWIETREGLQSFVARDTS 600

Query: 636 NDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGS 671
           ND TPEIY  LEGL+GLMKEEG I Q+EID+DCGS
Sbjct: 601 NDRTPEIYGTLEGLVGLMKEEGLIQQHEIDDDCGS 635

BLAST of Moc05g31880 vs. ExPASy Swiss-Prot
Match: Q9ZUT5 (Pentatricopeptide repeat-containing protein At2g37310 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E49 PE=2 SV=1)

HSP 1 Score: 745.0 bits (1922), Expect = 7.6e-214
Identity = 369/652 (56.60%), Postives = 483/652 (74.08%), Query Frame = 0

Query: 20  GAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKL 79
           G  I  ALQ +     ++  AYG LIQH    R      QLHAR+V+ S+ PDNFL SKL
Sbjct: 4   GFEIQRALQGLLNKAAVDGGAYGHLIQHFTRHRLPLHVLQLHARIVVFSIKPDNFLASKL 63

Query: 80  IAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMD-- 139
           I+FY++    R A +VF  I+ +N FS+NAL I+YT   M+ D   LF S + S+     
Sbjct: 64  ISFYTRQDRFRQALHVFDEITVRNAFSYNALLIAYTSREMYFDAFSLFLSWIGSSCYSSD 123

Query: 140 -VKPDKFTITCVLKALA--SSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEV 199
             +PD  +I+CVLKAL+    F    LA++VH FV+R G +SD+FV N ++TYY++C+ +
Sbjct: 124 AARPDSISISCVLKALSGCDDFWLGSLARQVHGFVIRGGFDSDVFVGNGMITYYTKCDNI 183

Query: 200 VLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQA 259
             AR VF  M ERD+VSWN+M++G+SQ G +E+CK+++K ML+  + KPN +T +SV QA
Sbjct: 184 ESARKVFDEMSERDVVSWNSMISGYSQSGSFEDCKKMYKAMLACSDFKPNGVTVISVFQA 243

Query: 260 CAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTY 319
           C QS+DLIFG+EVH+ + E+ I+MD+SLCNAVIG YAKCGSLDYAR LF+EM EKD VTY
Sbjct: 244 CGQSSDLIFGLEVHKKMIENHIQMDLSLCNAVIGFYAKCGSLDYARALFDEMSEKDSVTY 303

Query: 320 GSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCR 379
           G++ISGYM HG V +AM LF E++   LSTWNA+ISGL+QNN  + V++ FR M   G R
Sbjct: 304 GAIISGYMAHGLVKEAMALFSEMESIGLSTWNAMISGLMQNNHHEEVINSFREMIRCGSR 363

Query: 380 PNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQV 439
           PN VTL+S+LP  ++ S LKGGKEIHA+A+RNG + NIYV T+IID+YAK G+L GA +V
Sbjct: 364 PNTVTLSSLLPSLTYSSNLKGGKEIHAFAIRNGADNNIYVTTSIIDNYAKLGFLLGAQRV 423

Query: 440 FDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGEL 499
           FD  K RSLI WTAII+AYA HGD++ A SLF +M   G +PD VT T+VL A AHSG+ 
Sbjct: 424 FDNCKDRSLIAWTAIITAYAVHGDSDSACSLFDQMQCLGTKPDDVTLTAVLSAFAHSGDS 483

Query: 500 DEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLN 559
           D A  IF+ ML +Y I+P VEHYACMV VLSRAGKLSDA++FISKMPI+P AKVWGALLN
Sbjct: 484 DMAQHIFDSMLTKYDIEPGVEHYACMVSVLSRAGKLSDAMEFISKMPIDPIAKVWGALLN 543

Query: 560 GASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKI 619
           GASV GD+E+ ++  DRL E+EPENTG Y IMANLY+Q+GRW+EA+ VR+ MK +GL+KI
Sbjct: 544 GASVLGDLEIARFACDRLFEMEPENTGNYTIMANLYTQAGRWEEAEMVRNKMKRIGLKKI 603

Query: 620 PGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDE 667
           PG+SWIET  GL SF+A+D+S + + E+YE++EGL+  M ++ YI + E+DE
Sbjct: 604 PGTSWIETEKGLRSFIAKDSSCERSKEMYEIIEGLVESMSDKEYIRKQELDE 655

BLAST of Moc05g31880 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 3.1e-114
Identity = 218/618 (35.28%), Postives = 356/618 (57.61%), Query Frame = 0

Query: 44  LIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAF--YSKSGSLRDAYNVFGNISH 103
           LI+ C   R L   KQ H  ++      D +  SKL A    S   SL  A  VF  I  
Sbjct: 36  LIERCVSLRQL---KQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPK 95

Query: 104 KNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASSFTDSIL 163
            N F+WN L  +Y   +    +L +++ L   +     P+K+T   ++KA A+  +   L
Sbjct: 96  PNSFAWNTLIRAYA--SGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKA-AAEVSSLSL 155

Query: 164 AKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAMVAGFSQ 223
            + +H   ++  + SD+FV N+L+  Y  C ++  A  VF  + E+D+VSWN+M+ GF Q
Sbjct: 156 GQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 215

Query: 224 GGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQIEMDVS 283
            G  ++  ELFK+M  S ++K + +T V VL ACA+  +L FG +V  ++ E+++ ++++
Sbjct: 216 KGSPDKALELFKKM-ESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLT 275

Query: 284 LCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGSVNQAMDLFQELKKPA 343
           L NA++ +Y KCGS++ A+ LF+ M EKD VT+ +M+ GY +      A ++   + +  
Sbjct: 276 LANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKD 335

Query: 344 LSTWNAVISGLVQNNQQDGVLDIFRAMQ-SHGCRPNAVTLASVLPVFSHFSTLKGGKEIH 403
           +  WNA+IS   QN + +  L +F  +Q     + N +TL S L   +    L+ G+ IH
Sbjct: 336 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 395

Query: 404 AYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRSLIIWTAIISAYAAHGDAN 463
           +Y  ++G   N +V +A+I  Y+K G L  + +VF+ V+ R + +W+A+I   A HG  N
Sbjct: 396 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 455

Query: 464 VALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFNIMLPEYGIQPLVEHYACM 523
            A+ +FY+M    ++P+ VTFT+V  AC+H+G +DEA  +F+ M   YGI P  +HYAC+
Sbjct: 456 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 515

Query: 524 VGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDVELGKYVFDRLLEIEPENT 583
           V VL R+G L  AV FI  MPI PS  VWGALL    +  ++ L +    RLLE+EP N 
Sbjct: 516 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 575

Query: 584 GTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIETSGGLHSFVARDTSNDSTP 643
           G +++++N+Y++ G+W+   ++R  M+  GL+K PG S IE  G +H F++ D ++  + 
Sbjct: 576 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 635

Query: 644 EIYEMLEGLLGLMKEEGY 659
           ++Y  L  ++  +K  GY
Sbjct: 636 KVYGKLHEVMEKLKSNGY 646

BLAST of Moc05g31880 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 407.1 bits (1045), Expect = 3.8e-112
Identity = 232/687 (33.77%), Postives = 358/687 (52.11%), Query Frame = 0

Query: 44  LIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAFYSKSGSLRDAYNVFGNISHKN 103
           ++Q CAD + L+ GK++   +       D+ LGSKL   Y+  G L++A  VF  +  + 
Sbjct: 100 VLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEK 159

Query: 104 IFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASSFTDSILAK 163
              WN L          S  + LF  +++S    V+ D +T +CV K+  SS       +
Sbjct: 160 ALFWNILMNELAKSGDFSGSIGLFKKMMSSG---VEMDSYTFSCVSKSF-SSLRSVHGGE 219

Query: 164 EVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAMVAGFSQGG 223
           ++H F+L+ G      V N+LV +Y + + V  AR VF  M ERD++SWN+++ G+   G
Sbjct: 220 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 279

Query: 224 FYEECKELFKEML-SSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQIEMDVSL 283
             E+   +F +ML S +E+  +  T VSV   CA S  +  G  VH    ++    +   
Sbjct: 280 LAEKGLSVFVQMLVSGIEI--DLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRF 339

Query: 284 CNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGSVNQAMDLFQELKKPAL 343
           CN ++ +Y+KCG LD A+ +F EM ++  V+Y SMI+GY   G   +A+ LF+E+++  +
Sbjct: 340 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 399

Query: 344 S----------------------------------------------------------- 403
           S                                                           
Sbjct: 400 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 459

Query: 404 -----------TWNAVISGLVQNNQQDGVLDIFR-AMQSHGCRPNAVTLASVLPVFSHFS 463
                      +WN +I G  +N   +  L +F   ++     P+  T+A VLP  +  S
Sbjct: 460 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 519

Query: 464 TLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRSLIIWTAIIS 523
               G+EIH Y +RNGY  + +VA +++D YAK G L  A  +FD +  + L+ WT +I+
Sbjct: 520 AFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIA 579

Query: 524 AYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFNIMLPEYGIQ 583
            Y  HG    A++LF +M + GI+ D ++F S+L AC+HSG +DE W+ FNIM  E  I+
Sbjct: 580 GYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIE 639

Query: 584 PLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDVELGKYVFDR 643
           P VEHYAC+V +L+R G L  A  FI  MPI P A +WGALL G  +  DV+L + V ++
Sbjct: 640 PTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEK 699

Query: 644 LLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIETSGGLHSFVA 659
           + E+EPENTG Y++MAN+Y+++ +W++  ++R  + + GLRK PG SWIE  G ++ FVA
Sbjct: 700 VFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVA 759

BLAST of Moc05g31880 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 3.0e-109
Identity = 216/642 (33.64%), Postives = 354/642 (55.14%), Query Frame = 0

Query: 53  FLRLGKQLHARLVLLSV-TPDNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALF 112
           + + G  LHAR +   +     F  + +++ YSK G +      F  +  ++  SW  + 
Sbjct: 59  YSKTGYALHARKLFDEMPLRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMI 118

Query: 113 ISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLR 172
           + Y     +   +++   +V      ++P +FT+T VL ++A++       K+VH F+++
Sbjct: 119 VGYKNIGQYHKAIRVMGDMVKEG---IEPTQFTLTNVLASVAATRCME-TGKKVHSFIVK 178

Query: 173 RGLESDIFVVNALVTYYSRCEEVVLARIVFGR---------------------------- 232
            GL  ++ V N+L+  Y++C + ++A+ VF R                            
Sbjct: 179 LGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQ 238

Query: 233 ---MPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSND 292
              M ERDIV+WN+M++GF+Q G+     ++F +ML    L P+  T  SVL ACA    
Sbjct: 239 FEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEK 298

Query: 293 LIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYG--SMI 352
           L  G ++H  +  +  ++   + NA+I +Y++CG ++ AR L E+   KD    G  +++
Sbjct: 299 LCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALL 358

Query: 353 SGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPNAV 412
            GY+  G +NQA ++F  LK   +  W A+I G  Q+      +++FR+M   G RPN+ 
Sbjct: 359 DGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSY 418

Query: 413 TLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLV 472
           TLA++L V S  ++L  GK+IH  AV++G   ++ V+ A+I  YAK+G +  A + FDL+
Sbjct: 419 TLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLI 478

Query: 473 K-GRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEA 532
           +  R  + WT++I A A HG A  AL LF  ML  G++PD +T+  V  AC H+G +++ 
Sbjct: 479 RCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQG 538

Query: 533 WKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGAS 592
            + F++M     I P + HYACMV +  RAG L +A +FI KMPIEP    WG+LL+   
Sbjct: 539 RQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACR 598

Query: 593 VAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGS 652
           V  +++LGK   +RLL +EPEN+G Y  +ANLYS  G+W+EA K+R  MK+  ++K  G 
Sbjct: 599 VHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGF 658

Query: 653 SWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYI 660
           SWIE    +H F   D ++    EIY  ++ +   +K+ GY+
Sbjct: 659 SWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYV 696

BLAST of Moc05g31880 vs. ExPASy Swiss-Prot
Match: Q9LFL5 (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 4.7e-107
Identity = 222/656 (33.84%), Postives = 349/656 (53.20%), Query Frame = 0

Query: 48  CADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSW 107
           C +   +R G+  HA  ++     + F+G+ L+A YS+  SL DA  VF  +S  ++ SW
Sbjct: 137 CGEISSVRCGESAHALSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSW 196

Query: 108 NALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHC 167
           N++  SY         L++FS +  +N    +PD  T+  VL   AS  T S L K++HC
Sbjct: 197 NSIIESYAKLGKPKVALEMFSRM--TNEFGCRPDNITLVNVLPPCASLGTHS-LGKQLHC 256

Query: 168 FVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEE 227
           F +   +  ++FV N LV  Y++C  +  A  VF  M  +D+VSWNAMVAG+SQ G +E+
Sbjct: 257 FAVTSEMIQNMFVGNCLVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFED 316

Query: 228 CKELFKEM----------------------------------LSSVELKPNALTAVSVLQ 287
              LF++M                                  + S  +KPN +T +SVL 
Sbjct: 317 AVRLFEKMQEEKIKMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLS 376

Query: 288 ACAQSNDLIFGMEVHRFVNESQIEM-------DVSLCNAVIGLYAKCGSLDYARELFEEM 347
            CA    L+ G E+H +  +  I++       +  + N +I +YAKC  +D AR +F+ +
Sbjct: 377 GCASVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSL 436

Query: 348 --PEKDEVTYGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDI 407
              E+D VT+  MI GY  HG  N+A++L  E+ +    T                    
Sbjct: 437 SPKERDVVTWTVMIGGYSQHGDANKALELLSEMFEEDCQT-------------------- 496

Query: 408 FRAMQSHGCRPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNG-NIYVATAIIDSYA 467
                    RPNA T++  L   +  + L+ GK+IHAYA+RN  N   ++V+  +ID YA
Sbjct: 497 ---------RPNAFTISCALVACASLAALRIGKQIHAYALRNQQNAVPLFVSNCLIDMYA 556

Query: 468 KSGYLHGAWQVFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTS 527
           K G +  A  VFD +  ++ + WT++++ Y  HG    AL +F EM R G + D VT   
Sbjct: 557 KCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDEMRRIGFKLDGVTLLV 616

Query: 528 VLVACAHSGELDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIE 587
           VL AC+HSG +D+  + FN M   +G+ P  EHYAC+V +L RAG+L+ A+  I +MP+E
Sbjct: 617 VLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAGRLNAALRLIEEMPME 676

Query: 588 PSAKVWGALLNGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVR 647
           P   VW A L+   + G VELG+Y  +++ E+   + G+Y +++NLY+ +GRWK+  ++R
Sbjct: 677 PPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSNLYANAGRWKDVTRIR 736

Query: 648 DLMKEVGLRKIPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYI 660
            LM+  G++K PG SW+E   G  +F   D ++    EIY++L   +  +K+ GY+
Sbjct: 737 SLMRHKGVKKRPGCSWVEGIKGTTTFFVGDKTHPHAKEIYQVLLDHMQRIKDIGYV 760

BLAST of Moc05g31880 vs. ExPASy TrEMBL
Match: A0A6J1CWN9 (pentatricopeptide repeat-containing protein At2g37310 OS=Momordica charantia OX=3673 GN=LOC111015095 PE=4 SV=1)

HSP 1 Score: 1326.2 bits (3431), Expect = 0.0e+00
Identity = 660/660 (100.00%), Postives = 660/660 (100.00%), Query Frame = 0

Query: 12  MNQISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTP 71
           MNQISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTP
Sbjct: 1   MNQISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTP 60

Query: 72  DNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLV 131
           DNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLV
Sbjct: 61  DNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLV 120

Query: 132 NSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRC 191
           NSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRC
Sbjct: 121 NSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRC 180

Query: 192 EEVVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSV 251
           EEVVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSV
Sbjct: 181 EEVVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSV 240

Query: 252 LQACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDE 311
           LQACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDE
Sbjct: 241 LQACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDE 300

Query: 312 VTYGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSH 371
           VTYGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSH
Sbjct: 301 VTYGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSH 360

Query: 372 GCRPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGA 431
           GCRPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGA
Sbjct: 361 GCRPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGA 420

Query: 432 WQVFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHS 491
           WQVFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHS
Sbjct: 421 WQVFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHS 480

Query: 492 GELDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGA 551
           GELDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGA
Sbjct: 481 GELDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGA 540

Query: 552 LLNGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGL 611
           LLNGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGL
Sbjct: 541 LLNGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGL 600

Query: 612 RKIPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG 671
           RKIPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG
Sbjct: 601 RKIPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG 660

BLAST of Moc05g31880 vs. ExPASy TrEMBL
Match: A0A6J1F110 (pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita moschata OX=3662 GN=LOC111441405 PE=4 SV=1)

HSP 1 Score: 1134.0 bits (2932), Expect = 0.0e+00
Identity = 559/635 (88.03%), Postives = 592/635 (93.23%), Query Frame = 0

Query: 36  MNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAFYSKSGSLRDAYNV 95
           MNY AYGRLIQHC D+RF RLGKQLHARLVL SV PDNFLGSKLIA YSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKSGSLRDAYNV 60

Query: 96  FGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASS 155
           F +ISHKNIFSWNALFISYTLHNMH+DMLKLFSSLVN N+ DVKPDKFT+TCVLKALAS 
Sbjct: 61  FDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNVNSTDVKPDKFTVTCVLKALASL 120

Query: 156 FTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAM 215
           FT+SILAKEVHCFVLRRGLESDIFVVNAL+T+YSRC+E+ LARI+F R PERDIVSWNAM
Sbjct: 121 FTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELALARIMFDRTPERDIVSWNAM 180

Query: 216 VAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 275
           VAG+SQGGFYE+CKELFK ML S E KPNALTAVSVLQACA SNDLIFGMEVH+FVNES 
Sbjct: 181 VAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAHSNDLIFGMEVHKFVNESG 240

Query: 276 IEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGSVNQAMDLFQ 335
           IEMDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYGSMISGYMVHG VNQAMDLF+
Sbjct: 241 IEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFR 300

Query: 336 ELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPNAVTLASVLPVFSHFSTLKG 395
           EL++PALSTWNAVISGLVQNNQQDGV+DIFRAMQ HGCRPN VTLASVLP+FSHFSTLKG
Sbjct: 301 ELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKG 360

Query: 396 GKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRSLIIWTAIISAYAA 455
           GKEIHAYAVRN Y+GNIYVATAIIDSYAKSGYL GA QVFD +K RSLIIWTAIISAYAA
Sbjct: 361 GKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLQGARQVFDQLKRRSLIIWTAIISAYAA 420

Query: 456 HGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFNIMLPEYGIQPLVE 515
           HGDAN  LSLFYEML NGI+PDPVTFTSVLVACAHSGELDEAWKIFN++LPE+GIQPLVE
Sbjct: 421 HGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELDEAWKIFNVLLPEFGIQPLVE 480

Query: 516 HYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDVELGKYVFDRLLEI 575
           HYACMVGVLSRAGKLSDAV+FISKMPIEP+AKVWGALLNGASVAGDVELGKYVFDRLL+I
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFDRLLDI 540

Query: 576 EPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIETSGGLHSFVARDTS 635
           EPENTG YIIMANLYSQ GRWKEAD VRDLMKEVGL+KIPG+SWIET  GL SFVARDTS
Sbjct: 541 EPENTGNYIIMANLYSQFGRWKEADNVRDLMKEVGLKKIPGNSWIETREGLQSFVARDTS 600

Query: 636 NDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGS 671
           ND TPEIY  LEGL+GLMKEEG I Q+EID+DCGS
Sbjct: 601 NDRTPEIYGTLEGLVGLMKEEGLIQQHEIDDDCGS 635

BLAST of Moc05g31880 vs. ExPASy TrEMBL
Match: A0A6J1J0S5 (pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita maxima OX=3661 GN=LOC111482423 PE=4 SV=1)

HSP 1 Score: 1127.9 bits (2916), Expect = 0.0e+00
Identity = 559/636 (87.89%), Postives = 593/636 (93.24%), Query Frame = 0

Query: 36  MNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAFYSKSGSLRDAYNV 95
           MNY AYGRLIQHC D+RF RLGKQLHARLVL SV PDNFLGSKLIA YSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKSGSLRDAYNV 60

Query: 96  FGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASS 155
           F +ISHKNIFSWNALFISYTLHNMH+DMLKLFSSLVN N+ DVKPDKFT+TCVLKALAS 
Sbjct: 61  FDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNLNSTDVKPDKFTVTCVLKALASL 120

Query: 156 FTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAM 215
           FT+SILAKEVHCFVLRRGLESDIFVVNAL+T+YSRC+E+VLARI+F R PERDIVSWNAM
Sbjct: 121 FTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELVLARIMFHRTPERDIVSWNAM 180

Query: 216 VAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 275
           VAG+SQGGFYE+CKELFK ML S E KPNALTAVSVLQACAQSNDLIFGMEVH+FVNES 
Sbjct: 181 VAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAQSNDLIFGMEVHKFVNESG 240

Query: 276 IEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGSVNQAMDLFQ 335
           IEMDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYGSMISGYMVHG VNQAMDLF+
Sbjct: 241 IEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFR 300

Query: 336 ELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPNAVTLASVLPVFSHFSTLKG 395
           EL++PALSTWNAVISGLVQNNQQDGV+DIFRAMQ HGCRPN VTLASVLP+FSHFSTLKG
Sbjct: 301 ELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKG 360

Query: 396 GKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRSLIIWTAIISAYAA 455
           GKEIHAYAVRN Y+GNIYVATAIIDSYAKSGYL GA QVFD  K RSLIIWTAIISAYAA
Sbjct: 361 GKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLQGARQVFDQSKRRSLIIWTAIISAYAA 420

Query: 456 HGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFNIMLPEYGIQPLVE 515
           HGDAN  LSLFYEML NGI+PDPVTFTSVLVACAHSGEL+EAWKIFN++LPE+GIQPLVE
Sbjct: 421 HGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEFGIQPLVE 480

Query: 516 HYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDVELGKYVFDRLLEI 575
           HYACMVGVLSRAGKLSDAV+FISKMPIEP+AKVWGALLNGASVAGDVELGKYVFDRLL+I
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFDRLLDI 540

Query: 576 EPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIETSGGLHSFVARDTS 635
           EPENTG YIIMANLYSQ G WKEAD VRDLMKEVGL+KIPG+SWIET GGL SFVARDTS
Sbjct: 541 EPENTGNYIIMANLYSQFGWWKEADHVRDLMKEVGLKKIPGNSWIETRGGLQSFVARDTS 600

Query: 636 NDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGSG 672
           ND TPEIY  LEGL+GLMK EG I Q+EID++CGSG
Sbjct: 601 NDRTPEIYGTLEGLVGLMK-EGLIQQHEIDDECGSG 635

BLAST of Moc05g31880 vs. ExPASy TrEMBL
Match: A0A5A7TRM4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G002030 PE=4 SV=1)

HSP 1 Score: 1038.1 bits (2683), Expect = 1.6e-299
Identity = 505/594 (85.02%), Postives = 549/594 (92.42%), Query Frame = 0

Query: 36  MNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAFYSKSGSLRDAYNV 95
           MNY AYGRLIQHC D  F R+GKQLHARLVL SV PDNFLGSKLI+FYSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60

Query: 96  FGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASS 155
           FG I  KNIFSWNALFISYTLHNMH+D+LKLF SLVNSN+ DVKPD+FT+TCVLKALAS 
Sbjct: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120

Query: 156 FTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAM 215
           F++S+LAKEVHCF+LRR LESDIFVVNAL+T+YSRC+E+VLARI+F RMPERDIVSWNAM
Sbjct: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180

Query: 216 VAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 275
           +AG+SQGG YE+CKELF+ M SS+E+KPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ
Sbjct: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 240

Query: 276 IEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGSVNQAMDLFQ 335
           I+MDVSL NAVIGLYAKCGSLDYARELFEEMPEKD +TY SMISGYMVHG VNQAMDLF+
Sbjct: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300

Query: 336 ELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPNAVTLASVLPVFSHFSTLKG 395
           EL++P L TWNAVISGLVQNN+QDG LDIFRAMQSHGCRPN VTLAS+LP+FSHFSTLKG
Sbjct: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360

Query: 396 GKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRSLIIWTAIISAYAA 455
           GKEIH YA+RN Y+GNI+VATAIIDSYAK GYL GA QVFD +KGRSLI WT+IISAYA 
Sbjct: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420

Query: 456 HGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFNIMLPEYGIQPLVE 515
           HGDANVALSLFYEML  GIQPD VTFTSVL ACAHSGELDEAWKIFNI+LP+YGIQPLVE
Sbjct: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480

Query: 516 HYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDVELGKYVFDRLLEI 575
           HYACMVGVLSRAGKLSDAV+FISKMP+EP+AKVWGALLNGASVAGDVELGKYVFDRL EI
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540

Query: 576 EPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIETSGGLHSF 630
           EP NTG Y+IMANLYSQSGRWKEAD +RDLMKEV L+KIPG+SWIET GGL SF
Sbjct: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 594

BLAST of Moc05g31880 vs. ExPASy TrEMBL
Match: A0A1S4DUQ6 (pentatricopeptide repeat-containing protein At2g37310 OS=Cucumis melo OX=3656 GN=LOC107990300 PE=4 SV=1)

HSP 1 Score: 1038.1 bits (2683), Expect = 1.6e-299
Identity = 505/594 (85.02%), Postives = 549/594 (92.42%), Query Frame = 0

Query: 36  MNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAFYSKSGSLRDAYNV 95
           MNY AYGRLIQHC D  F R+GKQLHARLVL SV PDNFLGSKLI+FYSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDHLFFRVGKQLHARLVLSSVAPDNFLGSKLISFYSKSGSLRDAYNV 60

Query: 96  FGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASS 155
           FG I  KNIFSWNALFISYTLHNMH+D+LKLF SLVNSN+ DVKPD+FT+TCVLKALAS 
Sbjct: 61  FGKIPRKNIFSWNALFISYTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASL 120

Query: 156 FTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAM 215
           F++S+LAKEVHCF+LRR LESDIFVVNAL+T+YSRC+E+VLARI+F RMPERDIVSWNAM
Sbjct: 121 FSNSVLAKEVHCFILRRELESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAM 180

Query: 216 VAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 275
           +AG+SQGG YE+CKELF+ M SS+E+KPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ
Sbjct: 181 LAGYSQGGSYEKCKELFRVMSSSLEVKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 240

Query: 276 IEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGSVNQAMDLFQ 335
           I+MDVSL NAVIGLYAKCGSLDYARELFEEMPEKD +TY SMISGYMVHG VNQAMDLF+
Sbjct: 241 IKMDVSLWNAVIGLYAKCGSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFR 300

Query: 336 ELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPNAVTLASVLPVFSHFSTLKG 395
           EL++P L TWNAVISGLVQNN+QDG LDIFRAMQSHGCRPN VTLAS+LP+FSHFSTLKG
Sbjct: 301 ELERPRLPTWNAVISGLVQNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKG 360

Query: 396 GKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRSLIIWTAIISAYAA 455
           GKEIH YA+RN Y+GNI+VATAIIDSYAK GYL GA QVFD +KGRSLI WT+IISAYA 
Sbjct: 361 GKEIHGYAIRNTYDGNIFVATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAV 420

Query: 456 HGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFNIMLPEYGIQPLVE 515
           HGDANVALSLFYEML  GIQPD VTFTSVL ACAHSGELDEAWKIFNI+LP+YGIQPLVE
Sbjct: 421 HGDANVALSLFYEMLTYGIQPDQVTFTSVLAACAHSGELDEAWKIFNILLPDYGIQPLVE 480

Query: 516 HYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDVELGKYVFDRLLEI 575
           HYACMVGVLSRAGKLSDAV+FISKMP+EP+AKVWGALLNGASVAGDVELGKYVFDRL EI
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPLEPNAKVWGALLNGASVAGDVELGKYVFDRLFEI 540

Query: 576 EPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIETSGGLHSF 630
           EP NTG Y+IMANLYSQSGRWKEAD +RDLMKEV L+KIPG+SWIET GGL SF
Sbjct: 541 EPGNTGNYVIMANLYSQSGRWKEADTIRDLMKEVRLKKIPGNSWIETRGGLQSF 594

BLAST of Moc05g31880 vs. TAIR 10
Match: AT2G37310.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 745.0 bits (1922), Expect = 5.4e-215
Identity = 369/652 (56.60%), Postives = 483/652 (74.08%), Query Frame = 0

Query: 20  GAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKL 79
           G  I  ALQ +     ++  AYG LIQH    R      QLHAR+V+ S+ PDNFL SKL
Sbjct: 4   GFEIQRALQGLLNKAAVDGGAYGHLIQHFTRHRLPLHVLQLHARIVVFSIKPDNFLASKL 63

Query: 80  IAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMD-- 139
           I+FY++    R A +VF  I+ +N FS+NAL I+YT   M+ D   LF S + S+     
Sbjct: 64  ISFYTRQDRFRQALHVFDEITVRNAFSYNALLIAYTSREMYFDAFSLFLSWIGSSCYSSD 123

Query: 140 -VKPDKFTITCVLKALA--SSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEEV 199
             +PD  +I+CVLKAL+    F    LA++VH FV+R G +SD+FV N ++TYY++C+ +
Sbjct: 124 AARPDSISISCVLKALSGCDDFWLGSLARQVHGFVIRGGFDSDVFVGNGMITYYTKCDNI 183

Query: 200 VLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQA 259
             AR VF  M ERD+VSWN+M++G+SQ G +E+CK+++K ML+  + KPN +T +SV QA
Sbjct: 184 ESARKVFDEMSERDVVSWNSMISGYSQSGSFEDCKKMYKAMLACSDFKPNGVTVISVFQA 243

Query: 260 CAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTY 319
           C QS+DLIFG+EVH+ + E+ I+MD+SLCNAVIG YAKCGSLDYAR LF+EM EKD VTY
Sbjct: 244 CGQSSDLIFGLEVHKKMIENHIQMDLSLCNAVIGFYAKCGSLDYARALFDEMSEKDSVTY 303

Query: 320 GSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCR 379
           G++ISGYM HG V +AM LF E++   LSTWNA+ISGL+QNN  + V++ FR M   G R
Sbjct: 304 GAIISGYMAHGLVKEAMALFSEMESIGLSTWNAMISGLMQNNHHEEVINSFREMIRCGSR 363

Query: 380 PNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQV 439
           PN VTL+S+LP  ++ S LKGGKEIHA+A+RNG + NIYV T+IID+YAK G+L GA +V
Sbjct: 364 PNTVTLSSLLPSLTYSSNLKGGKEIHAFAIRNGADNNIYVTTSIIDNYAKLGFLLGAQRV 423

Query: 440 FDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGEL 499
           FD  K RSLI WTAII+AYA HGD++ A SLF +M   G +PD VT T+VL A AHSG+ 
Sbjct: 424 FDNCKDRSLIAWTAIITAYAVHGDSDSACSLFDQMQCLGTKPDDVTLTAVLSAFAHSGDS 483

Query: 500 DEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLN 559
           D A  IF+ ML +Y I+P VEHYACMV VLSRAGKLSDA++FISKMPI+P AKVWGALLN
Sbjct: 484 DMAQHIFDSMLTKYDIEPGVEHYACMVSVLSRAGKLSDAMEFISKMPIDPIAKVWGALLN 543

Query: 560 GASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKI 619
           GASV GD+E+ ++  DRL E+EPENTG Y IMANLY+Q+GRW+EA+ VR+ MK +GL+KI
Sbjct: 544 GASVLGDLEIARFACDRLFEMEPENTGNYTIMANLYTQAGRWEEAEMVRNKMKRIGLKKI 603

Query: 620 PGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDE 667
           PG+SWIET  GL SF+A+D+S + + E+YE++EGL+  M ++ YI + E+DE
Sbjct: 604 PGTSWIETEKGLRSFIAKDSSCERSKEMYEIIEGLVESMSDKEYIRKQELDE 655

BLAST of Moc05g31880 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 414.1 bits (1063), Expect = 2.2e-115
Identity = 218/618 (35.28%), Postives = 356/618 (57.61%), Query Frame = 0

Query: 44  LIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAF--YSKSGSLRDAYNVFGNISH 103
           LI+ C   R L   KQ H  ++      D +  SKL A    S   SL  A  VF  I  
Sbjct: 36  LIERCVSLRQL---KQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPK 95

Query: 104 KNIFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASSFTDSIL 163
            N F+WN L  +Y   +    +L +++ L   +     P+K+T   ++KA A+  +   L
Sbjct: 96  PNSFAWNTLIRAYA--SGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKA-AAEVSSLSL 155

Query: 164 AKEVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAMVAGFSQ 223
            + +H   ++  + SD+FV N+L+  Y  C ++  A  VF  + E+D+VSWN+M+ GF Q
Sbjct: 156 GQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 215

Query: 224 GGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQIEMDVS 283
            G  ++  ELFK+M  S ++K + +T V VL ACA+  +L FG +V  ++ E+++ ++++
Sbjct: 216 KGSPDKALELFKKM-ESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLT 275

Query: 284 LCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGSVNQAMDLFQELKKPA 343
           L NA++ +Y KCGS++ A+ LF+ M EKD VT+ +M+ GY +      A ++   + +  
Sbjct: 276 LANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKD 335

Query: 344 LSTWNAVISGLVQNNQQDGVLDIFRAMQ-SHGCRPNAVTLASVLPVFSHFSTLKGGKEIH 403
           +  WNA+IS   QN + +  L +F  +Q     + N +TL S L   +    L+ G+ IH
Sbjct: 336 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 395

Query: 404 AYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRSLIIWTAIISAYAAHGDAN 463
           +Y  ++G   N +V +A+I  Y+K G L  + +VF+ V+ R + +W+A+I   A HG  N
Sbjct: 396 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 455

Query: 464 VALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFNIMLPEYGIQPLVEHYACM 523
            A+ +FY+M    ++P+ VTFT+V  AC+H+G +DEA  +F+ M   YGI P  +HYAC+
Sbjct: 456 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 515

Query: 524 VGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDVELGKYVFDRLLEIEPENT 583
           V VL R+G L  AV FI  MPI PS  VWGALL    +  ++ L +    RLLE+EP N 
Sbjct: 516 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 575

Query: 584 GTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIETSGGLHSFVARDTSNDSTP 643
           G +++++N+Y++ G+W+   ++R  M+  GL+K PG S IE  G +H F++ D ++  + 
Sbjct: 576 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 635

Query: 644 EIYEMLEGLLGLMKEEGY 659
           ++Y  L  ++  +K  GY
Sbjct: 636 KVYGKLHEVMEKLKSNGY 646

BLAST of Moc05g31880 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 407.1 bits (1045), Expect = 2.7e-113
Identity = 232/687 (33.77%), Postives = 358/687 (52.11%), Query Frame = 0

Query: 44  LIQHCADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAFYSKSGSLRDAYNVFGNISHKN 103
           ++Q CAD + L+ GK++   +       D+ LGSKL   Y+  G L++A  VF  +  + 
Sbjct: 100 VLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEK 159

Query: 104 IFSWNALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASSFTDSILAK 163
              WN L          S  + LF  +++S    V+ D +T +CV K+  SS       +
Sbjct: 160 ALFWNILMNELAKSGDFSGSIGLFKKMMSSG---VEMDSYTFSCVSKSF-SSLRSVHGGE 219

Query: 164 EVHCFVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAMVAGFSQGG 223
           ++H F+L+ G      V N+LV +Y + + V  AR VF  M ERD++SWN+++ G+   G
Sbjct: 220 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 279

Query: 224 FYEECKELFKEML-SSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQIEMDVSL 283
             E+   +F +ML S +E+  +  T VSV   CA S  +  G  VH    ++    +   
Sbjct: 280 LAEKGLSVFVQMLVSGIEI--DLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRF 339

Query: 284 CNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGSMISGYMVHGSVNQAMDLFQELKKPAL 343
           CN ++ +Y+KCG LD A+ +F EM ++  V+Y SMI+GY   G   +A+ LF+E+++  +
Sbjct: 340 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 399

Query: 344 S----------------------------------------------------------- 403
           S                                                           
Sbjct: 400 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 459

Query: 404 -----------TWNAVISGLVQNNQQDGVLDIFR-AMQSHGCRPNAVTLASVLPVFSHFS 463
                      +WN +I G  +N   +  L +F   ++     P+  T+A VLP  +  S
Sbjct: 460 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 519

Query: 464 TLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLVKGRSLIIWTAIIS 523
               G+EIH Y +RNGY  + +VA +++D YAK G L  A  +FD +  + L+ WT +I+
Sbjct: 520 AFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIA 579

Query: 524 AYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEAWKIFNIMLPEYGIQ 583
            Y  HG    A++LF +M + GI+ D ++F S+L AC+HSG +DE W+ FNIM  E  I+
Sbjct: 580 GYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIE 639

Query: 584 PLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGASVAGDVELGKYVFDR 643
           P VEHYAC+V +L+R G L  A  FI  MPI P A +WGALL G  +  DV+L + V ++
Sbjct: 640 PTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEK 699

Query: 644 LLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGSSWIETSGGLHSFVA 659
           + E+EPENTG Y++MAN+Y+++ +W++  ++R  + + GLRK PG SWIE  G ++ FVA
Sbjct: 700 VFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVA 759

BLAST of Moc05g31880 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 397.5 bits (1020), Expect = 2.1e-110
Identity = 216/642 (33.64%), Postives = 354/642 (55.14%), Query Frame = 0

Query: 53  FLRLGKQLHARLVLLSV-TPDNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALF 112
           + + G  LHAR +   +     F  + +++ YSK G +      F  +  ++  SW  + 
Sbjct: 59  YSKTGYALHARKLFDEMPLRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMI 118

Query: 113 ISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLR 172
           + Y     +   +++   +V      ++P +FT+T VL ++A++       K+VH F+++
Sbjct: 119 VGYKNIGQYHKAIRVMGDMVKEG---IEPTQFTLTNVLASVAATRCME-TGKKVHSFIVK 178

Query: 173 RGLESDIFVVNALVTYYSRCEEVVLARIVFGR---------------------------- 232
            GL  ++ V N+L+  Y++C + ++A+ VF R                            
Sbjct: 179 LGLRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQ 238

Query: 233 ---MPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQACAQSND 292
              M ERDIV+WN+M++GF+Q G+     ++F +ML    L P+  T  SVL ACA    
Sbjct: 239 FEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEK 298

Query: 293 LIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYG--SMI 352
           L  G ++H  +  +  ++   + NA+I +Y++CG ++ AR L E+   KD    G  +++
Sbjct: 299 LCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALL 358

Query: 353 SGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGCRPNAV 412
            GY+  G +NQA ++F  LK   +  W A+I G  Q+      +++FR+M   G RPN+ 
Sbjct: 359 DGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSY 418

Query: 413 TLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQVFDLV 472
           TLA++L V S  ++L  GK+IH  AV++G   ++ V+ A+I  YAK+G +  A + FDL+
Sbjct: 419 TLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLI 478

Query: 473 K-GRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGELDEA 532
           +  R  + WT++I A A HG A  AL LF  ML  G++PD +T+  V  AC H+G +++ 
Sbjct: 479 RCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQG 538

Query: 533 WKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALLNGAS 592
            + F++M     I P + HYACMV +  RAG L +A +FI KMPIEP    WG+LL+   
Sbjct: 539 RQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACR 598

Query: 593 VAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRKIPGS 652
           V  +++LGK   +RLL +EPEN+G Y  +ANLYS  G+W+EA K+R  MK+  ++K  G 
Sbjct: 599 VHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGF 658

Query: 653 SWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYI 660
           SWIE    +H F   D ++    EIY  ++ +   +K+ GY+
Sbjct: 659 SWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYV 696

BLAST of Moc05g31880 vs. TAIR 10
Match: AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 390.2 bits (1001), Expect = 3.4e-108
Identity = 222/656 (33.84%), Postives = 349/656 (53.20%), Query Frame = 0

Query: 48  CADRRFLRLGKQLHARLVLLSVTPDNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSW 107
           C +   +R G+  HA  ++     + F+G+ L+A YS+  SL DA  VF  +S  ++ SW
Sbjct: 137 CGEISSVRCGESAHALSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSW 196

Query: 108 NALFISYTLHNMHSDMLKLFSSLVNSNAMDVKPDKFTITCVLKALASSFTDSILAKEVHC 167
           N++  SY         L++FS +  +N    +PD  T+  VL   AS  T S L K++HC
Sbjct: 197 NSIIESYAKLGKPKVALEMFSRM--TNEFGCRPDNITLVNVLPPCASLGTHS-LGKQLHC 256

Query: 168 FVLRRGLESDIFVVNALVTYYSRCEEVVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEE 227
           F +   +  ++FV N LV  Y++C  +  A  VF  M  +D+VSWNAMVAG+SQ G +E+
Sbjct: 257 FAVTSEMIQNMFVGNCLVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFED 316

Query: 228 CKELFKEM----------------------------------LSSVELKPNALTAVSVLQ 287
              LF++M                                  + S  +KPN +T +SVL 
Sbjct: 317 AVRLFEKMQEEKIKMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLS 376

Query: 288 ACAQSNDLIFGMEVHRFVNESQIEM-------DVSLCNAVIGLYAKCGSLDYARELFEEM 347
            CA    L+ G E+H +  +  I++       +  + N +I +YAKC  +D AR +F+ +
Sbjct: 377 GCASVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSL 436

Query: 348 --PEKDEVTYGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDI 407
              E+D VT+  MI GY  HG  N+A++L  E+ +    T                    
Sbjct: 437 SPKERDVVTWTVMIGGYSQHGDANKALELLSEMFEEDCQT-------------------- 496

Query: 408 FRAMQSHGCRPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNG-NIYVATAIIDSYA 467
                    RPNA T++  L   +  + L+ GK+IHAYA+RN  N   ++V+  +ID YA
Sbjct: 497 ---------RPNAFTISCALVACASLAALRIGKQIHAYALRNQQNAVPLFVSNCLIDMYA 556

Query: 468 KSGYLHGAWQVFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTS 527
           K G +  A  VFD +  ++ + WT++++ Y  HG    AL +F EM R G + D VT   
Sbjct: 557 KCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDEMRRIGFKLDGVTLLV 616

Query: 528 VLVACAHSGELDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIE 587
           VL AC+HSG +D+  + FN M   +G+ P  EHYAC+V +L RAG+L+ A+  I +MP+E
Sbjct: 617 VLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAGRLNAALRLIEEMPME 676

Query: 588 PSAKVWGALLNGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVR 647
           P   VW A L+   + G VELG+Y  +++ E+   + G+Y +++NLY+ +GRWK+  ++R
Sbjct: 677 PPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSNLYANAGRWKDVTRIR 736

Query: 648 DLMKEVGLRKIPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYI 660
            LM+  G++K PG SW+E   G  +F   D ++    EIY++L   +  +K+ GY+
Sbjct: 737 SLMRHKGVKKRPGCSWVEGIKGTTTFFVGDKTHPHAKEIYQVLLDHMQRIKDIGYV 760

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022145703.10.0e+00100.00pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia][more]
XP_038905794.10.0e+0087.63pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_03... [more]
KAG6580575.10.0e+0088.14ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. soror... [more]
KAG7017327.10.0e+0087.98ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyr... [more]
XP_022934145.10.0e+0088.03pentatricopeptide repeat-containing protein At2g37310 [Cucurbita moschata] >XP_0... [more]
Match NameE-valueIdentityDescription
Q9ZUT57.6e-21456.60Pentatricopeptide repeat-containing protein At2g37310 OS=Arabidopsis thaliana OX... [more]
O823803.1e-11435.28Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9SN393.8e-11233.77Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9SHZ83.0e-10933.64Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9LFL54.7e-10733.84Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1CWN90.0e+00100.00pentatricopeptide repeat-containing protein At2g37310 OS=Momordica charantia OX=... [more]
A0A6J1F1100.0e+0088.03pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita moschata OX=3... [more]
A0A6J1J0S50.0e+0087.89pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita maxima OX=366... [more]
A0A5A7TRM41.6e-29985.02Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4DUQ61.6e-29985.02pentatricopeptide repeat-containing protein At2g37310 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT2G37310.15.4e-21556.60Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.12.2e-11535.28Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.12.7e-11333.77Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G22070.12.1e-11033.64pentatricopeptide (PPR) repeat-containing protein [more]
AT5G16860.13.4e-10833.84Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 497..613
e-value: 2.7E-11
score: 45.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 26..159
e-value: 1.5E-12
score: 49.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 160..260
e-value: 3.2E-19
score: 71.0
coord: 261..339
e-value: 1.6E-18
score: 68.7
coord: 396..495
e-value: 1.3E-21
score: 78.7
coord: 340..395
e-value: 2.1E-9
score: 39.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 454..605
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 105..132
e-value: 0.65
score: 10.4
coord: 416..441
e-value: 0.038
score: 14.2
coord: 210..237
e-value: 3.0E-7
score: 30.2
coord: 283..309
e-value: 7.4E-5
score: 22.7
coord: 312..339
e-value: 1.7E-5
score: 24.8
coord: 582..610
e-value: 0.0014
score: 18.7
coord: 517..540
e-value: 0.65
score: 10.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 343..383
e-value: 1.7E-7
score: 31.3
coord: 444..489
e-value: 3.9E-11
score: 43.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 210..244
e-value: 2.2E-6
score: 25.4
coord: 312..339
e-value: 2.1E-4
score: 19.2
coord: 282..309
e-value: 3.9E-4
score: 18.4
coord: 582..611
e-value: 0.0014
score: 16.6
coord: 445..477
e-value: 1.5E-7
score: 29.1
coord: 344..377
e-value: 8.8E-7
score: 26.7
coord: 479..512
e-value: 2.3E-4
score: 19.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 442..476
score: 11.39981
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 579..613
score: 9.876189
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 208..238
score: 10.413293
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 341..375
score: 10.972319
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 477..512
score: 10.807899
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 279..313
score: 10.029647
NoneNo IPR availablePANTHERPTHR47925:SF76BNAA04G21330D PROTEINcoord: 33..666
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 33..666

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc05g31880.1Moc05g31880.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding