Moc08g38960 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc08g38960
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr8: 29383888 .. 29386161 (-)
RNA-Seq ExpressionMoc08g38960
SyntenyMoc08g38960
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGATGGATGAATCTGGGCAGTCGGTGCTTCTCTTCTGCTGCTTTTCTGAAACTTTCCCGTTCTGTTTCTCAAGTTTCGTTGCCCCAAAAACTCGTATCATTCAACTTGTCTCAGCATCAGCTGTTCAAGTCATGTTGCTACCACTCTTCCAATGATTCGTTGGCTTATACCCTTCACGCCAAGATGGTCAAGAATGGTTCTATTTTGGATTCAGGAAAGTTCGTTTTGAGTTCTTACGTGAAATCTGAGAAATTAGACGATGCACAGAAAGTGTTCGACGAAATGCCCAACAGGGATGTTCTCACTTGGACGGTCCTTATATCGGGTTTTGCTCGAGTAAGATGTTCTGAAATGGCATTGAGACTCTTTAGAGAGATGCTGGTTGAAGGTGTTAGTCCAAATCATTTTACTTTGTCTAGTGTTCTTAAACTTTGCTCTAGAGTAGGTGATGTTCAAATGGGTAAGGGGATTCATGGATGGATGCTCAGAAGTGGGGTCAACTTAGATGTTGTCCTGGAGAATTCTGTGCTTGATTTGTATGCAAAGTTTGATGCATTTGAATATGCCAAAAAGTTATTTGATTCAATGAGAGAAAAAAGTACTGCCACGTACAATATAATGCTTGGTGTGTATGTCCGTAGTTGTGATGTTAACAAATCTCTTGAATTATTCAGAAAATTGCCCTGCAGAGATACCGCGAGTTGGAATACGATTATATGTGGGCTAATGCAAGGTGGACATCTGAAAACAGCATTGGAGCTACTCTATGAGATGGTGGAGAACGAACCTGAGTTTAACAAAGTTACTTCCTCCATAGCTTTAAGTGTAGTTTCTTCTTTATTGATTATGGCGCTAGGGAGACAAGTACATGGCCGAATTGTTAGGTTTGGTTTTCATAATGATGGATTTGTGAAGAGTTCTCTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAGGCATCGGTGATATATAATCAAATGCCTTCAGATTTTGTGAGTAAACAAGGTTCCAACATTGTATGTAGCGACATGATGACAGAAATTGTATCACGGAGCTCAATGGTGTCTGGATATGTTCGAAATGGGAAGTATGAAGATGCCTTCAAAACTTTCATTTCTATGATGCGTGAACGGGCTCTGATGGACAGATTTACCATTGCAAGCATTGTATCAGCTTGTTCTAATGCTGCTGTTTTAGAGCTTGGTCGTCAAGTCCATGCTTATATTCAGAAAATTGGGGAAAGGCTCGATGCTCACTTGGCTTCCTCCTTGATTGACATGTACGCCAAAGGTGGGAGTTTGGATTGTGCCCGTCAAACTTTTGAGCAAATGACCTATGCAAATTTTGTGATATGGACTTCCATGATTGCTGGATGTGCTTTGCACGGGCAAGGTAAGGAAGCCATTAGATTGTTTGAACAGATGAGATCTGAAGGAATCATACCAAATGAGGTTACTTTCGTAGGAGTTTTAACAGCTTGCAGTCATGCAGGTCTGCTTAAAGACGGTCGTCGATATTTTAACATGATGAAAGATGTTTATGCAATCAAGCCTAAAGTCGAGCATTTCACTTGTATGGTAGATCTTTACGGTCGAGCTGGATGCTTGAATGAAGTCAAAGAGTTCATCTATGAGAACGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTCCTATCATCTTGTCGGCTTTACAAGGACATCGAAATGGGAAATTGGGTTTCTGAAAAATTGCTTGGGCTCGAACCACAAGATGAAGGGCCTTATGTTTTACTATCAAACATGTGCTCCAGCAATCAAAAATGGGAAGAAGCTTCCAGAACAAGAAGATCTATGCAACGCAGAGGGATTAACAAAACACCTGGCCAATCTTGGATTCATGTGAAAAATCAAGTCCACTCCTTTGTTGCGGGAGACCGATCGCACCCTCAACATGCTCAGATATATGCATATCTGGACAAGCTAATTGGAAGATTGAAGGAAATTGGGTATTTGTATGATGTAAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTAGTTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGCATTATCAGTTTGGCTTCTGGCATTCCAATCCGAATCATGAAGAACCTTCGAGTATGCGCTGATTGTCATAACTTTATGAAACTAACATCTCAGCTTTTAGGCAGGGAGATAATTGTTCGAGATATTCATCGTTTCCATCATTTTAACTCCGGTCGCTGCTCTTGTGGCGATTATTGGTGA

mRNA sequence

ATGAGATGGATGAATCTGGGCAGTCGGTGCTTCTCTTCTGCTGCTTTTCTGAAACTTTCCCGTTCTGTTTCTCAAGTTTCGTTGCCCCAAAAACTCGTATCATTCAACTTGTCTCAGCATCAGCTGTTCAAGTCATGTTGCTACCACTCTTCCAATGATTCGTTGGCTTATACCCTTCACGCCAAGATGGTCAAGAATGGTTCTATTTTGGATTCAGGAAAGTTCGTTTTGAGTTCTTACGTGAAATCTGAGAAATTAGACGATGCACAGAAAGTGTTCGACGAAATGCCCAACAGGGATGTTCTCACTTGGACGGTCCTTATATCGGGTTTTGCTCGAGTAAGATGTTCTGAAATGGCATTGAGACTCTTTAGAGAGATGCTGGTTGAAGGTGTTAGTCCAAATCATTTTACTTTGTCTAGTGTTCTTAAACTTTGCTCTAGAGTAGGTGATGTTCAAATGGGTAAGGGGATTCATGGATGGATGCTCAGAAGTGGGGTCAACTTAGATGTTGTCCTGGAGAATTCTGTGCTTGATTTGTATGCAAAGTTTGATGCATTTGAATATGCCAAAAAGTTATTTGATTCAATGAGAGAAAAAAGTACTGCCACGTACAATATAATGCTTGGTGTGTATGTCCGTAGTTGTGATGTTAACAAATCTCTTGAATTATTCAGAAAATTGCCCTGCAGAGATACCGCGAGTTGGAATACGATTATATGTGGGCTAATGCAAGGTGGACATCTGAAAACAGCATTGGAGCTACTCTATGAGATGGTGGAGAACGAACCTGAGTTTAACAAAGTTACTTCCTCCATAGCTTTAAGTGTAGTTTCTTCTTTATTGATTATGGCGCTAGGGAGACAAGTACATGGCCGAATTGTTAGGTTTGGTTTTCATAATGATGGATTTGTGAAGAGTTCTCTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAGGCATCGGTGATATATAATCAAATGCCTTCAGATTTTGTGAGTAAACAAGGTTCCAACATTGTATGTAGCGACATGATGACAGAAATTGTATCACGGAGCTCAATGGTGTCTGGATATGTTCGAAATGGGAAGTATGAAGATGCCTTCAAAACTTTCATTTCTATGATGCGTGAACGGGCTCTGATGGACAGATTTACCATTGCAAGCATTGTATCAGCTTGTTCTAATGCTGCTGTTTTAGAGCTTGGTCGTCAAGTCCATGCTTATATTCAGAAAATTGGGGAAAGGCTCGATGCTCACTTGGCTTCCTCCTTGATTGACATGTACGCCAAAGGTGGGAGTTTGGATTGTGCCCGTCAAACTTTTGAGCAAATGACCTATGCAAATTTTGTGATATGGACTTCCATGATTGCTGGATGTGCTTTGCACGGGCAAGGTAAGGAAGCCATTAGATTGTTTGAACAGATGAGATCTGAAGGAATCATACCAAATGAGGTTACTTTCGTAGGAGTTTTAACAGCTTGCAGTCATGCAGGTCTGCTTAAAGACGGTCGTCGATATTTTAACATGATGAAAGATGTTTATGCAATCAAGCCTAAAGTCGAGCATTTCACTTGTATGGTAGATCTTTACGGTCGAGCTGGATGCTTGAATGAAGTCAAAGAGTTCATCTATGAGAACGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTCCTATCATCTTGTCGGCTTTACAAGGACATCGAAATGGGAAATTGGGTTTCTGAAAAATTGCTTGGGCTCGAACCACAAGATGAAGGGCCTTATGTTTTACTATCAAACATGTGCTCCAGCAATCAAAAATGGGAAGAAGCTTCCAGAACAAGAAGATCTATGCAACGCAGAGGGATTAACAAAACACCTGGCCAATCTTGGATTCATGTGAAAAATCAAGTCCACTCCTTTGTTGCGGGAGACCGATCGCACCCTCAACATGCTCAGATATATGCATATCTGGACAAGCTAATTGGAAGATTGAAGGAAATTGGGTATTTGTATGATGTAAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTAGTTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGCATTATCAGTTTGGCTTCTGGCATTCCAATCCGAATCATGAAGAACCTTCGAGTATGCGCTGATTGTCATAACTTTATGAAACTAACATCTCAGCTTTTAGGCAGGGAGATAATTGTTCGAGATATTCATCGTTTCCATCATTTTAACTCCGGTCGCTGCTCTTGTGGCGATTATTGGTGA

Coding sequence (CDS)

ATGAGATGGATGAATCTGGGCAGTCGGTGCTTCTCTTCTGCTGCTTTTCTGAAACTTTCCCGTTCTGTTTCTCAAGTTTCGTTGCCCCAAAAACTCGTATCATTCAACTTGTCTCAGCATCAGCTGTTCAAGTCATGTTGCTACCACTCTTCCAATGATTCGTTGGCTTATACCCTTCACGCCAAGATGGTCAAGAATGGTTCTATTTTGGATTCAGGAAAGTTCGTTTTGAGTTCTTACGTGAAATCTGAGAAATTAGACGATGCACAGAAAGTGTTCGACGAAATGCCCAACAGGGATGTTCTCACTTGGACGGTCCTTATATCGGGTTTTGCTCGAGTAAGATGTTCTGAAATGGCATTGAGACTCTTTAGAGAGATGCTGGTTGAAGGTGTTAGTCCAAATCATTTTACTTTGTCTAGTGTTCTTAAACTTTGCTCTAGAGTAGGTGATGTTCAAATGGGTAAGGGGATTCATGGATGGATGCTCAGAAGTGGGGTCAACTTAGATGTTGTCCTGGAGAATTCTGTGCTTGATTTGTATGCAAAGTTTGATGCATTTGAATATGCCAAAAAGTTATTTGATTCAATGAGAGAAAAAAGTACTGCCACGTACAATATAATGCTTGGTGTGTATGTCCGTAGTTGTGATGTTAACAAATCTCTTGAATTATTCAGAAAATTGCCCTGCAGAGATACCGCGAGTTGGAATACGATTATATGTGGGCTAATGCAAGGTGGACATCTGAAAACAGCATTGGAGCTACTCTATGAGATGGTGGAGAACGAACCTGAGTTTAACAAAGTTACTTCCTCCATAGCTTTAAGTGTAGTTTCTTCTTTATTGATTATGGCGCTAGGGAGACAAGTACATGGCCGAATTGTTAGGTTTGGTTTTCATAATGATGGATTTGTGAAGAGTTCTCTGATAAATATGTACATTAAATGTGGAAATTTGGAAAAGGCATCGGTGATATATAATCAAATGCCTTCAGATTTTGTGAGTAAACAAGGTTCCAACATTGTATGTAGCGACATGATGACAGAAATTGTATCACGGAGCTCAATGGTGTCTGGATATGTTCGAAATGGGAAGTATGAAGATGCCTTCAAAACTTTCATTTCTATGATGCGTGAACGGGCTCTGATGGACAGATTTACCATTGCAAGCATTGTATCAGCTTGTTCTAATGCTGCTGTTTTAGAGCTTGGTCGTCAAGTCCATGCTTATATTCAGAAAATTGGGGAAAGGCTCGATGCTCACTTGGCTTCCTCCTTGATTGACATGTACGCCAAAGGTGGGAGTTTGGATTGTGCCCGTCAAACTTTTGAGCAAATGACCTATGCAAATTTTGTGATATGGACTTCCATGATTGCTGGATGTGCTTTGCACGGGCAAGGTAAGGAAGCCATTAGATTGTTTGAACAGATGAGATCTGAAGGAATCATACCAAATGAGGTTACTTTCGTAGGAGTTTTAACAGCTTGCAGTCATGCAGGTCTGCTTAAAGACGGTCGTCGATATTTTAACATGATGAAAGATGTTTATGCAATCAAGCCTAAAGTCGAGCATTTCACTTGTATGGTAGATCTTTACGGTCGAGCTGGATGCTTGAATGAAGTCAAAGAGTTCATCTATGAGAACGATTTATCACACCTTAGTGCAGTTTGGAAGGCATTCCTATCATCTTGTCGGCTTTACAAGGACATCGAAATGGGAAATTGGGTTTCTGAAAAATTGCTTGGGCTCGAACCACAAGATGAAGGGCCTTATGTTTTACTATCAAACATGTGCTCCAGCAATCAAAAATGGGAAGAAGCTTCCAGAACAAGAAGATCTATGCAACGCAGAGGGATTAACAAAACACCTGGCCAATCTTGGATTCATGTGAAAAATCAAGTCCACTCCTTTGTTGCGGGAGACCGATCGCACCCTCAACATGCTCAGATATATGCATATCTGGACAAGCTAATTGGAAGATTGAAGGAAATTGGGTATTTGTATGATGTAAAATTGGTGATGCAGGATGTAGAAGAAGAACAGGGTGAAGTGCTTCTTAGTTGGCATAGTGAAAAGCTTGCAGTTGCTTATGGCATTATCAGTTTGGCTTCTGGCATTCCAATCCGAATCATGAAGAACCTTCGAGTATGCGCTGATTGTCATAACTTTATGAAACTAACATCTCAGCTTTTAGGCAGGGAGATAATTGTTCGAGATATTCATCGTTTCCATCATTTTAACTCCGGTCGCTGCTCTTGTGGCGATTATTGGTGA

Protein sequence

MRWMNLGSRCFSSAAFLKLSRSVSQVSLPQKLVSFNLSQHQLFKSCCYHSSNDSLAYTLHAKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFARVRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVLENSVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDTASWNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALGRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIVSRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQVHAYIQKIGERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQGKEAIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEGPYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW
Homology
BLAST of Moc08g38960 vs. NCBI nr
Match: XP_038889548.1 (putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispida] >XP_038889549.1 putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispida])

HSP 1 Score: 1339.3 bits (3465), Expect = 0.0e+00
Identity = 657/758 (86.68%), Postives = 704/758 (92.88%), Query Frame = 0

Query: 1   MRWMNLGSRCFSSAAFLKLSRSVSQVSLPQKLVSFNLSQHQLFKSCCYHSSNDSLAYTLH 60
           MR MNL S CF++ AFLKL   + QV++ QK++SFNLS+HQLFKSCCYH+SNDSL  TLH
Sbjct: 1   MRLMNLSSCCFAT-AFLKLPHPICQVTMAQKIISFNLSEHQLFKSCCYHTSNDSLVNTLH 60

Query: 61  AKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFARVRCSEMA 120
           AKMVKNGSIL+SGKFVLSSYVKSEKL+DAQK+FDEMP+RDVLTWTVLISGF+R+ CSEMA
Sbjct: 61  AKMVKNGSILESGKFVLSSYVKSEKLNDAQKLFDEMPSRDVLTWTVLISGFSRINCSEMA 120

Query: 121 LRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVLENSVLDL 180
           L+LFR+MLVEGV PNHFTLS+VLKLCSRVGD+QMGKGIHGW+LR+GVNLDVVLENS+LDL
Sbjct: 121 LQLFRKMLVEGVCPNHFTLSTVLKLCSRVGDMQMGKGIHGWILRNGVNLDVVLENSMLDL 180

Query: 181 YAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDTASWNTII 240
           YAKFD F  AKKLFDSMREKSTATYNIMLGVYVRSCDVNKSL+LFR +PCR+TASWNTII
Sbjct: 181 YAKFDDFYCAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLDLFRNMPCRNTASWNTII 240

Query: 241 CGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALGRQVHGRIVRFGFH 300
           CGLMQGGHL  ALELLYEMVENEPEFNKVTSSIALSVV+SLLI+ LGRQVHGRI+R G H
Sbjct: 241 CGLMQGGHLNAALELLYEMVENEPEFNKVTSSIALSVVASLLIIELGRQVHGRIIRCGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIVSRSSMVSGY 360
           NDGFVKSSLINMYIKCGNLEKASVIY+QMPS FV+KQ SNIVCSDMMTEIVSRSSMVSGY
Sbjct: 301 NDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFVTKQDSNIVCSDMMTEIVSRSSMVSGY 360

Query: 361 VRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQVHAYIQKIGERLDA 420
           + NGKYE+AFKT +SM+RER LMD+FTIAS+VSACSNA VLELGRQ+H YIQK GE+LDA
Sbjct: 361 IWNGKYENAFKTVVSMVRERVLMDKFTIASVVSACSNAGVLELGRQIHGYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCARQTFEQMT-YANFVIWTSMIAGCALHGQGKEAIRLFEQMRS 480
           HLASSLIDMYAKGGSLDCA + FEQ T Y N V+WTSMIAG ALHGQGKEAIRLFE+MR 
Sbjct: 421 HLASSLIDMYAKGGSLDCAHRIFEQTTNYLNVVLWTSMIAGYALHGQGKEAIRLFERMRY 480

Query: 481 EGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLN 540
           EGIIPNEVTFVGVLTACSHAGLL+ GR YFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLN
Sbjct: 481 EGIIPNEVTFVGVLTACSHAGLLEHGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEGPYVLLSNMCS 600
           EVKEFIYENDLSHLSAVWKAFLSSCRLYK++EMGNWVSEKL  LE QDEG YVLLSNMCS
Sbjct: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYKNLEMGNWVSEKLFSLEQQDEGSYVLLSNMCS 600

Query: 601 SNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIG 660
            +QKWEEASRTRRSMQ RGINKTPGQSWIHVKNQVHSFVAGD+SHPQH QIY YLDKLIG
Sbjct: 601 GSQKWEEASRTRRSMQHRGINKTPGQSWIHVKNQVHSFVAGDQSHPQHVQIYEYLDKLIG 660

Query: 661 RLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIPIRIMKNLRVCAD 720
           RLKEIGYLYDVKLVMQDVEEEQGEVLL WHSEKLA+AYGIISL S IPIRIMKNLRVC D
Sbjct: 661 RLKEIGYLYDVKLVMQDVEEEQGEVLLGWHSEKLALAYGIISLGSAIPIRIMKNLRVCTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
           CHNFMKLTSQLLGREIIVRDIHRFH FNSG CSCGDYW
Sbjct: 721 CHNFMKLTSQLLGREIIVRDIHRFHRFNSGHCSCGDYW 757

BLAST of Moc08g38960 vs. NCBI nr
Match: KAG6586149.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1335.9 bits (3456), Expect = 0.0e+00
Identity = 653/757 (86.26%), Postives = 701/757 (92.60%), Query Frame = 0

Query: 1   MRWMNLGSRCFSSAAFLKLSRSVSQVSLPQKLVSFNLSQHQLFKSCCYHSSNDSLAYTLH 60
           MRWMN  S  F+S AFLKL+ SVSQV + QK++ FNLS+HQLFKSC YHSSND  + TLH
Sbjct: 1   MRWMNPCSGGFASTAFLKLTHSVSQVFMAQKIIPFNLSEHQLFKSCRYHSSNDDSSNTLH 60

Query: 61  AKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFARVRCSEMA 120
           AKMVKNGSIL  GK V+SSYVKSEKLDDAQKVFDEMP+RDVL+WTVLISGFARV CSE A
Sbjct: 61  AKMVKNGSILYLGKLVMSSYVKSEKLDDAQKVFDEMPHRDVLSWTVLISGFARVNCSERA 120

Query: 121 LRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVLENSVLDL 180
           L+LFREMLVEGV PNHFTLS VLKLCSRVGD+QMGKGIHGW+LRSGVNLDVVLENS+LDL
Sbjct: 121 LQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGWILRSGVNLDVVLENSMLDL 180

Query: 181 YAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDTASWNTII 240
           Y KFDAF+YA KLFDSMREKSTA+YNIMLGVYVRSCDVNKSL+LFR LPCRDTASWNTII
Sbjct: 181 YTKFDAFDYATKLFDSMREKSTASYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240

Query: 241 CGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALGRQVHGRIVRFGFH 300
           CGLMQGG+L  A+ELLYEMV+NEPEFN+VTSSIALSVVSSLLI+ LGRQVHGRI RFG H
Sbjct: 241 CGLMQGGYLNIAMELLYEMVKNEPEFNEVTSSIALSVVSSLLIIELGRQVHGRIFRFGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIVSRSSMVSGY 360
           NDGFV SSLINMYIKCGNLEKASVIY+QMPS+F  ++ SNIVCS+ MTEIVSRSS+VSGY
Sbjct: 301 NDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKRRDSNIVCSNTMTEIVSRSSIVSGY 360

Query: 361 VRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQVHAYIQKIGERLDA 420
           V+NGKYED+F+TF+SM+RERA+MDRFTIASI+SACSNA VLELGRQ+HAYIQK GE+LDA
Sbjct: 361 VQNGKYEDSFQTFVSMVRERAVMDRFTIASIISACSNAGVLELGRQIHAYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQGKEAIRLFEQMRSE 480
           HLASS+IDMYAKGGSLDCA Q FEQ TY N V WTSMI GCALHGQGKEAIRLFEQMR E
Sbjct: 421 HLASSMIDMYAKGGSLDCAHQVFEQTTYLNVVTWTSMITGCALHGQGKEAIRLFEQMRYE 480

Query: 481 GIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLNE 540
           GIIPNEVTF+GVLTACSHAGLL +GR YFNMMKDVYAI+PKVEHFTCMVD+YGRAGCLNE
Sbjct: 481 GIIPNEVTFIGVLTACSHAGLLDEGRLYFNMMKDVYAIEPKVEHFTCMVDVYGRAGCLNE 540

Query: 541 VKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEGPYVLLSNMCSS 600
           VKEFIY+NDLSH SAVWKAFLSSCRLYKDIEMGNWVSEKL  LEP+DEGPYVLLSNMCSS
Sbjct: 541 VKEFIYQNDLSHHSAVWKAFLSSCRLYKDIEMGNWVSEKLFKLEPRDEGPYVLLSNMCSS 600

Query: 601 NQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGR 660
           NQKWEEAS+TRRSMQ RGI+KTPGQSWIHVKNQVHSF+AGDRSH QHAQIYAYLDKLIGR
Sbjct: 601 NQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFIAGDRSHLQHAQIYAYLDKLIGR 660

Query: 661 LKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIPIRIMKNLRVCADC 720
           LKEIGY  DVKLVMQDVEEEQGEVLL WHSEKLAVAYGII+LASGIPIRIMKNLRVC DC
Sbjct: 661 LKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIINLASGIPIRIMKNLRVCTDC 720

Query: 721 HNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
           HNFMKLTSQLL REIIVRDIHRFHHFNSG CSCGDYW
Sbjct: 721 HNFMKLTSQLLDREIIVRDIHRFHHFNSGHCSCGDYW 757

BLAST of Moc08g38960 vs. NCBI nr
Match: KAG7020981.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1333.5 bits (3450), Expect = 0.0e+00
Identity = 653/757 (86.26%), Postives = 701/757 (92.60%), Query Frame = 0

Query: 1   MRWMNLGSRCFSSAAFLKLSRSVSQVSLPQKLVSFNLSQHQLFKSCCYHSSNDSLAYTLH 60
           MRWMN  S  F+S AFLKL+ SVSQVS+ QK++ FNLS+HQLFKSC YHSSND  + TLH
Sbjct: 1   MRWMNPCSGGFASTAFLKLTHSVSQVSMAQKIIPFNLSEHQLFKSCRYHSSNDDSSNTLH 60

Query: 61  AKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFARVRCSEMA 120
           AKMVKNGSIL  GK V+SSYVKSEKLDDAQKVFDEMP+RDVL+WTVLISGFARV CSE A
Sbjct: 61  AKMVKNGSILYLGKLVMSSYVKSEKLDDAQKVFDEMPHRDVLSWTVLISGFARVNCSERA 120

Query: 121 LRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVLENSVLDL 180
           L+LFREMLVEGV PNHFTLS VLKLCSRVGD+QMGKGIHGW+LRSGVNLDVVLENS+LDL
Sbjct: 121 LQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGWILRSGVNLDVVLENSMLDL 180

Query: 181 YAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDTASWNTII 240
           Y KFDAF+YA KLFDSMREKSTA+YNIMLGVYVRSCDVNKSL+LFR LPCRDTASWNTII
Sbjct: 181 YTKFDAFDYATKLFDSMREKSTASYNIMLGVYVRSCDVNKSLDLFRNLPCRDTASWNTII 240

Query: 241 CGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALGRQVHGRIVRFGFH 300
           CGLMQGG+L  A+ELLYEMV+NEPEFN+VTSSIALSVVSSLLI+ LGRQVHGRI RFG H
Sbjct: 241 CGLMQGGYLNIAMELLYEMVKNEPEFNEVTSSIALSVVSSLLIIELGRQVHGRIFRFGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIVSRSSMVSGY 360
           NDGFV SSLINMYIKCGNLEKASVIY+QMPS+F  ++ SNIVCS+ MTEIVSRSS+VSGY
Sbjct: 301 NDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKRRDSNIVCSNTMTEIVSRSSIVSGY 360

Query: 361 VRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQVHAYIQKIGERLDA 420
           V+NGKYED+F+TF+SM+RERA+MDRFTIASI+SACSNA VLELGRQ+HAYIQK GE+LDA
Sbjct: 361 VQNGKYEDSFQTFVSMVRERAVMDRFTIASIISACSNAGVLELGRQIHAYIQKTGEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQGKEAIRLFEQMRSE 480
           HLASS+IDMYAKGGSLDCA Q FEQ TY N V WTSMI GCALHGQGKEAIRLFEQMR E
Sbjct: 421 HLASSMIDMYAKGGSLDCAHQVFEQTTYLNVVTWTSMITGCALHGQGKEAIRLFEQMRYE 480

Query: 481 GIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLNE 540
           GIIPNEVTF+GVLTACSHAGLL +GR YFNMMKDVYAI+PKVEHFTCMVD+YGRAG LNE
Sbjct: 481 GIIPNEVTFIGVLTACSHAGLLDEGRLYFNMMKDVYAIEPKVEHFTCMVDVYGRAGRLNE 540

Query: 541 VKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEGPYVLLSNMCSS 600
           VKEFIY+NDLSH SAVWKAFLSSCRLYKDIEMGNWVSEKL  LEP+DEGPYVLLSNMCSS
Sbjct: 541 VKEFIYQNDLSHHSAVWKAFLSSCRLYKDIEMGNWVSEKLFKLEPRDEGPYVLLSNMCSS 600

Query: 601 NQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGR 660
           NQKWEEAS+TRRSMQ RGI+KTPGQSWIHVKNQVHSF+AGDRSH QHAQIYAYLDKLIGR
Sbjct: 601 NQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFIAGDRSHLQHAQIYAYLDKLIGR 660

Query: 661 LKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIPIRIMKNLRVCADC 720
           LKEIGY  DVKLVMQDVEEEQGEVLL WHSEKLAVAYGII+LASGIPIRIMKNLRVC DC
Sbjct: 661 LKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIINLASGIPIRIMKNLRVCTDC 720

Query: 721 HNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
           HNFMKLTSQLL REIIVRDIHRFHHFNSG CSCGDYW
Sbjct: 721 HNFMKLTSQLLDREIIVRDIHRFHHFNSGHCSCGDYW 757

BLAST of Moc08g38960 vs. NCBI nr
Match: KAG7029890.1 (putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1303.1 bits (3371), Expect = 0.0e+00
Identity = 635/746 (85.12%), Postives = 687/746 (92.09%), Query Frame = 0

Query: 12   SSAAFLKLSRSVSQVSLPQKLVSFNLSQHQLFKSCCYHSSNDSLAYTLHAKMVKNGSILD 71
            +S AFLKL RSVSQV++ QK++ FN S H LF+SC +HSSNDSL  TLHAKMVKNGSI +
Sbjct: 294  ASTAFLKLFRSVSQVTMAQKIIPFNFSAHHLFESCSFHSSNDSLPNTLHAKMVKNGSIFE 353

Query: 72   SGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFARVRCSEMALRLFREMLVEG 131
            S KF+LSSYVKSEKL+DA+KVFDEMP+RDVLTWTVLISGFARV CSEMAL+LFREMLVEG
Sbjct: 354  SRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVLISGFARVNCSEMALQLFREMLVEG 413

Query: 132  VSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVLENSVLDLYAKFDAFEYAK 191
            V PN FTLS+VLKLCSRVGDV+MGKGIHGW+LRSG++LDVVLENS+LDLYAKFD F+Y  
Sbjct: 414  VCPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGISLDVVLENSMLDLYAKFDEFDYVT 473

Query: 192  KLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDTASWNTIICGLMQGGHLKT 251
            KLFDSMREKSTATYNI+LGV+VRS DVNKSL+LFR LPCRDTA+WNT+ICGLMQGG+L  
Sbjct: 474  KLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRNLPCRDTATWNTVICGLMQGGYLNE 533

Query: 252  ALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALGRQVHGRIVRFGFHNDGFVKSSLIN 311
            ALELLYEMVENEPEFNKVTSSIALSVVSSLL+  LGRQVHGRIVR GFHNDGFVKSSLIN
Sbjct: 534  ALELLYEMVENEPEFNKVTSSIALSVVSSLLVSELGRQVHGRIVRCGFHNDGFVKSSLIN 593

Query: 312  MYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIVSRSSMVSGYVRNGKYEDAFK 371
            MYIKCGNLEKAS IY+QMPS F  +Q  +IVCSD MTEIVSRSSMVSGYVRNG YEDAFK
Sbjct: 594  MYIKCGNLEKASAIYSQMPSGFAKRQDFDIVCSDAMTEIVSRSSMVSGYVRNGNYEDAFK 653

Query: 372  TFISMMRERALMDRFTIASIVSACSNAAVLELGRQVHAYIQKIGERLDAHLASSLIDMYA 431
            TF+SM+RER LMD+FTIAS+VSACSNA V ELGRQ+HAYIQK GE+LDAHL SSLIDMYA
Sbjct: 654  TFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQIHAYIQKTGEQLDAHLTSSLIDMYA 713

Query: 432  KGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQGKEAIRLFEQMRSEGIIPNEVTFVG 491
            KGGSLDCARQ FEQ TY N VIWTSMI GCALHGQGKEAIRLFE+MR EG+IPNEVTF+G
Sbjct: 714  KGGSLDCARQIFEQTTYLNVVIWTSMITGCALHGQGKEAIRLFEKMRYEGMIPNEVTFIG 773

Query: 492  VLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLNEVKEFIYENDLS 551
            VL ACSHAGLL+DGR YFNMMKDVYAIKPKVEHFTCMVDLYGRAG LNEVK+FIYEND+S
Sbjct: 774  VLAACSHAGLLEDGRLYFNMMKDVYAIKPKVEHFTCMVDLYGRAGRLNEVKKFIYENDIS 833

Query: 552  HLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEGPYVLLSNMCSSNQKWEEASRTR 611
            HL+AVWKAFLSSC+LYKDIEMGNWVSE+L  LEP DEGPYVLLSNMCSSN+KWEEA RTR
Sbjct: 834  HLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLDEGPYVLLSNMCSSNKKWEEAFRTR 893

Query: 612  RSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGRLKEIGYLYDVK 671
            RSMQ RGI+KTPGQSWIHVKN+VHSFVAGDRSHPQHAQIY YLDKLIGRLKEIGYL+DVK
Sbjct: 894  RSMQHRGISKTPGQSWIHVKNRVHSFVAGDRSHPQHAQIYEYLDKLIGRLKEIGYLFDVK 953

Query: 672  LVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCHNFMKLTSQLL 731
            LVMQDVEEEQGEVLL WHSEKLA+AYG+ISL S IPIRIMKNLR+C DCHNFMKLTSQLL
Sbjct: 954  LVMQDVEEEQGEVLLGWHSEKLAIAYGLISLGSSIPIRIMKNLRICTDCHNFMKLTSQLL 1013

Query: 732  GREIIVRDIHRFHHFNSGRCSCGDYW 758
             REIIVRDIHRFHHFNSG CSCGDYW
Sbjct: 1014 CREIIVRDIHRFHHFNSGHCSCGDYW 1038

BLAST of Moc08g38960 vs. NCBI nr
Match: XP_022965499.1 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 [Cucurbita maxima])

HSP 1 Score: 1284.2 bits (3322), Expect = 0.0e+00
Identity = 628/716 (87.71%), Postives = 670/716 (93.58%), Query Frame = 0

Query: 42   LFKSCCYHSSNDSLAYTLHAKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDV 101
            LFKSCCYH+SN + A TLHAKMVKNGSIL  GKF++SS+VKSE+LDDAQKVFDEMP+RDV
Sbjct: 299  LFKSCCYHASNGASADTLHAKMVKNGSILYLGKFIMSSHVKSERLDDAQKVFDEMPHRDV 358

Query: 102  LTWTVLISGFARVRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGW 161
            L+WTVLISGFARV CSEMAL+LFREMLVEGV PNHFTLS VLKLCSRVGD+QMGKGIHGW
Sbjct: 359  LSWTVLISGFARVNCSEMALQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGW 418

Query: 162  MLRSGVNLDVVLENSVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKS 221
            +LRSGVNLDVVL NS+LDLYAKFDAF+YAK+LFDSM+EKSTATYNIMLGVYVRSCDVNKS
Sbjct: 419  ILRSGVNLDVVLGNSMLDLYAKFDAFDYAKQLFDSMKEKSTATYNIMLGVYVRSCDVNKS 478

Query: 222  LELFRKLPCRDTASWNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSL 281
            L+LFR LPCRD ASWNTIICGLMQGG+L TA+ELLYEMV+NEPEFNKVTSSIALSVVSSL
Sbjct: 479  LDLFRNLPCRDAASWNTIICGLMQGGYLNTAMELLYEMVKNEPEFNKVTSSIALSVVSSL 538

Query: 282  LIMALGRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNI 341
            LI+ LGRQVHGRI RFGFHNDGFV SSLINMYIKCGNLEKASVIY+QMPS+F  K+ SNI
Sbjct: 539  LIIDLGRQVHGRIFRFGFHNDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKKRDSNI 598

Query: 342  VCSDMMTEIVSRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVL 401
            VCS+ MTEIVSRSS+VSGYV+NGKYED+FKTF+SM+RERA+MDRFTIASI+SACSNA VL
Sbjct: 599  VCSNTMTEIVSRSSIVSGYVQNGKYEDSFKTFVSMIRERAVMDRFTIASIISACSNAGVL 658

Query: 402  ELGRQVHAYIQKIGERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGC 461
            ELGRQ+HAYIQK GE+LDAHLASSLIDMYAKGGSLDCA Q F Q TY N V WTSMI GC
Sbjct: 659  ELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAYQIFVQTTYLNVVTWTSMITGC 718

Query: 462  ALHGQGKEAIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPK 521
            ALHGQGKEAIRLFEQMR EGIIPNEVTF+GVL ACSHAGLL +GR YFNMMKDVYAI+PK
Sbjct: 719  ALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLIACSHAGLLDEGRLYFNMMKDVYAIEPK 778

Query: 522  VEHFTCMVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLL 581
            VEHFTCMVDLYGRAG LNEVKEFIY+N+LSH SAVWKAFLSSCRLYKDI+MGNWVSEKL 
Sbjct: 779  VEHFTCMVDLYGRAGRLNEVKEFIYQNNLSHHSAVWKAFLSSCRLYKDIKMGNWVSEKLF 838

Query: 582  GLEPQDEGPYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGD 641
             LEP+DEGPYVLLSNMCSSNQKWEEAS+TRRSMQ RGI+KTPGQSWIHVKNQVHSFVAGD
Sbjct: 839  KLEPRDEGPYVLLSNMCSSNQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFVAGD 898

Query: 642  RSHPQHAQIYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIIS 701
            RSH QHAQIYAYLDKLIGRLKEIGY  DVKLVMQDVEEEQGEVLL WHSEKLAV YGIIS
Sbjct: 899  RSHLQHAQIYAYLDKLIGRLKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVTYGIIS 958

Query: 702  LASGIPIRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
            LASGIPIRIMKNLRVC DCHNFMKLTSQLL REIIVRDIHRFHHF SGRCSCGDYW
Sbjct: 959  LASGIPIRIMKNLRVCTDCHNFMKLTSQLLDREIIVRDIHRFHHFISGRCSCGDYW 1014

BLAST of Moc08g38960 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 2.2e-145
Identity = 272/723 (37.62%), Postives = 434/723 (60.03%), Query Frame = 0

Query: 57  YTLHAKMVKNGSILD---SGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFAR 116
           Y LHA+ + +   L    S   VLS+Y K   +D   + FD++P RD ++WT +I G+  
Sbjct: 64  YALHARKLFDEMPLRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKN 123

Query: 117 VRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVL 176
           +     A+R+  +M+ EG+ P  FTL++VL   +    ++ GK +H ++++ G+  +V +
Sbjct: 124 IGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSV 183

Query: 177 ENSVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDT 236
            NS+L++YAK      AK +FD M  +  +++N M+ ++++   ++ ++  F ++  RD 
Sbjct: 184 SNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDI 243

Query: 237 ASWNTIICGLMQGGHLKTALELLYEMVENE-PEFNKVTSSIALSVVSSLLIMALGRQVHG 296
            +WN++I G  Q G+   AL++  +M+ +     ++ T +  LS  ++L  + +G+Q+H 
Sbjct: 244 VTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHS 303

Query: 297 RIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVS----------------- 356
            IV  GF   G V ++LI+MY +CG +E A  +  Q  +  +                  
Sbjct: 304 HIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDM 363

Query: 357 KQGSNIVCSDMMTEIVSRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSAC 416
            Q  NI  S    ++V+ ++M+ GY ++G Y +A   F SM+      + +T+A+++S  
Sbjct: 364 NQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVA 423

Query: 417 SNAAVLELGRQVHAYIQKIGERLDAHLASSLIDMYAKGGSLDCARQTFEQM-TYANFVIW 476
           S+ A L  G+Q+H    K GE     ++++LI MYAK G++  A + F+ +    + V W
Sbjct: 424 SSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSW 483

Query: 477 TSMIAGCALHGQGKEAIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKD 536
           TSMI   A HG  +EA+ LFE M  EG+ P+ +T+VGV +AC+HAGL+  GR+YF+MMKD
Sbjct: 484 TSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKD 543

Query: 537 VYAIKPKVEHFTCMVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGN 596
           V  I P + H+ CMVDL+GRAG L E +EFI +  +      W + LS+CR++K+I++G 
Sbjct: 544 VDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGK 603

Query: 597 WVSEKLLGLEPQDEGPYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQV 656
             +E+LL LEP++ G Y  L+N+ S+  KWEEA++ R+SM+   + K  G SWI VK++V
Sbjct: 604 VAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKV 663

Query: 657 HSFVAGDRSHPQHAQIYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLA 716
           H F   D +HP+  +IY  + K+   +K++GY+ D   V+ D+EEE  E +L  HSEKLA
Sbjct: 664 HVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKLA 723

Query: 717 VAYGIISLASGIPIRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCG 758
           +A+G+IS      +RIMKNLRVC DCH  +K  S+L+GREIIVRD  RFHHF  G CSC 
Sbjct: 724 IAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSCR 783

BLAST of Moc08g38960 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 510.0 bits (1312), Expect = 4.6e-143
Identity = 260/710 (36.62%), Postives = 418/710 (58.87%), Query Frame = 0

Query: 54  SLAYTLHAKMVKNGSIL-DSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFA 113
           S A  LHA+ ++  S+   S   V+S Y   + L +A  +F  + +  VL W  +I  F 
Sbjct: 22  SQAKQLHAQFIRTQSLSHTSASIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFT 81

Query: 114 RVRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVV 173
                  AL  F EM   G  P+H    SVLK C+ + D++ G+ +HG+++R G++ D+ 
Sbjct: 82  DQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLY 141

Query: 174 LENSVLDLYAK---FDAFEYAKKLFDSM--REKSTATYNIMLGVYVRSCDVNKSLELFRK 233
             N+++++YAK     +      +FD M  R  ++   ++     +    ++    +F  
Sbjct: 142 TGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEV 201

Query: 234 LPCRDTASWNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALG 293
           +P +D  S+NTII G  Q G  + AL ++ EM   + + +  T S  L + S  + +  G
Sbjct: 202 MPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKG 261

Query: 294 RQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMM 353
           +++HG ++R G  +D ++ SSL++MY K   +E +  ++            S + C D  
Sbjct: 262 KEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVF------------SRLYCRDG- 321

Query: 354 TEIVSRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQV 413
              +S +S+V+GYV+NG+Y +A + F  M+  +        +S++ AC++ A L LG+Q+
Sbjct: 322 ---ISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQL 381

Query: 414 HAYIQKIGERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQG 473
           H Y+ + G   +  +AS+L+DMY+K G++  AR+ F++M   + V WT++I G ALHG G
Sbjct: 382 HGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHG 441

Query: 474 KEAIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTC 533
            EA+ LFE+M+ +G+ PN+V FV VLTACSH GL+ +   YFN M  VY +  ++EH+  
Sbjct: 442 HEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAA 501

Query: 534 MVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQD 593
           + DL GRAG L E   FI +  +    +VW   LSSC ++K++E+   V+EK+  ++ ++
Sbjct: 502 VADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSEN 561

Query: 594 EGPYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQH 653
            G YVL+ NM +SN +W+E ++ R  M+++G+ K P  SWI +KN+ H FV+GDRSHP  
Sbjct: 562 MGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSM 621

Query: 654 AQIYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIP 713
            +I  +L  ++ ++++ GY+ D   V+ DV+EE    LL  HSE+LAVA+GII+   G  
Sbjct: 622 DKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTT 681

Query: 714 IRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
           IR+ KN+R+C DCH  +K  S++  REIIVRD  RFHHFN G CSCGDYW
Sbjct: 682 IRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of Moc08g38960 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 2.2e-132
Identity = 247/708 (34.89%), Postives = 402/708 (56.78%), Query Frame = 0

Query: 59  LHAKMVKNGSILDSGKFVLSSYVK-------SEKLDDAQKVFDEMPNRDVLTWTVLISGF 118
           +HA+M+K G  L +  + LS  ++        E L  A  VF  +   ++L W  +  G 
Sbjct: 52  IHAQMIKIG--LHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGH 111

Query: 119 ARVRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDV 178
           A       AL+L+  M+  G+ PN +T   VLK C++    + G+ IHG +L+ G +LD+
Sbjct: 112 ALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDL 171

Query: 179 VLENSVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCR 238
            +  S++ +Y +    E A K+FD    +   +Y  ++  Y     +  + +LF ++P +
Sbjct: 172 YVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVK 231

Query: 239 DTASWNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALGRQVH 298
           D  SWN +I G  + G+ K ALEL  +M++     ++ T    +S  +    + LGRQVH
Sbjct: 232 DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVH 291

Query: 299 GRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIV 358
             I   GF ++  + ++LI++Y KCG LE A  ++ ++P                  +++
Sbjct: 292 LWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLP----------------YKDVI 351

Query: 359 SRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQVHAYI 418
           S ++++ GY     Y++A   F  M+R     +  T+ SI+ AC++   +++GR +H YI
Sbjct: 352 SWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI 411

Query: 419 QK--IGERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQGKE 478
            K   G    + L +SLIDMYAK G ++ A Q F  + + +   W +MI G A+HG+   
Sbjct: 412 DKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADA 471

Query: 479 AIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMV 538
           +  LF +MR  GI P+++TFVG+L+ACSH+G+L  GR  F  M   Y + PK+EH+ CM+
Sbjct: 472 SFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMI 531

Query: 539 DLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEG 598
           DL G +G   E +E I   ++     +W + L +C+++ ++E+G   +E L+ +EP++ G
Sbjct: 532 DLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPG 591

Query: 599 PYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQ 658
            YVLLSN+ +S  +W E ++TR  +  +G+ K PG S I + + VH F+ GD+ HP++ +
Sbjct: 592 SYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNRE 651

Query: 659 IYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIPIR 718
           IY  L+++   L++ G++ D   V+Q++EEE  E  L  HSEKLA+A+G+IS   G  + 
Sbjct: 652 IYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLT 711

Query: 719 IMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
           I+KNLRVC +CH   KL S++  REII RD  RFHHF  G CSC DYW
Sbjct: 712 IVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Moc08g38960 vs. ExPASy Swiss-Prot
Match: Q5G1T1 (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 2.8e-132
Identity = 261/696 (37.50%), Postives = 390/696 (56.03%), Query Frame = 0

Query: 73  GKFVLSSYVKSE-KLDDAQKVFDEMPNRDVLTWTVLISGFARVRCSEMALRLFREMLVEG 132
           G  ++  +VK E   ++A KVFD+M   +V+TWT++I+   ++     A+R F +M++ G
Sbjct: 205 GCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSG 264

Query: 133 VSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVLENSVLDLYAKFDA---FE 192
              + FTLSSV   C+ + ++ +GK +H W +RSG+  DV  E S++D+YAK  A    +
Sbjct: 265 FESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDDV--ECSLVDMYAKCSADGSVD 324

Query: 193 YAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDTASWNTIICGLMQGGH 252
             +K+FD M + S                                 SW  +I G M+  +
Sbjct: 325 DCRKVFDRMEDHS-------------------------------VMSWTALITGYMKNCN 384

Query: 253 LKT-ALELLYEMV-ENEPEFNKVTSSIALSVVSSLLIMALGRQVHGRIVRFGFHNDGFVK 312
           L T A+ L  EM+ +   E N  T S A     +L    +G+QV G+  + G  ++  V 
Sbjct: 385 LATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSSVA 444

Query: 313 SSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIVSRSSMVSGYVRNGKY 372
           +S+I+M++K   +E A   +  +                    +VS ++ + G  RN  +
Sbjct: 445 NSVISMFVKSDRMEDAQRAFESLSE----------------KNLVSYNTFLDGTCRNLNF 504

Query: 373 EDAFKTFISMMRERAL-MDRFTIASIVSACSNAAVLELGRQVHAYIQKIGERLDAHLASS 432
           E AFK  +S + ER L +  FT AS++S  +N   +  G Q+H+ + K+G   +  + ++
Sbjct: 505 EQAFK-LLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNA 564

Query: 433 LIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQGKEAIRLFEQMRSEGIIPN 492
           LI MY+K GS+D A + F  M   N + WTSMI G A HG     +  F QM  EG+ PN
Sbjct: 565 LISMYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPN 624

Query: 493 EVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLNEVKEFI 552
           EVT+V +L+ACSH GL+ +G R+FN M + + IKPK+EH+ CMVDL  RAG L +  EFI
Sbjct: 625 EVTYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFI 684

Query: 553 YENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEGPYVLLSNMCSSNQKWE 612
                     VW+ FL +CR++ + E+G   + K+L L+P +   Y+ LSN+ +   KWE
Sbjct: 685 NTMPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWE 744

Query: 613 EASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGRLKEIG 672
           E++  RR M+ R + K  G SWI V +++H F  GD +HP   QIY  LD+LI  +K  G
Sbjct: 745 ESTEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCG 804

Query: 673 YLYDVKLVMQDVEEEQGEV----LLSWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCH 732
           Y+ D  LV+  +EEE  E     LL  HSEK+AVA+G+IS +   P+R+ KNLRVC DCH
Sbjct: 805 YVPDTDLVLHKLEEENDEAEKERLLYQHSEKIAVAFGLISTSKSRPVRVFKNLRVCGDCH 850

Query: 733 NFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
           N MK  S + GREI++RD++RFHHF  G+CSC DYW
Sbjct: 865 NAMKYISTVSGREIVLRDLNRFHHFKDGKCSCNDYW 850

BLAST of Moc08g38960 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 474.2 bits (1219), Expect = 2.8e-132
Identity = 246/702 (35.04%), Postives = 397/702 (56.55%), Query Frame = 0

Query: 56   AYTLHAKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFARVR 115
            AYT       N  I  +   +L+ Y K   ++ A   F E    +V+ W V++  +  + 
Sbjct: 413  AYTTKLGFASNNKIEGA---LLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLD 472

Query: 116  CSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVLEN 175
                + R+FR+M +E + PN +T  S+LK C R+GD+++G+ IH  ++++   L+  + +
Sbjct: 473  DLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCS 532

Query: 176  SVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDTAS 235
             ++D+YAK    + A              ++I++                 +   +D  S
Sbjct: 533  VLIDMYAKLGKLDTA--------------WDILI-----------------RFAGKDVVS 592

Query: 236  WNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALGRQVHGRIV 295
            W T+I G  Q      AL    +M++     ++V  + A+S  + L  +  G+Q+H +  
Sbjct: 593  WTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQAC 652

Query: 296  RFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIVSRSS 355
              GF +D   +++L+ +Y +CG +E++ + + Q      ++ G NI          + ++
Sbjct: 653  VSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQ------TEAGDNI----------AWNA 712

Query: 356  MVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQVHAYIQKIG 415
            +VSG+ ++G  E+A + F+ M RE    + FT  S V A S  A ++ G+QVHA I K G
Sbjct: 713  LVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTG 772

Query: 416  ERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQGKEAIRLFE 475
               +  + ++LI MYAK GS+  A + F +++  N V W ++I   + HG G EA+  F+
Sbjct: 773  YDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFD 832

Query: 476  QMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMVDLYGRA 535
            QM    + PN VT VGVL+ACSH GL+  G  YF  M   Y + PK EH+ C+VD+  RA
Sbjct: 833  QMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRA 892

Query: 536  GCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEGPYVLLS 595
            G L+  KEFI E  +   + VW+  LS+C ++K++E+G + +  LL LEP+D   YVLLS
Sbjct: 893  GLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLS 952

Query: 596  NMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLD 655
            N+ + ++KW+    TR+ M+ +G+ K PGQSWI VKN +HSF  GD++HP   +I+ Y  
Sbjct: 953  NLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQ 1012

Query: 656  KLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIPIRIMKNLR 715
             L  R  EIGY+ D   ++ +++ EQ + ++  HSEKLA+++G++SL + +PI +MKNLR
Sbjct: 1013 DLTKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLR 1064

Query: 716  VCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
            VC DCH ++K  S++  REIIVRD +RFHHF  G CSC DYW
Sbjct: 1073 VCNDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 1064

BLAST of Moc08g38960 vs. ExPASy TrEMBL
Match: A0A0A0LKI4 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G074230 PE=3 SV=1)

HSP 1 Score: 1289.6 bits (3336), Expect = 0.0e+00
Identity = 633/758 (83.51%), Postives = 684/758 (90.24%), Query Frame = 0

Query: 1   MRWMNLGSRCFSSAAFLKLSRSVSQVSLPQKLVSFNLSQHQLFKSCCYHSSNDSLAYTLH 60
           MRWMNL S CF S AFLKLS S+SQ ++  K++SFNLS+H LFKS  YH+SN   + TLH
Sbjct: 1   MRWMNLSSSCFPSPAFLKLSHSISQGTMTHKIISFNLSEHHLFKSFSYHTSNHFSSNTLH 60

Query: 61  AKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFARVRCSEMA 120
           AKMVK GSI  SGKFVL+SYVKSEKL+DAQK+FDEMPNRDVLTWT LISGF+RV  S MA
Sbjct: 61  AKMVKIGSIFVSGKFVLTSYVKSEKLNDAQKLFDEMPNRDVLTWTALISGFSRVNSSGMA 120

Query: 121 LRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVLENSVLDL 180
           L+LFREMLVEGVSPNHFTLS+VLKLCS+VGDV+MGKGIHGW+LR+GV LDVVLENS+LDL
Sbjct: 121 LQLFREMLVEGVSPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNGVKLDVVLENSMLDL 180

Query: 181 YAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDTASWNTII 240
           YAKFD F YA+KL+DSMREKST T NI+LGVYVRSCDVNKSL LFR LPCR+ ASWNTII
Sbjct: 181 YAKFDEFVYARKLYDSMREKSTDTDNIILGVYVRSCDVNKSLHLFRNLPCRNAASWNTII 240

Query: 241 CGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALGRQVHGRIVRFGFH 300
           CGLMQGG+L  ALELLYEMVENE EFN  TSSIALSVVSSLLI+ LGRQVHGRIVR G H
Sbjct: 241 CGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVVSSLLILELGRQVHGRIVRCGLH 300

Query: 301 NDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIVSRSSMVSGY 360
           NDGFVKS+LINMYIKCGNLEKASVIY+++PS F +KQ SNIVCSD MTEIVSRSSMV GY
Sbjct: 301 NDGFVKSALINMYIKCGNLEKASVIYSRLPSGFATKQSSNIVCSDTMTEIVSRSSMVYGY 360

Query: 361 VRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQVHAYIQKIGERLDA 420
           VRNGKYEDAFKTF+SM+RER LMD+FTIA++VSACSNA VLELGRQVH +I K  E+LDA
Sbjct: 361 VRNGKYEDAFKTFVSMVRERVLMDKFTIANVVSACSNAGVLELGRQVHGFIHKTVEQLDA 420

Query: 421 HLASSLIDMYAKGGSLDCARQTFEQMT-YANFVIWTSMIAGCALHGQGKEAIRLFEQMRS 480
           HLASSLIDMYAKGGSLDCA + F+QMT Y N VIWTSMI GCALHG GKEAIRLFEQMR 
Sbjct: 421 HLASSLIDMYAKGGSLDCAHRIFDQMTNYLNVVIWTSMIVGCALHGHGKEAIRLFEQMRY 480

Query: 481 EGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLN 540
           EGIIPNEVTF+GVLTACSHAGLL+DG  YFNMMKDVYAIKPKVEH+TCMVDLYGRAG LN
Sbjct: 481 EGIIPNEVTFIGVLTACSHAGLLEDGHLYFNMMKDVYAIKPKVEHYTCMVDLYGRAGLLN 540

Query: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEGPYVLLSNMCS 600
           EVKEFIYENDLSHLSAVWKAFLSSCRLY+D+EMG WVSEKL  L+PQDEG YVLLSNMCS
Sbjct: 541 EVKEFIYENDLSHLSAVWKAFLSSCRLYRDLEMGKWVSEKLFRLKPQDEGSYVLLSNMCS 600

Query: 601 SNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIG 660
            +QKWEEASR RRSMQ  GINKTPGQSWIH+KNQVHSFVAGD+SHPQHAQIY YLDKLIG
Sbjct: 601 GSQKWEEASRARRSMQHSGINKTPGQSWIHLKNQVHSFVAGDQSHPQHAQIYEYLDKLIG 660

Query: 661 RLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIPIRIMKNLRVCAD 720
           RLKEIGYL+DVKLVMQDVEEEQGEVLL WHSEKLAVAYGIISL S IPIRIMKNLR+C D
Sbjct: 661 RLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSAIPIRIMKNLRICTD 720

Query: 721 CHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
           CHNFMKLTSQLLGREIIVRDI+RFHHFNSG CSCGDYW
Sbjct: 721 CHNFMKLTSQLLGREIIVRDIYRFHHFNSGHCSCGDYW 758

BLAST of Moc08g38960 vs. ExPASy TrEMBL
Match: A0A6J1HR62 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxima OX=3661 GN=LOC111465385 PE=3 SV=1)

HSP 1 Score: 1284.2 bits (3322), Expect = 0.0e+00
Identity = 628/716 (87.71%), Postives = 670/716 (93.58%), Query Frame = 0

Query: 42   LFKSCCYHSSNDSLAYTLHAKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDV 101
            LFKSCCYH+SN + A TLHAKMVKNGSIL  GKF++SS+VKSE+LDDAQKVFDEMP+RDV
Sbjct: 299  LFKSCCYHASNGASADTLHAKMVKNGSILYLGKFIMSSHVKSERLDDAQKVFDEMPHRDV 358

Query: 102  LTWTVLISGFARVRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGW 161
            L+WTVLISGFARV CSEMAL+LFREMLVEGV PNHFTLS VLKLCSRVGD+QMGKGIHGW
Sbjct: 359  LSWTVLISGFARVNCSEMALQLFREMLVEGVCPNHFTLSCVLKLCSRVGDLQMGKGIHGW 418

Query: 162  MLRSGVNLDVVLENSVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKS 221
            +LRSGVNLDVVL NS+LDLYAKFDAF+YAK+LFDSM+EKSTATYNIMLGVYVRSCDVNKS
Sbjct: 419  ILRSGVNLDVVLGNSMLDLYAKFDAFDYAKQLFDSMKEKSTATYNIMLGVYVRSCDVNKS 478

Query: 222  LELFRKLPCRDTASWNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSL 281
            L+LFR LPCRD ASWNTIICGLMQGG+L TA+ELLYEMV+NEPEFNKVTSSIALSVVSSL
Sbjct: 479  LDLFRNLPCRDAASWNTIICGLMQGGYLNTAMELLYEMVKNEPEFNKVTSSIALSVVSSL 538

Query: 282  LIMALGRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNI 341
            LI+ LGRQVHGRI RFGFHNDGFV SSLINMYIKCGNLEKASVIY+QMPS+F  K+ SNI
Sbjct: 539  LIIDLGRQVHGRIFRFGFHNDGFVNSSLINMYIKCGNLEKASVIYSQMPSNFGKKRDSNI 598

Query: 342  VCSDMMTEIVSRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVL 401
            VCS+ MTEIVSRSS+VSGYV+NGKYED+FKTF+SM+RERA+MDRFTIASI+SACSNA VL
Sbjct: 599  VCSNTMTEIVSRSSIVSGYVQNGKYEDSFKTFVSMIRERAVMDRFTIASIISACSNAGVL 658

Query: 402  ELGRQVHAYIQKIGERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGC 461
            ELGRQ+HAYIQK GE+LDAHLASSLIDMYAKGGSLDCA Q F Q TY N V WTSMI GC
Sbjct: 659  ELGRQIHAYIQKTGEQLDAHLASSLIDMYAKGGSLDCAYQIFVQTTYLNVVTWTSMITGC 718

Query: 462  ALHGQGKEAIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPK 521
            ALHGQGKEAIRLFEQMR EGIIPNEVTF+GVL ACSHAGLL +GR YFNMMKDVYAI+PK
Sbjct: 719  ALHGQGKEAIRLFEQMRYEGIIPNEVTFIGVLIACSHAGLLDEGRLYFNMMKDVYAIEPK 778

Query: 522  VEHFTCMVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLL 581
            VEHFTCMVDLYGRAG LNEVKEFIY+N+LSH SAVWKAFLSSCRLYKDI+MGNWVSEKL 
Sbjct: 779  VEHFTCMVDLYGRAGRLNEVKEFIYQNNLSHHSAVWKAFLSSCRLYKDIKMGNWVSEKLF 838

Query: 582  GLEPQDEGPYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGD 641
             LEP+DEGPYVLLSNMCSSNQKWEEAS+TRRSMQ RGI+KTPGQSWIHVKNQVHSFVAGD
Sbjct: 839  KLEPRDEGPYVLLSNMCSSNQKWEEASKTRRSMQHRGISKTPGQSWIHVKNQVHSFVAGD 898

Query: 642  RSHPQHAQIYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIIS 701
            RSH QHAQIYAYLDKLIGRLKEIGY  DVKLVMQDVEEEQGEVLL WHSEKLAV YGIIS
Sbjct: 899  RSHLQHAQIYAYLDKLIGRLKEIGYSCDVKLVMQDVEEEQGEVLLGWHSEKLAVTYGIIS 958

Query: 702  LASGIPIRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
            LASGIPIRIMKNLRVC DCHNFMKLTSQLL REIIVRDIHRFHHF SGRCSCGDYW
Sbjct: 959  LASGIPIRIMKNLRVCTDCHNFMKLTSQLLDREIIVRDIHRFHHFISGRCSCGDYW 1014

BLAST of Moc08g38960 vs. ExPASy TrEMBL
Match: A0A6J1EPP7 (putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita moschata OX=3662 GN=LOC111436248 PE=3 SV=1)

HSP 1 Score: 1271.1 bits (3288), Expect = 0.0e+00
Identity = 621/710 (87.46%), Postives = 662/710 (93.24%), Query Frame = 0

Query: 48   YHSSNDSLAYTLHAKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVL 107
            +HSSNDSL  TLHAKMVKNGSI +S KF+LSSYVKSEKL+DA+KVFDEMP+RDVLTWTVL
Sbjct: 306  FHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVL 365

Query: 108  ISGFARVRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGV 167
            ISGFARV CSEMAL+LFREMLVEGV PN FTLS+VLKLCSRVGDV+MGKGIHGW+LRSGV
Sbjct: 366  ISGFARVNCSEMALQLFREMLVEGVCPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGV 425

Query: 168  NLDVVLENSVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRK 227
            +LDVVLENS+LDLYAKFD F+Y  KLFDSMREKSTATYNI+LGV+VRS DVNKSL+LFR 
Sbjct: 426  SLDVVLENSMLDLYAKFDEFDYVTKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRN 485

Query: 228  LPCRDTASWNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALG 287
            LPCRDTASWNT+ICGLMQGG+L  ALELLYEMVENEPEFNKVTSSIALSVVSSLLI+ LG
Sbjct: 486  LPCRDTASWNTVICGLMQGGYLNEALELLYEMVENEPEFNKVTSSIALSVVSSLLIIELG 545

Query: 288  RQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMM 347
            RQVHGRIVR G HNDGFVKSSLINMYIKCGNLEKASVIY+QMPS F +KQ  NIVCSD M
Sbjct: 546  RQVHGRIVRCGLHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFATKQDFNIVCSDTM 605

Query: 348  TEIVSRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQV 407
            TEIVSRSSMVSGYVRNGKYEDAFKTF+SM+RER LMD+FTIAS+VSACSNA V ELGRQ+
Sbjct: 606  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQI 665

Query: 408  HAYIQKIGERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQG 467
            HAYIQK GE+LDAHL SSLIDMYAKGGSLDCARQ FEQ TY N VIWTSMI GCALHGQG
Sbjct: 666  HAYIQKTGEQLDAHLTSSLIDMYAKGGSLDCARQIFEQTTYLNVVIWTSMITGCALHGQG 725

Query: 468  KEAIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTC 527
            KEAIRLFE+MR EG+IPNEVTF+GVL ACSHAGLL+DGR YFNMMKDVYAIKPKVEHFTC
Sbjct: 726  KEAIRLFEKMRYEGMIPNEVTFIGVLAACSHAGLLEDGRLYFNMMKDVYAIKPKVEHFTC 785

Query: 528  MVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQD 587
            MVDLYGRAG LNEVK+FIYENDLSHL+AVWKAFLSSC+LYKDIEMGNWVSE+L  LEP D
Sbjct: 786  MVDLYGRAGHLNEVKKFIYENDLSHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLD 845

Query: 588  EGPYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQH 647
            EGPYVLLSNMCSSNQKWEEA RTRRSMQ RGI+KTPGQSWIHVKN+VHSFVAGDRSHPQH
Sbjct: 846  EGPYVLLSNMCSSNQKWEEAFRTRRSMQHRGISKTPGQSWIHVKNRVHSFVAGDRSHPQH 905

Query: 648  AQIYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIP 707
            AQIY YLDKLIGRLKEIGYL+DVKLVMQDVEEEQGEVLL WHSEKLA+AYG+ISL S IP
Sbjct: 906  AQIYEYLDKLIGRLKEIGYLFDVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLGSSIP 965

Query: 708  IRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
            IRIMKNLR+C DCHNFMKLTSQLL REIIVRDIHRFHHFNSG CSCGDYW
Sbjct: 966  IRIMKNLRICTDCHNFMKLTSQLLCREIIVRDIHRFHHFNSGHCSCGDYW 1014

BLAST of Moc08g38960 vs. ExPASy TrEMBL
Match: A0A6J1KA70 (putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxima OX=3661 GN=LOC111492492 PE=3 SV=1)

HSP 1 Score: 1263.8 bits (3269), Expect = 0.0e+00
Identity = 618/710 (87.04%), Postives = 661/710 (93.10%), Query Frame = 0

Query: 48   YHSSNDSLAYTLHAKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVL 107
            YHSSNDSL  TLHAKMVKNGSI +S KF+LSSYVKSEKL+DA+KVFDEMP+RDVLTWTVL
Sbjct: 306  YHSSNDSLPNTLHAKMVKNGSIFESRKFILSSYVKSEKLNDARKVFDEMPSRDVLTWTVL 365

Query: 108  ISGFARVRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGV 167
            ISGFARV CSEMAL+LFREMLVEGV PN FTLS+VLKLCSRVGDV+MGKGIHGW+LRSGV
Sbjct: 366  ISGFARVNCSEMALQLFREMLVEGVYPNPFTLSTVLKLCSRVGDVKMGKGIHGWILRSGV 425

Query: 168  NLDVVLENSVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRK 227
            +LDVVLENS+LDLYAKFD F+Y KKLFDSMREKSTATYNI+LGV+VRS DVNKSL+LFR 
Sbjct: 426  SLDVVLENSMLDLYAKFDEFDYVKKLFDSMREKSTATYNILLGVHVRS-DVNKSLDLFRN 485

Query: 228  LPCRDTASWNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALG 287
            LPCRDTASWNT+ICGLMQGG+L  ALELLYEMVEN+PEFNKVTSSIALSVVSSLLI+ LG
Sbjct: 486  LPCRDTASWNTVICGLMQGGYLNEALELLYEMVENQPEFNKVTSSIALSVVSSLLIIELG 545

Query: 288  RQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMM 347
            RQVHGRI+R GFHNDGFVKSSLINMYIKCGNLEKASVIY+QMPS F  KQ  +IV SD M
Sbjct: 546  RQVHGRILRCGFHNDGFVKSSLINMYIKCGNLEKASVIYSQMPSGFGKKQDFDIVYSDTM 605

Query: 348  TEIVSRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQV 407
            TEIVSRSSMVSGYVRNGKYEDAFKTF+SM+RER LMD+FTIAS+VSACSNA V ELGRQ+
Sbjct: 606  TEIVSRSSMVSGYVRNGKYEDAFKTFVSMVRERVLMDKFTIASVVSACSNAGVFELGRQI 665

Query: 408  HAYIQKIGERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQG 467
            HAYIQK GE+LDAHL SSLIDMYAKGGSLDCARQ FEQMTY N VIWTSMI GCALHGQG
Sbjct: 666  HAYIQKTGEQLDAHLTSSLIDMYAKGGSLDCARQIFEQMTYLNVVIWTSMITGCALHGQG 725

Query: 468  KEAIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTC 527
            KEAIRLFE+MR EG+IPNEVTF+GVL ACSHAGL++DGR YFNMMKDVYAIKPKVEHFTC
Sbjct: 726  KEAIRLFEKMRYEGMIPNEVTFIGVLAACSHAGLIEDGRLYFNMMKDVYAIKPKVEHFTC 785

Query: 528  MVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQD 587
            MVDLYGRAG LNEVK+FIYENDLSHL+AVWKAFLSSC+LYKDIEMGNWVSE+L  LEP D
Sbjct: 786  MVDLYGRAGRLNEVKKFIYENDLSHLNAVWKAFLSSCQLYKDIEMGNWVSERLFRLEPLD 845

Query: 588  EGPYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQH 647
            EGPY+LLSNMCSSNQKWEEA RTRR MQ RGI+KTPGQSWIHVKNQVHSFVAGDRSHPQH
Sbjct: 846  EGPYILLSNMCSSNQKWEEAFRTRRFMQHRGISKTPGQSWIHVKNQVHSFVAGDRSHPQH 905

Query: 648  AQIYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIP 707
            AQIY YLD LIGRLKEIGYL+DVKLVMQDVEEEQGEVLL WHSEKLA+AYG+ISL S IP
Sbjct: 906  AQIYEYLDNLIGRLKEIGYLFDVKLVMQDVEEEQGEVLLGWHSEKLAIAYGLISLDSAIP 965

Query: 708  IRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
            IRIMKNLR+C DCHNFMKLTSQLL REIIVRDIHRFHHFNSG CSCGDYW
Sbjct: 966  IRIMKNLRMCTDCHNFMKLTSQLLCREIIVRDIHRFHHFNSGHCSCGDYW 1014

BLAST of Moc08g38960 vs. ExPASy TrEMBL
Match: A0A1S3B4E3 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucumis melo OX=3656 GN=LOC103485889 PE=3 SV=1)

HSP 1 Score: 1229.9 bits (3181), Expect = 0.0e+00
Identity = 600/712 (84.27%), Postives = 647/712 (90.87%), Query Frame = 0

Query: 47   CYHSSNDSLAYTLHAKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTV 106
            CYH+SN   + TLHAKMVK GSI++SGKFVL+SYVKS+KL+DAQK+FDEMPNRDVLTWT 
Sbjct: 301  CYHTSNSFSSNTLHAKMVKIGSIIESGKFVLTSYVKSKKLNDAQKLFDEMPNRDVLTWTA 360

Query: 107  LISGFARVRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSG 166
            +ISGF+RV CS MAL+LFREMLVEGV PNHFTLS+VLKLCS+VGDV+MGKGIHGW+LR+G
Sbjct: 361  IISGFSRVNCSGMALQLFREMLVEGVCPNHFTLSTVLKLCSKVGDVRMGKGIHGWILRNG 420

Query: 167  VNLDVVLENSVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFR 226
            V LDVVLENS+LDLYAKFD F YA+KL+DSM EKST T NI+LGVYVRSCDVNKSL LFR
Sbjct: 421  VKLDVVLENSLLDLYAKFDEFVYARKLYDSMGEKSTDTDNIILGVYVRSCDVNKSLHLFR 480

Query: 227  KLPCRDTASWNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMAL 286
             LPCR+ ASWNTIICGLMQGG+L  ALELLYEMVENE EFN  TSSIALSV SSLLI+ L
Sbjct: 481  NLPCRNAASWNTIICGLMQGGYLNAALELLYEMVENESEFNNFTSSIALSVASSLLILEL 540

Query: 287  GRQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDM 346
            GRQVHGRIVR G HNDGFVKS+LINMYIKCGNLEKASVIY+Q+PS F +KQGSNIVCSD 
Sbjct: 541  GRQVHGRIVRCGLHNDGFVKSALINMYIKCGNLEKASVIYSQLPSGFATKQGSNIVCSDT 600

Query: 347  MTEIVSRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQ 406
            MTEIVSRSSMV GYVRNGKYEDAFKTF+SM+RER LMD+FTIAS+VSAC+NA VLELGRQ
Sbjct: 601  MTEIVSRSSMVYGYVRNGKYEDAFKTFVSMVRERVLMDKFTIASVVSACANAGVLELGRQ 660

Query: 407  VHAYIQKIGERLDAHLASSLIDMYAKGGSLDCARQTFEQMT-YANFVIWTSMIAGCALHG 466
            VH +IQK  E+LDAHLASSLIDMYAKGGSLDCA + F+QMT Y N VIWTSMI GC+LHG
Sbjct: 661  VHGFIQKSVEQLDAHLASSLIDMYAKGGSLDCAHRIFDQMTYYLNVVIWTSMIVGCSLHG 720

Query: 467  QGKEAIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHF 526
             GKEAIRLFEQMR EGIIPNEVTF+GVLTACSHAGLL+DG  YFNMMKDVYAIKPKVEH+
Sbjct: 721  HGKEAIRLFEQMRYEGIIPNEVTFIGVLTACSHAGLLEDGLLYFNMMKDVYAIKPKVEHY 780

Query: 527  TCMVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEP 586
            TCMVDLYGRAG LNEVKEFIYENDLSHLS VWKAFLSSC LY+D+EMG WVSEKL  LEP
Sbjct: 781  TCMVDLYGRAGLLNEVKEFIYENDLSHLSVVWKAFLSSCLLYRDLEMGKWVSEKLFRLEP 840

Query: 587  QDEGPYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHP 646
            QDEG YVLLSNMCS +QKW+EASR R SMQ  GINKTPGQSWIH+KNQVHSFVAGDRSHP
Sbjct: 841  QDEGSYVLLSNMCSGSQKWQEASRARSSMQHSGINKTPGQSWIHLKNQVHSFVAGDRSHP 900

Query: 647  QHAQIYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASG 706
            QHAQIY YLDKLIGRLKEIGYL+DVKLVMQDVEEEQGEVLL WHSEKLAVAYGIISL S 
Sbjct: 901  QHAQIYEYLDKLIGRLKEIGYLHDVKLVMQDVEEEQGEVLLGWHSEKLAVAYGIISLGSA 960

Query: 707  IPIRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
            IPIRIMKNLR+C DCHNFMKLTSQLLGREIIVRDI RFHHFNSG CSCGDYW
Sbjct: 961  IPIRIMKNLRICTDCHNFMKLTSQLLGREIIVRDICRFHHFNSGHCSCGDYW 1012

BLAST of Moc08g38960 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 517.7 bits (1332), Expect = 1.6e-146
Identity = 272/723 (37.62%), Postives = 434/723 (60.03%), Query Frame = 0

Query: 57  YTLHAKMVKNGSILD---SGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFAR 116
           Y LHA+ + +   L    S   VLS+Y K   +D   + FD++P RD ++WT +I G+  
Sbjct: 64  YALHARKLFDEMPLRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKN 123

Query: 117 VRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVL 176
           +     A+R+  +M+ EG+ P  FTL++VL   +    ++ GK +H ++++ G+  +V +
Sbjct: 124 IGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSV 183

Query: 177 ENSVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDT 236
            NS+L++YAK      AK +FD M  +  +++N M+ ++++   ++ ++  F ++  RD 
Sbjct: 184 SNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDI 243

Query: 237 ASWNTIICGLMQGGHLKTALELLYEMVENE-PEFNKVTSSIALSVVSSLLIMALGRQVHG 296
            +WN++I G  Q G+   AL++  +M+ +     ++ T +  LS  ++L  + +G+Q+H 
Sbjct: 244 VTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHS 303

Query: 297 RIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVS----------------- 356
            IV  GF   G V ++LI+MY +CG +E A  +  Q  +  +                  
Sbjct: 304 HIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDM 363

Query: 357 KQGSNIVCSDMMTEIVSRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSAC 416
            Q  NI  S    ++V+ ++M+ GY ++G Y +A   F SM+      + +T+A+++S  
Sbjct: 364 NQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVA 423

Query: 417 SNAAVLELGRQVHAYIQKIGERLDAHLASSLIDMYAKGGSLDCARQTFEQM-TYANFVIW 476
           S+ A L  G+Q+H    K GE     ++++LI MYAK G++  A + F+ +    + V W
Sbjct: 424 SSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSW 483

Query: 477 TSMIAGCALHGQGKEAIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKD 536
           TSMI   A HG  +EA+ LFE M  EG+ P+ +T+VGV +AC+HAGL+  GR+YF+MMKD
Sbjct: 484 TSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKD 543

Query: 537 VYAIKPKVEHFTCMVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGN 596
           V  I P + H+ CMVDL+GRAG L E +EFI +  +      W + LS+CR++K+I++G 
Sbjct: 544 VDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGK 603

Query: 597 WVSEKLLGLEPQDEGPYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQV 656
             +E+LL LEP++ G Y  L+N+ S+  KWEEA++ R+SM+   + K  G SWI VK++V
Sbjct: 604 VAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKV 663

Query: 657 HSFVAGDRSHPQHAQIYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLA 716
           H F   D +HP+  +IY  + K+   +K++GY+ D   V+ D+EEE  E +L  HSEKLA
Sbjct: 664 HVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKLA 723

Query: 717 VAYGIISLASGIPIRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCG 758
           +A+G+IS      +RIMKNLRVC DCH  +K  S+L+GREIIVRD  RFHHF  G CSC 
Sbjct: 724 IAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSCR 783

BLAST of Moc08g38960 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 510.0 bits (1312), Expect = 3.3e-144
Identity = 260/710 (36.62%), Postives = 418/710 (58.87%), Query Frame = 0

Query: 54  SLAYTLHAKMVKNGSIL-DSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFA 113
           S A  LHA+ ++  S+   S   V+S Y   + L +A  +F  + +  VL W  +I  F 
Sbjct: 22  SQAKQLHAQFIRTQSLSHTSASIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFT 81

Query: 114 RVRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVV 173
                  AL  F EM   G  P+H    SVLK C+ + D++ G+ +HG+++R G++ D+ 
Sbjct: 82  DQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLY 141

Query: 174 LENSVLDLYAK---FDAFEYAKKLFDSM--REKSTATYNIMLGVYVRSCDVNKSLELFRK 233
             N+++++YAK     +      +FD M  R  ++   ++     +    ++    +F  
Sbjct: 142 TGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEV 201

Query: 234 LPCRDTASWNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALG 293
           +P +D  S+NTII G  Q G  + AL ++ EM   + + +  T S  L + S  + +  G
Sbjct: 202 MPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKG 261

Query: 294 RQVHGRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMM 353
           +++HG ++R G  +D ++ SSL++MY K   +E +  ++            S + C D  
Sbjct: 262 KEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVF------------SRLYCRDG- 321

Query: 354 TEIVSRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQV 413
              +S +S+V+GYV+NG+Y +A + F  M+  +        +S++ AC++ A L LG+Q+
Sbjct: 322 ---ISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQL 381

Query: 414 HAYIQKIGERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQG 473
           H Y+ + G   +  +AS+L+DMY+K G++  AR+ F++M   + V WT++I G ALHG G
Sbjct: 382 HGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHG 441

Query: 474 KEAIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTC 533
            EA+ LFE+M+ +G+ PN+V FV VLTACSH GL+ +   YFN M  VY +  ++EH+  
Sbjct: 442 HEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAA 501

Query: 534 MVDLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQD 593
           + DL GRAG L E   FI +  +    +VW   LSSC ++K++E+   V+EK+  ++ ++
Sbjct: 502 VADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSEN 561

Query: 594 EGPYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQH 653
            G YVL+ NM +SN +W+E ++ R  M+++G+ K P  SWI +KN+ H FV+GDRSHP  
Sbjct: 562 MGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSM 621

Query: 654 AQIYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIP 713
            +I  +L  ++ ++++ GY+ D   V+ DV+EE    LL  HSE+LAVA+GII+   G  
Sbjct: 622 DKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTT 681

Query: 714 IRIMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
           IR+ KN+R+C DCH  +K  S++  REIIVRD  RFHHFN G CSCGDYW
Sbjct: 682 IRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of Moc08g38960 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 474.6 bits (1220), Expect = 1.5e-133
Identity = 247/708 (34.89%), Postives = 402/708 (56.78%), Query Frame = 0

Query: 59  LHAKMVKNGSILDSGKFVLSSYVK-------SEKLDDAQKVFDEMPNRDVLTWTVLISGF 118
           +HA+M+K G  L +  + LS  ++        E L  A  VF  +   ++L W  +  G 
Sbjct: 52  IHAQMIKIG--LHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGH 111

Query: 119 ARVRCSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDV 178
           A       AL+L+  M+  G+ PN +T   VLK C++    + G+ IHG +L+ G +LD+
Sbjct: 112 ALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDL 171

Query: 179 VLENSVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCR 238
            +  S++ +Y +    E A K+FD    +   +Y  ++  Y     +  + +LF ++P +
Sbjct: 172 YVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVK 231

Query: 239 DTASWNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALGRQVH 298
           D  SWN +I G  + G+ K ALEL  +M++     ++ T    +S  +    + LGRQVH
Sbjct: 232 DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVH 291

Query: 299 GRIVRFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIV 358
             I   GF ++  + ++LI++Y KCG LE A  ++ ++P                  +++
Sbjct: 292 LWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLP----------------YKDVI 351

Query: 359 SRSSMVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQVHAYI 418
           S ++++ GY     Y++A   F  M+R     +  T+ SI+ AC++   +++GR +H YI
Sbjct: 352 SWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI 411

Query: 419 QK--IGERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQGKE 478
            K   G    + L +SLIDMYAK G ++ A Q F  + + +   W +MI G A+HG+   
Sbjct: 412 DKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADA 471

Query: 479 AIRLFEQMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMV 538
           +  LF +MR  GI P+++TFVG+L+ACSH+G+L  GR  F  M   Y + PK+EH+ CM+
Sbjct: 472 SFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMI 531

Query: 539 DLYGRAGCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEG 598
           DL G +G   E +E I   ++     +W + L +C+++ ++E+G   +E L+ +EP++ G
Sbjct: 532 DLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPG 591

Query: 599 PYVLLSNMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQ 658
            YVLLSN+ +S  +W E ++TR  +  +G+ K PG S I + + VH F+ GD+ HP++ +
Sbjct: 592 SYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNRE 651

Query: 659 IYAYLDKLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIPIR 718
           IY  L+++   L++ G++ D   V+Q++EEE  E  L  HSEKLA+A+G+IS   G  + 
Sbjct: 652 IYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLT 711

Query: 719 IMKNLRVCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
           I+KNLRVC +CH   KL S++  REII RD  RFHHF  G CSC DYW
Sbjct: 712 IVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Moc08g38960 vs. TAIR 10
Match: AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 474.2 bits (1219), Expect = 2.0e-133
Identity = 261/696 (37.50%), Postives = 390/696 (56.03%), Query Frame = 0

Query: 73  GKFVLSSYVKSE-KLDDAQKVFDEMPNRDVLTWTVLISGFARVRCSEMALRLFREMLVEG 132
           G  ++  +VK E   ++A KVFD+M   +V+TWT++I+   ++     A+R F +M++ G
Sbjct: 205 GCSLIDMFVKGENSFENAYKVFDKMSELNVVTWTLMITRCMQMGFPREAIRFFLDMVLSG 264

Query: 133 VSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVLENSVLDLYAKFDA---FE 192
              + FTLSSV   C+ + ++ +GK +H W +RSG+  DV  E S++D+YAK  A    +
Sbjct: 265 FESDKFTLSSVFSACAELENLSLGKQLHSWAIRSGLVDDV--ECSLVDMYAKCSADGSVD 324

Query: 193 YAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDTASWNTIICGLMQGGH 252
             +K+FD M + S                                 SW  +I G M+  +
Sbjct: 325 DCRKVFDRMEDHS-------------------------------VMSWTALITGYMKNCN 384

Query: 253 LKT-ALELLYEMV-ENEPEFNKVTSSIALSVVSSLLIMALGRQVHGRIVRFGFHNDGFVK 312
           L T A+ L  EM+ +   E N  T S A     +L    +G+QV G+  + G  ++  V 
Sbjct: 385 LATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSSVA 444

Query: 313 SSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIVSRSSMVSGYVRNGKY 372
           +S+I+M++K   +E A   +  +                    +VS ++ + G  RN  +
Sbjct: 445 NSVISMFVKSDRMEDAQRAFESLSE----------------KNLVSYNTFLDGTCRNLNF 504

Query: 373 EDAFKTFISMMRERAL-MDRFTIASIVSACSNAAVLELGRQVHAYIQKIGERLDAHLASS 432
           E AFK  +S + ER L +  FT AS++S  +N   +  G Q+H+ + K+G   +  + ++
Sbjct: 505 EQAFK-LLSEITERELGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNA 564

Query: 433 LIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQGKEAIRLFEQMRSEGIIPN 492
           LI MY+K GS+D A + F  M   N + WTSMI G A HG     +  F QM  EG+ PN
Sbjct: 565 LISMYSKCGSIDTASRVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPN 624

Query: 493 EVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMVDLYGRAGCLNEVKEFI 552
           EVT+V +L+ACSH GL+ +G R+FN M + + IKPK+EH+ CMVDL  RAG L +  EFI
Sbjct: 625 EVTYVAILSACSHVGLVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFI 684

Query: 553 YENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEGPYVLLSNMCSSNQKWE 612
                     VW+ FL +CR++ + E+G   + K+L L+P +   Y+ LSN+ +   KWE
Sbjct: 685 NTMPFQADVLVWRTFLGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWE 744

Query: 613 EASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLDKLIGRLKEIG 672
           E++  RR M+ R + K  G SWI V +++H F  GD +HP   QIY  LD+LI  +K  G
Sbjct: 745 ESTEMRRKMKERNLVKEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCG 804

Query: 673 YLYDVKLVMQDVEEEQGEV----LLSWHSEKLAVAYGIISLASGIPIRIMKNLRVCADCH 732
           Y+ D  LV+  +EEE  E     LL  HSEK+AVA+G+IS +   P+R+ KNLRVC DCH
Sbjct: 805 YVPDTDLVLHKLEEENDEAEKERLLYQHSEKIAVAFGLISTSKSRPVRVFKNLRVCGDCH 850

Query: 733 NFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
           N MK  S + GREI++RD++RFHHF  G+CSC DYW
Sbjct: 865 NAMKYISTVSGREIVLRDLNRFHHFKDGKCSCNDYW 850

BLAST of Moc08g38960 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 474.2 bits (1219), Expect = 2.0e-133
Identity = 246/702 (35.04%), Postives = 397/702 (56.55%), Query Frame = 0

Query: 56   AYTLHAKMVKNGSILDSGKFVLSSYVKSEKLDDAQKVFDEMPNRDVLTWTVLISGFARVR 115
            AYT       N  I  +   +L+ Y K   ++ A   F E    +V+ W V++  +  + 
Sbjct: 413  AYTTKLGFASNNKIEGA---LLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLD 472

Query: 116  CSEMALRLFREMLVEGVSPNHFTLSSVLKLCSRVGDVQMGKGIHGWMLRSGVNLDVVLEN 175
                + R+FR+M +E + PN +T  S+LK C R+GD+++G+ IH  ++++   L+  + +
Sbjct: 473  DLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCS 532

Query: 176  SVLDLYAKFDAFEYAKKLFDSMREKSTATYNIMLGVYVRSCDVNKSLELFRKLPCRDTAS 235
             ++D+YAK    + A              ++I++                 +   +D  S
Sbjct: 533  VLIDMYAKLGKLDTA--------------WDILI-----------------RFAGKDVVS 592

Query: 236  WNTIICGLMQGGHLKTALELLYEMVENEPEFNKVTSSIALSVVSSLLIMALGRQVHGRIV 295
            W T+I G  Q      AL    +M++     ++V  + A+S  + L  +  G+Q+H +  
Sbjct: 593  WTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQAC 652

Query: 296  RFGFHNDGFVKSSLINMYIKCGNLEKASVIYNQMPSDFVSKQGSNIVCSDMMTEIVSRSS 355
              GF +D   +++L+ +Y +CG +E++ + + Q      ++ G NI          + ++
Sbjct: 653  VSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQ------TEAGDNI----------AWNA 712

Query: 356  MVSGYVRNGKYEDAFKTFISMMRERALMDRFTIASIVSACSNAAVLELGRQVHAYIQKIG 415
            +VSG+ ++G  E+A + F+ M RE    + FT  S V A S  A ++ G+QVHA I K G
Sbjct: 713  LVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTG 772

Query: 416  ERLDAHLASSLIDMYAKGGSLDCARQTFEQMTYANFVIWTSMIAGCALHGQGKEAIRLFE 475
               +  + ++LI MYAK GS+  A + F +++  N V W ++I   + HG G EA+  F+
Sbjct: 773  YDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFD 832

Query: 476  QMRSEGIIPNEVTFVGVLTACSHAGLLKDGRRYFNMMKDVYAIKPKVEHFTCMVDLYGRA 535
            QM    + PN VT VGVL+ACSH GL+  G  YF  M   Y + PK EH+ C+VD+  RA
Sbjct: 833  QMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRA 892

Query: 536  GCLNEVKEFIYENDLSHLSAVWKAFLSSCRLYKDIEMGNWVSEKLLGLEPQDEGPYVLLS 595
            G L+  KEFI E  +   + VW+  LS+C ++K++E+G + +  LL LEP+D   YVLLS
Sbjct: 893  GLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLS 952

Query: 596  NMCSSNQKWEEASRTRRSMQRRGINKTPGQSWIHVKNQVHSFVAGDRSHPQHAQIYAYLD 655
            N+ + ++KW+    TR+ M+ +G+ K PGQSWI VKN +HSF  GD++HP   +I+ Y  
Sbjct: 953  NLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQ 1012

Query: 656  KLIGRLKEIGYLYDVKLVMQDVEEEQGEVLLSWHSEKLAVAYGIISLASGIPIRIMKNLR 715
             L  R  EIGY+ D   ++ +++ EQ + ++  HSEKLA+++G++SL + +PI +MKNLR
Sbjct: 1013 DLTKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLR 1064

Query: 716  VCADCHNFMKLTSQLLGREIIVRDIHRFHHFNSGRCSCGDYW 758
            VC DCH ++K  S++  REIIVRD +RFHHF  G CSC DYW
Sbjct: 1073 VCNDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDYW 1064

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038889548.10.0e+0086.68putative pentatricopeptide repeat-containing protein At3g23330 [Benincasa hispid... [more]
KAG6586149.10.0e+0086.26putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
KAG7020981.10.0e+0086.26putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
KAG7029890.10.0e+0085.12putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma sub... [more]
XP_022965499.10.0e+0087.71LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
Match NameE-valueIdentityDescription
Q9SHZ82.2e-14537.62Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9LW634.6e-14336.62Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9LN012.2e-13234.89Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q5G1T12.8e-13237.50Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
Q9SVP72.8e-13235.04Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0LKI40.0e+0083.51DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G0742... [more]
A0A6J1HR620.0e+0087.71LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
A0A6J1EPP70.0e+0087.46putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita mosc... [more]
A0A6J1KA700.0e+0087.04putative pentatricopeptide repeat-containing protein At3g23330 OS=Cucurbita maxi... [more]
A0A1S3B4E30.0e+0084.27LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g23... [more]
Match NameE-valueIdentityDescription
AT2G22070.11.6e-14637.62pentatricopeptide (PPR) repeat-containing protein [more]
AT3G23330.13.3e-14436.62Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.11.5e-13334.89Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G49170.12.0e-13337.50Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G13650.12.0e-13335.04Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 450..497
e-value: 1.4E-10
score: 41.2
coord: 100..146
e-value: 7.7E-11
score: 42.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 307..331
e-value: 0.0067
score: 16.6
coord: 204..228
e-value: 0.0063
score: 16.7
coord: 524..546
e-value: 0.73
score: 10.2
coord: 235..262
e-value: 1.4E-4
score: 21.9
coord: 424..447
e-value: 0.15
score: 12.4
coord: 175..200
e-value: 0.065
score: 13.5
coord: 352..379
e-value: 5.2E-5
score: 23.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 103..135
e-value: 7.7E-8
score: 30.1
coord: 354..379
e-value: 1.8E-4
score: 19.5
coord: 235..262
e-value: 5.8E-5
score: 21.0
coord: 204..228
e-value: 6.6E-4
score: 17.7
coord: 452..486
e-value: 1.9E-8
score: 31.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 450..484
score: 12.167101
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 232..266
score: 9.404853
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 100..134
score: 11.980759
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 349..384
score: 8.769097
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 623..746
e-value: 5.9E-40
score: 136.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 424..644
e-value: 1.5E-37
score: 131.6
coord: 282..423
e-value: 2.8E-15
score: 58.5
coord: 8..162
e-value: 2.7E-22
score: 81.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 174..281
e-value: 1.1E-20
score: 75.8
NoneNo IPR availablePANTHERPTHR24015:SF1922OS07G0239600 PROTEINcoord: 33..739
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 33..739

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc08g38960.1Moc08g38960.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding