CaUC01G008010 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC01G008010
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCiama_Chr01: 9021259 .. 9023094 (+)
RNA-Seq ExpressionCaUC01G008010
SyntenyCaUC01G008010
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGTCACTGGCTCTCCCGACTGGTCACTCCCCCCATCAACCTTTTTCAGAAAATCCCATCTCATAACTTTCATCCCCACCTCTAATCTCGCTCTCCTTTTCTCTCTTCCCGCTTCAAATCTTCGATCCCTTCATCTAAATTCCTCCGGTTGCCCTTCCCCAATCTTAGAACAATCCTCCATCGCCTTACCCGACATCCATTTGGACTCCAATCTTCAAGATTTTCAACTTCCCTCCTTGCCTAACGTTGAAGATTTGAACGATTTCTTATGTGGGTTGTCCCAAAACCCCGGGAGCGAGGATTTGATCTATGAGTACTATGTAAAAGCGAAGGAGAGGGCAGGGTTTCGACCTGAGAAATCCACATTGCGGCATCTAGTCAGGTACTTAGTTCGATTGAAGAAGTGGGATTTGATTTTGTTAGTTTCTAGGGATTTTGTGGACTATGGTGTTTCCCCTGATAGAGATACTTGTTCTAGATTGGTTAGTAGTTGTGTTAGAGGTAGAAAATTTAAAGTTGTTAAGGCTCTGCTTGAGGTTTTTGAAAGGGATAGTGATGTTGCCACGGCTGCTTTTGAAGCTGCCATGAGAGGCTACAATAAGCTTCACATGTACAAAAGCACTATCCTGGTTTTCCAGCGGTTGAAATCTGCGAGAATTGAAGCAGATTCTGGATGCTATTGCAGGGTAATGGAAGCCTATCTTAAACTTGGGGATTTTGAGAGAGTTGTGGAGCTGTTTGATGAAGTTGAGAGTAGGATTTCGGATTTTGCGCCCTTTTCGACCAAAATTTACGGGATACTTTGCGAGTCCTTAGCGAAATCGGGGCGAGTTTTCGAGTCGCTTGAGTTCTTTAGAGATATGAAGAAGAAAGGGATTGCAGAAGACTACACCATTTACTCTGCTTTGATATGTACTTTTGCAAGCATTCAGGAAGTTAAATTAGCTGAAGATCTTTACAATGAGGCAAAAGCCAAGAAGTTGTTGAGAGACCCTGCAGTGTTTCTAAAGCTCATATTGATGTATATTCAACGAGGGTCATTAGAGAAGGCACTTGAGATTGTTGAAGTGATGAAGGACTTTAAAGTTGGAGTGTCTGACTGTATTTTCTGTGCAATTGTCAATGGTTACGCCACGAGAAGGGGGTATAATGCTGCAGTTAAGGTTTACGAGAAACTGATCGAAGACGGGTGCGAGCCAGGACAAGTGACATACGCCTCAGCAATCAATGCCTACTGCCGGGTAGGGCTGTACTCGAAAGCAGAGGACATGTTTAGAGAAATGGAGGAGAAGGGGTTTGACAAATGTGTAGTAGCTTACTCTAGCTTGATATCAATGTATGGAAAGACAGGGAGATTGAAGGATGCAATGAGGCTTTTAGCAAAGATGAAAGAAAGAGGGTGTCAACCAAATGTTTGGATTTACAATATCTTGATGGAGATGCATGGGAAGGCTAAGAATTTGAAGCAAGTTGAGAAGCTATGGAAAGAAATGAAGCGCAAAAAGATAGCACCTGATAAAGTTAGCTATACAAGTATCATAAGTGCTTATGTCAAGGCATCTGAATTCGAGACGTGCGAGCGATATTACCGCGAGTTTCGGATGAATGGGGGTGCCATCGATAAGGCAATGGCGGGAATCATGGTTGGCGTGTTCTCAAAGACGAGTCGGGTTGATGAGCTGGTGAAGCTTCTTAGGGACATGAACTTAGAAGGAACACGGTTGGATGGGAGGTTGTATAGGTCGGCATTGAACGCTTTGATTGATGCTGGGTTGCAAGTGCAAGCAAAATGGTTGCAAGATCATTATGCTGGAAAATCAGGCTATGTTTAA

mRNA sequence

ATGGCCGTCACTGGCTCTCCCGACTGGTCACTCCCCCCATCAACCTTTTTCAGAAAATCCCATCTCATAACTTTCATCCCCACCTCTAATCTCGCTCTCCTTTTCTCTCTTCCCGCTTCAAATCTTCGATCCCTTCATCTAAATTCCTCCGGTTGCCCTTCCCCAATCTTAGAACAATCCTCCATCGCCTTACCCGACATCCATTTGGACTCCAATCTTCAAGATTTTCAACTTCCCTCCTTGCCTAACGTTGAAGATTTGAACGATTTCTTATGTGGGTTGTCCCAAAACCCCGGGAGCGAGGATTTGATCTATGAGTACTATGTAAAAGCGAAGGAGAGGGCAGGGTTTCGACCTGAGAAATCCACATTGCGGCATCTAGTCAGGTACTTAGTTCGATTGAAGAAGTGGGATTTGATTTTGTTAGTTTCTAGGGATTTTGTGGACTATGGTGTTTCCCCTGATAGAGATACTTGTTCTAGATTGGTTAGTAGTTGTGTTAGAGGTAGAAAATTTAAAGTTGTTAAGGCTCTGCTTGAGGTTTTTGAAAGGGATAGTGATGTTGCCACGGCTGCTTTTGAAGCTGCCATGAGAGGCTACAATAAGCTTCACATGTACAAAAGCACTATCCTGGTTTTCCAGCGGTTGAAATCTGCGAGAATTGAAGCAGATTCTGGATGCTATTGCAGGGTAATGGAAGCCTATCTTAAACTTGGGGATTTTGAGAGAGTTGTGGAGCTGTTTGATGAAGTTGAGAGTAGGATTTCGGATTTTGCGCCCTTTTCGACCAAAATTTACGGGATACTTTGCGAGTCCTTAGCGAAATCGGGGCGAGTTTTCGAGTCGCTTGAGTTCTTTAGAGATATGAAGAAGAAAGGGATTGCAGAAGACTACACCATTTACTCTGCTTTGATATGTACTTTTGCAAGCATTCAGGAAGTTAAATTAGCTGAAGATCTTTACAATGAGGCAAAAGCCAAGAAGTTGTTGAGAGACCCTGCAGTGTTTCTAAAGCTCATATTGATGTATATTCAACGAGGGTCATTAGAGAAGGCACTTGAGATTGTTGAAGTGATGAAGGACTTTAAAGTTGGAGTGTCTGACTGTATTTTCTGTGCAATTGTCAATGGTTACGCCACGAGAAGGGGGTATAATGCTGCAGTTAAGGTTTACGAGAAACTGATCGAAGACGGGTGCGAGCCAGGACAAGTGACATACGCCTCAGCAATCAATGCCTACTGCCGGGTAGGGCTGTACTCGAAAGCAGAGGACATGTTTAGAGAAATGGAGGAGAAGGGGTTTGACAAATGTGTAGTAGCTTACTCTAGCTTGATATCAATGTATGGAAAGACAGGGAGATTGAAGGATGCAATGAGGCTTTTAGCAAAGATGAAAGAAAGAGGGTGTCAACCAAATGTTTGGATTTACAATATCTTGATGGAGATGCATGGGAAGGCTAAGAATTTGAAGCAAGTTGAGAAGCTATGGAAAGAAATGAAGCGCAAAAAGATAGCACCTGATAAAGTTAGCTATACAAGTATCATAAGTGCTTATGTCAAGGCATCTGAATTCGAGACGTGCGAGCGATATTACCGCGAGTTTCGGATGAATGGGGGTGCCATCGATAAGGCAATGGCGGGAATCATGGTTGGCGTGTTCTCAAAGACGAGTCGGGTTGATGAGCTGGTGAAGCTTCTTAGGGACATGAACTTAGAAGGAACACGGTTGGATGGGAGGTTGTATAGGTCGGCATTGAACGCTTTGATTGATGCTGGGTTGCAAGTGCAAGCAAAATGGTTGCAAGATCATTATGCTGGAAAATCAGGCTATGTTTAA

Coding sequence (CDS)

ATGGCCGTCACTGGCTCTCCCGACTGGTCACTCCCCCCATCAACCTTTTTCAGAAAATCCCATCTCATAACTTTCATCCCCACCTCTAATCTCGCTCTCCTTTTCTCTCTTCCCGCTTCAAATCTTCGATCCCTTCATCTAAATTCCTCCGGTTGCCCTTCCCCAATCTTAGAACAATCCTCCATCGCCTTACCCGACATCCATTTGGACTCCAATCTTCAAGATTTTCAACTTCCCTCCTTGCCTAACGTTGAAGATTTGAACGATTTCTTATGTGGGTTGTCCCAAAACCCCGGGAGCGAGGATTTGATCTATGAGTACTATGTAAAAGCGAAGGAGAGGGCAGGGTTTCGACCTGAGAAATCCACATTGCGGCATCTAGTCAGGTACTTAGTTCGATTGAAGAAGTGGGATTTGATTTTGTTAGTTTCTAGGGATTTTGTGGACTATGGTGTTTCCCCTGATAGAGATACTTGTTCTAGATTGGTTAGTAGTTGTGTTAGAGGTAGAAAATTTAAAGTTGTTAAGGCTCTGCTTGAGGTTTTTGAAAGGGATAGTGATGTTGCCACGGCTGCTTTTGAAGCTGCCATGAGAGGCTACAATAAGCTTCACATGTACAAAAGCACTATCCTGGTTTTCCAGCGGTTGAAATCTGCGAGAATTGAAGCAGATTCTGGATGCTATTGCAGGGTAATGGAAGCCTATCTTAAACTTGGGGATTTTGAGAGAGTTGTGGAGCTGTTTGATGAAGTTGAGAGTAGGATTTCGGATTTTGCGCCCTTTTCGACCAAAATTTACGGGATACTTTGCGAGTCCTTAGCGAAATCGGGGCGAGTTTTCGAGTCGCTTGAGTTCTTTAGAGATATGAAGAAGAAAGGGATTGCAGAAGACTACACCATTTACTCTGCTTTGATATGTACTTTTGCAAGCATTCAGGAAGTTAAATTAGCTGAAGATCTTTACAATGAGGCAAAAGCCAAGAAGTTGTTGAGAGACCCTGCAGTGTTTCTAAAGCTCATATTGATGTATATTCAACGAGGGTCATTAGAGAAGGCACTTGAGATTGTTGAAGTGATGAAGGACTTTAAAGTTGGAGTGTCTGACTGTATTTTCTGTGCAATTGTCAATGGTTACGCCACGAGAAGGGGGTATAATGCTGCAGTTAAGGTTTACGAGAAACTGATCGAAGACGGGTGCGAGCCAGGACAAGTGACATACGCCTCAGCAATCAATGCCTACTGCCGGGTAGGGCTGTACTCGAAAGCAGAGGACATGTTTAGAGAAATGGAGGAGAAGGGGTTTGACAAATGTGTAGTAGCTTACTCTAGCTTGATATCAATGTATGGAAAGACAGGGAGATTGAAGGATGCAATGAGGCTTTTAGCAAAGATGAAAGAAAGAGGGTGTCAACCAAATGTTTGGATTTACAATATCTTGATGGAGATGCATGGGAAGGCTAAGAATTTGAAGCAAGTTGAGAAGCTATGGAAAGAAATGAAGCGCAAAAAGATAGCACCTGATAAAGTTAGCTATACAAGTATCATAAGTGCTTATGTCAAGGCATCTGAATTCGAGACGTGCGAGCGATATTACCGCGAGTTTCGGATGAATGGGGGTGCCATCGATAAGGCAATGGCGGGAATCATGGTTGGCGTGTTCTCAAAGACGAGTCGGGTTGATGAGCTGGTGAAGCTTCTTAGGGACATGAACTTAGAAGGAACACGGTTGGATGGGAGGTTGTATAGGTCGGCATTGAACGCTTTGATTGATGCTGGGTTGCAAGTGCAAGCAAAATGGTTGCAAGATCATTATGCTGGAAAATCAGGCTATGTTTAA

Protein sequence

MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQSSIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPEKSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLEVFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGDFERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMKDFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGAIDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWLQDHYAGKSGYV
Homology
BLAST of CaUC01G008010 vs. NCBI nr
Match: XP_038874313.1 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 [Benincasa hispida] >XP_038874314.1 pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 1150.6 bits (2975), Expect = 0.0e+00
Identity = 576/611 (94.27%), Postives = 591/611 (96.73%), Query Frame = 0

Query: 1   MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQS 60
           MAVTGSPDWSLPPST FRKSHLI FIPTSNL+ LFSLP SNLRSLHL SSGCPSPILEQS
Sbjct: 1   MAVTGSPDWSLPPSTSFRKSHLINFIPTSNLSFLFSLPTSNLRSLHLKSSGCPSPILEQS 60

Query: 61  SIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPE 120
           SIALPDIHLDSNLQD QLPSLP VEDLNDFLCGLSQNPGSEDLIYEYYVKAKE+AGFRPE
Sbjct: 61  SIALPDIHLDSNLQDIQLPSLPTVEDLNDFLCGLSQNPGSEDLIYEYYVKAKEKAGFRPE 120

Query: 121 KSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLE 180
           KSTLRHL+RYLVRLKKWDLILLVSRDFVDY V PDRDTCSRLVSSCVRGRKFKVVKALLE
Sbjct: 121 KSTLRHLIRYLVRLKKWDLILLVSRDFVDYSVCPDRDTCSRLVSSCVRGRKFKVVKALLE 180

Query: 181 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFE+DSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGC CRVMEAYLKLGD
Sbjct: 181 VFEKDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCCCRVMEAYLKLGD 240

Query: 241 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 300
            ERV+ELF+EVESRISDF PFSTKIYGILCESLAKSGRVFESLEFFRDM+KKGI EDYTI
Sbjct: 241 SERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIVEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMK 360
           YSALI TFASIQEVKLAEDLYNEAKAKKLLRDPA+FLKLILMYIQ+GSLEKALE+V+VMK
Sbjct: 301 YSALISTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALELVQVMK 360

Query: 361 DFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFK+GVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS
Sbjct: 361 DFKIGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420

Query: 421 KAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480
           KAEDMF EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM
Sbjct: 421 KAEDMFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGA 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAY KASEFETCE+YY EFRMNGG 
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYAKASEFETCEQYYLEFRMNGGT 540

Query: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWL 600
           IDKAMAGIMVGVFSKTSRVDELVKLLRDM LEGTRLDGRLYRSALNAL+DAGLQVQAKWL
Sbjct: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGYV 612
           Q HYAGKSG+V
Sbjct: 601 QGHYAGKSGFV 611

BLAST of CaUC01G008010 vs. NCBI nr
Match: XP_022938691.1 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1104.0 bits (2854), Expect = 0.0e+00
Identity = 554/611 (90.67%), Postives = 577/611 (94.44%), Query Frame = 0

Query: 1   MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQS 60
           MAVTGSPDWSLP ST FRKS L+TFIP SNLALLFSLP  NLRSLHLNSSGCPSPILE S
Sbjct: 1   MAVTGSPDWSLPSSTCFRKSRLLTFIPASNLALLFSLP--NLRSLHLNSSGCPSPILESS 60

Query: 61  SIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPE 120
             +LP+I  DSNLQDFQLPS  +VEDLNDFLCGL QNPG EDLIYEYYVKAKE  GFRPE
Sbjct: 61  PTSLPEIDSDSNLQDFQLPSSSSVEDLNDFLCGLPQNPGREDLIYEYYVKAKETPGFRPE 120

Query: 121 KSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLE 180
           KSTLRHL+RYLVR K W+LILLVSRDFVDY V PDRDTCSRLVSSCVRGRKFKVV+ALLE
Sbjct: 121 KSTLRHLIRYLVRSKNWNLILLVSRDFVDYDVCPDRDTCSRLVSSCVRGRKFKVVRALLE 180

Query: 181 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFERD DVATAAFEAAMRGYNKLHMYKSTILVFQRLKSA+IEADSGCYCRVMEAYLKLGD
Sbjct: 181 VFERDRDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSAKIEADSGCYCRVMEAYLKLGD 240

Query: 241 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 300
            ER++ELF+E+ESRISDF PFSTKIYGILC+SLAKSGRVFESLEFFRDM+KKGI EDYTI
Sbjct: 241 SERIMELFNEIESRISDFTPFSTKIYGILCKSLAKSGRVFESLEFFRDMRKKGIVEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMK 360
           YSALICTFASIQEVKLAEDLYNEAK KKLLRDPA+F KLILMYIQ+GSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIQEVKLAEDLYNEAKTKKLLRDPAMFQKLILMYIQQGSLEKALEIVEVMK 360

Query: 361 DFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFK+GVSDCIFCAIVNGYATRRGYNAAV VYEKLI D CEPGQVTYA AINAYCRVGLYS
Sbjct: 361 DFKIGVSDCIFCAIVNGYATRRGYNAAVNVYEKLIRDECEPGQVTYALAINAYCRVGLYS 420

Query: 421 KAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM
Sbjct: 421 KAEDVFVEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGA 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EFETCERYYREFRMNGGA
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKAAEFETCERYYREFRMNGGA 540

Query: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWL 600
           IDKA+AGIMVGVFSKTSRVDELVKLLRDMNLEG RLD RLYRSALNAL+DAGLQVQAKWL
Sbjct: 541 IDKAIAGIMVGVFSKTSRVDELVKLLRDMNLEGIRLDERLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGYV 612
           QDHYAGKSG+V
Sbjct: 601 QDHYAGKSGFV 609

BLAST of CaUC01G008010 vs. NCBI nr
Match: XP_022993436.1 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 [Cucurbita maxima])

HSP 1 Score: 1103.6 bits (2853), Expect = 0.0e+00
Identity = 553/611 (90.51%), Postives = 575/611 (94.11%), Query Frame = 0

Query: 1   MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQS 60
           MAVTGSPDWSLP ST FRKS LITFIP SNLALLFSLP  NLRSLHLNSSGCPSPILE S
Sbjct: 1   MAVTGSPDWSLPSSTCFRKSRLITFIPASNLALLFSLP--NLRSLHLNSSGCPSPILESS 60

Query: 61  SIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPE 120
             +LP+I  DSNLQDFQLPS  +VEDLNDFLCGL QNPG EDLIYEYYVKAKE  GFRPE
Sbjct: 61  PTSLPEIDSDSNLQDFQLPSSSSVEDLNDFLCGLPQNPGREDLIYEYYVKAKETPGFRPE 120

Query: 121 KSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLE 180
           KSTLRHL+RYLVRLKKW LILLVSRDFVDY V PDRDTCSRLVSSCVRGRKFKVV+ALLE
Sbjct: 121 KSTLRHLIRYLVRLKKWSLILLVSRDFVDYDVCPDRDTCSRLVSSCVRGRKFKVVRALLE 180

Query: 181 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFERD DVATAAFEAAMRGYNKLHMY+STILVFQRLKSA+IEADSGCYCRVMEAYLKLGD
Sbjct: 181 VFERDRDVATAAFEAAMRGYNKLHMYRSTILVFQRLKSAKIEADSGCYCRVMEAYLKLGD 240

Query: 241 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 300
            ER++ELF+E+ESRISDF PFSTKIYGILC+SLAKSGRVFESLEFFRDM+KKGI EDYTI
Sbjct: 241 SERIMELFNEIESRISDFTPFSTKIYGILCKSLAKSGRVFESLEFFRDMRKKGIVEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMK 360
           YSALICTFASIQEVKLAEDLYNEAK KKLLRDPA F KLILMYIQ+GSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIQEVKLAEDLYNEAKTKKLLRDPATFQKLILMYIQQGSLEKALEIVEVMK 360

Query: 361 DFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFK+G SDCIFCAIVNGYATRRGYNAAV +YEKLI D CEPGQVTYA AINAYCRVGLYS
Sbjct: 361 DFKIGASDCIFCAIVNGYATRRGYNAAVNIYEKLIRDECEPGQVTYALAINAYCRVGLYS 420

Query: 421 KAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM
Sbjct: 421 KAEDVFVEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGA 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EFETCERYYREFRMNGG 
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKAAEFETCERYYREFRMNGGT 540

Query: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWL 600
           IDKA+AGIMVGVFSKTSRVDELVKLLRDMNLEG RLD RLYRSALNAL+DAGLQVQAKWL
Sbjct: 541 IDKAIAGIMVGVFSKTSRVDELVKLLRDMNLEGIRLDERLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGYV 612
           QDHYAGKSG+V
Sbjct: 601 QDHYAGKSGFV 609

BLAST of CaUC01G008010 vs. NCBI nr
Match: XP_004151188.1 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic [Cucumis sativus] >KGN49761.1 hypothetical protein Csa_017804 [Cucumis sativus])

HSP 1 Score: 1101.7 bits (2848), Expect = 0.0e+00
Identity = 552/611 (90.34%), Postives = 579/611 (94.76%), Query Frame = 0

Query: 1   MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQS 60
           MA++ SPD S PPS  FRKSH   FI TSN +LLFSLP SNL SLHLNSSGCPSPILEQ 
Sbjct: 1   MALSPSPDCSFPPSNSFRKSH---FISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQP 60

Query: 61  SIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPE 120
           SIALPDIH +SNL DFQLPSLPNV+DLNDFLCGLSQNPG+EDLIY+YYVKAKE AGFRP+
Sbjct: 61  SIALPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQ 120

Query: 121 KSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLE 180
           KSTLRHL+RYLVRLKKWDLILLVSRDFVD+GV PDRDTCS+LVSSCVRGRKFKVVK+LLE
Sbjct: 121 KSTLRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLE 180

Query: 181 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFERDS VA  AFEAAMRGYNKLHM+KSTI+VFQRLKSARIEADSGCYCRVMEAYLKLGD
Sbjct: 181 VFERDSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGD 240

Query: 241 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 300
            ERV+ELF+EVESRIS   PFSTKIYGILCESLAKSGRVFESLEFFRDM+KKGIAEDYTI
Sbjct: 241 SERVMELFNEVESRISVSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMK 360
           YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPA+FLKLILMY+Q+GSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMK 360

Query: 361 DFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFK+GVSDCIFCAIVNGYATRRGY AAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS
Sbjct: 361 DFKIGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420

Query: 421 KAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILM
Sbjct: 421 KAEDIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGA 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFE CE+YYREFRMNGG 
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGT 540

Query: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWL 600
           IDKA  GIMVGVFSKTSRVDELVKLLRDM LEGTRLD RLYR+ALNAL+DAGLQVQAKWL
Sbjct: 541 IDKAFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGYV 612
           QDHYAGKSG+V
Sbjct: 601 QDHYAGKSGFV 608

BLAST of CaUC01G008010 vs. NCBI nr
Match: KAG6579160.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1100.1 bits (2844), Expect = 0.0e+00
Identity = 552/611 (90.34%), Postives = 576/611 (94.27%), Query Frame = 0

Query: 1   MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQS 60
           MAVTGSPDWSLP ST FRKS L+TFIP SNLALLFSLP  NLRSLHLNSSGCPSPILE S
Sbjct: 1   MAVTGSPDWSLPSSTCFRKSRLLTFIPASNLALLFSLP--NLRSLHLNSSGCPSPILETS 60

Query: 61  SIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPE 120
             +LP+I  DSNLQDFQLPS  +VEDLNDFLCGL QNPG EDLIYEYYVKAKE  GFRPE
Sbjct: 61  PTSLPEIDSDSNLQDFQLPSSSSVEDLNDFLCGLPQNPGREDLIYEYYVKAKETPGFRPE 120

Query: 121 KSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLE 180
           KSTLRHL+RYLVR K W+LILLVSRDFVDY V PDRDTCSRLVSSCVRGRKFKVV+ALLE
Sbjct: 121 KSTLRHLIRYLVRSKNWNLILLVSRDFVDYDVCPDRDTCSRLVSSCVRGRKFKVVRALLE 180

Query: 181 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFERD DVATAAFEAAMRGYNKLHMYKSTILVFQRLKSA+IEADSGCYCRVMEAYLKLGD
Sbjct: 181 VFERDRDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSAKIEADSGCYCRVMEAYLKLGD 240

Query: 241 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 300
            ER++ELF+E+ESRISDF PFSTKIYGILC+SLAKSGRVFESLEFFRDM+KKGI EDYTI
Sbjct: 241 SERIMELFNEIESRISDFTPFSTKIYGILCKSLAKSGRVFESLEFFRDMRKKGIVEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMK 360
           YSALICTFASIQEVKLAEDLYNEAK KKLLRDPA+F KLILMYIQ+GSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIQEVKLAEDLYNEAKTKKLLRDPAMFQKLILMYIQQGSLEKALEIVEVMK 360

Query: 361 DFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFK+GVSDCIFCAIVNGYATRRGYNAAV VYEKLI D CEPGQVTYA AINAYCRVGLYS
Sbjct: 361 DFKIGVSDCIFCAIVNGYATRRGYNAAVNVYEKLIRDECEPGQVTYALAINAYCRVGLYS 420

Query: 421 KAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480
           KAED+F EMEEKGFD CVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM
Sbjct: 421 KAEDVFVEMEEKGFDNCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGA 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EFETCE+YYREFRMNGGA
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKAAEFETCEQYYREFRMNGGA 540

Query: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWL 600
           IDKA+AGIMVGVFSKTSRVDELVKLLRDMNLEG RLD RLYRSALNAL+DAGLQVQAKWL
Sbjct: 541 IDKAIAGIMVGVFSKTSRVDELVKLLRDMNLEGIRLDERLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGYV 612
           QDHYAGKSG+V
Sbjct: 601 QDHYAGKSGFV 609

BLAST of CaUC01G008010 vs. ExPASy Swiss-Prot
Match: Q66GP4 (Pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g13770 PE=2 SV=1)

HSP 1 Score: 590.5 bits (1521), Expect = 2.2e-167
Identity = 308/580 (53.10%), Postives = 402/580 (69.31%), Query Frame = 0

Query: 27  PTSNLALLFSLPASNLRSLHLNSSGCPSPILEQSSIALPDIHLDSNLQDFQLPSLPNVED 86
           PT  +  L   P     + H+ SS C S +LE+     P     S  +D      P   D
Sbjct: 24  PTKPIFFLSQKP----HNFHVCSSRC-SMVLEEDEKKSP-----SPKEDKWPFFEPGPND 83

Query: 87  LNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPEKSTLRHLVRYLVRLKKWDLILLVSRD 146
           LN  L    ++P +  L  E+Y KAKE +  R    T +HL+ YLV  K WDL++ V  D
Sbjct: 84  LNRVLSRFLRDPETRKLSSEFYEKAKENSELR----TTKHLISYLVSSKSWDLLVSVCED 143

Query: 147 FVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLEVFERDSDVATAAFEAAMRGYNKLHMY 206
             ++   PD  TCS L+ SC+R RKF++   LL VF  D  +A +A +AAM+G+NKL MY
Sbjct: 144 LREHKALPDGQTCSNLIRSCIRDRKFRITHCLLSVFRSDKSLAVSASDAAMKGFNKLQMY 203

Query: 207 KSTILVFQRLK-SARIEADSGCYCRVMEAYLKLGDFERVVELFDEVES-RISDFAPFSTK 266
            STI VF RLK S  +E   GCYCR+MEA+ K+G+  +VVELF E +S R+S  A  S  
Sbjct: 204 SSTIQVFDRLKQSVGVEPSPGCYCRIMEAHEKIGENHKVVELFQEFKSQRLSFLAKESGS 263

Query: 267 IYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEA 326
           IY I+C SLAKSGR FE+LE   +MK KGI E   +YS LI  FA  +EV + E L+ EA
Sbjct: 264 IYTIVCSSLAKSGRAFEALEVLEEMKDKGIPESSELYSMLIRAFAEAREVVITEKLFKEA 323

Query: 327 KAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMKDFKVGVSDCIFCAIVNGYATRRGY 386
             KKLL+DP + LK++LMY++ G++E  LE+V  M+  ++ V+DCI CAIVNG++ +RG+
Sbjct: 324 GGKKLLKDPEMCLKVVLMYVREGNMETTLEVVAAMRKAELKVTDCILCAIVNGFSKQRGF 383

Query: 387 NAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDMFREMEEKGFDKCVVAYSSL 446
             AVKVYE  +++ CE GQVTYA AINAYCR+  Y+KAE +F EM +KGFDKCVVAYS++
Sbjct: 384 AEAVKVYEWAMKEECEAGQVTYAIAINAYCRLEKYNKAEMLFDEMVKKGFDKCVVAYSNI 443

Query: 447 ISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKI 506
           + MYGKT RL DA+RL+AKMK+RGC+PN+WIYN L++MHG+A +L++ EK+WKEMKR K+
Sbjct: 444 MDMYGKTRRLSDAVRLMAKMKQRGCKPNIWIYNSLIDMHGRAMDLRRAEKIWKEMKRAKV 503

Query: 507 APDKVSYTSIISAYVKASEFETCERYYREFRMNGGAIDKAMAGIMVGVFSKTSRVDELVK 566
            PDKVSYTS+ISAY ++ E E C   Y+EFRMN G ID+AMAGIMVGVFSKTSR+DEL++
Sbjct: 504 LPDKVSYTSMISAYNRSKELERCVELYQEFRMNRGKIDRAMAGIMVGVFSKTSRIDELMR 563

Query: 567 LLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWLQDHY 605
           LL+DM +EGTRLD RLY SALNAL DAGL  Q +WLQ+ +
Sbjct: 564 LLQDMKVEGTRLDARLYSSALNALRDAGLNSQIRWLQESF 589

BLAST of CaUC01G008010 vs. ExPASy Swiss-Prot
Match: O82178 (Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana OX=3702 GN=At2g35130 PE=3 SV=1)

HSP 1 Score: 147.1 bits (370), Expect = 6.4e-34
Identity = 99/415 (23.86%), Postives = 190/415 (45.78%), Query Frame = 0

Query: 193 FEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGDFERVVELFDEVE 252
           F   +  Y +   YK    ++ +L  +R       Y  +++AY   G  ER   +  E++
Sbjct: 158 FNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMAGLIERAEVVLVEMQ 217

Query: 253 SRISDFAPFSTKIYGILCESLAK-SGRVFESLEFFRDMKKKGIAEDYTIYSALICTFASI 312
           +           +Y    E L K  G   E+++ F+ MK+         Y+ +I  +   
Sbjct: 218 NHHVSPKTIGVTVYNAYIEGLMKRKGNTEEAIDVFQRMKRDRCKPTTETYNLMINLYGKA 277

Query: 313 QEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMKDFKVGVSDCIF 372
            +  ++  LY E ++ +   +   +  L+  + + G  EKA EI E +++  +     ++
Sbjct: 278 SKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKAEEIFEQLQEDGLEPDVYVY 337

Query: 373 CAIVNGYATRRGY-NAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDMF---- 432
            A++  Y +R GY   A +++  +   GCEP + +Y   ++AY R GL+S AE +F    
Sbjct: 338 NALMESY-SRAGYPYGAAEIFSLMQHMGCEPDRASYNIMVDAYGRAGLHSDAEAVFEEMK 397

Query: 433 -------------------------------REMEEKGFDKCVVAYSSLISMYGKTGRLK 492
                                          +EM E G +      +S++++YG+ G+  
Sbjct: 398 RLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVEPDTFVLNSMLNLYGRLGQFT 457

Query: 493 DAMRLLAKMKERGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII 552
              ++LA+M+   C  ++  YNIL+ ++GKA  L+++E+L+ E+K K   PD V++TS I
Sbjct: 458 KMEKILAEMENGPCTADISTYNILINIYGKAGFLERIEELFVELKEKNFRPDVVTWTSRI 517

Query: 553 SAYVKASEFETCERYYREFRMNGGAIDKAMAGIMVGVFSKTSRVDELVKLLRDMN 571
            AY +   +  C   + E   +G A D   A +++   S   +V+++  +LR M+
Sbjct: 518 GAYSRKKLYVKCLEVFEEMIDSGCAPDGGTAKVLLSACSSEEQVEQVTSVLRTMH 571

BLAST of CaUC01G008010 vs. ExPASy Swiss-Prot
Match: Q9LW84 (Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX=3702 GN=At3g16010 PE=2 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 1.4e-33
Identity = 109/470 (23.19%), Postives = 211/470 (44.89%), Query Frame = 0

Query: 122 STLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVR-GRKFKVVKALLE 181
           + L  LV+ L R K     L V          P   T + ++   ++ G+  KV +   E
Sbjct: 163 AVLSELVKALGRAKMVSKALSVFYQAKGRKCKPTSSTYNSVILMLMQEGQHEKVHEVYTE 222

Query: 182 VF-ERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLG 241
           +  E D    T  + A +  Y KL    S I +F  +K   ++     Y  ++  Y K+G
Sbjct: 223 MCNEGDCFPDTITYSALISSYEKLGRNDSAIRLFDEMKDNCMQPTEKIYTTLLGIYFKVG 282

Query: 242 DFERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYT 301
             E+ ++LF+E++   +  +P +   Y  L + L K+GRV E+  F++DM + G+  D  
Sbjct: 283 KVEKALDLFEEMKR--AGCSP-TVYTYTELIKGLGKAGRVDEAYGFYKDMLRDGLTPDVV 342

Query: 302 IYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLI-LMYIQRGSLEKALEIVEV 361
             + L+     +  V+   ++++E    +       +  +I  ++  +  + +     + 
Sbjct: 343 FLNNLMNILGKVGRVEELTNVFSEMGMWRCTPTVVSYNTVIKALFESKAHVSEVSSWFDK 402

Query: 362 MKDFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGL 421
           MK   V  S+  +  +++GY        A+ + E++ E G  P    Y S INA  +   
Sbjct: 403 MKADSVSPSEFTYSILIDGYCKTNRVEKALLLLEEMDEKGFPPCPAAYCSLINALGKAKR 462

Query: 422 YSKAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNI 481
           Y  A ++F+E++E   +     Y+ +I  +GK G+L +A+ L  +MK +G  P+V+ YN 
Sbjct: 463 YEAANELFKELKENFGNVSSRVYAVMIKHFGKCGKLSEAVDLFNEMKNQGSGPDVYAYNA 522

Query: 482 LMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNG 541
           LM    KA  + +   L ++M+      D  S+  I++ + +          +   + +G
Sbjct: 523 LMSGMVKAGMINEANSLLRKMEENGCRADINSHNIILNGFARTGVPRRAIEMFETIKHSG 582

Query: 542 GAIDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNAL 589
              D      ++G F+     +E  +++R+M  +G   D   Y S L+A+
Sbjct: 583 IKPDGVTYNTLLGCFAHAGMFEEAARMMREMKDKGFEYDAITYSSILDAV 629

BLAST of CaUC01G008010 vs. ExPASy Swiss-Prot
Match: Q8S8P6 (Pentatricopeptide repeat-containing protein At2g32630 OS=Arabidopsis thaliana OX=3702 GN=At2g32630 PE=3 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 3.2e-33
Identity = 115/433 (26.56%), Postives = 197/433 (45.50%), Query Frame = 0

Query: 125 RHLVRYLVRLKK---WDLILLVSRDFVDYGVSPDRDTCSRLVSS-CVRGRKFKVVKALLE 184
           R  + +LV  KK    DL L + R  VD GV     + + +V   C RG   K  K + E
Sbjct: 190 RSCIVFLVAAKKRRRIDLCLEIFRRMVDSGVKITVYSLTIVVEGLCRRGEVEKSKKLIKE 249

Query: 185 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 244
              +        +   +  Y K   +     V + +K   +  +   Y  +ME  +K G 
Sbjct: 250 FSVKGIKPEAYTYNTIINAYVKQRDFSGVEGVLKVMKKDGVVYNKVTYTLLMELSVKNGK 309

Query: 245 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 304
                +LFDE+  R  +       +Y  L     + G +  +   F ++ +KG++     
Sbjct: 310 MSDAEKLFDEMRERGIE---SDVHVYTSLISWNCRKGNMKRAFLLFDELTEKGLSPSSYT 369

Query: 305 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVM- 364
           Y ALI     + E+  AE L NE ++K +     VF  LI  Y ++G +++A  I +VM 
Sbjct: 370 YGALIDGVCKVGEMGAAEILMNEMQSKGVNITQVVFNTLIDGYCRKGMVDEASMIYDVME 429

Query: 365 -KDFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGL 424
            K F+  V  C    I + +   + Y+ A +   +++E G +   V+Y + I+ YC+ G 
Sbjct: 430 QKGFQADVFTC--NTIASCFNRLKRYDEAKQWLFRMMEGGVKLSTVSYTNLIDVYCKEGN 489

Query: 425 YSKAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNI 484
             +A+ +F EM  KG     + Y+ +I  Y K G++K+A +L A M+  G  P+ + Y  
Sbjct: 490 VEEAKRLFVEMSSKGVQPNAITYNVMIYAYCKQGKIKEARKLRANMEANGMDPDSYTYTS 549

Query: 485 LMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNG 544
           L+     A N+ +  +L+ EM  K +  + V+YT +IS   KA + +     Y E +  G
Sbjct: 550 LIHGECIADNVDEAMRLFSEMGLKGLDQNSVTYTVMISGLSKAGKSDEAFGLYDEMKRKG 609

Query: 545 GAIDKAMAGIMVG 552
             ID  +   ++G
Sbjct: 610 YTIDNKVYTALIG 617

BLAST of CaUC01G008010 vs. ExPASy Swiss-Prot
Match: Q0WMY5 (Pentatricopeptide repeat-containing protein At5g04810, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PPR4 PE=1 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 7.8e-32
Identity = 93/412 (22.57%), Postives = 184/412 (44.66%), Query Frame = 0

Query: 180 EVFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLG 239
           E+ E   D   A +   M GY  +   K  ++VF+RLK          Y  ++  Y K+G
Sbjct: 439 EMEEEGIDAPIAIYHTMMDGYTMVADEKKGLVVFKRLKECGFTPTVVTYGCLINLYTKVG 498

Query: 240 DFERVVELFDEVESRI--SDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAED 299
              + +E+     SR+   +    + K Y ++     K      +   F DM K+G+  D
Sbjct: 499 KISKALEV-----SRVMKEEGVKHNLKTYSMMINGFVKLKDWANAFAVFEDMVKEGMKPD 558

Query: 300 YTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVE 359
             +Y+ +I  F  +  +  A     E +  +       F+ +I  Y + G + ++LE+ +
Sbjct: 559 VILYNNIISAFCGMGNMDRAIQTVKEMQKLRHRPTTRTFMPIIHGYAKSGDMRRSLEVFD 618

Query: 360 VMKDFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVG 419
           +M+      +   F  ++NG   +R    AV++ +++   G    + TY   +  Y  VG
Sbjct: 619 MMRRCGCVPTVHTFNGLINGLVEKRQMEKAVEILDEMTLAGVSANEHTYTKIMQGYASVG 678

Query: 420 LYSKAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYN 479
              KA + F  ++ +G D  +  Y +L+    K+GR++ A+ +  +M  R    N ++YN
Sbjct: 679 DTGKAFEYFTRLQNEGLDVDIFTYEALLKACCKSGRMQSALAVTKEMSARNIPRNSFVYN 738

Query: 480 ILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMN 539
           IL++   +  ++ +   L ++MK++ + PD  +YTS ISA  KA +     +   E    
Sbjct: 739 ILIDGWARRGDVWEAADLIQQMKKEGVKPDIHTYTSFISACSKAGDMNRATQTIEEMEAL 798

Query: 540 GGAIDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALI 590
           G   +      ++  +++ S  ++ +    +M   G + D  +Y   L +L+
Sbjct: 799 GVKPNIKTYTTLIKGWARASLPEKALSCYEEMKAMGIKPDKAVYHCLLTSLL 845

BLAST of CaUC01G008010 vs. ExPASy TrEMBL
Match: A0A6J1FDV4 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111444845 PE=4 SV=1)

HSP 1 Score: 1104.0 bits (2854), Expect = 0.0e+00
Identity = 554/611 (90.67%), Postives = 577/611 (94.44%), Query Frame = 0

Query: 1   MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQS 60
           MAVTGSPDWSLP ST FRKS L+TFIP SNLALLFSLP  NLRSLHLNSSGCPSPILE S
Sbjct: 1   MAVTGSPDWSLPSSTCFRKSRLLTFIPASNLALLFSLP--NLRSLHLNSSGCPSPILESS 60

Query: 61  SIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPE 120
             +LP+I  DSNLQDFQLPS  +VEDLNDFLCGL QNPG EDLIYEYYVKAKE  GFRPE
Sbjct: 61  PTSLPEIDSDSNLQDFQLPSSSSVEDLNDFLCGLPQNPGREDLIYEYYVKAKETPGFRPE 120

Query: 121 KSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLE 180
           KSTLRHL+RYLVR K W+LILLVSRDFVDY V PDRDTCSRLVSSCVRGRKFKVV+ALLE
Sbjct: 121 KSTLRHLIRYLVRSKNWNLILLVSRDFVDYDVCPDRDTCSRLVSSCVRGRKFKVVRALLE 180

Query: 181 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFERD DVATAAFEAAMRGYNKLHMYKSTILVFQRLKSA+IEADSGCYCRVMEAYLKLGD
Sbjct: 181 VFERDRDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSAKIEADSGCYCRVMEAYLKLGD 240

Query: 241 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 300
            ER++ELF+E+ESRISDF PFSTKIYGILC+SLAKSGRVFESLEFFRDM+KKGI EDYTI
Sbjct: 241 SERIMELFNEIESRISDFTPFSTKIYGILCKSLAKSGRVFESLEFFRDMRKKGIVEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMK 360
           YSALICTFASIQEVKLAEDLYNEAK KKLLRDPA+F KLILMYIQ+GSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIQEVKLAEDLYNEAKTKKLLRDPAMFQKLILMYIQQGSLEKALEIVEVMK 360

Query: 361 DFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFK+GVSDCIFCAIVNGYATRRGYNAAV VYEKLI D CEPGQVTYA AINAYCRVGLYS
Sbjct: 361 DFKIGVSDCIFCAIVNGYATRRGYNAAVNVYEKLIRDECEPGQVTYALAINAYCRVGLYS 420

Query: 421 KAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM
Sbjct: 421 KAEDVFVEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGA 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EFETCERYYREFRMNGGA
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKAAEFETCERYYREFRMNGGA 540

Query: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWL 600
           IDKA+AGIMVGVFSKTSRVDELVKLLRDMNLEG RLD RLYRSALNAL+DAGLQVQAKWL
Sbjct: 541 IDKAIAGIMVGVFSKTSRVDELVKLLRDMNLEGIRLDERLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGYV 612
           QDHYAGKSG+V
Sbjct: 601 QDHYAGKSGFV 609

BLAST of CaUC01G008010 vs. ExPASy TrEMBL
Match: A0A6J1JWB4 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489451 PE=4 SV=1)

HSP 1 Score: 1103.6 bits (2853), Expect = 0.0e+00
Identity = 553/611 (90.51%), Postives = 575/611 (94.11%), Query Frame = 0

Query: 1   MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQS 60
           MAVTGSPDWSLP ST FRKS LITFIP SNLALLFSLP  NLRSLHLNSSGCPSPILE S
Sbjct: 1   MAVTGSPDWSLPSSTCFRKSRLITFIPASNLALLFSLP--NLRSLHLNSSGCPSPILESS 60

Query: 61  SIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPE 120
             +LP+I  DSNLQDFQLPS  +VEDLNDFLCGL QNPG EDLIYEYYVKAKE  GFRPE
Sbjct: 61  PTSLPEIDSDSNLQDFQLPSSSSVEDLNDFLCGLPQNPGREDLIYEYYVKAKETPGFRPE 120

Query: 121 KSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLE 180
           KSTLRHL+RYLVRLKKW LILLVSRDFVDY V PDRDTCSRLVSSCVRGRKFKVV+ALLE
Sbjct: 121 KSTLRHLIRYLVRLKKWSLILLVSRDFVDYDVCPDRDTCSRLVSSCVRGRKFKVVRALLE 180

Query: 181 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFERD DVATAAFEAAMRGYNKLHMY+STILVFQRLKSA+IEADSGCYCRVMEAYLKLGD
Sbjct: 181 VFERDRDVATAAFEAAMRGYNKLHMYRSTILVFQRLKSAKIEADSGCYCRVMEAYLKLGD 240

Query: 241 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 300
            ER++ELF+E+ESRISDF PFSTKIYGILC+SLAKSGRVFESLEFFRDM+KKGI EDYTI
Sbjct: 241 SERIMELFNEIESRISDFTPFSTKIYGILCKSLAKSGRVFESLEFFRDMRKKGIVEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMK 360
           YSALICTFASIQEVKLAEDLYNEAK KKLLRDPA F KLILMYIQ+GSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIQEVKLAEDLYNEAKTKKLLRDPATFQKLILMYIQQGSLEKALEIVEVMK 360

Query: 361 DFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFK+G SDCIFCAIVNGYATRRGYNAAV +YEKLI D CEPGQVTYA AINAYCRVGLYS
Sbjct: 361 DFKIGASDCIFCAIVNGYATRRGYNAAVNIYEKLIRDECEPGQVTYALAINAYCRVGLYS 420

Query: 421 KAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM
Sbjct: 421 KAEDVFVEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGA 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EFETCERYYREFRMNGG 
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKAAEFETCERYYREFRMNGGT 540

Query: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWL 600
           IDKA+AGIMVGVFSKTSRVDELVKLLRDMNLEG RLD RLYRSALNAL+DAGLQVQAKWL
Sbjct: 541 IDKAIAGIMVGVFSKTSRVDELVKLLRDMNLEGIRLDERLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGYV 612
           QDHYAGKSG+V
Sbjct: 601 QDHYAGKSGFV 609

BLAST of CaUC01G008010 vs. ExPASy TrEMBL
Match: A0A0A0KPV8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G114580 PE=4 SV=1)

HSP 1 Score: 1101.7 bits (2848), Expect = 0.0e+00
Identity = 552/611 (90.34%), Postives = 579/611 (94.76%), Query Frame = 0

Query: 1   MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQS 60
           MA++ SPD S PPS  FRKSH   FI TSN +LLFSLP SNL SLHLNSSGCPSPILEQ 
Sbjct: 1   MALSPSPDCSFPPSNSFRKSH---FISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQP 60

Query: 61  SIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPE 120
           SIALPDIH +SNL DFQLPSLPNV+DLNDFLCGLSQNPG+EDLIY+YYVKAKE AGFRP+
Sbjct: 61  SIALPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQ 120

Query: 121 KSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLE 180
           KSTLRHL+RYLVRLKKWDLILLVSRDFVD+GV PDRDTCS+LVSSCVRGRKFKVVK+LLE
Sbjct: 121 KSTLRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLE 180

Query: 181 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFERDS VA  AFEAAMRGYNKLHM+KSTI+VFQRLKSARIEADSGCYCRVMEAYLKLGD
Sbjct: 181 VFERDSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGD 240

Query: 241 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 300
            ERV+ELF+EVESRIS   PFSTKIYGILCESLAKSGRVFESLEFFRDM+KKGIAEDYTI
Sbjct: 241 SERVMELFNEVESRISVSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMK 360
           YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPA+FLKLILMY+Q+GSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMK 360

Query: 361 DFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFK+GVSDCIFCAIVNGYATRRGY AAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS
Sbjct: 361 DFKIGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420

Query: 421 KAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILM
Sbjct: 421 KAEDIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGA 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFE CE+YYREFRMNGG 
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGT 540

Query: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWL 600
           IDKA  GIMVGVFSKTSRVDELVKLLRDM LEGTRLD RLYR+ALNAL+DAGLQVQAKWL
Sbjct: 541 IDKAFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGYV 612
           QDHYAGKSG+V
Sbjct: 601 QDHYAGKSGFV 608

BLAST of CaUC01G008010 vs. ExPASy TrEMBL
Match: A0A1S3CPF0 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103503127 PE=4 SV=1)

HSP 1 Score: 1099.7 bits (2843), Expect = 0.0e+00
Identity = 554/611 (90.67%), Postives = 578/611 (94.60%), Query Frame = 0

Query: 1   MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQS 60
           MAV+ SPD S PPS  FRKSH   FIPTSN  LLFSLP SNL SLHLNSSG PSPILEQ 
Sbjct: 1   MAVSPSPDCSFPPSNSFRKSH---FIPTSNFPLLFSLPTSNLPSLHLNSSGFPSPILEQP 60

Query: 61  SIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPE 120
           SIALPDIH +SNL DFQLP L NVEDLNDFLCGLSQNPG+EDLIY+YYVKAKERAGFRPE
Sbjct: 61  SIALPDIHSNSNLHDFQLPPLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPE 120

Query: 121 KSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLE 180
           KSTLRHL+RYLVRLKKWDLI LVSRDFVD+GV PDRDTCS+LVSSCVRGRKFKVVKALLE
Sbjct: 121 KSTLRHLIRYLVRLKKWDLIFLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKALLE 180

Query: 181 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFERDSDVA  AFEAAMRGYNKLHMYKSTI+VFQRLKSARIEADSGCY RVMEAYLKLGD
Sbjct: 181 VFERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYFRVMEAYLKLGD 240

Query: 241 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 300
            ERV+ELF+EVESRIS+  PFSTKIYGILCESLAKSGRVFESLEFFRDM+KKGIAEDYTI
Sbjct: 241 SERVMELFNEVESRISNLTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMK 360
           YSALICTFASI+EVKLAEDLYNEAKAKKLLRDPA+FLKLILMYIQ+GSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIREVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMK 360

Query: 361 DFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFK+GVSDCIFCAIVNGYATRRGY+AAVKVYEKLI DGCEPGQVTYASAINAYCRVGLYS
Sbjct: 361 DFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIGDGCEPGQVTYASAINAYCRVGLYS 420

Query: 421 KAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILM
Sbjct: 421 KAEDIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGA 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKASEFE CE+YYREFRMNGG 
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKASEFEKCEQYYREFRMNGGT 540

Query: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWL 600
           IDKA+ GIMVGVFSKTSRVDELVKLLRDM LEGTRLD RLYRSALNAL+DAGLQVQAKWL
Sbjct: 541 IDKAIGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGYV 612
           QDHYAGKSG+V
Sbjct: 601 QDHYAGKSGFV 608

BLAST of CaUC01G008010 vs. ExPASy TrEMBL
Match: A0A5A7VR01 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold174G00670 PE=4 SV=1)

HSP 1 Score: 1095.1 bits (2831), Expect = 0.0e+00
Identity = 552/611 (90.34%), Postives = 576/611 (94.27%), Query Frame = 0

Query: 1   MAVTGSPDWSLPPSTFFRKSHLITFIPTSNLALLFSLPASNLRSLHLNSSGCPSPILEQS 60
           MAV+ SPD S PPS  FRKSH   FIPTSN  LLFSL  SNL SLHLNSSG PSPILEQ 
Sbjct: 1   MAVSPSPDCSFPPSNSFRKSH---FIPTSNFPLLFSLSTSNLPSLHLNSSGFPSPILEQP 60

Query: 61  SIALPDIHLDSNLQDFQLPSLPNVEDLNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPE 120
           SIALPDIH +SNL DFQLP L NVEDLNDFLCGLSQNPG+EDLIY+YYVKAKERAGFRPE
Sbjct: 61  SIALPDIHSNSNLHDFQLPPLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPE 120

Query: 121 KSTLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLE 180
           KSTLRHL+RYLVRLKKWDLI LVSRDFVD+GV PDRDTCS+LVSSCVRGRKFKVVKALLE
Sbjct: 121 KSTLRHLIRYLVRLKKWDLIFLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKALLE 180

Query: 181 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFERDSDVA   FEAAMRGYNKLHMYKSTI+VFQRLKSARIEADSGCY RVMEAYLKLGD
Sbjct: 181 VFERDSDVALTTFEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYFRVMEAYLKLGD 240

Query: 241 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 300
            ERV+ELF+EVESRIS+  PFSTKIYGILCESLAKSGRVFESLEFFRDM+KKGIAEDYTI
Sbjct: 241 SERVMELFNEVESRISNLTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMK 360
           YSALICTFASI+EVKLAEDLYNEAKAKKLLRDPA+FLKLILMYIQ+GSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIREVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMK 360

Query: 361 DFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFK+GVSDCIFCAIVNGYATRRGY+AAVKVYEKLI DGCEPGQVTYASAINAYCRVGLYS
Sbjct: 361 DFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIGDGCEPGQVTYASAINAYCRVGLYS 420

Query: 421 KAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILM
Sbjct: 421 KAEDIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNGGA 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKASEFE CE+YYREFRMNGG 
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKASEFEKCEQYYREFRMNGGT 540

Query: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWL 600
           IDKA+ GIMVGVFSKTSRVDELVKLLRDM LEGTRLD RLYRSALNAL+DAGLQVQAKWL
Sbjct: 541 IDKAIGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGYV 612
           QDHYAGKSG+V
Sbjct: 601 QDHYAGKSGFV 608

BLAST of CaUC01G008010 vs. TAIR 10
Match: AT5G13770.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 590.5 bits (1521), Expect = 1.5e-168
Identity = 308/580 (53.10%), Postives = 402/580 (69.31%), Query Frame = 0

Query: 27  PTSNLALLFSLPASNLRSLHLNSSGCPSPILEQSSIALPDIHLDSNLQDFQLPSLPNVED 86
           PT  +  L   P     + H+ SS C S +LE+     P     S  +D      P   D
Sbjct: 24  PTKPIFFLSQKP----HNFHVCSSRC-SMVLEEDEKKSP-----SPKEDKWPFFEPGPND 83

Query: 87  LNDFLCGLSQNPGSEDLIYEYYVKAKERAGFRPEKSTLRHLVRYLVRLKKWDLILLVSRD 146
           LN  L    ++P +  L  E+Y KAKE +  R    T +HL+ YLV  K WDL++ V  D
Sbjct: 84  LNRVLSRFLRDPETRKLSSEFYEKAKENSELR----TTKHLISYLVSSKSWDLLVSVCED 143

Query: 147 FVDYGVSPDRDTCSRLVSSCVRGRKFKVVKALLEVFERDSDVATAAFEAAMRGYNKLHMY 206
             ++   PD  TCS L+ SC+R RKF++   LL VF  D  +A +A +AAM+G+NKL MY
Sbjct: 144 LREHKALPDGQTCSNLIRSCIRDRKFRITHCLLSVFRSDKSLAVSASDAAMKGFNKLQMY 203

Query: 207 KSTILVFQRLK-SARIEADSGCYCRVMEAYLKLGDFERVVELFDEVES-RISDFAPFSTK 266
            STI VF RLK S  +E   GCYCR+MEA+ K+G+  +VVELF E +S R+S  A  S  
Sbjct: 204 SSTIQVFDRLKQSVGVEPSPGCYCRIMEAHEKIGENHKVVELFQEFKSQRLSFLAKESGS 263

Query: 267 IYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEA 326
           IY I+C SLAKSGR FE+LE   +MK KGI E   +YS LI  FA  +EV + E L+ EA
Sbjct: 264 IYTIVCSSLAKSGRAFEALEVLEEMKDKGIPESSELYSMLIRAFAEAREVVITEKLFKEA 323

Query: 327 KAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMKDFKVGVSDCIFCAIVNGYATRRGY 386
             KKLL+DP + LK++LMY++ G++E  LE+V  M+  ++ V+DCI CAIVNG++ +RG+
Sbjct: 324 GGKKLLKDPEMCLKVVLMYVREGNMETTLEVVAAMRKAELKVTDCILCAIVNGFSKQRGF 383

Query: 387 NAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDMFREMEEKGFDKCVVAYSSL 446
             AVKVYE  +++ CE GQVTYA AINAYCR+  Y+KAE +F EM +KGFDKCVVAYS++
Sbjct: 384 AEAVKVYEWAMKEECEAGQVTYAIAINAYCRLEKYNKAEMLFDEMVKKGFDKCVVAYSNI 443

Query: 447 ISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKI 506
           + MYGKT RL DA+RL+AKMK+RGC+PN+WIYN L++MHG+A +L++ EK+WKEMKR K+
Sbjct: 444 MDMYGKTRRLSDAVRLMAKMKQRGCKPNIWIYNSLIDMHGRAMDLRRAEKIWKEMKRAKV 503

Query: 507 APDKVSYTSIISAYVKASEFETCERYYREFRMNGGAIDKAMAGIMVGVFSKTSRVDELVK 566
            PDKVSYTS+ISAY ++ E E C   Y+EFRMN G ID+AMAGIMVGVFSKTSR+DEL++
Sbjct: 504 LPDKVSYTSMISAYNRSKELERCVELYQEFRMNRGKIDRAMAGIMVGVFSKTSRIDELMR 563

Query: 567 LLRDMNLEGTRLDGRLYRSALNALIDAGLQVQAKWLQDHY 605
           LL+DM +EGTRLD RLY SALNAL DAGL  Q +WLQ+ +
Sbjct: 564 LLQDMKVEGTRLDARLYSSALNALRDAGLNSQIRWLQESF 589

BLAST of CaUC01G008010 vs. TAIR 10
Match: AT2G35130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 147.1 bits (370), Expect = 4.5e-35
Identity = 99/415 (23.86%), Postives = 190/415 (45.78%), Query Frame = 0

Query: 193 FEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGDFERVVELFDEVE 252
           F   +  Y +   YK    ++ +L  +R       Y  +++AY   G  ER   +  E++
Sbjct: 158 FNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMAGLIERAEVVLVEMQ 217

Query: 253 SRISDFAPFSTKIYGILCESLAK-SGRVFESLEFFRDMKKKGIAEDYTIYSALICTFASI 312
           +           +Y    E L K  G   E+++ F+ MK+         Y+ +I  +   
Sbjct: 218 NHHVSPKTIGVTVYNAYIEGLMKRKGNTEEAIDVFQRMKRDRCKPTTETYNLMINLYGKA 277

Query: 313 QEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMKDFKVGVSDCIF 372
            +  ++  LY E ++ +   +   +  L+  + + G  EKA EI E +++  +     ++
Sbjct: 278 SKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKAEEIFEQLQEDGLEPDVYVY 337

Query: 373 CAIVNGYATRRGY-NAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDMF---- 432
            A++  Y +R GY   A +++  +   GCEP + +Y   ++AY R GL+S AE +F    
Sbjct: 338 NALMESY-SRAGYPYGAAEIFSLMQHMGCEPDRASYNIMVDAYGRAGLHSDAEAVFEEMK 397

Query: 433 -------------------------------REMEEKGFDKCVVAYSSLISMYGKTGRLK 492
                                          +EM E G +      +S++++YG+ G+  
Sbjct: 398 RLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVEPDTFVLNSMLNLYGRLGQFT 457

Query: 493 DAMRLLAKMKERGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII 552
              ++LA+M+   C  ++  YNIL+ ++GKA  L+++E+L+ E+K K   PD V++TS I
Sbjct: 458 KMEKILAEMENGPCTADISTYNILINIYGKAGFLERIEELFVELKEKNFRPDVVTWTSRI 517

Query: 553 SAYVKASEFETCERYYREFRMNGGAIDKAMAGIMVGVFSKTSRVDELVKLLRDMN 571
            AY +   +  C   + E   +G A D   A +++   S   +V+++  +LR M+
Sbjct: 518 GAYSRKKLYVKCLEVFEEMIDSGCAPDGGTAKVLLSACSSEEQVEQVTSVLRTMH 571

BLAST of CaUC01G008010 vs. TAIR 10
Match: AT2G35130.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 147.1 bits (370), Expect = 4.5e-35
Identity = 99/415 (23.86%), Postives = 190/415 (45.78%), Query Frame = 0

Query: 193 FEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGDFERVVELFDEVE 252
           F   +  Y +   YK    ++ +L  +R       Y  +++AY   G  ER   +  E++
Sbjct: 180 FNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMAGLIERAEVVLVEMQ 239

Query: 253 SRISDFAPFSTKIYGILCESLAK-SGRVFESLEFFRDMKKKGIAEDYTIYSALICTFASI 312
           +           +Y    E L K  G   E+++ F+ MK+         Y+ +I  +   
Sbjct: 240 NHHVSPKTIGVTVYNAYIEGLMKRKGNTEEAIDVFQRMKRDRCKPTTETYNLMINLYGKA 299

Query: 313 QEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVMKDFKVGVSDCIF 372
            +  ++  LY E ++ +   +   +  L+  + + G  EKA EI E +++  +     ++
Sbjct: 300 SKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKAEEIFEQLQEDGLEPDVYVY 359

Query: 373 CAIVNGYATRRGY-NAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDMF---- 432
            A++  Y +R GY   A +++  +   GCEP + +Y   ++AY R GL+S AE +F    
Sbjct: 360 NALMESY-SRAGYPYGAAEIFSLMQHMGCEPDRASYNIMVDAYGRAGLHSDAEAVFEEMK 419

Query: 433 -------------------------------REMEEKGFDKCVVAYSSLISMYGKTGRLK 492
                                          +EM E G +      +S++++YG+ G+  
Sbjct: 420 RLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVEPDTFVLNSMLNLYGRLGQFT 479

Query: 493 DAMRLLAKMKERGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII 552
              ++LA+M+   C  ++  YNIL+ ++GKA  L+++E+L+ E+K K   PD V++TS I
Sbjct: 480 KMEKILAEMENGPCTADISTYNILINIYGKAGFLERIEELFVELKEKNFRPDVVTWTSRI 539

Query: 553 SAYVKASEFETCERYYREFRMNGGAIDKAMAGIMVGVFSKTSRVDELVKLLRDMN 571
            AY +   +  C   + E   +G A D   A +++   S   +V+++  +LR M+
Sbjct: 540 GAYSRKKLYVKCLEVFEEMIDSGCAPDGGTAKVLLSACSSEEQVEQVTSVLRTMH 593

BLAST of CaUC01G008010 vs. TAIR 10
Match: AT3G16010.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 146.0 bits (367), Expect = 1.0e-34
Identity = 109/470 (23.19%), Postives = 211/470 (44.89%), Query Frame = 0

Query: 122 STLRHLVRYLVRLKKWDLILLVSRDFVDYGVSPDRDTCSRLVSSCVR-GRKFKVVKALLE 181
           + L  LV+ L R K     L V          P   T + ++   ++ G+  KV +   E
Sbjct: 163 AVLSELVKALGRAKMVSKALSVFYQAKGRKCKPTSSTYNSVILMLMQEGQHEKVHEVYTE 222

Query: 182 VF-ERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLG 241
           +  E D    T  + A +  Y KL    S I +F  +K   ++     Y  ++  Y K+G
Sbjct: 223 MCNEGDCFPDTITYSALISSYEKLGRNDSAIRLFDEMKDNCMQPTEKIYTTLLGIYFKVG 282

Query: 242 DFERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYT 301
             E+ ++LF+E++   +  +P +   Y  L + L K+GRV E+  F++DM + G+  D  
Sbjct: 283 KVEKALDLFEEMKR--AGCSP-TVYTYTELIKGLGKAGRVDEAYGFYKDMLRDGLTPDVV 342

Query: 302 IYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLI-LMYIQRGSLEKALEIVEV 361
             + L+     +  V+   ++++E    +       +  +I  ++  +  + +     + 
Sbjct: 343 FLNNLMNILGKVGRVEELTNVFSEMGMWRCTPTVVSYNTVIKALFESKAHVSEVSSWFDK 402

Query: 362 MKDFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGL 421
           MK   V  S+  +  +++GY        A+ + E++ E G  P    Y S INA  +   
Sbjct: 403 MKADSVSPSEFTYSILIDGYCKTNRVEKALLLLEEMDEKGFPPCPAAYCSLINALGKAKR 462

Query: 422 YSKAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNI 481
           Y  A ++F+E++E   +     Y+ +I  +GK G+L +A+ L  +MK +G  P+V+ YN 
Sbjct: 463 YEAANELFKELKENFGNVSSRVYAVMIKHFGKCGKLSEAVDLFNEMKNQGSGPDVYAYNA 522

Query: 482 LMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNG 541
           LM    KA  + +   L ++M+      D  S+  I++ + +          +   + +G
Sbjct: 523 LMSGMVKAGMINEANSLLRKMEENGCRADINSHNIILNGFARTGVPRRAIEMFETIKHSG 582

Query: 542 GAIDKAMAGIMVGVFSKTSRVDELVKLLRDMNLEGTRLDGRLYRSALNAL 589
              D      ++G F+     +E  +++R+M  +G   D   Y S L+A+
Sbjct: 583 IKPDGVTYNTLLGCFAHAGMFEEAARMMREMKDKGFEYDAITYSSILDAV 629

BLAST of CaUC01G008010 vs. TAIR 10
Match: AT2G32630.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 144.8 bits (364), Expect = 2.2e-34
Identity = 115/433 (26.56%), Postives = 197/433 (45.50%), Query Frame = 0

Query: 125 RHLVRYLVRLKK---WDLILLVSRDFVDYGVSPDRDTCSRLVSS-CVRGRKFKVVKALLE 184
           R  + +LV  KK    DL L + R  VD GV     + + +V   C RG   K  K + E
Sbjct: 190 RSCIVFLVAAKKRRRIDLCLEIFRRMVDSGVKITVYSLTIVVEGLCRRGEVEKSKKLIKE 249

Query: 185 VFERDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCYCRVMEAYLKLGD 244
              +        +   +  Y K   +     V + +K   +  +   Y  +ME  +K G 
Sbjct: 250 FSVKGIKPEAYTYNTIINAYVKQRDFSGVEGVLKVMKKDGVVYNKVTYTLLMELSVKNGK 309

Query: 245 FERVVELFDEVESRISDFAPFSTKIYGILCESLAKSGRVFESLEFFRDMKKKGIAEDYTI 304
                +LFDE+  R  +       +Y  L     + G +  +   F ++ +KG++     
Sbjct: 310 MSDAEKLFDEMRERGIE---SDVHVYTSLISWNCRKGNMKRAFLLFDELTEKGLSPSSYT 369

Query: 305 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAVFLKLILMYIQRGSLEKALEIVEVM- 364
           Y ALI     + E+  AE L NE ++K +     VF  LI  Y ++G +++A  I +VM 
Sbjct: 370 YGALIDGVCKVGEMGAAEILMNEMQSKGVNITQVVFNTLIDGYCRKGMVDEASMIYDVME 429

Query: 365 -KDFKVGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGL 424
            K F+  V  C    I + +   + Y+ A +   +++E G +   V+Y + I+ YC+ G 
Sbjct: 430 QKGFQADVFTC--NTIASCFNRLKRYDEAKQWLFRMMEGGVKLSTVSYTNLIDVYCKEGN 489

Query: 425 YSKAEDMFREMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNI 484
             +A+ +F EM  KG     + Y+ +I  Y K G++K+A +L A M+  G  P+ + Y  
Sbjct: 490 VEEAKRLFVEMSSKGVQPNAITYNVMIYAYCKQGKIKEARKLRANMEANGMDPDSYTYTS 549

Query: 485 LMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFETCERYYREFRMNG 544
           L+     A N+ +  +L+ EM  K +  + V+YT +IS   KA + +     Y E +  G
Sbjct: 550 LIHGECIADNVDEAMRLFSEMGLKGLDQNSVTYTVMISGLSKAGKSDEAFGLYDEMKRKG 609

Query: 545 GAIDKAMAGIMVG 552
             ID  +   ++G
Sbjct: 610 YTIDNKVYTALIG 617

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038874313.10.0e+0094.27pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 ... [more]
XP_022938691.10.0e+0090.67pentatricopeptide repeat-containing protein At5g13770, chloroplastic [Cucurbita ... [more]
XP_022993436.10.0e+0090.51pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 ... [more]
XP_004151188.10.0e+0090.34pentatricopeptide repeat-containing protein At5g13770, chloroplastic [Cucumis sa... [more]
KAG6579160.10.0e+0090.34Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q66GP42.2e-16753.10Pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Arabidop... [more]
O821786.4e-3423.86Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana OX... [more]
Q9LW841.4e-3323.19Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX... [more]
Q8S8P63.2e-3326.56Pentatricopeptide repeat-containing protein At2g32630 OS=Arabidopsis thaliana OX... [more]
Q0WMY57.8e-3222.57Pentatricopeptide repeat-containing protein At5g04810, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1FDV40.0e+0090.67pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Cucurbit... [more]
A0A6J1JWB40.0e+0090.51pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 ... [more]
A0A0A0KPV80.0e+0090.34Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G114580 PE=4 SV=1[more]
A0A1S3CPF00.0e+0090.67pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Cucumis ... [more]
A0A5A7VR010.0e+0090.34Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT5G13770.11.5e-16853.10Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G35130.14.5e-3523.86Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G35130.24.5e-3523.86Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G16010.11.0e-3423.19Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G32630.12.2e-3426.56Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 361..413
e-value: 0.0019
score: 18.3
coord: 425..485
e-value: 6.0E-14
score: 51.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 509..538
e-value: 4.9E-5
score: 21.2
coord: 475..507
e-value: 2.1E-7
score: 28.7
coord: 439..473
e-value: 1.4E-9
score: 35.5
coord: 404..434
e-value: 1.7E-8
score: 32.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 271..294
e-value: 0.0045
score: 17.1
coord: 228..253
e-value: 0.04
score: 14.2
coord: 509..538
e-value: 9.4E-4
score: 19.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 472..506
score: 10.961357
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 402..436
score: 11.783455
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 437..471
score: 12.868624
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 262..296
score: 9.733692
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 507..541
score: 8.70333
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 367..401
score: 8.812943
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 433..501
e-value: 4.7E-20
score: 73.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 502..603
e-value: 6.0E-14
score: 53.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 77..309
e-value: 3.9E-18
score: 67.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 310..432
e-value: 2.3E-22
score: 81.8
NoneNo IPR availablePANTHERPTHR47934:SF14OSJNBA0088A01.11 PROTEINcoord: 1..605
NoneNo IPR availablePANTHERPTHR47934PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN PET309, MITOCHONDRIALcoord: 1..605
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 234..478

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC01G008010.1CaUC01G008010.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding