Cla97C01G007680 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G007680
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr01: 7814509 .. 7817605 (+)
RNA-Seq ExpressionCla97C01G007680
SyntenyCla97C01G007680
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CGACGAAGCCATGGTTTTCGCTACAAGGAAGATAAATTGGAGAATGAGCACCCACAATTCATTTTCGATCCGGCCACCAGTGGAGGCGGTGAGGCTTCTCTTTCTCACTTCTTCTACTAGTTCCCCATGCCAGTTTCAGTTTGATTTTTAACTTTATTTGCTTTCAGTTCTGCGATTTCTGGTGTTTTAATGGGACTTCTCTCAACTCTGATCACTCCTCATTGTCTTCTTCATCTCTTATATGCTGTTTTTTTTCCTTATTTCTTTGAGTAATGTTGATTTTTCTCCTTTTTCTTACTGGTATATTATAAGTACAGAAGCGAGGATAACTGAGTTTTCCTGTTATTTTTTGCGCATGTTTCGTATTCAATTGGGTTCACTGCAGTTGGAACACACAAAAAAACCCTTGTTTCATTCGGCTTCCACTTGTAGAACCCTAATAAAAGTTAGGTTTTCTTTTTGCTCTGCACTTATGATTGGAGCTCCCTGCTATTCGTTTGTCCATGTTAACAGAAATGGAAGGAAAAAAATTTCAAAATCTTGTATGTTTATATCTCATTGGCATGAATTTGGTGCTATCCCGTATCAAAATGAGTGAAAATTCTAAAACGTTTTAAGGGTAAATTGAAACTAACCCACACATTAGAGCCATTTTCGTGTGTGTGAGCCTTTTAATTTGTACCAATTGGAAACTTACCCATCCGGCCATCACTGTCTTGTCACCCCTCTTGTGTAAGCTTTCTTTACTATTATTCAAGGCTACTGTTTCTAAAACTACAAGAGAAGAGGAGAAAACAAAAGGATGGCTTGCTGCCTGCTGTTTTTTCCCAAGATTTTTCTGACTTGATTTGCTCTATCCTTTTCATTTATTTGAATGTTTCCCCTATTATTCATGTTTGATTTGGTGAGCTGTGTTTTTTAATTGCTTTCAGATTCCAAATAATAAAGGCTTATCTATAATGGAAAGTGTCCTCAGTTTTTAAGAAATTTTGCTCCAGTTATTCTTGTTTGATGCCATTGCAATTCTTTCTTTTTTTAACATGTATTCACATGTGGAAATCTGTGTGAGCATGCTCATATCTAAATATTTCTGGTATGTTGTGATTGTAGGTGATAATGCCAAAATTAAGAATTGCACGGTGGATTTTTCATGACATCGTTGATAGATGACCTTTGTAATTTTCACACACTGGCTTATATGAATCAAATGAGCAAAAAGACCATCCTCTCAAAAATCATTGGTAAGAAGCACCACTTATTCTCTTCTCCGTTTTTCAGTTTACCCTCCAGACTTCTAGTTTTTCAAATACACTATGATGTAGCCACCTGCAATTTTTTTCTCCAATCATATGCCAACCACAAGAATCTCACTGAAGGAAAACAGCTACATTCCTTAATGATCACTTCTGGATTTATTCATTTGCCTTCATCCATTACTAGCTTGATCAACATGTACTCTAAATGCAATCAAATGGAACAGGCTGTTTTAGTTTTCCATGATCCATATCATGAGCGTAATGTGTTTGCATACAATGCAATAATTGCTGGATTTGTTGCAAATGGGCTTGCAGCACACGGGTTTCAGTTTTATAAGAGAATGAGGTCAGTGGGTATAATGCCTGATAAATTCACTTTTCCATGTGTAGTTAGAGCTTGTTGTGAGTTCATGGAGGTTAGGAAAATTCATGGTTGTTTATTTAAAATGGGATTGGAGTTGGATGTGTTTGTTGGTAGTGCTCTTGTCAATACTTACTTGAAGGTTGATGTAGTGGAGGACGCGGAGAAAGTTTTTGAAGAGTTACCAGAGAGAGATGTTGTGCTCTGGAATGCAATGATCAATGGTTACACCCAAATTGGTCGACTCAACAAAGCGGTAGTGTTTTTTAAGAAAATGGGTGAAGAAGGGATTTCACCTTGTAGATTTACAATGACTGGCATTTTGTCTATTTTTACTCTAATGGGAGATATTAACAATGGGAGAGCTATTCATGGAATTGTAACAAAAATGGGTTATAGTTCATGTGTTGCAGTTTCAAATGCACTGATTGATATGTATGGGAAATGCAAGCATATTGAAGATGCTTTGATGATTTTTGATATGATAAATGGGAAGGATTTATTTTCATGGAATTCAATTATATCTGCTCATGAGCAATGTGGTGATCATGATGGTACCTTGAGACTTTTTGGCAAGATGTTAGGTTCTAGGGTTCTACCTGATGTGGTTACCATCACTGCTGTACTTCCAGCTTGTTCTCACTTGGCTGCTCTCATGCATGGCAGAGAAATTCATGGACATATGATTGTTAATGGATTGGGAAAAAATGAAAACGGTGATGATGTATTATTAAACAATGCTGTTATGGACATGTATGCGAAGTGCGGATGCATGAAAAATGCCGACATAGTATTTGATCTAATGAGCAATAAGGATGTCGCATCTTGGAACATCATGATTATGGGTTATGCAATTCATGGATATGGTAGAGAGGCATTGGATATGTTTCATCATATGTGTGAGGCCCAAATTAAACCAGATGCTATTACATTTGTTGGAGTTTTATCTGCTTGTAGTCATGCAGGCTTTGTACATCAAGGGCGCTCGTTTTTAACTCGAATGGAACTTGAATTTGGCGTGGTTCCAACTATTGAGCATTATACTTGTATAATCGATATGCTCGGTCGAGCTGGACGTATAGAGGAAGCTTATGAGCTGGCTCAAAGGATACCTTTTCAAGACAACCTTGTTTTGTGGATGGCATTATTGGGAGCGTGTCGACTTCATGGGAATGCAGAGTTGGGAAAAGTCGTTGGAGAAAAGATAACGCGACTTGAACCTAAGCATTGTGGTAGTGGTAGTTATATATTGATGTCTAACATGTATGGAGTCATAGGTCGATATGAAGAAGCATTGGAGGTTAGACGAACAATGAAGGAACAAAACATTAAGAAGACACCAGGTTGTAGCTGGATTGAACTCAAGGATGGGCTGTACGTTTTTAGCATGGGAGACAGGACACATCATGAATTAAATGCATTGATTAATTGCCTTTGTGGCATTGGATACTTTCATGATGAAGTTATGCATTCGTTTTAA

mRNA sequence

CGACGAAGCCATGGTTTTCGCTACAAGGAAGATAAATTGGAGAATGAGCACCCACAATTCATTTTCGATCCGGCCACCAGTGGAGGCGTTCTGCGATTTCTGGTGTTTTAATGGGACTTCTCTCAACTCTGATCACTCCTCATTTTTACCCTCCAGACTTCTAGTTTTTCAAATACACTATGATGTAGCCACCTGCAATTTTTTTCTCCAATCATATGCCAACCACAAGAATCTCACTGAAGGAAAACAGCTACATTCCTTAATGATCACTTCTGGATTTATTCATTTGCCTTCATCCATTACTAGCTTGATCAACATGTACTCTAAATGCAATCAAATGGAACAGGCTGTTTTAGTTTTCCATGATCCATATCATGAGCGTAATGTGTTTGCATACAATGCAATAATTGCTGGATTTGTTGCAAATGGGCTTGCAGCACACGGGTTTCAGTTTTATAAGAGAATGAGGTCAGTGGGTATAATGCCTGATAAATTCACTTTTCCATGTGTAGTTAGAGCTTGTTGTGAGTTCATGGAGGTTAGGAAAATTCATGGTTGTTTATTTAAAATGGGATTGGAGTTGGATGTGTTTGTTGGTAGTGCTCTTGTCAATACTTACTTGAAGGTTGATGTAGTGGAGGACGCGGAGAAAGTTTTTGAAGAGTTACCAGAGAGAGATGTTGTGCTCTGGAATGCAATGATCAATGGTTACACCCAAATTGGTCGACTCAACAAAGCGGTAGTGTTTTTTAAGAAAATGGGTGAAGAAGGGATTTCACCTTGTAGATTTACAATGACTGGCATTTTGTCTATTTTTACTCTAATGGGAGATATTAACAATGGGAGAGCTATTCATGGAATTGTAACAAAAATGGGTTATAGTTCATGTGTTGCAGTTTCAAATGCACTGATTGATATGTATGGGAAATGCAAGCATATTGAAGATGCTTTGATGATTTTTGATATGATAAATGGGAAGGATTTATTTTCATGGAATTCAATTATATCTGCTCATGAGCAATGTGGTGATCATGATGGTACCTTGAGACTTTTTGGCAAGATGTTAGGTTCTAGGGTTCTACCTGATGTGGTTACCATCACTGCTGTACTTCCAGCTTGTTCTCACTTGGCTGCTCTCATGCATGGCAGAGAAATTCATGGACATATGATTGTTAATGGATTGGGAAAAAATGAAAACGGTGATGATGTATTATTAAACAATGCTGTTATGGACATGTATGCGAAGTGCGGATGCATGAAAAATGCCGACATAGTATTTGATCTAATGAGCAATAAGGATGTCGCATCTTGGAACATCATGATTATGGGTTATGCAATTCATGGATATGGTAGAGAGGCATTGGATATGTTTCATCATATGTGTGAGGCCCAAATTAAACCAGATGCTATTACATTTGTTGGAGTTTTATCTGCTTGTAGTCATGCAGGCTTTGTACATCAAGGGCGCTCGTTTTTAACTCGAATGGAACTTGAATTTGGCGTGGTTCCAACTATTGAGCATTATACTTGTATAATCGATATGCTCGGTCGAGCTGGACGTATAGAGGAAGCTTATGAGCTGGCTCAAAGGATACCTTTTCAAGACAACCTTGTTTTGTGGATGGCATTATTGGGAGCGTGTCGACTTCATGGGAATGCAGAGTTGGGAAAAGTCGTTGGAGAAAAGATAACGCGACTTGAACCTAAGCATTGTGGTAGTGGTAGTTATATATTGATGTCTAACATGTATGGAGTCATAGGTCGATATGAAGAAGCATTGGAGGTTAGACGAACAATGAAGGAACAAAACATTAAGAAGACACCAGGTTGTAGCTGGATTGAACTCAAGGATGGGCTGTACGTTTTTAGCATGGGAGACAGGACACATCATGAATTAAATGCATTGATTAATTGCCTTTGTGGCATTGGATACTTTCATGATGAAGTTATGCATTCGTTTTAA

Coding sequence (CDS)

ATGGTTTTCGCTACAAGGAAGATAAATTGGAGAATGAGCACCCACAATTCATTTTCGATCCGGCCACCAGTGGAGGCGTTCTGCGATTTCTGGTGTTTTAATGGGACTTCTCTCAACTCTGATCACTCCTCATTTTTACCCTCCAGACTTCTAGTTTTTCAAATACACTATGATGTAGCCACCTGCAATTTTTTTCTCCAATCATATGCCAACCACAAGAATCTCACTGAAGGAAAACAGCTACATTCCTTAATGATCACTTCTGGATTTATTCATTTGCCTTCATCCATTACTAGCTTGATCAACATGTACTCTAAATGCAATCAAATGGAACAGGCTGTTTTAGTTTTCCATGATCCATATCATGAGCGTAATGTGTTTGCATACAATGCAATAATTGCTGGATTTGTTGCAAATGGGCTTGCAGCACACGGGTTTCAGTTTTATAAGAGAATGAGGTCAGTGGGTATAATGCCTGATAAATTCACTTTTCCATGTGTAGTTAGAGCTTGTTGTGAGTTCATGGAGGTTAGGAAAATTCATGGTTGTTTATTTAAAATGGGATTGGAGTTGGATGTGTTTGTTGGTAGTGCTCTTGTCAATACTTACTTGAAGGTTGATGTAGTGGAGGACGCGGAGAAAGTTTTTGAAGAGTTACCAGAGAGAGATGTTGTGCTCTGGAATGCAATGATCAATGGTTACACCCAAATTGGTCGACTCAACAAAGCGGTAGTGTTTTTTAAGAAAATGGGTGAAGAAGGGATTTCACCTTGTAGATTTACAATGACTGGCATTTTGTCTATTTTTACTCTAATGGGAGATATTAACAATGGGAGAGCTATTCATGGAATTGTAACAAAAATGGGTTATAGTTCATGTGTTGCAGTTTCAAATGCACTGATTGATATGTATGGGAAATGCAAGCATATTGAAGATGCTTTGATGATTTTTGATATGATAAATGGGAAGGATTTATTTTCATGGAATTCAATTATATCTGCTCATGAGCAATGTGGTGATCATGATGGTACCTTGAGACTTTTTGGCAAGATGTTAGGTTCTAGGGTTCTACCTGATGTGGTTACCATCACTGCTGTACTTCCAGCTTGTTCTCACTTGGCTGCTCTCATGCATGGCAGAGAAATTCATGGACATATGATTGTTAATGGATTGGGAAAAAATGAAAACGGTGATGATGTATTATTAAACAATGCTGTTATGGACATGTATGCGAAGTGCGGATGCATGAAAAATGCCGACATAGTATTTGATCTAATGAGCAATAAGGATGTCGCATCTTGGAACATCATGATTATGGGTTATGCAATTCATGGATATGGTAGAGAGGCATTGGATATGTTTCATCATATGTGTGAGGCCCAAATTAAACCAGATGCTATTACATTTGTTGGAGTTTTATCTGCTTGTAGTCATGCAGGCTTTGTACATCAAGGGCGCTCGTTTTTAACTCGAATGGAACTTGAATTTGGCGTGGTTCCAACTATTGAGCATTATACTTGTATAATCGATATGCTCGGTCGAGCTGGACGTATAGAGGAAGCTTATGAGCTGGCTCAAAGGATACCTTTTCAAGACAACCTTGTTTTGTGGATGGCATTATTGGGAGCGTGTCGACTTCATGGGAATGCAGAGTTGGGAAAAGTCGTTGGAGAAAAGATAACGCGACTTGAACCTAAGCATTGTGGTAGTGGTAGTTATATATTGATGTCTAACATGTATGGAGTCATAGGTCGATATGAAGAAGCATTGGAGGTTAGACGAACAATGAAGGAACAAAACATTAAGAAGACACCAGGTTGTAGCTGGATTGAACTCAAGGATGGGCTGTACGTTTTTAGCATGGGAGACAGGACACATCATGAATTAAATGCATTGATTAATTGCCTTTGTGGCATTGGATACTTTCATGATGAAGTTATGCATTCGTTTTAA

Protein sequence

MVFATRKINWRMSTHNSFSIRPPVEAFCDFWCFNGTSLNSDHSSFLPSRLLVFQIHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVVLWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTLRLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNAVMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPDAITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELAQRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGRYEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDEVMHSF
Homology
BLAST of Cla97C01G007680 vs. NCBI nr
Match: XP_038877905.1 (pentatricopeptide repeat-containing protein At3g14730-like [Benincasa hispida] >XP_038877906.1 pentatricopeptide repeat-containing protein At3g14730-like [Benincasa hispida])

HSP 1 Score: 1202.2 bits (3109), Expect = 0.0e+00
Identity = 574/605 (94.88%), Postives = 591/605 (97.69%), Query Frame = 0

Query: 46  LPSRLLVFQIHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYS 105
           LPSRLLVFQIHYDVATCNF LQSYANHKNLT+GKQLHSLMITSGFIHLPSSITSLINMYS
Sbjct: 28  LPSRLLVFQIHYDVATCNFVLQSYANHKNLTKGKQLHSLMITSGFIHLPSSITSLINMYS 87

Query: 106 KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFP 165
           KCNQMEQAVLVFHDPY ERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVG+MPDKFTFP
Sbjct: 88  KCNQMEQAVLVFHDPYRERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGVMPDKFTFP 147

Query: 166 CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVV 225
           CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVD++EDAEKVFEELPERDVV
Sbjct: 148 CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDMMEDAEKVFEELPERDVV 207

Query: 226 LWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIV 285
           LWNAMINGYTQIGRLNKAVV FK MG+EGISPCRFTMTGILSI TLM DINNGRAIHGIV
Sbjct: 208 LWNAMINGYTQIGRLNKAVVVFKNMGKEGISPCRFTMTGILSILTLMRDINNGRAIHGIV 267

Query: 286 TKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTL 345
           TKMGYSSCVAVSNALIDMYGKCKHIEDALMIF+MIN KDLFSWNS+ISAHEQCGDHD TL
Sbjct: 268 TKMGYSSCVAVSNALIDMYGKCKHIEDALMIFEMINEKDLFSWNSVISAHEQCGDHDSTL 327

Query: 346 RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNA 405
           RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHG+MIVNGLGKNENGDDVLLNNA
Sbjct: 328 RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGYMIVNGLGKNENGDDVLLNNA 387

Query: 406 VMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPD 465
           VMDMYAKCGCMKNADIVFDL  NKDVASWNIMIMGYA+HGYG+EALDMFH MCEAQIKPD
Sbjct: 388 VMDMYAKCGCMKNADIVFDLTRNKDVASWNIMIMGYAMHGYGKEALDMFHLMCEAQIKPD 447

Query: 466 AITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELA 525
           A+TFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAG +EEAYELA
Sbjct: 448 AVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGHVEEAYELA 507

Query: 526 QRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGR 585
           QRIP Q+NLVLWMALLGACRLHGNAELGKVVGEKITRLEP+HCGSGSYILMS+MYGV+GR
Sbjct: 508 QRIPLQENLVLWMALLGACRLHGNAELGKVVGEKITRLEPEHCGSGSYILMSSMYGVVGR 567

Query: 586 YEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDE 645
           YEEALEVRRTM EQN+KKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDE
Sbjct: 568 YEEALEVRRTMNEQNVKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDE 627

Query: 646 VMHSF 651
           VMHSF
Sbjct: 628 VMHSF 632

BLAST of Cla97C01G007680 vs. NCBI nr
Match: XP_004149501.2 (pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >XP_031740827.1 pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >KAE8648313.1 hypothetical protein Csa_004684 [Cucumis sativus])

HSP 1 Score: 1166.0 bits (3015), Expect = 0.0e+00
Identity = 553/605 (91.40%), Postives = 579/605 (95.70%), Query Frame = 0

Query: 46  LPSRLLVFQIHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYS 105
           LPSRLLVFQIHYDVATCN  LQSYANHKNLT+G+QLHSLM+TSGFIHLPSSITSLINMYS
Sbjct: 28  LPSRLLVFQIHYDVATCNLSLQSYANHKNLTKGRQLHSLMVTSGFIHLPSSITSLINMYS 87

Query: 106 KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFP 165
           +CNQME+AVLVF DPYHERNVFAYNAIIAGFVANGLAA GFQFYKRMRSVG+MPDKFTFP
Sbjct: 88  RCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVGVMPDKFTFP 147

Query: 166 CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVV 225
           CVVRACCEFMEVRKIHGCLFKMGLEL+VFVGSALVNTYLKVD  EDAEKVFEELPERDVV
Sbjct: 148 CVVRACCEFMEVRKIHGCLFKMGLELNVFVGSALVNTYLKVDGTEDAEKVFEELPERDVV 207

Query: 226 LWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIV 285
           LWNAMINGYT+IG LNKAVV FK+MGEEGIS  RFT T ILSI T MGDINNGRAIHGIV
Sbjct: 208 LWNAMINGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTSILSILTSMGDINNGRAIHGIV 267

Query: 286 TKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTL 345
           TKMGYSSCVAVSNALIDMYGKCKH EDALMIF+MIN KDLFSWNSIISAHEQC DHDGTL
Sbjct: 268 TKMGYSSCVAVSNALIDMYGKCKHTEDALMIFEMINEKDLFSWNSIISAHEQCDDHDGTL 327

Query: 346 RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNA 405
           RLFGKMLGSRVLPDV+TITAVLPACSHLAALMHGREIHG+MIVNGLGKNENGDDVLLNNA
Sbjct: 328 RLFGKMLGSRVLPDVITITAVLPACSHLAALMHGREIHGYMIVNGLGKNENGDDVLLNNA 387

Query: 406 VMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPD 465
           +MDMYAKCGCMKNADI+FDLM NKDVASWNIMIMGYA+HGYG EALDMFH MCEAQIKPD
Sbjct: 388 IMDMYAKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMHGYGTEALDMFHRMCEAQIKPD 447

Query: 466 AITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELA 525
            +TFVGVLSACSHAGFVHQGRSFLTRMELEFGV+PTIEHYTCIIDMLGRAG + EAY+LA
Sbjct: 448 VVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEHYTCIIDMLGRAGHLGEAYDLA 507

Query: 526 QRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGR 585
           QRIP +DNL+LWMALLGACRLHGNAELG VVGEKIT+LEPKHCGSGSYILMS++YGV+GR
Sbjct: 508 QRIPLEDNLILWMALLGACRLHGNAELGNVVGEKITQLEPKHCGSGSYILMSSLYGVVGR 567

Query: 586 YEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDE 645
           YEEALEVRRTMKEQN+KKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCG GYFHDE
Sbjct: 568 YEEALEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGFGYFHDE 627

Query: 646 VMHSF 651
           VMHSF
Sbjct: 628 VMHSF 632

BLAST of Cla97C01G007680 vs. NCBI nr
Match: XP_008466127.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g14730-like [Cucumis melo])

HSP 1 Score: 1151.0 bits (2976), Expect = 0.0e+00
Identity = 546/605 (90.25%), Postives = 576/605 (95.21%), Query Frame = 0

Query: 46  LPSRLLVFQIHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYS 105
           LPSRLLVFQIHYDVATCN FLQSYANHKNLT+GKQLHSLM+TSGFIHLPSSITSLINMYS
Sbjct: 28  LPSRLLVFQIHYDVATCNLFLQSYANHKNLTKGKQLHSLMVTSGFIHLPSSITSLINMYS 87

Query: 106 KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFP 165
           KCNQME+AVLVFHDPY ERNVFAYNAIIAGFVANGLAA GFQFYKRMRSVG+MPDKFTFP
Sbjct: 88  KCNQMEEAVLVFHDPYRERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVGVMPDKFTFP 147

Query: 166 CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVV 225
           CVVRACCEFME+RKIHGCLFKMGLEL+VFVGSALVNTYLKVD +EDAEKVFEELPERDVV
Sbjct: 148 CVVRACCEFMEIRKIHGCLFKMGLELNVFVGSALVNTYLKVDGMEDAEKVFEELPERDVV 207

Query: 226 LWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIV 285
           LWNAMINGY +IG LNKAV  FKKMGEEGIS  RFT T ILS+FT MGDINNGRAIHGIV
Sbjct: 208 LWNAMINGYIKIGHLNKAVAVFKKMGEEGISLSRFTTTSILSVFTSMGDINNGRAIHGIV 267

Query: 286 TKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTL 345
           TKMGYSSCVAVSNALIDMYGKCKH +DAL+IF+MIN KDLFSWNSIISAHEQCGDHDGTL
Sbjct: 268 TKMGYSSCVAVSNALIDMYGKCKHNKDALLIFEMINEKDLFSWNSIISAHEQCGDHDGTL 327

Query: 346 RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNA 405
           RLFGKML SRVLPDV+TIT VLPACSHLAALMHGREIHG+MIVNGLGKNEN DDVLLNNA
Sbjct: 328 RLFGKMLASRVLPDVITITVVLPACSHLAALMHGREIHGYMIVNGLGKNENSDDVLLNNA 387

Query: 406 VMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPD 465
           VMDMYAKCGCMKNA I+FDLM NKDVASWNIMIMGYA+HGYG EALDMFH MCEAQIKP+
Sbjct: 388 VMDMYAKCGCMKNAGIIFDLMRNKDVASWNIMIMGYAMHGYGTEALDMFHRMCEAQIKPN 447

Query: 466 AITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELA 525
            +TFVGVLSACSHAGFVHQGRSFLTRMELEFGV+PTIEHYTCIIDMLGRAG++ EAY+LA
Sbjct: 448 VVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEHYTCIIDMLGRAGQLGEAYDLA 507

Query: 526 QRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGR 585
           QRIP QDNL+LWMALLGACRLHGNAELG VVGEKI +LEPK+CGSGSYILMS++YGV+GR
Sbjct: 508 QRIPLQDNLILWMALLGACRLHGNAELGNVVGEKIRQLEPKNCGSGSYILMSSLYGVVGR 567

Query: 586 YEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDE 645
           YEEALEVRRTMKEQN+KKTPGCSWIELK+GLYVFSMGDRTHHELNALINCLCG  YFHDE
Sbjct: 568 YEEALEVRRTMKEQNVKKTPGCSWIELKNGLYVFSMGDRTHHELNALINCLCGFQYFHDE 627

Query: 646 VMHSF 651
           VMHSF
Sbjct: 628 VMHSF 632

BLAST of Cla97C01G007680 vs. NCBI nr
Match: XP_022939865.1 (pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita moschata] >XP_022939866.1 pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 1145.2 bits (2961), Expect = 0.0e+00
Identity = 544/604 (90.07%), Postives = 575/604 (95.20%), Query Frame = 0

Query: 46  LPSRLLVFQIHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYS 105
           LPSRLLVFQ+HYDVATCNFFLQSYANHKNL EGKQLHS+MITSGF+HLPSSITSLINMYS
Sbjct: 28  LPSRLLVFQMHYDVATCNFFLQSYANHKNLPEGKQLHSVMITSGFMHLPSSITSLINMYS 87

Query: 106 KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFP 165
           KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVAN L AHGFQFYKRMRSVG+MPDKFTFP
Sbjct: 88  KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANRLPAHGFQFYKRMRSVGVMPDKFTFP 147

Query: 166 CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVV 225
           CVVRACCEFMEVRKIHGCLFK+GLELD+FV SALVNTYLK D++EDA+KVF+ELPERDVV
Sbjct: 148 CVVRACCEFMEVRKIHGCLFKLGLELDMFVSSALVNTYLKFDLMEDAKKVFKELPERDVV 207

Query: 226 LWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIV 285
           LWNAMING  +IG LNKAVV FKKMGEEGISPCRFT+TGILSIF+LMGDINNGRAIHGIV
Sbjct: 208 LWNAMINGCAKIGHLNKAVVVFKKMGEEGISPCRFTITGILSIFSLMGDINNGRAIHGIV 267

Query: 286 TKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTL 345
           TKMGYSSCVAVSNALIDMYGKCKHIEDAL+IF+MIN KDLFSWNSIISAHEQC DHDGTL
Sbjct: 268 TKMGYSSCVAVSNALIDMYGKCKHIEDALVIFEMINEKDLFSWNSIISAHEQCVDHDGTL 327

Query: 346 RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNA 405
           R F KML SRVLPDV+TITAVLPACS+ AALMHGREIHG+M VNGLGKNE+GDDVLLNNA
Sbjct: 328 RFFDKMLASRVLPDVITITAVLPACSYFAALMHGREIHGYMTVNGLGKNEDGDDVLLNNA 387

Query: 406 VMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPD 465
           VMDMYAKCGC+KNA  VFD  SNKDVASWNIMIMGYA HGYG+EALDMFHHMCEAQIKPD
Sbjct: 388 VMDMYAKCGCLKNAHRVFDRTSNKDVASWNIMIMGYATHGYGQEALDMFHHMCEAQIKPD 447

Query: 466 AITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELA 525
           AITFVGVLSACSHAGF+ QGRSFL RMELEFGVVPTIEHYTCIIDMLGRAG + EAYELA
Sbjct: 448 AITFVGVLSACSHAGFLRQGRSFLARMELEFGVVPTIEHYTCIIDMLGRAGHLGEAYELA 507

Query: 526 QRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGR 585
           +RIP QDNLVLWMALLGACRLHGNA+LGKVVGEKI RLEPKHCGSGSY+LMS+MYGV+GR
Sbjct: 508 ERIPLQDNLVLWMALLGACRLHGNADLGKVVGEKIMRLEPKHCGSGSYVLMSSMYGVVGR 567

Query: 586 YEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDE 645
           YEEAL+VRRTMKEQN+KKTPGCSWIELKDGLYVFSMGDRTH ELNALI+CLCGIGY HDE
Sbjct: 568 YEEALQVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRTHPELNALIHCLCGIGYLHDE 627

Query: 646 VMHS 650
           VMHS
Sbjct: 628 VMHS 631

BLAST of Cla97C01G007680 vs. NCBI nr
Match: XP_023551696.1 (pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023551697.1 pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1144.4 bits (2959), Expect = 0.0e+00
Identity = 542/604 (89.74%), Postives = 576/604 (95.36%), Query Frame = 0

Query: 46  LPSRLLVFQIHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYS 105
           LPSRLLVFQ+HYDVATCNFFLQSYANHKNL+EGK+LHS+MITSGF+HLPSSITSLINMYS
Sbjct: 28  LPSRLLVFQMHYDVATCNFFLQSYANHKNLSEGKKLHSVMITSGFMHLPSSITSLINMYS 87

Query: 106 KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFP 165
           KCN MEQAVLVFHDPYHE NVFAYNAIIAGFVAN L AHGFQFYKRMRSVG+MPDKFTFP
Sbjct: 88  KCNHMEQAVLVFHDPYHESNVFAYNAIIAGFVANRLPAHGFQFYKRMRSVGVMPDKFTFP 147

Query: 166 CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVV 225
           CVVRACCEFMEVRKIHGCLFK+GLELD+FV SALVNTYLK D++EDA+KVF+ELPERDVV
Sbjct: 148 CVVRACCEFMEVRKIHGCLFKLGLELDMFVSSALVNTYLKFDLMEDAKKVFKELPERDVV 207

Query: 226 LWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIV 285
           LWNAMINGY +IG LNKAVV FKKMGEEGISPCRFT+TGILSIF+LMGDINNGRAIHGIV
Sbjct: 208 LWNAMINGYAKIGHLNKAVVVFKKMGEEGISPCRFTITGILSIFSLMGDINNGRAIHGIV 267

Query: 286 TKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTL 345
           TKMGYSSCVAVSNALIDMYGKCKHIEDAL+IF+MIN KDLFSWNSIISAHEQC DHDGTL
Sbjct: 268 TKMGYSSCVAVSNALIDMYGKCKHIEDALVIFEMINEKDLFSWNSIISAHEQCVDHDGTL 327

Query: 346 RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNA 405
           R F KML SRVLPDV+TITAVLPACS+ AALMHGREIHG+M VNGLGKNE+GDDVLLNNA
Sbjct: 328 RFFDKMLASRVLPDVITITAVLPACSYFAALMHGREIHGYMTVNGLGKNEDGDDVLLNNA 387

Query: 406 VMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPD 465
           VMDMYAKCGC+KNA  VFD  SNKDVASWNIMIMGYA+HGYG+EALDMFHHMCEAQIKPD
Sbjct: 388 VMDMYAKCGCLKNAHRVFDRTSNKDVASWNIMIMGYAMHGYGQEALDMFHHMCEAQIKPD 447

Query: 466 AITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELA 525
           AITFVGVLSACSHAGF+ QGRSFL RMELEFGVVPTIEHYTCIIDMLGRAG + EAYELA
Sbjct: 448 AITFVGVLSACSHAGFLRQGRSFLARMELEFGVVPTIEHYTCIIDMLGRAGHLGEAYELA 507

Query: 526 QRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGR 585
           +RIP QDNLVLWMALLGACRLHGNA+LGKVVGEKI RLEPKHCGSGSY+LMS+MYGV+GR
Sbjct: 508 ERIPLQDNLVLWMALLGACRLHGNADLGKVVGEKIMRLEPKHCGSGSYVLMSSMYGVVGR 567

Query: 586 YEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDE 645
           YEEAL+VRRTMKEQN+KKTPGCSWIELKDGLYVFSMGDRTH ELNALI+CLCGIGY HDE
Sbjct: 568 YEEALQVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRTHPELNALIHCLCGIGYLHDE 627

Query: 646 VMHS 650
           VMHS
Sbjct: 628 VMHS 631

BLAST of Cla97C01G007680 vs. ExPASy Swiss-Prot
Match: Q9LUC2 (Pentatricopeptide repeat-containing protein At3g14730 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E31 PE=2 SV=1)

HSP 1 Score: 644.4 bits (1661), Expect = 1.4e-183
Identity = 306/581 (52.67%), Postives = 416/581 (71.60%), Query Frame = 0

Query: 56  HYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFI-HLPSSITSLINMYSKCNQMEQAV 115
           H++VATC   LQ  A  K+   G+Q+H  M+  GF+   P + TSL+NMY+KC  M +AV
Sbjct: 57  HHNVATCIATLQRCAQRKDYVSGQQIHGFMVRKGFLDDSPRAGTSLVNMYAKCGLMRRAV 116

Query: 116 LVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFPCVVRA--CC 175
           LVF     ER+VF YNA+I+GFV NG      + Y+ MR+ GI+PDK+TFP +++     
Sbjct: 117 LVFGG--SERDVFGYNALISGFVVNGSPLDAMETYREMRANGILPDKYTFPSLLKGSDAM 176

Query: 176 EFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPER-DVVLWNAMI 235
           E  +V+K+HG  FK+G + D +VGS LV +Y K   VEDA+KVF+ELP+R D VLWNA++
Sbjct: 177 ELSDVKKVHGLAFKLGFDSDCYVGSGLVTSYSKFMSVEDAQKVFDELPDRDDSVLWNALV 236

Query: 236 NGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIVTKMGYS 295
           NGY+QI R   A++ F KM EEG+   R T+T +LS FT+ GDI+NGR+IHG+  K G  
Sbjct: 237 NGYSQIFRFEDALLVFSKMREEGVGVSRHTITSVLSAFTVSGDIDNGRSIHGLAVKTGSG 296

Query: 296 SCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTLRLFGKM 355
           S + VSNALIDMYGK K +E+A  IF+ ++ +DLF+WNS++  H+ CGDHDGTL LF +M
Sbjct: 297 SDIVVSNALIDMYGKSKWLEEANSIFEAMDERDLFTWNSVLCVHDYCGDHDGTLALFERM 356

Query: 356 LGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNAVMDMYA 415
           L S + PD+VT+T VLP C  LA+L  GREIHG+MIV+GL  N    +  ++N++MDMY 
Sbjct: 357 LCSGIRPDIVTLTTVLPTCGRLASLRQGREIHGYMIVSGL-LNRKSSNEFIHNSLMDMYV 416

Query: 416 KCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPDAITFVG 475
           KCG +++A +VFD M  KD ASWNIMI GY +   G  ALDMF  MC A +KPD ITFVG
Sbjct: 417 KCGDLRDARMVFDSMRVKDSASWNIMINGYGVQSCGELALDMFSCMCRAGVKPDEITFVG 476

Query: 476 VLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELAQRIPFQ 535
           +L ACSH+GF+++GR+FL +ME  + ++PT +HY C+IDMLGRA ++EEAYELA   P  
Sbjct: 477 LLQACSHSGFLNEGRNFLAQMETVYNILPTSDHYACVIDMLGRADKLEEAYELAISKPIC 536

Query: 536 DNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGRYEEALE 595
           DN V+W ++L +CRLHGN +L  V G+++  LEP+HC  G Y+LMSN+Y   G+YEE L+
Sbjct: 537 DNPVVWRSILSSCRLHGNKDLALVAGKRLHELEPEHC--GGYVLMSNVYVEAGKYEEVLD 596

Query: 596 VRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNAL 633
           VR  M++QN+KKTPGCSWI LK+G++ F  G++TH E  ++
Sbjct: 597 VRDAMRQQNVKKTPGCSWIVLKNGVHTFFTGNQTHPEFKSI 632

BLAST of Cla97C01G007680 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 1.1e-111
Identity = 216/629 (34.34%), Postives = 355/629 (56.44%), Query Frame = 0

Query: 79  KQLHSLMITSGFIHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAYNAIIAGF-- 138
           + +H+ +I SGF +       LI+ YSKC  +E    VF D   +RN++ +N+++ G   
Sbjct: 40  RYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVF-DKMPQRNIYTWNSVVTGLTK 99

Query: 139 ------------------------VANGLAAH-----GFQFYKRMRSVGIMPDKFTFPCV 198
                                   + +G A H        ++  M   G + ++++F  V
Sbjct: 100 LGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASV 159

Query: 199 VRACCEFMEVRK---IHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDV 258
           + AC    ++ K   +H  + K     DV++GSALV+ Y K   V DA++VF+E+ +R+V
Sbjct: 160 LSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNV 219

Query: 259 VLWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGI 318
           V WN++I  + Q G   +A+  F+ M E  + P   T+  ++S    +  I  G+ +HG 
Sbjct: 220 VSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGR 279

Query: 319 VTKMG-YSSCVAVSNALIDMYGKCKHIEDALMIFD------------MING--------- 378
           V K     + + +SNA +DMY KC  I++A  IFD            MI+G         
Sbjct: 280 VVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKA 339

Query: 379 ----------KDLFSWNSIISAHEQCGDHDGTLRLFGKMLGSRVLPDVVTITAVLPACSH 438
                     +++ SWN++I+ + Q G+++  L LF  +    V P   +   +L AC+ 
Sbjct: 340 ARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACAD 399

Query: 439 LAALMHGREIHGHMIVNGLGKNENG--DDVLLNNAVMDMYAKCGCMKNADIVFDLMSNKD 498
           LA L  G + H H++ +G  K ++G  DD+ + N+++DMY KCGC++   +VF  M  +D
Sbjct: 400 LAELHLGMQAHVHVLKHGF-KFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERD 459

Query: 499 VASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPDAITFVGVLSACSHAGFVHQGRSFLT 558
             SWN MI+G+A +GYG EAL++F  M E+  KPD IT +GVLSAC HAGFV +GR + +
Sbjct: 460 CVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFS 519

Query: 559 RMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELAQRIPFQDNLVLWMALLGACRLHGNA 618
            M  +FGV P  +HYTC++D+LGRAG +EEA  + + +P Q + V+W +LL AC++H N 
Sbjct: 520 SMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNI 579

Query: 619 ELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGRYEEALEVRRTMKEQNIKKTPGCSWI 637
            LGK V EK+  +EP +  SG Y+L+SNMY  +G++E+ + VR++M+++ + K PGCSWI
Sbjct: 580 TLGKYVAEKLLEVEPSN--SGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWI 639

BLAST of Cla97C01G007680 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 396.0 bits (1016), Expect = 8.4e-109
Identity = 202/600 (33.67%), Postives = 340/600 (56.67%), Query Frame = 0

Query: 73  KNLTEGKQLHSLMI-TSGFIHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAYNA 132
           K+ ++ KQLH+  I T    H  +SI  +I++Y+    + +A+L+F        V A+ +
Sbjct: 19  KSKSQAKQLHAQFIRTQSLSHTSASI--VISIYTNLKLLHEALLLF-KTLKSPPVLAWKS 78

Query: 133 IIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFPCVVRACCEFMEVR---KIHGCLFKMG 192
           +I  F    L +     +  MR+ G  PD   FP V+++C   M++R    +HG + ++G
Sbjct: 79  VIRCFTDQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLG 138

Query: 193 LELDVFVGSALVNTYLKV--------------------------DV----------VEDA 252
           ++ D++ G+AL+N Y K+                          DV          ++  
Sbjct: 139 MDCDLYTGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSV 198

Query: 253 EKVFEELPERDVVLWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLM 312
            +VFE +P +DVV +N +I GY Q G    A+   ++MG   + P  FT++ +L IF+  
Sbjct: 199 RRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEY 258

Query: 313 GDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSII 372
            D+  G+ IHG V + G  S V + ++L+DMY K   IED+  +F  +  +D  SWNS++
Sbjct: 259 VDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLV 318

Query: 373 SAHEQCGDHDGTLRLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLG 432
           + + Q G ++  LRLF +M+ ++V P  V  ++V+PAC+HLA L  G+++HG+++  G G
Sbjct: 319 AGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFG 378

Query: 433 KNENGDDVLLNNAVMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALD 492
            N     + + +A++DMY+KCG +K A  +FD M+  D  SW  +IMG+A+HG+G EA+ 
Sbjct: 379 SN-----IFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVS 438

Query: 493 MFHHMCEAQIKPDAITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDML 552
           +F  M    +KP+ + FV VL+ACSH G V +   +   M   +G+   +EHY  + D+L
Sbjct: 439 LFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLL 498

Query: 553 GRAGRIEEAYELAQRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGS 612
           GRAG++EEAY    ++  +    +W  LL +C +H N EL + V EKI  ++ ++   G+
Sbjct: 499 GRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSEN--MGA 558

Query: 613 YILMSNMYGVIGRYEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNAL 633
           Y+LM NMY   GR++E  ++R  M+++ ++K P CSWIE+K+  + F  GDR+H  ++ +
Sbjct: 559 YVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKI 608

BLAST of Cla97C01G007680 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 394.0 bits (1011), Expect = 3.2e-108
Identity = 207/578 (35.81%), Postives = 340/578 (58.82%), Query Frame = 0

Query: 55  IHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYSKCNQMEQAV 114
           +  D  T +   +S+++ +++  G+QLH  ++ SGF    S   SL+  Y K  +++ A 
Sbjct: 191 VEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSAR 250

Query: 115 LVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFPCVVRACCEF 174
            VF D   ER+V ++N+II G+V+NGLA  G   + +M   GI  D  T   V   C + 
Sbjct: 251 KVF-DEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADS 310

Query: 175 MEV---RKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVVLWNAMI 234
             +   R +H    K     +    + L++ Y K   ++ A+ VF E+ +R VV + +MI
Sbjct: 311 RLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMI 370

Query: 235 NGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIVTKMGYS 294
            GY + G   +AV  F++M EEGISP  +T+T +L+       ++ G+ +H  + +    
Sbjct: 371 AGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLG 430

Query: 295 SCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTLRLFGKM 354
             + VSNAL+DMY KC  +++A ++F  +  KD+ SWN+II  + +    +  L LF  +
Sbjct: 431 FDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLL 490

Query: 355 L-GSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNAVMDMY 414
           L   R  PD  T+  VLPAC+ L+A   GREIHG+++ NG   + +     + N+++DMY
Sbjct: 491 LEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRH-----VANSLVDMY 550

Query: 415 AKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPDAITFV 474
           AKCG +  A ++FD +++KD+ SW +MI GY +HG+G+EA+ +F+ M +A I+ D I+FV
Sbjct: 551 AKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFV 610

Query: 475 GVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELAQRIPF 534
            +L ACSH+G V +G  F   M  E  + PT+EHY CI+DML R G + +AY   + +P 
Sbjct: 611 SLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPI 670

Query: 535 QDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGRYEEAL 594
             +  +W ALL  CR+H + +L + V EK+  LEP++  +G Y+LM+N+Y    ++E+  
Sbjct: 671 PPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPEN--TGYYVLMANIYAEAEKWEQVK 730

Query: 595 EVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHE 629
            +R+ + ++ ++K PGCSWIE+K  + +F  GD ++ E
Sbjct: 731 RLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPE 760

BLAST of Cla97C01G007680 vs. ExPASy Swiss-Prot
Match: Q9M9E2 (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 1.3e-106
Identity = 214/636 (33.65%), Postives = 349/636 (54.87%), Query Frame = 0

Query: 11  RMSTHNSFSIRPPVEAFCDFWCFNGTSLNSDHSSFLPSRLL-VFQIHYDVATCNFFLQSY 70
           +MS  N FS    V  +     F       D +  L  R+L V  +  DV T    L++ 
Sbjct: 154 KMSERNLFSWNVLVGGYAKQGYF-------DEAMCLYHRMLWVGGVKPDVYTFPCVLRTC 213

Query: 71  ANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAY 130
               +L  GK++H  ++  G+      + +LI MY KC  ++ A L+F D    R++ ++
Sbjct: 214 GGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLF-DRMPRRDIISW 273

Query: 131 NAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFPCVVRACCEFMEVRK----IHGCLF 190
           NA+I+G+  NG+   G + +  MR + + PD  T   V+ A CE +  R+    IH  + 
Sbjct: 274 NAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISA-CELLGDRRLGRDIHAYVI 333

Query: 191 KMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVVLWNAMINGYTQIGRLNKAVV 250
             G  +D+ V ++L   YL      +AEK+F  +  +D+V W  MI+GY      +KA+ 
Sbjct: 334 TTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKAID 393

Query: 251 FFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYG 310
            ++ M ++ + P   T+  +LS    +GD++ G  +H +  K    S V V+N LI+MY 
Sbjct: 394 TYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLINMYS 453

Query: 311 KCKHIEDALMIFDMINGKDLFSWNSIISA---HEQCGDHDGTLRLFGKMLGSRVLPDVVT 370
           KCK I+ AL IF  I  K++ SW SII+    + +C +      +F + +   + P+ +T
Sbjct: 454 KCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFE----ALIFLRQMKMTLQPNAIT 513

Query: 371 ITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNAVMDMYAKCGCMKNADIV 430
           +TA L AC+ + ALM G+EIH H++  G+G      D  L NA++DMY +CG M  A   
Sbjct: 514 LTAALAACARIGALMCGKEIHAHVLRTGVGL-----DDFLPNALLDMYVRCGRMNTAWSQ 573

Query: 431 FDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPDAITFVGVLSACSHAGFV 490
           F+    KDV SWNI++ GY+  G G   +++F  M +++++PD ITF+ +L  CS +  V
Sbjct: 574 FN-SQKKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISLLCGCSKSQMV 633

Query: 491 HQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELAQRIPFQDNLVLWMALLG 550
            QG  + ++ME ++GV P ++HY C++D+LGRAG ++EA++  Q++P   +  +W ALL 
Sbjct: 634 RQGLMYFSKME-DYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPAVWGALLN 693

Query: 551 ACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGRYEEALEVRRTMKEQNIK 610
           ACR+H   +LG++  + I  L+ K    G YIL+ N+Y   G++ E  +VRR MKE  + 
Sbjct: 694 ACRIHHKIDLGELSAQHIFELDKK--SVGYYILLCNLYADCGKWREVAKVRRMMKENGLT 753

Query: 611 KTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCG 639
              GCSW+E+K  ++ F   D+ H +   +   L G
Sbjct: 754 VDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVLEG 767

BLAST of Cla97C01G007680 vs. ExPASy TrEMBL
Match: A0A1S3CQH1 (pentatricopeptide repeat-containing protein At3g14730-like OS=Cucumis melo OX=3656 GN=LOC103503636 PE=4 SV=1)

HSP 1 Score: 1151.0 bits (2976), Expect = 0.0e+00
Identity = 546/605 (90.25%), Postives = 576/605 (95.21%), Query Frame = 0

Query: 46  LPSRLLVFQIHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYS 105
           LPSRLLVFQIHYDVATCN FLQSYANHKNLT+GKQLHSLM+TSGFIHLPSSITSLINMYS
Sbjct: 28  LPSRLLVFQIHYDVATCNLFLQSYANHKNLTKGKQLHSLMVTSGFIHLPSSITSLINMYS 87

Query: 106 KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFP 165
           KCNQME+AVLVFHDPY ERNVFAYNAIIAGFVANGLAA GFQFYKRMRSVG+MPDKFTFP
Sbjct: 88  KCNQMEEAVLVFHDPYRERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVGVMPDKFTFP 147

Query: 166 CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVV 225
           CVVRACCEFME+RKIHGCLFKMGLEL+VFVGSALVNTYLKVD +EDAEKVFEELPERDVV
Sbjct: 148 CVVRACCEFMEIRKIHGCLFKMGLELNVFVGSALVNTYLKVDGMEDAEKVFEELPERDVV 207

Query: 226 LWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIV 285
           LWNAMINGY +IG LNKAV  FKKMGEEGIS  RFT T ILS+FT MGDINNGRAIHGIV
Sbjct: 208 LWNAMINGYIKIGHLNKAVAVFKKMGEEGISLSRFTTTSILSVFTSMGDINNGRAIHGIV 267

Query: 286 TKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTL 345
           TKMGYSSCVAVSNALIDMYGKCKH +DAL+IF+MIN KDLFSWNSIISAHEQCGDHDGTL
Sbjct: 268 TKMGYSSCVAVSNALIDMYGKCKHNKDALLIFEMINEKDLFSWNSIISAHEQCGDHDGTL 327

Query: 346 RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNA 405
           RLFGKML SRVLPDV+TIT VLPACSHLAALMHGREIHG+MIVNGLGKNEN DDVLLNNA
Sbjct: 328 RLFGKMLASRVLPDVITITVVLPACSHLAALMHGREIHGYMIVNGLGKNENSDDVLLNNA 387

Query: 406 VMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPD 465
           VMDMYAKCGCMKNA I+FDLM NKDVASWNIMIMGYA+HGYG EALDMFH MCEAQIKP+
Sbjct: 388 VMDMYAKCGCMKNAGIIFDLMRNKDVASWNIMIMGYAMHGYGTEALDMFHRMCEAQIKPN 447

Query: 466 AITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELA 525
            +TFVGVLSACSHAGFVHQGRSFLTRMELEFGV+PTIEHYTCIIDMLGRAG++ EAY+LA
Sbjct: 448 VVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEHYTCIIDMLGRAGQLGEAYDLA 507

Query: 526 QRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGR 585
           QRIP QDNL+LWMALLGACRLHGNAELG VVGEKI +LEPK+CGSGSYILMS++YGV+GR
Sbjct: 508 QRIPLQDNLILWMALLGACRLHGNAELGNVVGEKIRQLEPKNCGSGSYILMSSLYGVVGR 567

Query: 586 YEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDE 645
           YEEALEVRRTMKEQN+KKTPGCSWIELK+GLYVFSMGDRTHHELNALINCLCG  YFHDE
Sbjct: 568 YEEALEVRRTMKEQNVKKTPGCSWIELKNGLYVFSMGDRTHHELNALINCLCGFQYFHDE 627

Query: 646 VMHSF 651
           VMHSF
Sbjct: 628 VMHSF 632

BLAST of Cla97C01G007680 vs. ExPASy TrEMBL
Match: A0A6J1FNY8 (pentatricopeptide repeat-containing protein At3g14730-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445604 PE=4 SV=1)

HSP 1 Score: 1145.2 bits (2961), Expect = 0.0e+00
Identity = 544/604 (90.07%), Postives = 575/604 (95.20%), Query Frame = 0

Query: 46  LPSRLLVFQIHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYS 105
           LPSRLLVFQ+HYDVATCNFFLQSYANHKNL EGKQLHS+MITSGF+HLPSSITSLINMYS
Sbjct: 28  LPSRLLVFQMHYDVATCNFFLQSYANHKNLPEGKQLHSVMITSGFMHLPSSITSLINMYS 87

Query: 106 KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFP 165
           KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVAN L AHGFQFYKRMRSVG+MPDKFTFP
Sbjct: 88  KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANRLPAHGFQFYKRMRSVGVMPDKFTFP 147

Query: 166 CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVV 225
           CVVRACCEFMEVRKIHGCLFK+GLELD+FV SALVNTYLK D++EDA+KVF+ELPERDVV
Sbjct: 148 CVVRACCEFMEVRKIHGCLFKLGLELDMFVSSALVNTYLKFDLMEDAKKVFKELPERDVV 207

Query: 226 LWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIV 285
           LWNAMING  +IG LNKAVV FKKMGEEGISPCRFT+TGILSIF+LMGDINNGRAIHGIV
Sbjct: 208 LWNAMINGCAKIGHLNKAVVVFKKMGEEGISPCRFTITGILSIFSLMGDINNGRAIHGIV 267

Query: 286 TKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTL 345
           TKMGYSSCVAVSNALIDMYGKCKHIEDAL+IF+MIN KDLFSWNSIISAHEQC DHDGTL
Sbjct: 268 TKMGYSSCVAVSNALIDMYGKCKHIEDALVIFEMINEKDLFSWNSIISAHEQCVDHDGTL 327

Query: 346 RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNA 405
           R F KML SRVLPDV+TITAVLPACS+ AALMHGREIHG+M VNGLGKNE+GDDVLLNNA
Sbjct: 328 RFFDKMLASRVLPDVITITAVLPACSYFAALMHGREIHGYMTVNGLGKNEDGDDVLLNNA 387

Query: 406 VMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPD 465
           VMDMYAKCGC+KNA  VFD  SNKDVASWNIMIMGYA HGYG+EALDMFHHMCEAQIKPD
Sbjct: 388 VMDMYAKCGCLKNAHRVFDRTSNKDVASWNIMIMGYATHGYGQEALDMFHHMCEAQIKPD 447

Query: 466 AITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELA 525
           AITFVGVLSACSHAGF+ QGRSFL RMELEFGVVPTIEHYTCIIDMLGRAG + EAYELA
Sbjct: 448 AITFVGVLSACSHAGFLRQGRSFLARMELEFGVVPTIEHYTCIIDMLGRAGHLGEAYELA 507

Query: 526 QRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGR 585
           +RIP QDNLVLWMALLGACRLHGNA+LGKVVGEKI RLEPKHCGSGSY+LMS+MYGV+GR
Sbjct: 508 ERIPLQDNLVLWMALLGACRLHGNADLGKVVGEKIMRLEPKHCGSGSYVLMSSMYGVVGR 567

Query: 586 YEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDE 645
           YEEAL+VRRTMKEQN+KKTPGCSWIELKDGLYVFSMGDRTH ELNALI+CLCGIGY HDE
Sbjct: 568 YEEALQVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRTHPELNALIHCLCGIGYLHDE 627

Query: 646 VMHS 650
           VMHS
Sbjct: 628 VMHS 631

BLAST of Cla97C01G007680 vs. ExPASy TrEMBL
Match: A0A6J1JQW1 (pentatricopeptide repeat-containing protein At3g14730-like OS=Cucurbita maxima OX=3661 GN=LOC111489017 PE=4 SV=1)

HSP 1 Score: 1131.7 bits (2926), Expect = 0.0e+00
Identity = 538/604 (89.07%), Postives = 573/604 (94.87%), Query Frame = 0

Query: 46  LPSRLLVFQIHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYS 105
           LPSRLLVFQ+HYDVATCNF LQSYANHKNL EGKQLHS+MITSGF+HLPSSITSLINMYS
Sbjct: 28  LPSRLLVFQMHYDVATCNFLLQSYANHKNLPEGKQLHSVMITSGFMHLPSSITSLINMYS 87

Query: 106 KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFP 165
           KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVAN L A+GFQFYKRMR+VG+MPDKFTFP
Sbjct: 88  KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANRLPANGFQFYKRMRAVGVMPDKFTFP 147

Query: 166 CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVV 225
           CVVRACCEFMEVRKIHGCLFK+GLELD+FV SALVNTYLK D++E+A+KVFEELP RDVV
Sbjct: 148 CVVRACCEFMEVRKIHGCLFKLGLELDMFVSSALVNTYLKFDLMENAKKVFEELPVRDVV 207

Query: 226 LWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIV 285
           LWNAMINGY QIG LNKAVV FKKMGEEGISPCRFT+TGILSIF+LMGD+NNGRAIHGIV
Sbjct: 208 LWNAMINGYAQIGHLNKAVVVFKKMGEEGISPCRFTITGILSIFSLMGDVNNGRAIHGIV 267

Query: 286 TKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTL 345
           TKMGYSSCVAVSNALIDMYGKCKHIEDAL+IF+MIN KDLFSWNSIISAHEQC DHDGTL
Sbjct: 268 TKMGYSSCVAVSNALIDMYGKCKHIEDALVIFEMINEKDLFSWNSIISAHEQCVDHDGTL 327

Query: 346 RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNA 405
           R F KML SRVLPDV+TITAVLPACS+ AALMHGREIHG+M VNGLGKNE+GDDVLLNNA
Sbjct: 328 RFFDKMLASRVLPDVITITAVLPACSYFAALMHGREIHGYMTVNGLGKNEDGDDVLLNNA 387

Query: 406 VMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPD 465
           VMDMYAKCGC+KNA  VFD  SNKDVASWNIMIMGYA+HGYG+EALDMFHHM EAQIKPD
Sbjct: 388 VMDMYAKCGCLKNAHRVFDRTSNKDVASWNIMIMGYAMHGYGQEALDMFHHMREAQIKPD 447

Query: 466 AITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELA 525
           AITFVGVLSACSHAGF+ QGRSFL RMELEFGVVPTIEHYTCIIDMLGRAG + EAYELA
Sbjct: 448 AITFVGVLSACSHAGFLRQGRSFLARMELEFGVVPTIEHYTCIIDMLGRAGHLGEAYELA 507

Query: 526 QRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGR 585
           +RIP QDNLVLWMALLGACRLHGNA+LGKVVGEKI RLEPKHCGSGSY+LMS+MYGV+GR
Sbjct: 508 ERIPLQDNLVLWMALLGACRLHGNADLGKVVGEKIMRLEPKHCGSGSYVLMSSMYGVVGR 567

Query: 586 YEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYFHDE 645
           YEEAL+VRR MKEQN+KKTPGCSWIELKDGLYVFSMGDRTH ELNALI+CLCGIGY HDE
Sbjct: 568 YEEALQVRRMMKEQNVKKTPGCSWIELKDGLYVFSMGDRTHPELNALIHCLCGIGYLHDE 627

Query: 646 VMHS 650
           VM+S
Sbjct: 628 VMNS 631

BLAST of Cla97C01G007680 vs. ExPASy TrEMBL
Match: A0A0A0KN02 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G129340 PE=4 SV=1)

HSP 1 Score: 1094.3 bits (2829), Expect = 0.0e+00
Identity = 518/566 (91.52%), Postives = 542/566 (95.76%), Query Frame = 0

Query: 85  MITSGFIHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAH 144
           M+TSGFIHLPSSITSLINMYS+CNQME+AVLVF DPYHERNVFAYNAIIAGFVANGLAA 
Sbjct: 1   MVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVANGLAAD 60

Query: 145 GFQFYKRMRSVGIMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYL 204
           GFQFYKRMRSVG+MPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLEL+VFVGSALVNTYL
Sbjct: 61  GFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSALVNTYL 120

Query: 205 KVDVVEDAEKVFEELPERDVVLWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTG 264
           KVD  EDAEKVFEELPERDVVLWNAMINGYT+IG LNKAVV FK+MGEEGIS  RFT T 
Sbjct: 121 KVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTS 180

Query: 265 ILSIFTLMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKD 324
           ILSI T MGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKH EDALMIF+MIN KD
Sbjct: 181 ILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFEMINEKD 240

Query: 325 LFSWNSIISAHEQCGDHDGTLRLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHG 384
           LFSWNSIISAHEQC DHDGTLRLFGKMLGSRVLPDV+TITAVLPACSHLAALMHGREIHG
Sbjct: 241 LFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMHGREIHG 300

Query: 385 HMIVNGLGKNENGDDVLLNNAVMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIH 444
           +MIVNGLGKNENGDDVLLNNA+MDMYAKCGCMKNADI+FDLM NKDVASWNIMIMGYA+H
Sbjct: 301 YMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMH 360

Query: 445 GYGREALDMFHHMCEAQIKPDAITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEH 504
           GYG EALDMFH MCEAQIKPD +TFVGVLSACSHAGFVHQGRSFLTRMELEFGV+PTIEH
Sbjct: 361 GYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEH 420

Query: 505 YTCIIDMLGRAGRIEEAYELAQRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLE 564
           YTCIIDMLGRAG + EAY+LAQRIP +DNL+LWMALLGACRLHGNAELG VVGEKIT+LE
Sbjct: 421 YTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGEKITQLE 480

Query: 565 PKHCGSGSYILMSNMYGVIGRYEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDR 624
           PKHCGSGSYILMS++YGV+GRYEEALEVRRTMKEQN+KKTPGCSWIELKDGLYVFSMGDR
Sbjct: 481 PKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDR 540

Query: 625 THHELNALINCLCGIGYFHDEVMHSF 651
           THHELNALINCLCG GYFHDEVMHSF
Sbjct: 541 THHELNALINCLCGFGYFHDEVMHSF 566

BLAST of Cla97C01G007680 vs. ExPASy TrEMBL
Match: A0A6J1CXY2 (pentatricopeptide repeat-containing protein At3g14730-like OS=Momordica charantia OX=3673 GN=LOC111015602 PE=4 SV=1)

HSP 1 Score: 1082.0 bits (2797), Expect = 0.0e+00
Identity = 521/605 (86.12%), Postives = 561/605 (92.73%), Query Frame = 0

Query: 46  LPSRLLVFQIHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYS 105
           LPS+L VFQIHYDVA+CNF LQSYANHK+LT+GKQLHSLMITSGFIHLPSS+TSLINMYS
Sbjct: 25  LPSKLFVFQIHYDVASCNFSLQSYANHKHLTKGKQLHSLMITSGFIHLPSSVTSLINMYS 84

Query: 106 KCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFP 165
           KCNQMEQAVLVFHDPYH+RNVFAYNAIIAGFVANGLA  G +FYKRM+S G+MPDKFTFP
Sbjct: 85  KCNQMEQAVLVFHDPYHDRNVFAYNAIIAGFVANGLAGQGIRFYKRMKSAGVMPDKFTFP 144

Query: 166 CVVRACCEFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVV 225
            VVRACCE MEVRKIHGCLFKMGLE DVFV SALVNTYLK+D++EDAEKVFEELP RD  
Sbjct: 145 SVVRACCEVMEVRKIHGCLFKMGLESDVFVASALVNTYLKIDLMEDAEKVFEELPIRDAA 204

Query: 226 LWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIV 285
           LWNAMINGY QIG LNKA+  F+KMGEEGISPCRFT+TGILSIF+LMGDINNGRAIHGIV
Sbjct: 205 LWNAMINGYAQIGCLNKALDVFRKMGEEGISPCRFTITGILSIFSLMGDINNGRAIHGIV 264

Query: 286 TKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTL 345
           TKMGYS CVAVSNALIDMYGKCK+IEDAL+IF+MIN KDLFSWNSIIS HEQCG+HD TL
Sbjct: 265 TKMGYSPCVAVSNALIDMYGKCKNIEDALVIFEMINEKDLFSWNSIISVHEQCGNHDDTL 324

Query: 346 RLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGD---DVLL 405
           RLFGKMLGSRVLPDVVTIT+VLPACSHLAALMHGREIHG+MIVNGLGK+EN +   DVLL
Sbjct: 325 RLFGKMLGSRVLPDVVTITSVLPACSHLAALMHGREIHGYMIVNGLGKDENSEDAVDVLL 384

Query: 406 NNAVMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQI 465
           NNAVMDMYAKCGCMKNA IVFD MSNKDVASWNIMI GYA+HGYG+EALDMF HMCEAQI
Sbjct: 385 NNAVMDMYAKCGCMKNALIVFDRMSNKDVASWNIMITGYAMHGYGKEALDMFLHMCEAQI 444

Query: 466 KPDAITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAY 525
           KPDA+TFVGVLSACSHAGFVHQGRSFL +MEL+FGVVP+IEHYTCIIDMLGRAG + EAY
Sbjct: 445 KPDAVTFVGVLSACSHAGFVHQGRSFLAQMELKFGVVPSIEHYTCIIDMLGRAGHLGEAY 504

Query: 526 ELAQRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGV 585
           ELAQRIP Q NLVLWMALLGACRLHG+A+LGKVVGEKI +LEPK+CGSGSY LMS+MYGV
Sbjct: 505 ELAQRIPLQGNLVLWMALLGACRLHGDADLGKVVGEKIMQLEPKNCGSGSYTLMSSMYGV 564

Query: 586 IGRYEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGIGYF 645
           +GRYEEA EVRRTM+EQN+KKTPGCSWIELKDGL VFS GDRTH ELNALINCL GIGY 
Sbjct: 565 VGRYEEAFEVRRTMREQNVKKTPGCSWIELKDGLRVFSTGDRTHPELNALINCL-GIGYI 624

Query: 646 HDEVM 648
            DE +
Sbjct: 625 LDEAV 628

BLAST of Cla97C01G007680 vs. TAIR 10
Match: AT3G14730.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 644.4 bits (1661), Expect = 9.6e-185
Identity = 306/581 (52.67%), Postives = 416/581 (71.60%), Query Frame = 0

Query: 56  HYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFI-HLPSSITSLINMYSKCNQMEQAV 115
           H++VATC   LQ  A  K+   G+Q+H  M+  GF+   P + TSL+NMY+KC  M +AV
Sbjct: 57  HHNVATCIATLQRCAQRKDYVSGQQIHGFMVRKGFLDDSPRAGTSLVNMYAKCGLMRRAV 116

Query: 116 LVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFPCVVRA--CC 175
           LVF     ER+VF YNA+I+GFV NG      + Y+ MR+ GI+PDK+TFP +++     
Sbjct: 117 LVFGG--SERDVFGYNALISGFVVNGSPLDAMETYREMRANGILPDKYTFPSLLKGSDAM 176

Query: 176 EFMEVRKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPER-DVVLWNAMI 235
           E  +V+K+HG  FK+G + D +VGS LV +Y K   VEDA+KVF+ELP+R D VLWNA++
Sbjct: 177 ELSDVKKVHGLAFKLGFDSDCYVGSGLVTSYSKFMSVEDAQKVFDELPDRDDSVLWNALV 236

Query: 236 NGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIVTKMGYS 295
           NGY+QI R   A++ F KM EEG+   R T+T +LS FT+ GDI+NGR+IHG+  K G  
Sbjct: 237 NGYSQIFRFEDALLVFSKMREEGVGVSRHTITSVLSAFTVSGDIDNGRSIHGLAVKTGSG 296

Query: 296 SCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTLRLFGKM 355
           S + VSNALIDMYGK K +E+A  IF+ ++ +DLF+WNS++  H+ CGDHDGTL LF +M
Sbjct: 297 SDIVVSNALIDMYGKSKWLEEANSIFEAMDERDLFTWNSVLCVHDYCGDHDGTLALFERM 356

Query: 356 LGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNAVMDMYA 415
           L S + PD+VT+T VLP C  LA+L  GREIHG+MIV+GL  N    +  ++N++MDMY 
Sbjct: 357 LCSGIRPDIVTLTTVLPTCGRLASLRQGREIHGYMIVSGL-LNRKSSNEFIHNSLMDMYV 416

Query: 416 KCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPDAITFVG 475
           KCG +++A +VFD M  KD ASWNIMI GY +   G  ALDMF  MC A +KPD ITFVG
Sbjct: 417 KCGDLRDARMVFDSMRVKDSASWNIMINGYGVQSCGELALDMFSCMCRAGVKPDEITFVG 476

Query: 476 VLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELAQRIPFQ 535
           +L ACSH+GF+++GR+FL +ME  + ++PT +HY C+IDMLGRA ++EEAYELA   P  
Sbjct: 477 LLQACSHSGFLNEGRNFLAQMETVYNILPTSDHYACVIDMLGRADKLEEAYELAISKPIC 536

Query: 536 DNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGRYEEALE 595
           DN V+W ++L +CRLHGN +L  V G+++  LEP+HC  G Y+LMSN+Y   G+YEE L+
Sbjct: 537 DNPVVWRSILSSCRLHGNKDLALVAGKRLHELEPEHC--GGYVLMSNVYVEAGKYEEVLD 596

Query: 596 VRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNAL 633
           VR  M++QN+KKTPGCSWI LK+G++ F  G++TH E  ++
Sbjct: 597 VRDAMRQQNVKKTPGCSWIVLKNGVHTFFTGNQTHPEFKSI 632

BLAST of Cla97C01G007680 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 405.6 bits (1041), Expect = 7.5e-113
Identity = 216/629 (34.34%), Postives = 355/629 (56.44%), Query Frame = 0

Query: 79  KQLHSLMITSGFIHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAYNAIIAGF-- 138
           + +H+ +I SGF +       LI+ YSKC  +E    VF D   +RN++ +N+++ G   
Sbjct: 40  RYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVF-DKMPQRNIYTWNSVVTGLTK 99

Query: 139 ------------------------VANGLAAH-----GFQFYKRMRSVGIMPDKFTFPCV 198
                                   + +G A H        ++  M   G + ++++F  V
Sbjct: 100 LGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASV 159

Query: 199 VRACCEFMEVRK---IHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDV 258
           + AC    ++ K   +H  + K     DV++GSALV+ Y K   V DA++VF+E+ +R+V
Sbjct: 160 LSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNV 219

Query: 259 VLWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGI 318
           V WN++I  + Q G   +A+  F+ M E  + P   T+  ++S    +  I  G+ +HG 
Sbjct: 220 VSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGR 279

Query: 319 VTKMG-YSSCVAVSNALIDMYGKCKHIEDALMIFD------------MING--------- 378
           V K     + + +SNA +DMY KC  I++A  IFD            MI+G         
Sbjct: 280 VVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKA 339

Query: 379 ----------KDLFSWNSIISAHEQCGDHDGTLRLFGKMLGSRVLPDVVTITAVLPACSH 438
                     +++ SWN++I+ + Q G+++  L LF  +    V P   +   +L AC+ 
Sbjct: 340 ARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACAD 399

Query: 439 LAALMHGREIHGHMIVNGLGKNENG--DDVLLNNAVMDMYAKCGCMKNADIVFDLMSNKD 498
           LA L  G + H H++ +G  K ++G  DD+ + N+++DMY KCGC++   +VF  M  +D
Sbjct: 400 LAELHLGMQAHVHVLKHGF-KFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERD 459

Query: 499 VASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPDAITFVGVLSACSHAGFVHQGRSFLT 558
             SWN MI+G+A +GYG EAL++F  M E+  KPD IT +GVLSAC HAGFV +GR + +
Sbjct: 460 CVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFS 519

Query: 559 RMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELAQRIPFQDNLVLWMALLGACRLHGNA 618
            M  +FGV P  +HYTC++D+LGRAG +EEA  + + +P Q + V+W +LL AC++H N 
Sbjct: 520 SMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNI 579

Query: 619 ELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGRYEEALEVRRTMKEQNIKKTPGCSWI 637
            LGK V EK+  +EP +  SG Y+L+SNMY  +G++E+ + VR++M+++ + K PGCSWI
Sbjct: 580 TLGKYVAEKLLEVEPSN--SGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWI 639

BLAST of Cla97C01G007680 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 396.0 bits (1016), Expect = 6.0e-110
Identity = 202/600 (33.67%), Postives = 340/600 (56.67%), Query Frame = 0

Query: 73  KNLTEGKQLHSLMI-TSGFIHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAYNA 132
           K+ ++ KQLH+  I T    H  +SI  +I++Y+    + +A+L+F        V A+ +
Sbjct: 19  KSKSQAKQLHAQFIRTQSLSHTSASI--VISIYTNLKLLHEALLLF-KTLKSPPVLAWKS 78

Query: 133 IIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFPCVVRACCEFMEVR---KIHGCLFKMG 192
           +I  F    L +     +  MR+ G  PD   FP V+++C   M++R    +HG + ++G
Sbjct: 79  VIRCFTDQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLG 138

Query: 193 LELDVFVGSALVNTYLKV--------------------------DV----------VEDA 252
           ++ D++ G+AL+N Y K+                          DV          ++  
Sbjct: 139 MDCDLYTGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSV 198

Query: 253 EKVFEELPERDVVLWNAMINGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLM 312
            +VFE +P +DVV +N +I GY Q G    A+   ++MG   + P  FT++ +L IF+  
Sbjct: 199 RRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEY 258

Query: 313 GDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSII 372
            D+  G+ IHG V + G  S V + ++L+DMY K   IED+  +F  +  +D  SWNS++
Sbjct: 259 VDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLV 318

Query: 373 SAHEQCGDHDGTLRLFGKMLGSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLG 432
           + + Q G ++  LRLF +M+ ++V P  V  ++V+PAC+HLA L  G+++HG+++  G G
Sbjct: 319 AGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFG 378

Query: 433 KNENGDDVLLNNAVMDMYAKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALD 492
            N     + + +A++DMY+KCG +K A  +FD M+  D  SW  +IMG+A+HG+G EA+ 
Sbjct: 379 SN-----IFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVS 438

Query: 493 MFHHMCEAQIKPDAITFVGVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDML 552
           +F  M    +KP+ + FV VL+ACSH G V +   +   M   +G+   +EHY  + D+L
Sbjct: 439 LFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLL 498

Query: 553 GRAGRIEEAYELAQRIPFQDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGS 612
           GRAG++EEAY    ++  +    +W  LL +C +H N EL + V EKI  ++ ++   G+
Sbjct: 499 GRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSEN--MGA 558

Query: 613 YILMSNMYGVIGRYEEALEVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHELNAL 633
           Y+LM NMY   GR++E  ++R  M+++ ++K P CSWIE+K+  + F  GDR+H  ++ +
Sbjct: 559 YVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKI 608

BLAST of Cla97C01G007680 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 394.0 bits (1011), Expect = 2.3e-109
Identity = 207/578 (35.81%), Postives = 340/578 (58.82%), Query Frame = 0

Query: 55  IHYDVATCNFFLQSYANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYSKCNQMEQAV 114
           +  D  T +   +S+++ +++  G+QLH  ++ SGF    S   SL+  Y K  +++ A 
Sbjct: 191 VEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSAR 250

Query: 115 LVFHDPYHERNVFAYNAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFPCVVRACCEF 174
            VF D   ER+V ++N+II G+V+NGLA  G   + +M   GI  D  T   V   C + 
Sbjct: 251 KVF-DEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADS 310

Query: 175 MEV---RKIHGCLFKMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVVLWNAMI 234
             +   R +H    K     +    + L++ Y K   ++ A+ VF E+ +R VV + +MI
Sbjct: 311 RLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMI 370

Query: 235 NGYTQIGRLNKAVVFFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIVTKMGYS 294
            GY + G   +AV  F++M EEGISP  +T+T +L+       ++ G+ +H  + +    
Sbjct: 371 AGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLG 430

Query: 295 SCVAVSNALIDMYGKCKHIEDALMIFDMINGKDLFSWNSIISAHEQCGDHDGTLRLFGKM 354
             + VSNAL+DMY KC  +++A ++F  +  KD+ SWN+II  + +    +  L LF  +
Sbjct: 431 FDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLL 490

Query: 355 L-GSRVLPDVVTITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNAVMDMY 414
           L   R  PD  T+  VLPAC+ L+A   GREIHG+++ NG   + +     + N+++DMY
Sbjct: 491 LEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRH-----VANSLVDMY 550

Query: 415 AKCGCMKNADIVFDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPDAITFV 474
           AKCG +  A ++FD +++KD+ SW +MI GY +HG+G+EA+ +F+ M +A I+ D I+FV
Sbjct: 551 AKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFV 610

Query: 475 GVLSACSHAGFVHQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELAQRIPF 534
            +L ACSH+G V +G  F   M  E  + PT+EHY CI+DML R G + +AY   + +P 
Sbjct: 611 SLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPI 670

Query: 535 QDNLVLWMALLGACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGRYEEAL 594
             +  +W ALL  CR+H + +L + V EK+  LEP++  +G Y+LM+N+Y    ++E+  
Sbjct: 671 PPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPEN--TGYYVLMANIYAEAEKWEQVK 730

Query: 595 EVRRTMKEQNIKKTPGCSWIELKDGLYVFSMGDRTHHE 629
            +R+ + ++ ++K PGCSWIE+K  + +F  GD ++ E
Sbjct: 731 RLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPE 760

BLAST of Cla97C01G007680 vs. TAIR 10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 388.7 bits (997), Expect = 9.5e-108
Identity = 214/636 (33.65%), Postives = 349/636 (54.87%), Query Frame = 0

Query: 11  RMSTHNSFSIRPPVEAFCDFWCFNGTSLNSDHSSFLPSRLL-VFQIHYDVATCNFFLQSY 70
           +MS  N FS    V  +     F       D +  L  R+L V  +  DV T    L++ 
Sbjct: 154 KMSERNLFSWNVLVGGYAKQGYF-------DEAMCLYHRMLWVGGVKPDVYTFPCVLRTC 213

Query: 71  ANHKNLTEGKQLHSLMITSGFIHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAY 130
               +L  GK++H  ++  G+      + +LI MY KC  ++ A L+F D    R++ ++
Sbjct: 214 GGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLF-DRMPRRDIISW 273

Query: 131 NAIIAGFVANGLAAHGFQFYKRMRSVGIMPDKFTFPCVVRACCEFMEVRK----IHGCLF 190
           NA+I+G+  NG+   G + +  MR + + PD  T   V+ A CE +  R+    IH  + 
Sbjct: 274 NAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISA-CELLGDRRLGRDIHAYVI 333

Query: 191 KMGLELDVFVGSALVNTYLKVDVVEDAEKVFEELPERDVVLWNAMINGYTQIGRLNKAVV 250
             G  +D+ V ++L   YL      +AEK+F  +  +D+V W  MI+GY      +KA+ 
Sbjct: 334 TTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKAID 393

Query: 251 FFKKMGEEGISPCRFTMTGILSIFTLMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYG 310
            ++ M ++ + P   T+  +LS    +GD++ G  +H +  K    S V V+N LI+MY 
Sbjct: 394 TYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLINMYS 453

Query: 311 KCKHIEDALMIFDMINGKDLFSWNSIISA---HEQCGDHDGTLRLFGKMLGSRVLPDVVT 370
           KCK I+ AL IF  I  K++ SW SII+    + +C +      +F + +   + P+ +T
Sbjct: 454 KCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFE----ALIFLRQMKMTLQPNAIT 513

Query: 371 ITAVLPACSHLAALMHGREIHGHMIVNGLGKNENGDDVLLNNAVMDMYAKCGCMKNADIV 430
           +TA L AC+ + ALM G+EIH H++  G+G      D  L NA++DMY +CG M  A   
Sbjct: 514 LTAALAACARIGALMCGKEIHAHVLRTGVGL-----DDFLPNALLDMYVRCGRMNTAWSQ 573

Query: 431 FDLMSNKDVASWNIMIMGYAIHGYGREALDMFHHMCEAQIKPDAITFVGVLSACSHAGFV 490
           F+    KDV SWNI++ GY+  G G   +++F  M +++++PD ITF+ +L  CS +  V
Sbjct: 574 FN-SQKKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISLLCGCSKSQMV 633

Query: 491 HQGRSFLTRMELEFGVVPTIEHYTCIIDMLGRAGRIEEAYELAQRIPFQDNLVLWMALLG 550
            QG  + ++ME ++GV P ++HY C++D+LGRAG ++EA++  Q++P   +  +W ALL 
Sbjct: 634 RQGLMYFSKME-DYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPAVWGALLN 693

Query: 551 ACRLHGNAELGKVVGEKITRLEPKHCGSGSYILMSNMYGVIGRYEEALEVRRTMKEQNIK 610
           ACR+H   +LG++  + I  L+ K    G YIL+ N+Y   G++ E  +VRR MKE  + 
Sbjct: 694 ACRIHHKIDLGELSAQHIFELDKK--SVGYYILLCNLYADCGKWREVAKVRRMMKENGLT 753

Query: 611 KTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCG 639
              GCSW+E+K  ++ F   D+ H +   +   L G
Sbjct: 754 VDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVLEG 767

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038877905.10.0e+0094.88pentatricopeptide repeat-containing protein At3g14730-like [Benincasa hispida] >... [more]
XP_004149501.20.0e+0091.40pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >XP_0317... [more]
XP_008466127.10.0e+0090.25PREDICTED: pentatricopeptide repeat-containing protein At3g14730-like [Cucumis m... [more]
XP_022939865.10.0e+0090.07pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita... [more]
XP_023551696.10.0e+0089.74pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita... [more]
Match NameE-valueIdentityDescription
Q9LUC21.4e-18352.67Pentatricopeptide repeat-containing protein At3g14730 OS=Arabidopsis thaliana OX... [more]
Q9SIT71.1e-11134.34Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9LW638.4e-10933.67Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9SN393.2e-10835.81Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9M9E21.3e-10633.65Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A1S3CQH10.0e+0090.25pentatricopeptide repeat-containing protein At3g14730-like OS=Cucumis melo OX=36... [more]
A0A6J1FNY80.0e+0090.07pentatricopeptide repeat-containing protein At3g14730-like isoform X1 OS=Cucurbi... [more]
A0A6J1JQW10.0e+0089.07pentatricopeptide repeat-containing protein At3g14730-like OS=Cucurbita maxima O... [more]
A0A0A0KN020.0e+0091.52Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G129340 PE=4 SV=1[more]
A0A6J1CXY20.0e+0086.12pentatricopeptide repeat-containing protein At3g14730-like OS=Momordica charanti... [more]
Match NameE-valueIdentityDescription
AT3G14730.19.6e-18552.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G13600.17.5e-11334.34Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G23330.16.0e-11033.67Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.12.3e-10935.81Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G15510.19.5e-10833.65Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 298..320
e-value: 0.017
score: 15.3
coord: 572..601
e-value: 0.083
score: 13.2
coord: 326..354
e-value: 8.8E-4
score: 19.4
coord: 98..118
e-value: 0.12
score: 12.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 429..477
e-value: 3.1E-10
score: 40.1
coord: 222..266
e-value: 5.2E-9
score: 36.2
coord: 124..172
e-value: 3.4E-10
score: 40.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 128..160
e-value: 1.3E-4
score: 19.9
coord: 225..257
e-value: 3.9E-7
score: 27.8
coord: 326..360
e-value: 5.6E-4
score: 17.9
coord: 433..466
e-value: 9.9E-7
score: 26.6
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 497..524
e-value: 2.5E-5
score: 23.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 223..257
score: 12.824779
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 430..464
score: 10.807899
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 125..159
score: 10.64348
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 324..358
score: 10.369448
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 276..377
e-value: 1.7E-19
score: 71.9
coord: 178..275
e-value: 9.9E-19
score: 69.4
coord: 59..177
e-value: 1.6E-19
score: 72.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 495..614
e-value: 1.6E-10
score: 42.9
coord: 378..494
e-value: 1.1E-20
score: 76.3
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 55..627
NoneNo IPR availablePANTHERPTHR47928:SF61OS01G0818200 PROTEINcoord: 55..627

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G007680.2Cla97C01G007680.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding