Cla97C10G203940 (gene) Watermelon (97103) v2.5

Overview
NameCla97C10G203940
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr10: 33486106 .. 33492550 (-)
RNA-Seq ExpressionCla97C10G203940
SyntenyCla97C10G203940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGAATGCGAAGCCCAAAAACCTTCAAACCTCAGTTCCCGCCAGCGTCTTTCTTCCATGGGCTCTGCAGGCGCTCCACCGCACCGACGGGATGAATTACGGCGCTTATGGCCGCCTTATCCAGCTCTGCACCGACCACCTCTTCGTCCGCCTCGGTAAGCAGCTTCACGCTCGTCTTGTTCTATCCTCCGTCGCTCCCGATAACTTCCTCGGATCGAAACTCATCGCCTTCTACTCAAAATCCGGCAGCCTTAGAGATGCCTACAATGTGTTCGGTAAAATTTCTCACAAAAATATTTTCAGTTGGAATGCTTTGTTCATCAGTTACACTCTTCACAATATGCACACTGATATGCTGAAGCTGTTTTCGTCTTTGGTTAATTCAAATTCGACGGATGTGAAGCCCGATAAGTTTACTGTCACTTGTGTTTTGAAAGCGTTGGCGTCTTTGTTTTCTAATTCGGTTTTGGCTAAGGAAGTTCATTGTTTCATTCTTCGACGAGGGCTTGAGTCTGATATTTTTGTTGCCAATGCTTTGATCACTTTTTACTCGAGGTGCGATGAGCTGGTTTTAGCAAGAATTATGTTTGATAGAATGCCTGGGAGAGATATAGTGTCTTGGAATGCGATGTTGGCTGGGTACTCTCAGGCTGGGCTCTATGAGGAGTGCAAGAAACTATTTAAAGCGATGTTGAGTTCAGTGGAGCTGAAGCCTAATGCATTAACCGCAGTCAGTGTTTTGCAAGCTTGTGCTCAGTCAAATGATCTCATTTTTGGAATGGAAGTTCACAGATTCGTCAATGAAAGCCAGGTTGTAATGGATGTTTCACTATGCAATGCTGTTATTGGATTATATGCAAAGTGTGGTAGCTTGGATTATGCTCGTGAGTTGTTTGAAGAAATGCCTGAGAAGGATGAGGTCACCTATGGCGCTATGATATCAGGCTACATGGTCCATGGTTTTGTTAACCAAGCAATGGATCTTTTTCGAGAACTGGAAAGGCCAGCATTGAGCACATGGAATGCTGTGATTTCTGGTCTGGTTCAGAACAACCAGCAAGATAGAGTCGTAGATATATTTCGAGCAATGCAGTCACATGGTTGCAGACCAAATACCGTGACTCTTGCGAGCGTTCTTCCCGTTTTCTCACATTTTTCAACCCTGAAAGGTGGGAAAGAAATTCATGCTTATGCCATTAGAAACGCTTACAATGGGAATATTTATGTTGCTACTGCCATCATCGATTCTTATGCTAAGTCTGGTTACCTCCATGGGGCACGACAAGTTTTTGATCAAATAAAAGGTAGGAGTCTAATCATCTGGACAGCAATAATCTCAGCATATGCTGCACATGGAGATGCCAATGTGGCTCTTAGTCTTTTCTATGAGATGCTGACAAACGGGATTCAGCCTGACCCGGTAACCTTTACATCAGTATTGGTTGCCTGTGCCCATTCAGGAGAGTTAGATGAAGCCTGGAAGATATTTAATGTCTTGTTACTAGAGTATGGGATTCAACCACTAGTCGAGCATTATGCTTGCATGGTAGGAGTCCTTAGTCGAGCAGGAAAGCTCTCTAATGCTGTTGAATTTATTTCTAAAATGCCATTTGAACCCACCGCAAAAGTTTGGGGTGCTTTGCTCAATGGGGCTTCTGTTGCTGGTGATGTTGAACTTGGAAAGTATGTTTTTGATCGTCTCTTTGAGATTGAGCCTGAAAATACTGGTAACTACATCATCATGGCTAACTTATATTCACAATTTGGAAGGTGGAAAGAAGCTGACAAGATTAGGAATTTGATGAAGGAAGTTGGATTGAAGAAGATCCCGGGAAGTAGCTGGATAGAAACAAGGGGAGGGTTGCAGAGTTTTGTAGCTAGAGACACTTCAAATGACAGGACTCCAGAGATCTATGGAATGTTGGAAGGATTACTTGGGTTGATGAAAGAAGAAGGAATCATTCTGCAACATGAGATAGATGACGACTGTGGGAGTGGTTAGTACATGGCCACATTTGCCTGCCATTTCTGTGCTTGAATCATGCTCCTCGAAGATAAACCACTTTACTTAGAGGTCTTATTTTTTGACCAGAGTCCTAGCATTAGTTATTCCTTGAAATGTTGTCGACTTATGATTTCATCTGACCATGCCAATCGAGAACCATATGTTACCTACCGAGCTGCAGCAAACTGATTTCTTTCTTGACCACAGTTATTCTTTCTACTGGCAATTAACTATACCTCCTCTATTTATGAGCCAAGGGATTTTCGTAGGGAGAATATGTTCACAGGAGCATAATGTTGAAGACTGCTAGAATTCAACCTGAAGAGGTAAATATATTGATGAAAGCAGCTACAAATGAAAAGGAATCTGATATTTGGCTCTGCTGGTGGGTAATAAGGACTTTTTAGTTCATAACAGGTGTGATTCAAGAGTTCCAGGTTGTATTCCTAACATGAGAGAAAAGTAAGTAGCGAGTTGAATTGAATGGTTATTTAATTTTATGCGTCAGTAGCCTGATGAAGATGGTAATATATCGTCAGTTGAGAGCTAGACGACACTGCAGCTCGTAGACTTCGTTAACTTTGGATTCCCAAAGGATACTGAAAATGAACCCAGATCGATGCTTACTGTATATAATATTCGATAACCCCAACAAAAGCAACACTTTATGCTCTGTATAATTGTAAGGTGACTTTGGAATGCGCTTTAGTGTTTCTTTCAATGGAGGTGGGTGACATTTCTCACTTTCAAAAGTAAGAATGGTGTCTGCTTGTTTCGGTAAAGTTCAGCTAACTGTTTGAGATCTTAACAGGGTACCTACTTTGTGATCTTTAACTTGATCGCGTGCCATCGGCCACTGCGTTGCTTGTTTCAGATATCTCAATCATCATGCCTTCAGTTAAAAAAATCGTCGTCCCACTTGCCGTGATGATGAAATTGGCGAAGCCCATTCCCAACTGAGACCACTCAACTTCCCTCAAGCAGGCAGCTTGCAGGTATTGAAAGGAAACTTTCTAATATTCAATTCTTTCACACTACTGCCACTAATACTTAAAAGAATCGAAGCTTTATAAACTATAATAAATGTAGTTGATTTAGCTAAGTATAACAGAATAATATTCCATATTAAGAAGCTTGTTCACTAAATCCTTTAGTTTTTATATGATAAATACAATGGAATATAGATGTCTTGGAACTCGTGTGAATGGATCTATATCGTCTTCAATTTCCACAATACAAATTTTATTGCTTCTGACTCATTAATTTTTTTCTTCATGCTATATTTTAGTCTTAGCATATATCTTTGAATTTCGTTATTTTTGTCATATCATCTATTCACCATCTATATAGATTGATCGAATTACTAGAGATGCCAACATAAGCATAACTCAACTTGACATTATGTATGGTCCTCACTCCCACACGGGTGAACTCAAAAAACTACCTAGACGACAAACAAATATGAATTGATATACCTCATTTCTAGATTTGAGAATGTTAGGTATCTAAATGCGAAAAAATGTAGGAAAACACCAAAAGTTACTATATGCAATATCAATGAAGAAATTACAATATAAGCAAATAATTCCTTTGGAAATCTCTTCCAAACCTCTTAATTCACCCACCAAAATAATGTCTAAATTCATCCCTTCCCACCCTCTATTTATAACTAATTTCCCTAACTACTTTCCTAACTAATTACTAACATACCATTAGAATATCCTTAATTACATTCCTTATTGAAGGATGTAAATTTTAACCAAAGGACAAAAGAGATTATTAAGGCCCAACATTTCAAGGAAGTAAACTAGGTTTCACACCCTACTAGTTGCGTAGCAATCAAATTAGATTGAAACACGAACACAAAGGAGAAATAAAACTTTAGGACACATGGGAAGGAGCAATGATTTGTCAACCATGTGGGACTAAGCAATTTGCTAAAAAAAAAAAATAAAAATCTTTTGCCCATAACACAAGGCTAAGCTGGGAATTTGGAAAGAAAAGGTTGCAGACAGGGGGCCACGATTGGTGCAAAGAGAGAGAGAGAGAAATTACACATGTGGTTGGGTCTTGTGGAATTCCACCGGCTTTGACGAAAGAGTGTTAGGTTGGAGGGGACAAAAGGAAGAGCCGTGGATGGGCCACCCTTCCCTTTACTCTTTTCTGTTTGTTTTGTTGCGTTTTACAGAAAAAAGGCAATGACTATCAGCTTATGGAAAAGAAAGGGAGTGAATTATAGCTTGGATTTGGCCCTTTGCTTTAGCTAGATAAATAATTGTTGTCATTGTAAGTGTCACATACACCATATTGCACATTGGACATGTATCAATATCAATACTTTCCTCCTCCTTCTCCCTTCTATTGTTATCCATCTTTACTGGGAGAATTTGGATTCATTTAGTTTGGTTTTTTCATGGTAGATAACACTTTCATTCTTGTAATTATCAAATGTAATAGAGGAGAACGATTTATTCCCATTTTATTTGATGACTAGGTGCATATTGAAAGCTACGTAATAATAAATTCTAAGGATTTTCAAAGAAAGAAAATACGTCTTCTTACCATGAAAAAGGTTAGTTTTGTTGAGTTTAGCCACTAATTGCTTGTCTTGCTTCAAGTATGCTTAATTTTGGCGAGTGTTCCATCTAATATTTTATGAATTTAATGAAGAAACTTTAGATTTCTAAGTTCGTAATATGGGTTTACCTTCCTTTATGTTGATAATTTACTTGTAACTCTCAAGTCTATCTATATAATAAGTCTGTAAATTTTTTTGAGAGCATGATTTATGAAGAACTCAACTCTATTAAAATAGAGATTTATCTATTGATAAATTTAGATAAACATGATTCTCTATCTCATAATCTAATCTTCTATCAATATCAATCAAATATTGTTTTTTTCTCTCTATATTTATGTGCAATTTCGTATTCTATAATTGACTTTCTATCGGTATTAATTGATAATCGTTTTGTTCTTTTTAAATCTATACTAAGTTTTAGAAGTATTAACAGGAAATAGATTGACCTTTTGATCCTTAGAGCAAGTGGGAAAGAAAAACCTCGATAAAAAAAAAAAGACATGAAAGACATGTTTGACTCAAAGAGTAGAAGTTGTTAACTCAATTCCTTGTTTGGCCAAAGTAGTTCTTAATTTAAGTGTGAGCTTTATGAATTAAAGCATCACTTTTATGACTTATTAACTCTTTACGACTCTTTAGACTTAACGACGGAGACTAATTAACTTCATTCCTAAAATGCTTACACAAGTTTAACTATGGACCCATAAATAAGTAGACATTAAAAATGAAGATATCTTGTTACTCACTTGCACACTTGTGGACAACAAAAGAAGAATTTATTTTCAAGAATAAAATAAGAAAAAGAATTAATGGTAGATTTTCTTTAATTTCAATTTTTCAGTAGTATTTGTTTTGTTTTTCCTTTTAATAATAAGATCCATTAGTAAGATAATATGAAATATTTCAGCCCAATATGCCATGGATGGGTAAATTGGGATCCACATAAAAAGGTTCATATCTCTCTCTGAAGTGAAGGAGCAATACGGCTCACGAGATCCCATAATGGAAGAATTAGAACCCCCCACTGTCATCATGGCCGCCCTCTCCGCCCTCTCTCCGCCGTGTCTCTCCGATCTCTCCCACTCCATTTTCTCCGACATTCACCACCACCGCCGCCGCCTCACCTTCATCCTCTCTTCCCCAACACTCTTCTCCCTCACTCTTCGCCACCTCAATTCCCTCTCCCTCTCCCACAAATCTCTCCTCCTTGCCCGCTTCCTCCTCTCCGCCCTCCGCCGCCTCTCCCGCCCCTTCCAGCCACCATCCAAGCTCCTCCCTTACCACCCTTCCACCGCCGCCATCTCCCCTCAAGACCTCGACGCCGCCGTCCTCCTCCTCCTCCTCTGCGAGGTCCGGCAACACAATCCAGCCGCCCTCCGAACTCCGATCACCAAATGGCGTGCGACCCTCTGTAGAATCTACTCCGATTCCCTCTTGACGATCTCAGGTGTCGCCACAGGCGGGGGTGGGGCTTTGATTCCGTTCATTGAGACGGTGGTGAGATGTTGGAAGTTCGTGGGGTTTGTTGGGAGTTGCGGAGGGAAGGCCCGGAGAGAGGTGGCGGCGTCTCCTGTGGCGGTGGTGGAGCTGCCGTCAGTGGCGGTGGGTGGTGGCGGTGGTGCGGCGGTGGAATGTGTGATTTGTAAAGAGGAGATGAGAGAAGGGAGAGACGCGTGTAAATTGCCTTGTGACCACTTATTCCATTGGCTCTGTATTTTGCCATGGCTGAGGAAACGGAACACGTGTCCCTGTTGTAGGTTTCAGCTTCCCACTGATGATATTTTCGGAGAGATCCAACGGCTCTGGGAGATCCTCCTCAAAGTGGGCTCCACGATGTGTCCATCTGATGGAGATTAA

mRNA sequence

ATGAGGAATGCGAAGCCCAAAAACCTTCAAACCTCAGTTCCCGCCAGCGTCTTTCTTCCATGGGCTCTGCAGGCGCTCCACCGCACCGACGGGATGAATTACGGCGCTTATGGCCGCCTTATCCAGCTCTGCACCGACCACCTCTTCGTCCGCCTCGGTAAGCAGCTTCACGCTCGTCTTGTTCTATCCTCCGTCGCTCCCGATAACTTCCTCGGATCGAAACTCATCGCCTTCTACTCAAAATCCGGCAGCCTTAGAGATGCCTACAATGTGTTCGGTAAAATTTCTCACAAAAATATTTTCAGTTGGAATGCTTTGTTCATCAGTTACACTCTTCACAATATGCACACTGATATGCTGAAGCTGTTTTCGTCTTTGGTTAATTCAAATTCGACGGATGTGAAGCCCGATAAGTTTACTGTCACTTGTGTTTTGAAAGCGTTGGCGTCTTTGTTTTCTAATTCGGTTTTGGCTAAGGAAGTTCATTGTTTCATTCTTCGACGAGGGCTTGAGTCTGATATTTTTGTTGCCAATGCTTTGATCACTTTTTACTCGAGGTGCGATGAGCTGGTTTTAGCAAGAATTATGTTTGATAGAATGCCTGGGAGAGATATAGTGTCTTGGAATGCGATGTTGGCTGGGTACTCTCAGGCTGGGCTCTATGAGGAGTGCAAGAAACTATTTAAAGCGATGTTGAGTTCAGTGGAGCTGAAGCCTAATGCATTAACCGCAGTCAGTGTTTTGCAAGCTTGTGCTCAGTCAAATGATCTCATTTTTGGAATGGAAGTTCACAGATTCGTCAATGAAAGCCAGGTTGTAATGGATGTTTCACTATGCAATGCTGTTATTGGATTATATGCAAAGTGTGGTAGCTTGGATTATGCTCGTGAGTTGTTTGAAGAAATGCCTGAGAAGGATGAGGTCACCTATGGCGCTATGATATCAGGCTACATGGTCCATGGTTTTGTTAACCAAGCAATGGATCTTTTTCGAGAACTGGAAAGGCCAGCATTGAGCACATGGAATGCTGTGATTTCTGGTCTGGTTCAGAACAACCAGCAAGATAGAGTCGTAGATATATTTCGAGCAATGCAGTCACATGGTTGCAGACCAAATACCGTGACTCTTGCGAGCGTTCTTCCCGTTTTCTCACATTTTTCAACCCTGAAAGGTGGGAAAGAAATTCATGCTTATGCCATTAGAAACGCTTACAATGGGAATATTTATGTTGCTACTGCCATCATCGATTCTTATGCTAAGTCTGGTTACCTCCATGGGGCACGACAAGTTTTTGATCAAATAAAAGGTAGGAGTCTAATCATCTGGACAGCAATAATCTCAGCATATGCTGCACATGGAGATGCCAATGTGGCTCTTAGTCTTTTCTATGAGATGCTGACAAACGGGATTCAGCCTGACCCGGTAACCTTTACATCAGTATTGGTTGCCTGTGCCCATTCAGGAGAGTTAGATGAAGCCTGGAAGATATTTAATGTCTTGTTACTAGAGTATGGGATTCAACCACTAGTCGAGCATTATGCTTGCATGGTAGGAGTCCTTAGTCGAGCAGGAAAGCTCTCTAATGCTGTTGAATTTATTTCTAAAATGCCATTTGAACCCACCGCAAAAGTTTGGGGTGCTTTGCTCAATGGGGCTTCTGTTGCTGGTGATGTTGAACTTGGAAAGTATGTTTTTGATCGTCTCTTTGAGATTGAGCCTGAAAATACTGGTAACTACATCATCATGGCTAACTTATATTCACAATTTGGAAGGTGGAAAGAAGCTGACAAGATTAGGAATTTGATGAAGGAAGTTGGATTGAAGAAGATCCCGGGAAGTAGCTGGATAGAAACAAGGGGAGGGTTGCAGAGTTTTGTAGCTAGAGACACTTCAAATGACAGGACTCCAGAGATCTATGGAATGTTGGAAGGATTACTTGGGTTGATGAAAGAAGAAGGAATCATTCTGCAACATGAGATAGATGACGACTGTGGGAGTGTGAAGGAGCAATACGGCTCACGAGATCCCATAATGGAAGAATTAGAACCCCCCACTGTCATCATGGCCGCCCTCTCCGCCCTCTCTCCGCCGTGTCTCTCCGATCTCTCCCACTCCATTTTCTCCGACATTCACCACCACCGCCGCCGCCTCACCTTCATCCTCTCTTCCCCAACACTCTTCTCCCTCACTCTTCGCCACCTCAATTCCCTCTCCCTCTCCCACAAATCTCTCCTCCTTGCCCGCTTCCTCCTCTCCGCCCTCCGCCGCCTCTCCCGCCCCTTCCAGCCACCATCCAAGCTCCTCCCTTACCACCCTTCCACCGCCGCCATCTCCCCTCAAGACCTCGACGCCGCCGTCCTCCTCCTCCTCCTCTGCGAGGTCCGGCAACACAATCCAGCCGCCCTCCGAACTCCGATCACCAAATGGCGTGCGACCCTCTGTAGAATCTACTCCGATTCCCTCTTGACGATCTCAGGTGTCGCCACAGGCGGGGGTGGGGCTTTGATTCCGTTCATTGAGACGGTGGTGAGATGTTGGAAGTTCGTGGGGTTTGTTGGGAGTTGCGGAGGGAAGGCCCGGAGAGAGGTGGCGGCGTCTCCTGTGGCGGTGGTGGAGCTGCCGTCAGTGGCGGTGGGTGGTGGCGGTGGTGCGGCGGTGGAATGTGTGATTTGTAAAGAGGAGATGAGAGAAGGGAGAGACGCGTGTAAATTGCCTTGTGACCACTTATTCCATTGGCTCTGTATTTTGCCATGGCTGAGGAAACGGAACACGTGTCCCTGTTGTAGGTTTCAGCTTCCCACTGATGATATTTTCGGAGAGATCCAACGGCTCTGGGAGATCCTCCTCAAAGTGGGCTCCACGATGTGTCCATCTGATGGAGATTAA

Coding sequence (CDS)

ATGAGGAATGCGAAGCCCAAAAACCTTCAAACCTCAGTTCCCGCCAGCGTCTTTCTTCCATGGGCTCTGCAGGCGCTCCACCGCACCGACGGGATGAATTACGGCGCTTATGGCCGCCTTATCCAGCTCTGCACCGACCACCTCTTCGTCCGCCTCGGTAAGCAGCTTCACGCTCGTCTTGTTCTATCCTCCGTCGCTCCCGATAACTTCCTCGGATCGAAACTCATCGCCTTCTACTCAAAATCCGGCAGCCTTAGAGATGCCTACAATGTGTTCGGTAAAATTTCTCACAAAAATATTTTCAGTTGGAATGCTTTGTTCATCAGTTACACTCTTCACAATATGCACACTGATATGCTGAAGCTGTTTTCGTCTTTGGTTAATTCAAATTCGACGGATGTGAAGCCCGATAAGTTTACTGTCACTTGTGTTTTGAAAGCGTTGGCGTCTTTGTTTTCTAATTCGGTTTTGGCTAAGGAAGTTCATTGTTTCATTCTTCGACGAGGGCTTGAGTCTGATATTTTTGTTGCCAATGCTTTGATCACTTTTTACTCGAGGTGCGATGAGCTGGTTTTAGCAAGAATTATGTTTGATAGAATGCCTGGGAGAGATATAGTGTCTTGGAATGCGATGTTGGCTGGGTACTCTCAGGCTGGGCTCTATGAGGAGTGCAAGAAACTATTTAAAGCGATGTTGAGTTCAGTGGAGCTGAAGCCTAATGCATTAACCGCAGTCAGTGTTTTGCAAGCTTGTGCTCAGTCAAATGATCTCATTTTTGGAATGGAAGTTCACAGATTCGTCAATGAAAGCCAGGTTGTAATGGATGTTTCACTATGCAATGCTGTTATTGGATTATATGCAAAGTGTGGTAGCTTGGATTATGCTCGTGAGTTGTTTGAAGAAATGCCTGAGAAGGATGAGGTCACCTATGGCGCTATGATATCAGGCTACATGGTCCATGGTTTTGTTAACCAAGCAATGGATCTTTTTCGAGAACTGGAAAGGCCAGCATTGAGCACATGGAATGCTGTGATTTCTGGTCTGGTTCAGAACAACCAGCAAGATAGAGTCGTAGATATATTTCGAGCAATGCAGTCACATGGTTGCAGACCAAATACCGTGACTCTTGCGAGCGTTCTTCCCGTTTTCTCACATTTTTCAACCCTGAAAGGTGGGAAAGAAATTCATGCTTATGCCATTAGAAACGCTTACAATGGGAATATTTATGTTGCTACTGCCATCATCGATTCTTATGCTAAGTCTGGTTACCTCCATGGGGCACGACAAGTTTTTGATCAAATAAAAGGTAGGAGTCTAATCATCTGGACAGCAATAATCTCAGCATATGCTGCACATGGAGATGCCAATGTGGCTCTTAGTCTTTTCTATGAGATGCTGACAAACGGGATTCAGCCTGACCCGGTAACCTTTACATCAGTATTGGTTGCCTGTGCCCATTCAGGAGAGTTAGATGAAGCCTGGAAGATATTTAATGTCTTGTTACTAGAGTATGGGATTCAACCACTAGTCGAGCATTATGCTTGCATGGTAGGAGTCCTTAGTCGAGCAGGAAAGCTCTCTAATGCTGTTGAATTTATTTCTAAAATGCCATTTGAACCCACCGCAAAAGTTTGGGGTGCTTTGCTCAATGGGGCTTCTGTTGCTGGTGATGTTGAACTTGGAAAGTATGTTTTTGATCGTCTCTTTGAGATTGAGCCTGAAAATACTGGTAACTACATCATCATGGCTAACTTATATTCACAATTTGGAAGGTGGAAAGAAGCTGACAAGATTAGGAATTTGATGAAGGAAGTTGGATTGAAGAAGATCCCGGGAAGTAGCTGGATAGAAACAAGGGGAGGGTTGCAGAGTTTTGTAGCTAGAGACACTTCAAATGACAGGACTCCAGAGATCTATGGAATGTTGGAAGGATTACTTGGGTTGATGAAAGAAGAAGGAATCATTCTGCAACATGAGATAGATGACGACTGTGGGAGTGTGAAGGAGCAATACGGCTCACGAGATCCCATAATGGAAGAATTAGAACCCCCCACTGTCATCATGGCCGCCCTCTCCGCCCTCTCTCCGCCGTGTCTCTCCGATCTCTCCCACTCCATTTTCTCCGACATTCACCACCACCGCCGCCGCCTCACCTTCATCCTCTCTTCCCCAACACTCTTCTCCCTCACTCTTCGCCACCTCAATTCCCTCTCCCTCTCCCACAAATCTCTCCTCCTTGCCCGCTTCCTCCTCTCCGCCCTCCGCCGCCTCTCCCGCCCCTTCCAGCCACCATCCAAGCTCCTCCCTTACCACCCTTCCACCGCCGCCATCTCCCCTCAAGACCTCGACGCCGCCGTCCTCCTCCTCCTCCTCTGCGAGGTCCGGCAACACAATCCAGCCGCCCTCCGAACTCCGATCACCAAATGGCGTGCGACCCTCTGTAGAATCTACTCCGATTCCCTCTTGACGATCTCAGGTGTCGCCACAGGCGGGGGTGGGGCTTTGATTCCGTTCATTGAGACGGTGGTGAGATGTTGGAAGTTCGTGGGGTTTGTTGGGAGTTGCGGAGGGAAGGCCCGGAGAGAGGTGGCGGCGTCTCCTGTGGCGGTGGTGGAGCTGCCGTCAGTGGCGGTGGGTGGTGGCGGTGGTGCGGCGGTGGAATGTGTGATTTGTAAAGAGGAGATGAGAGAAGGGAGAGACGCGTGTAAATTGCCTTGTGACCACTTATTCCATTGGCTCTGTATTTTGCCATGGCTGAGGAAACGGAACACGTGTCCCTGTTGTAGGTTTCAGCTTCCCACTGATGATATTTTCGGAGAGATCCAACGGCTCTGGGAGATCCTCCTCAAAGTGGGCTCCACGATGTGTCCATCTGATGGAGATTAA

Protein sequence

MRNAKPKNLQTSVPASVFLPWALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDIFVANALITFYSRCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRAMQSHGCRPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSVKEQYGSRDPIMEELEPPTVIMAALSALSPPCLSDLSHSIFSDIHHHRRRLTFILSSPTLFSLTLRHLNSLSLSHKSLLLARFLLSALRRLSRPFQPPSKLLPYHPSTAAISPQDLDAAVLLLLLCEVRQHNPAALRTPITKWRATLCRIYSDSLLTISGVATGGGGALIPFIETVVRCWKFVGFVGSCGGKARREVAASPVAVVELPSVAVGGGGGAAVECVICKEEMREGRDACKLPCDHLFHWLCILPWLRKRNTCPCCRFQLPTDDIFGEIQRLWEILLKVGSTMCPSDGD
Homology
BLAST of Cla97C10G203940 vs. NCBI nr
Match: RXH80631.1 (hypothetical protein DVH24_004545 [Malus domestica])

HSP 1 Score: 1234.6 bits (3193), Expect = 0.0e+00
Identity = 608/951 (63.93%), Postives = 740/951 (77.81%), Query Frame = 0

Query: 4   AKPKNLQTSVPASVFLPWALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLS 63
           +K  N+Q S   + ++  ALQ LH  DG++ GAYG  IQ CT H  VR  KQLHARLVL 
Sbjct: 4   SKSLNIQISAATNGYVQRALQILHGIDGLDCGAYGHFIQHCTVHRLVRQAKQLHARLVLF 63

Query: 64  SVAPDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLF 123
           SV P NFL SKLI FYSK+ ++  A  VF +I   N FSWNA+ I Y+++NMH D LK F
Sbjct: 64  SVTPGNFLASKLINFYSKTNNINYARKVFDQIPRPNAFSWNAMLIGYSINNMHADTLKWF 123

Query: 124 SSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDIFVANALITF 183
           S++V+S S   KPD FTVTCVLKAL  L S S LAKEVHCF+LR G +SD+FV N+LIT+
Sbjct: 124 SAMVSSCSDQAKPDNFTVTCVLKALGVLLSGSKLAKEVHCFVLRSGFDSDVFVVNSLITY 183

Query: 184 YSRCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALT 243
           YSRCDE+ LAR +FDRMP RDIVSWN+M+AGYSQAG Y+ECK+L++ ML   + KP  LT
Sbjct: 184 YSRCDEVGLARALFDRMPERDIVSWNSMIAGYSQAGYYDECKELYRMMLGLEKFKPVGLT 243

Query: 244 AVSVLQACAQSNDLIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMP 303
            VSVLQAC QSNDL+ GMEVH+FV E+Q+ MDV +CNA+IGLYA+CGSLDYA+ELF+EM 
Sbjct: 244 VVSVLQACLQSNDLMLGMEVHQFVIENQIEMDVLVCNALIGLYARCGSLDYAQELFDEMS 303

Query: 304 EKDEVTYGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRA 363
           EKDEVTYG+++SGYM HGFV++AM +FR+ ++P LSTWNAVISGLVQNNQ +  +++ R 
Sbjct: 304 EKDEVTYGSLVSGYMFHGFVDKAMGVFRDSKKPKLSTWNAVISGLVQNNQHEEALNLIRE 363

Query: 364 MQSHGCRPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGY 423
           MQ+ GC+PNTVTL+S+LP  S+FS LK GKE+HAYA+RN ++ NIYVATAIID+YAKSG 
Sbjct: 364 MQACGCKPNTVTLSSILPTISYFSNLKVGKEVHAYAVRNNFDWNIYVATAIIDTYAKSGL 423

Query: 424 LHGARQVFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVA 483
           L+GA++VFDQ KG+SLIIWT+IISAYA+HGD + ++ LFYEML +GIQPD VT T+VL A
Sbjct: 424 LYGAQRVFDQAKGKSLIIWTSIISAYASHGDGHTSIGLFYEMLNSGIQPDQVTITAVLTA 483

Query: 484 CAHSGELDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAK 543
           CAHSG +DEAWKIF+ +  EYGIQP VEHYACMVG+LSRAGKL+ A +FI KMP EP+AK
Sbjct: 484 CAHSGVVDEAWKIFDAMFPEYGIQPSVEHYACMVGILSRAGKLTEAADFIHKMPVEPSAK 543

Query: 544 VWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMK 603
           VWGALLNGASV+ DVELG++V  RLF+IEPENTGNYIIMANLYSQ GRW+EADK+R  MK
Sbjct: 544 VWGALLNGASVSRDVELGEFVCHRLFQIEPENTGNYIIMANLYSQAGRWEEADKVRERMK 603

Query: 604 EVGLKKIPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDD 663
           EVGL+KIPGSSWIET  GLQSF+ +DTSN+RT EIY  LEGLLG+MKE+G+         
Sbjct: 604 EVGLRKIPGSSWIETSKGLQSFIVKDTSNERTEEIYETLEGLLGMMKEKGL--------- 663

Query: 664 CGSVKEQYGSRDPIMEELEPPT-VIMAALSALSPPCLSDLSHSIFSDIHHHRRRLTFILS 723
                + + + DPIMEE+   T  IMAAL+ L+PP LS L+H+I S  HHH  RL+ +LS
Sbjct: 664 ---KAKVHKAHDPIMEEIATATATIMAALATLTPPQLSHLTHTILSHTHHHHHRLSSLLS 723

Query: 724 SPTLFSLTLRHLNSLSLSHKSLLLARFLLSALRRLSRPFQPPSKLLPYHPSTAAISPQDL 783
           SP LFSLTL  LNSL L HK+LL+A  LLS+L  L+  F P +      P    +  +DL
Sbjct: 724 SPILFSLTLHRLNSLPLPHKTLLIANHLLSSLYHLTLHFHPYTN----PPPPRVVRKRDL 783

Query: 784 DAAVLLLLLCEVRQHNPAALRTPITKWRATLCRIYSDSLLTISGVATGGGGALIPFIETV 843
           D+ +LLLLLCEV QHNP AL+ P  KWR  L ++YSD++LT+SG+    G AL+ +IE +
Sbjct: 784 DSVLLLLLLCEVHQHNPEALQAPTIKWREILSKLYSDNMLTVSGIGVYNGSALVSYIEVL 843

Query: 844 VRCWKFVGFVGSC-GGKARREVAASPVAVVELPSVAVGGGGGAAVECVICKEEMREGRDA 903
            RC +FV  +G C GGKA REVAASP AVV LPSV V  GG    EC+ICKEEMRE RD 
Sbjct: 844 TRCLRFVSVMGFCYGGKAGREVAASPAAVVALPSVKVSSGGS---ECMICKEEMREDRDV 903

Query: 904 CKLPCDHLFHWLCILPWLRKRNTCPCCRFQLPTDDIFGEIQRLWEILLKVG 953
           C+LPC HLFHW+CIL WLRKRNTCPCCRF LPTDD+FGEIQRLWEIL+K+G
Sbjct: 904 CELPCRHLFHWMCILRWLRKRNTCPCCRFTLPTDDVFGEIQRLWEILVKMG 935

BLAST of Cla97C10G203940 vs. NCBI nr
Match: XP_038905794.1 (pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_038905795.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_038905796.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_038905797.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida])

HSP 1 Score: 1233.0 bits (3189), Expect = 0.0e+00
Identity = 606/661 (91.68%), Postives = 634/661 (95.92%), Query Frame = 0

Query: 6   PKNLQTSVPASVFLPWALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSSV 65
           PK  +  VPA+V L WALQAL RTD MNYGAYGRLIQ CTD LFVRLGKQLHARLVLSSV
Sbjct: 4   PKTFKPQVPATVCLSWALQALRRTDEMNYGAYGRLIQHCTDQLFVRLGKQLHARLVLSSV 63

Query: 66  APDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLFSS 125
           APDNFLGSKLIAFYSKSGSLRDAYNVFG ISHKNIF+WNALFISYTLHNMH DML+LFSS
Sbjct: 64  APDNFLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFTWNALFISYTLHNMHIDMLRLFSS 123

Query: 126 LVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDIFVANALITFYS 185
           LVNSNSTDVKPDKFT+TCVLKALASLFSNSVLAKEVHCFILRR LE DIFV NALITFYS
Sbjct: 124 LVNSNSTDVKPDKFTITCVLKALASLFSNSVLAKEVHCFILRRELEFDIFVVNALITFYS 183

Query: 186 RCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTAV 245
           RCDELVLARI+FDRMP +DIVSWNAM+AGYSQ G YEECK+LFKAMLSSVELKPNALT V
Sbjct: 184 RCDELVLARIVFDRMPEKDIVSWNAMVAGYSQGGFYEECKELFKAMLSSVELKPNALTTV 243

Query: 246 SVLQACAQSNDLIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEK 305
           SVLQACAQSNDLIFGMEVHRFV+ESQ+ MDVSLCNAVIGLYAKCGSLDYARELFEEMP+K
Sbjct: 244 SVLQACAQSNDLIFGMEVHRFVSESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPKK 303

Query: 306 DEVTYGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRAMQ 365
           DEVTYG+MISGYMV+GFVNQAMDLFRELERP LSTWNAVISGLVQNNQQD V+DIFRAMQ
Sbjct: 304 DEVTYGSMISGYMVYGFVNQAMDLFRELERPVLSTWNAVISGLVQNNQQDEVLDIFRAMQ 363

Query: 366 SHGCRPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLH 425
           SHGCRPNTVTLASVLP+FSHFST+KGGKEIHAYAIR AY+GNIYVAT II+SYAKSGYLH
Sbjct: 364 SHGCRPNTVTLASVLPIFSHFSTIKGGKEIHAYAIRKAYDGNIYVATGIINSYAKSGYLH 423

Query: 426 GARQVFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACA 485
           GARQVFDQ+KGRSLIIWTAIISAYAAHGDANVALSLFYEML NGIQPDPVTFTSVLVACA
Sbjct: 424 GARQVFDQLKGRSLIIWTAIISAYAAHGDANVALSLFYEMLANGIQPDPVTFTSVLVACA 483

Query: 486 HSGELDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVW 545
           HSGELDEAWKIFNVLL +YGIQP VEHYACMVGVLSRAGKLS+AVEFISKMPFEPTAKVW
Sbjct: 484 HSGELDEAWKIFNVLLPKYGIQPPVEHYACMVGVLSRAGKLSDAVEFISKMPFEPTAKVW 543

Query: 546 GALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEV 605
           GALLNGASVAGDVELGKYVFDRLFEIEPENTGNY+IMANLYSQFGRWKEADK+R+LMKEV
Sbjct: 544 GALLNGASVAGDVELGKYVFDRLFEIEPENTGNYVIMANLYSQFGRWKEADKVRDLMKEV 603

Query: 606 GLKKIPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCG 665
           GLKKIPG+SWIETRGGLQSF+ARDTSN+RTPEIYGMLEGLLGLMKEEGIILQHEIDDDCG
Sbjct: 604 GLKKIPGNSWIETRGGLQSFIARDTSNNRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCG 663

Query: 666 S 667
           S
Sbjct: 664 S 664

BLAST of Cla97C10G203940 vs. NCBI nr
Match: XP_022145703.1 (pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia])

HSP 1 Score: 1187.6 bits (3071), Expect = 0.0e+00
Identity = 580/657 (88.28%), Postives = 617/657 (93.91%), Query Frame = 0

Query: 10  QTSVPASVFLPWALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSSVAPDN 69
           Q S+PA   +PWALQA+ R DGMNY AYGRLIQ C D  F+RLGKQLHARLVL SV PDN
Sbjct: 3   QISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDN 62

Query: 70  FLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNS 129
           FLGSKLIAFYSKSGSLRDAYNVFG ISHKNIFSWNALFISYTLHNMH+DMLKLFSSLVNS
Sbjct: 63  FLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNS 122

Query: 130 NSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDIFVANALITFYSRCDE 189
           N+ DVKPDKFT+TCVLKALAS F++S+LAKEVHCF+LRRGLESDIFV NAL+T+YSRC+E
Sbjct: 123 NAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEE 182

Query: 190 LVLARIMFDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTAVSVLQ 249
           +VLARI+F RMP RDIVSWNAM+AG+SQ G YEECK+LFK MLSSVELKPNALTAVSVLQ
Sbjct: 183 VVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQ 242

Query: 250 ACAQSNDLIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT 309
           ACAQSNDLIFGMEVHRFVNESQ+ MDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT
Sbjct: 243 ACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT 302

Query: 310 YGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRAMQSHGC 369
           YG+MISGYMVHG VNQAMDLF+EL++PALSTWNAVISGLVQNNQQD V+DIFRAMQSHGC
Sbjct: 303 YGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGC 362

Query: 370 RPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQ 429
           RPN VTLASVLPVFSHFSTLKGGKEIHAYA+RN YNGNIYVATAIIDSYAKSGYLHGA Q
Sbjct: 363 RPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQ 422

Query: 430 VFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGE 489
           VFD +KGRSLIIWTAIISAYAAHGDANVALSLFYEML NGIQPDPVTFTSVLVACAHSGE
Sbjct: 423 VFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGE 482

Query: 490 LDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALL 549
           LDEAWKIFN++L EYGIQPLVEHYACMVGVLSRAGKLS+AV+FISKMP EP+AKVWGALL
Sbjct: 483 LDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALL 542

Query: 550 NGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKK 609
           NGASVAGDVELGKYVFDRL EIEPENTG YIIMANLYSQ GRWKEADK+R+LMKEVGL+K
Sbjct: 543 NGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRK 602

Query: 610 IPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGS 667
           IPGSSWIET GGL SFVARDTSND TPEIY MLEGLLGLMKEEG ILQ+EID+DCGS
Sbjct: 603 IPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGS 659

BLAST of Cla97C10G203940 vs. NCBI nr
Match: KAG6580575.1 (ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1186.8 bits (3069), Expect = 0.0e+00
Identity = 590/661 (89.26%), Postives = 616/661 (93.19%), Query Frame = 0

Query: 13   VPASVF-------LPWALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSSV 72
            +PAS F       +  ALQ + R+DGMNYGAYGRLIQ CTD  F RLGKQLHARLVLSSV
Sbjct: 703  IPASGFGSLHLFDIDGALQLIRRSDGMNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSV 762

Query: 73   APDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLFSS 132
            APDNFLGSKLIA YSKSGSLRDAYNVF  ISHKNIFSWNALFISYTLHNMH DMLKLFSS
Sbjct: 763  APDNFLGSKLIALYSKSGSLRDAYNVFDSISHKNIFSWNALFISYTLHNMHADMLKLFSS 822

Query: 133  LVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDIFVANALITFYS 192
            LVN NSTDVKPDKFTVTCVLKALASLF+NS+LAKEVHCF+LRRGLESDIFV NALITFYS
Sbjct: 823  LVNLNSTDVKPDKFTVTCVLKALASLFTNSILAKEVHCFVLRRGLESDIFVVNALITFYS 882

Query: 193  RCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTAV 252
            RCDELVLARIMFDR P RDIVSWNAM+AGYSQ G YE+CK+LFKAML S E KPNALTAV
Sbjct: 883  RCDELVLARIMFDRTPERDIVSWNAMVAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAV 942

Query: 253  SVLQACAQSNDLIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEK 312
            SVLQACAQSNDLIFGMEVH+FVNES + MDVSL NAVIGLYAKCGSLDYARELFE MPEK
Sbjct: 943  SVLQACAQSNDLIFGMEVHKFVNESGIEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEK 1002

Query: 313  DEVTYGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRAMQ 372
            DEVTYG+MISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQD VVDIFRAMQ
Sbjct: 1003 DEVTYGSMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQ 1062

Query: 373  SHGCRPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLH 432
             HGCRPNTVTLASVLP+FSHFSTLKGGKEIHAYA+RNAY+GNIYVATAIIDSYAKSGYLH
Sbjct: 1063 LHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLH 1122

Query: 433  GARQVFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACA 492
            GARQVFDQ K RSLIIWTAIISAYAAHGDAN  LSLFYEMLTNGI+PDPVTFTSVLVACA
Sbjct: 1123 GARQVFDQSKRRSLIIWTAIISAYAAHGDANATLSLFYEMLTNGIRPDPVTFTSVLVACA 1182

Query: 493  HSGELDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVW 552
            HSGELDEAWKIFNVLL E+GIQPLVEHYACMVGVLSRAGKLS+AVEFISKMP EPTAKVW
Sbjct: 1183 HSGELDEAWKIFNVLLPEFGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVW 1242

Query: 553  GALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEV 612
            GALLNGASVAGDVELGKYVFDRL +IEPENTGNYIIMANLYSQFGRWKEAD++R+LMKEV
Sbjct: 1243 GALLNGASVAGDVELGKYVFDRLLDIEPENTGNYIIMANLYSQFGRWKEADRVRDLMKEV 1302

Query: 613  GLKKIPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCG 667
            GLKKIPG+SWIETRGGLQSFVARDTSNDRTPEIYG LEGL+ LMKEEG+I QHEIDDDCG
Sbjct: 1303 GLKKIPGNSWIETRGGLQSFVARDTSNDRTPEIYGTLEGLVRLMKEEGLIQQHEIDDDCG 1362

BLAST of Cla97C10G203940 vs. NCBI nr
Match: KAG7017327.1 (ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1183.7 bits (3061), Expect = 0.0e+00
Identity = 589/661 (89.11%), Postives = 615/661 (93.04%), Query Frame = 0

Query: 13   VPASVF-------LPWALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSSV 72
            +PAS F       +  ALQ + R+DGMNYGAYGRLIQ CTD  F RLGKQLHARLVLSSV
Sbjct: 722  IPASGFGSLHLFDIDGALQLIRRSDGMNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSV 781

Query: 73   APDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLFSS 132
            APDNFLGSKLIA YSKSGSLRDAYNVF  ISHKNIFSWNALFISYTLHNMH DMLKLFSS
Sbjct: 782  APDNFLGSKLIALYSKSGSLRDAYNVFDSISHKNIFSWNALFISYTLHNMHADMLKLFSS 841

Query: 133  LVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDIFVANALITFYS 192
            LVN NSTDVKPDKFTVTCVLKALASLF+NS+LAKEVHCF+LRRGLESDIFV NALITFYS
Sbjct: 842  LVNLNSTDVKPDKFTVTCVLKALASLFTNSILAKEVHCFVLRRGLESDIFVVNALITFYS 901

Query: 193  RCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTAV 252
            RCDELVLARIMFDR P RDIVSWNAM+AGYSQ G YE+CK+LFKAML S E KPNALTAV
Sbjct: 902  RCDELVLARIMFDRTPERDIVSWNAMVAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAV 961

Query: 253  SVLQACAQSNDLIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEK 312
            SVLQACAQSNDLIFGMEVH+FVNES + MDVSL NAVIGLYAKCGSLDYARELFE MPEK
Sbjct: 962  SVLQACAQSNDLIFGMEVHKFVNESGIEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEK 1021

Query: 313  DEVTYGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRAMQ 372
            DEVTYG+MISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQD VVDIFRAMQ
Sbjct: 1022 DEVTYGSMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQ 1081

Query: 373  SHGCRPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLH 432
             HGCRPNTVTLASVLP+FSHFSTLKGGKEIHAYA+RNAY+GNIYVATAIIDSYAKSGYL 
Sbjct: 1082 LHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLQ 1141

Query: 433  GARQVFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACA 492
            GARQVFDQ K RSLIIWTAIISAYAAHGDAN  LSLFYEMLTNGI+PDPVTFTSVLVACA
Sbjct: 1142 GARQVFDQSKRRSLIIWTAIISAYAAHGDANATLSLFYEMLTNGIRPDPVTFTSVLVACA 1201

Query: 493  HSGELDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVW 552
            HSGELDEAWKIFNVLL E+GIQPLVEHYACMVGVLSRAGKLS+AVEFISKMP EPTAKVW
Sbjct: 1202 HSGELDEAWKIFNVLLPEFGIQPLVEHYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVW 1261

Query: 553  GALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEV 612
            GALLNGASVAGDVELGKYVFDRL +IEPENTGNYIIMANLYSQFGRWKEAD++R+LMKEV
Sbjct: 1262 GALLNGASVAGDVELGKYVFDRLLDIEPENTGNYIIMANLYSQFGRWKEADRVRDLMKEV 1321

Query: 613  GLKKIPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCG 667
            GLKKIPG+SWIETRGGLQSFVARDTSNDRTPEIYG LEGL+ LMKEEG+I QHEIDDDCG
Sbjct: 1322 GLKKIPGNSWIETRGGLQSFVARDTSNDRTPEIYGTLEGLVRLMKEEGLIQQHEIDDDCG 1381

BLAST of Cla97C10G203940 vs. ExPASy Swiss-Prot
Match: Q9ZUT5 (Pentatricopeptide repeat-containing protein At2g37310 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E49 PE=2 SV=1)

HSP 1 Score: 754.2 bits (1946), Expect = 1.8e-216
Identity = 372/646 (57.59%), Postives = 486/646 (75.23%), Query Frame = 0

Query: 22  ALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAFYSK 81
           ALQ L     ++ GAYG LIQ  T H       QLHAR+V+ S+ PDNFL SKLI+FY++
Sbjct: 10  ALQGLLNKAAVDGGAYGHLIQHFTRHRLPLHVLQLHARIVVFSIKPDNFLASKLISFYTR 69

Query: 82  SGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNS---NSTDVKPDK 141
               R A +VF +I+ +N FS+NAL I+YT   M+ D   LF S + S   +S   +PD 
Sbjct: 70  QDRFRQALHVFDEITVRNAFSYNALLIAYTSREMYFDAFSLFLSWIGSSCYSSDAARPDS 129

Query: 142 FTVTCVLKALASL--FSNSVLAKEVHCFILRRGLESDIFVANALITFYSRCDELVLARIM 201
            +++CVLKAL+    F    LA++VH F++R G +SD+FV N +IT+Y++CD +  AR +
Sbjct: 130 ISISCVLKALSGCDDFWLGSLARQVHGFVIRGGFDSDVFVGNGMITYYTKCDNIESARKV 189

Query: 202 FDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTAVSVLQACAQSND 261
           FD M  RD+VSWN+M++GYSQ+G +E+CKK++KAML+  + KPN +T +SV QAC QS+D
Sbjct: 190 FDEMSERDVVSWNSMISGYSQSGSFEDCKKMYKAMLACSDFKPNGVTVISVFQACGQSSD 249

Query: 262 LIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISG 321
           LIFG+EVH+ + E+ + MD+SLCNAVIG YAKCGSLDYAR LF+EM EKD VTYGA+ISG
Sbjct: 250 LIFGLEVHKKMIENHIQMDLSLCNAVIGFYAKCGSLDYARALFDEMSEKDSVTYGAIISG 309

Query: 322 YMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRAMQSHGCRPNTVTL 381
           YM HG V +AM LF E+E   LSTWNA+ISGL+QNN  + V++ FR M   G RPNTVTL
Sbjct: 310 YMAHGLVKEAMALFSEMESIGLSTWNAMISGLMQNNHHEEVINSFREMIRCGSRPNTVTL 369

Query: 382 ASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIKG 441
           +S+LP  ++ S LKGGKEIHA+AIRN  + NIYV T+IID+YAK G+L GA++VFD  K 
Sbjct: 370 SSLLPSLTYSSNLKGGKEIHAFAIRNGADNNIYVTTSIIDNYAKLGFLLGAQRVFDNCKD 429

Query: 442 RSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWKI 501
           RSLI WTAII+AYA HGD++ A SLF +M   G +PD VT T+VL A AHSG+ D A  I
Sbjct: 430 RSLIAWTAIITAYAVHGDSDSACSLFDQMQCLGTKPDDVTLTAVLSAFAHSGDSDMAQHI 489

Query: 502 FNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVAG 561
           F+ +L +Y I+P VEHYACMV VLSRAGKLS+A+EFISKMP +P AKVWGALLNGASV G
Sbjct: 490 FDSMLTKYDIEPGVEHYACMVSVLSRAGKLSDAMEFISKMPIDPIAKVWGALLNGASVLG 549

Query: 562 DVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSWI 621
           D+E+ ++  DRLFE+EPENTGNY IMANLY+Q GRW+EA+ +RN MK +GLKKIPG+SWI
Sbjct: 550 DLEIARFACDRLFEMEPENTGNYTIMANLYTQAGRWEEAEMVRNKMKRIGLKKIPGTSWI 609

Query: 622 ETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDD 663
           ET  GL+SF+A+D+S +R+ E+Y ++EGL+  M ++  I + E+D+
Sbjct: 610 ETEKGLRSFIAKDSSCERSKEMYEIIEGLVESMSDKEYIRKQELDE 655

BLAST of Cla97C10G203940 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 403.7 bits (1036), Expect = 5.9e-111
Identity = 213/617 (34.52%), Postives = 356/617 (57.70%), Query Frame = 0

Query: 40  LIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAF--YSKSGSLRDAYNVFGKISH 99
           LI+ C     +R  KQ H  ++ +    D +  SKL A    S   SL  A  VF +I  
Sbjct: 36  LIERCVS---LRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPK 95

Query: 100 KNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVL 159
            N F+WN L  +Y   +    +L +++ L   + +   P+K+T   ++KA A + S S L
Sbjct: 96  PNSFAWNTLIRAYA--SGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLS-L 155

Query: 160 AKEVHCFILRRGLESDIFVANALITFYSRCDELVLARIMFDRMPGRDIVSWNAMLAGYSQ 219
            + +H   ++  + SD+FVAN+LI  Y  C +L  A  +F  +  +D+VSWN+M+ G+ Q
Sbjct: 156 GQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 215

Query: 220 AGLYEECKKLFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVVMDVS 279
            G  ++  +LFK M  S ++K + +T V VL ACA+  +L FG +V  ++ E++V ++++
Sbjct: 216 KGSPDKALELFKKM-ESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLT 275

Query: 280 LCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISGYMVHGFVNQAMDLFRELERPA 339
           L NA++ +Y KCGS++ A+ LF+ M EKD VT+  M+ GY +      A ++   + +  
Sbjct: 276 LANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKD 335

Query: 340 LSTWNAVISGLVQNNQQDRVVDIFRAMQ-SHGCRPNTVTLASVLPVFSHFSTLKGGKEIH 399
           +  WNA+IS   QN + +  + +F  +Q     + N +TL S L   +    L+ G+ IH
Sbjct: 336 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 395

Query: 400 AYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIKGRSLIIWTAIISAYAAHGDAN 459
           +Y  ++    N +V +A+I  Y+K G L  +R+VF+ ++ R + +W+A+I   A HG  N
Sbjct: 396 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 455

Query: 460 VALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWKIFNVLLLEYGIQPLVEHYACM 519
            A+ +FY+M    ++P+ VTFT+V  AC+H+G +DEA  +F+ +   YGI P  +HYAC+
Sbjct: 456 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 515

Query: 520 VGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENT 579
           V VL R+G L  AV+FI  MP  P+  VWGALL    +  ++ L +    RL E+EP N 
Sbjct: 516 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 575

Query: 580 GNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSWIETRGGLQSFVARDTSNDRTP 639
           G +++++N+Y++ G+W+   ++R  M+  GLKK PG S IE  G +  F++ D ++  + 
Sbjct: 576 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 635

Query: 640 EIYGMLEGLLGLMKEEG 654
           ++YG L  ++  +K  G
Sbjct: 636 KVYGKLHEVMEKLKSNG 645

BLAST of Cla97C10G203940 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 402.1 bits (1032), Expect = 1.7e-110
Identity = 227/686 (33.09%), Postives = 354/686 (51.60%), Query Frame = 0

Query: 40  LIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKN 99
           ++QLC D   ++ GK++   +  +    D+ LGSKL   Y+  G L++A  VF ++  + 
Sbjct: 100 VLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEK 159

Query: 100 IFSWNALFISYTLHNMHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAK 159
              WN L          +  + LF  +++S    V+ D +T +CV K+ +SL S     +
Sbjct: 160 ALFWNILMNELAKSGDFSGSIGLFKKMMSSG---VEMDSYTFSCVSKSFSSLRSVHG-GE 219

Query: 160 EVHCFILRRGLESDIFVANALITFYSRCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAG 219
           ++H FIL+ G      V N+L+ FY +   +  AR +FD M  RD++SWN+++ GY   G
Sbjct: 220 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 279

Query: 220 LYEECKKLFKAML-SSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVVMDVSL 279
           L E+   +F  ML S +E+  +  T VSV   CA S  +  G  VH    ++    +   
Sbjct: 280 LAEKGLSVFVQMLVSGIEI--DLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRF 339

Query: 280 CNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISGYMVHGFVNQAMDLFRELERPAL 339
           CN ++ +Y+KCG LD A+ +F EM ++  V+Y +MI+GY   G   +A+ LF E+E   +
Sbjct: 340 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 399

Query: 340 S----------------------------------------------------------- 399
           S                                                           
Sbjct: 400 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 459

Query: 400 -----------TWNAVISGLVQNNQQDRVVDIFR-AMQSHGCRPNTVTLASVLPVFSHFS 459
                      +WN +I G  +N   +  + +F   ++     P+  T+A VLP  +  S
Sbjct: 460 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 519

Query: 460 TLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIKGRSLIIWTAIIS 519
               G+EIH Y +RN Y  + +VA +++D YAK G L  A  +FD I  + L+ WT +I+
Sbjct: 520 AFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIA 579

Query: 520 AYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWKIFNVLLLEYGIQ 579
            Y  HG    A++LF +M   GI+ D ++F S+L AC+HSG +DE W+ FN++  E  I+
Sbjct: 580 GYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIE 639

Query: 580 PLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVAGDVELGKYVFDR 639
           P VEHYAC+V +L+R G L  A  FI  MP  P A +WGALL G  +  DV+L + V ++
Sbjct: 640 PTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEK 699

Query: 640 LFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSWIETRGGLQSFVA 654
           +FE+EPENTG Y++MAN+Y++  +W++  ++R  + + GL+K PG SWIE +G +  FVA
Sbjct: 700 VFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVA 759

BLAST of Cla97C10G203940 vs. ExPASy Swiss-Prot
Match: Q9LFL5 (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 5.2e-107
Identity = 222/673 (32.99%), Postives = 356/673 (52.90%), Query Frame = 0

Query: 40  LIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKN 99
           + + C +   VR G+  HA  +++    + F+G+ L+A YS+  SL DA  VF ++S  +
Sbjct: 133 VFKACGEISSVRCGESAHALSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWD 192

Query: 100 IFSWNALFISYTLHNMHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAK 159
           + SWN++  SY         L++FS +  +N    +PD  T+  VL   ASL ++S L K
Sbjct: 193 VVSWNSIIESYAKLGKPKVALEMFSRM--TNEFGCRPDNITLVNVLPPCASLGTHS-LGK 252

Query: 160 EVHCFILRRGLESDIFVANALITFYSRCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAG 219
           ++HCF +   +  ++FV N L+  Y++C  +  A  +F  M  +D+VSWNAM+AGYSQ G
Sbjct: 253 QLHCFAVTSEMIQNMFVGNCLVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIG 312

Query: 220 LYEECKKLFKAM----------------------------------LSSVELKPNALTAV 279
            +E+  +LF+ M                                  + S  +KPN +T +
Sbjct: 313 RFEDAVRLFEKMQEEKIKMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLI 372

Query: 280 SVLQACAQSNDLIFGMEVHRFVNESQVVM-------DVSLCNAVIGLYAKCGSLDYAREL 339
           SVL  CA    L+ G E+H +  +  + +       +  + N +I +YAKC  +D AR +
Sbjct: 373 SVLSGCASVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAM 432

Query: 340 FEEM--PEKDEVTYGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDR 399
           F+ +   E+D VT+  MI GY  HG  N+A++L  E+      T                
Sbjct: 433 FDSLSPKERDVVTWTVMIGGYSQHGDANKALELLSEMFEEDCQT---------------- 492

Query: 400 VVDIFRAMQSHGCRPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNG-NIYVATAII 459
                        RPN  T++  L   +  + L+ GK+IHAYA+RN  N   ++V+  +I
Sbjct: 493 -------------RPNAFTISCALVACASLAALRIGKQIHAYALRNQQNAVPLFVSNCLI 552

Query: 460 DSYAKSGYLHGARQVFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPV 519
           D YAK G +  AR VFD +  ++ + WT++++ Y  HG    AL +F EM   G + D V
Sbjct: 553 DMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDEMRRIGFKLDGV 612

Query: 520 TFTSVLVACAHSGELDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISK 579
           T   VL AC+HSG +D+  + FN +   +G+ P  EHYAC+V +L RAG+L+ A+  I +
Sbjct: 613 TLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAGRLNAALRLIEE 672

Query: 580 MPFEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEA 639
           MP EP   VW A L+   + G VELG+Y  +++ E+   + G+Y +++NLY+  GRWK+ 
Sbjct: 673 MPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSNLYANAGRWKDV 732

Query: 640 DKIRNLMKEVGLKKIPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGII 664
            +IR+LM+  G+KK PG SW+E   G  +F   D ++    EIY +L   +  +K+ G +
Sbjct: 733 TRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFVGDKTHPHAKEIYQVLLDHMQRIKDIGYV 773

BLAST of Cla97C10G203940 vs. ExPASy Swiss-Prot
Match: Q9LNU6 (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H21 PE=2 SV=2)

HSP 1 Score: 380.9 bits (977), Expect = 4.1e-104
Identity = 203/638 (31.82%), Postives = 346/638 (54.23%), Query Frame = 0

Query: 55  QLHARLVLSSVAPDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHN 114
           Q HAR++ S    D ++ +KLIA YS      DA  V   I    I+S+++L  + T   
Sbjct: 36  QAHARILKSGAQNDGYISAKLIASYSNYNCFNDADLVLQSIPDPTIYSFSSLIYALTKAK 95

Query: 115 MHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDI 174
           + T  + +FS +    S  + PD   +  + K  A L +  V  K++HC     GL+ D 
Sbjct: 96  LFTQSIGVFSRMF---SHGLIPDSHVLPNLFKVCAELSAFKV-GKQIHCVSCVSGLDMDA 155

Query: 175 FVANALITFYSRCDELVLARIMFDRMPGRD------------------------------ 234
           FV  ++   Y RC  +  AR +FDRM  +D                              
Sbjct: 156 FVQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAYARKGCLEEVVRILSEMESS 215

Query: 235 -----IVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTAVSVLQACAQSNDLIF 294
                IVSWN +L+G++++G ++E   +F+  +  +   P+ +T  SVL +   S  L  
Sbjct: 216 GIEANIVSWNGILSGFNRSGYHKEAVVMFQ-KIHHLGFCPDQVTVSSVLPSVGDSEMLNM 275

Query: 295 GMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISGYMV 354
           G  +H +V +  ++ D  + +A+I +Y K G +     LF +    +     A I+G   
Sbjct: 276 GRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVYGIISLFNQFEMMEAGVCNAYITGLSR 335

Query: 355 HGFVNQAMDLFRELERPALS----TWNAVISGLVQNNQQDRVVDIFRAMQSHGCRPNTVT 414
           +G V++A+++F   +   +     +W ++I+G  QN +    +++FR MQ  G +PN VT
Sbjct: 336 NGLVDKALEMFELFKEQTMELNVVSWTSIIAGCAQNGKDIEALELFREMQVAGVKPNHVT 395

Query: 415 LASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIK 474
           + S+LP   + + L  G+  H +A+R     N++V +A+ID YAK G ++ ++ VF+ + 
Sbjct: 396 IPSMLPACGNIAALGHGRSTHGFAVRVHLLDNVHVGSALIDMYAKCGRINLSQIVFNMMP 455

Query: 475 GRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWK 534
            ++L+ W ++++ ++ HG A   +S+F  ++   ++PD ++FTS+L AC   G  DE WK
Sbjct: 456 TKNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEGWK 515

Query: 535 IFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVA 594
            F ++  EYGI+P +EHY+CMV +L RAGKL  A + I +MPFEP + VWGALLN   + 
Sbjct: 516 YFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSCRLQ 575

Query: 595 GDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSW 654
            +V+L +   ++LF +EPEN G Y++++N+Y+  G W E D IRN M+ +GLKK PG SW
Sbjct: 576 NNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGCSW 635

BLAST of Cla97C10G203940 vs. ExPASy TrEMBL
Match: M5W6C8 (RING-type domain-containing protein OS=Prunus persica OX=3760 GN=PRUPE_ppa001024mg PE=4 SV=1)

HSP 1 Score: 1258.8 bits (3256), Expect = 0.0e+00
Identity = 615/950 (64.74%), Postives = 746/950 (78.53%), Query Frame = 0

Query: 5   KPKNLQTSVPASVFLPWALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSS 64
           KP N+Q S   + ++  ALQ LHR D ++ GAYG+LIQ CTD   +R  KQLHARLVL +
Sbjct: 5   KPLNIQISATTNGYVQRALQGLHRVDVLDCGAYGQLIQHCTDRRLLRQAKQLHARLVLFA 64

Query: 65  VAPDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLFS 124
           V P NFL SKLI  YSK+  L  A  VF +I HKN FSWNA+ I Y+ +NMH+D LKLFS
Sbjct: 65  VVPSNFLASKLITLYSKTNHLSQARKVFDQIPHKNTFSWNAMLIGYSFNNMHSDTLKLFS 124

Query: 125 SLVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDIFVANALITFY 184
           ++++S S +VK D FTVTCVLKAL +L   S LA+EVHCF+LR G +SD+FV N+LIT+Y
Sbjct: 125 AMMSSCSDEVKTDNFTVTCVLKALGALLYGSRLAQEVHCFVLRHGFDSDVFVTNSLITYY 184

Query: 185 SRCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTA 244
           SRCDEL  AR +FDRMP RD VSWN+M+AGYSQAG Y ECK+LF+ ML    L+PN LT 
Sbjct: 185 SRCDELGWARTLFDRMPDRDTVSWNSMIAGYSQAGYYAECKELFREMLRLGRLRPNGLTV 244

Query: 245 VSVLQACAQSNDLIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPE 304
           VSVLQAC QSNDLIFGMEVH+FVNESQ+ MD+ LCNA+IGLYA+CGSLDYA ELF  M E
Sbjct: 245 VSVLQACLQSNDLIFGMEVHQFVNESQIEMDIILCNALIGLYARCGSLDYAEELFHGMSE 304

Query: 305 KDEVTYGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRAM 364
           KDEVTYG++ISGYM HGFV++AMDLFRE ++P LSTWN++ISGLVQNN+ +  +D+ R M
Sbjct: 305 KDEVTYGSLISGYMFHGFVDKAMDLFRESKKPRLSTWNSMISGLVQNNRHEAALDLIREM 364

Query: 365 QSHGCRPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYL 424
           Q+ G +PNTVTL+S+LP  S+ S LK GKE+HAY++RN ++ NIYVATAIID+YAKSG +
Sbjct: 365 QACGYKPNTVTLSSILPAISYLSNLKAGKELHAYSVRNNFDANIYVATAIIDTYAKSGLV 424

Query: 425 HGARQVFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVAC 484
           HGA+QVF+Q +G+SLIIWTAIISAYA+HGDA++AL LFYEML NGIQPD VTFT+VL AC
Sbjct: 425 HGAQQVFNQSRGKSLIIWTAIISAYASHGDADMALGLFYEMLNNGIQPDQVTFTAVLTAC 484

Query: 485 AHSGELDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKV 544
           AHSG +DE+WKIF+ +  +YGIQP VEHYACMVGVLSRAG+LS A++FI KMP EP+AKV
Sbjct: 485 AHSGVVDESWKIFDAMFPKYGIQPSVEHYACMVGVLSRAGRLSEAIDFIHKMPVEPSAKV 544

Query: 545 WGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKE 604
           WGALLNGASV+GDVELGK+V DRLF+IEP+NTGNYIIMANLYSQ GRW+EADK+R  MKE
Sbjct: 545 WGALLNGASVSGDVELGKFVCDRLFQIEPDNTGNYIIMANLYSQAGRWEEADKVRERMKE 604

Query: 605 VGLKKIPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDC 664
           VGL+KIPG SWIET  GLQSF+A+D SN RT EIY +LEGLLGLMKE+G +LQ E+D++ 
Sbjct: 605 VGLRKIPGGSWIETSDGLQSFIAKDVSNGRTEEIYEILEGLLGLMKEKGYVLQDELDEET 664

Query: 665 GSVKEQYGSRDPIMEELEPPTVIMAALSALSPPCLSDLSHSIFSDIHHHRRRLTFILSSP 724
            +                  T IMAAL+ LSPP LSDL+H+I S  HHH  RL+F+LSSP
Sbjct: 665 TT------------------TTIMAALATLSPPQLSDLTHTILSHTHHHLLRLSFLLSSP 724

Query: 725 TLFSLTLRHLNSLSLSHKSLLLARFLLSALRRLSRPFQPPSKLLPYHPSTAAISPQDLDA 784
            LFSLTL  LNS+SL HK+LL+A  LLS+L  L+  FQP       +P    +  +DLD+
Sbjct: 725 ILFSLTLHRLNSISLPHKTLLIANHLLSSLHHLTLHFQP-------NPPPRRVKQRDLDS 784

Query: 785 AVLLLLLCEVRQHNPAALRTPITKWRATLCRIYSDSLLTISGVATGGGGALIPFIETVVR 844
            +LLLLLC+V QHNP AL+ P +KWR  L  +YSD +LT+SG+    G AL+ +IE + R
Sbjct: 785 VLLLLLLCDVHQHNPEALQAPTSKWREILSNLYSDDMLTVSGIGVYNGSALVSYIEVLTR 844

Query: 845 CWKFVGFVGSC-GGKARREVAASPVAVVELPSVAVGGGGGAAVECVICKEEMREGRDACK 904
           C +FV  +G C GGK  REVAASP  VV LPSV V GGG    ECVICKEEMRE RD C+
Sbjct: 845 CLRFVSVMGFCYGGKVGREVAASPAVVVALPSVEVRGGGS---ECVICKEEMRENRDVCE 904

Query: 905 LPCDHLFHWLCILPWLRKRNTCPCCRFQLPTDDIFGEIQRLWEILLKVGS 954
           LPC HLFHW+CIL WL+KRNTCPCCRF+LPTDD+FGEIQRL E+L+K+G+
Sbjct: 905 LPCRHLFHWMCILRWLKKRNTCPCCRFRLPTDDVFGEIQRLLEVLVKIGN 926

BLAST of Cla97C10G203940 vs. ExPASy TrEMBL
Match: A0A498IGK0 (RING-type domain-containing protein OS=Malus domestica OX=3750 GN=DVH24_004545 PE=4 SV=1)

HSP 1 Score: 1234.6 bits (3193), Expect = 0.0e+00
Identity = 608/951 (63.93%), Postives = 740/951 (77.81%), Query Frame = 0

Query: 4   AKPKNLQTSVPASVFLPWALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLS 63
           +K  N+Q S   + ++  ALQ LH  DG++ GAYG  IQ CT H  VR  KQLHARLVL 
Sbjct: 4   SKSLNIQISAATNGYVQRALQILHGIDGLDCGAYGHFIQHCTVHRLVRQAKQLHARLVLF 63

Query: 64  SVAPDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLF 123
           SV P NFL SKLI FYSK+ ++  A  VF +I   N FSWNA+ I Y+++NMH D LK F
Sbjct: 64  SVTPGNFLASKLINFYSKTNNINYARKVFDQIPRPNAFSWNAMLIGYSINNMHADTLKWF 123

Query: 124 SSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDIFVANALITF 183
           S++V+S S   KPD FTVTCVLKAL  L S S LAKEVHCF+LR G +SD+FV N+LIT+
Sbjct: 124 SAMVSSCSDQAKPDNFTVTCVLKALGVLLSGSKLAKEVHCFVLRSGFDSDVFVVNSLITY 183

Query: 184 YSRCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALT 243
           YSRCDE+ LAR +FDRMP RDIVSWN+M+AGYSQAG Y+ECK+L++ ML   + KP  LT
Sbjct: 184 YSRCDEVGLARALFDRMPERDIVSWNSMIAGYSQAGYYDECKELYRMMLGLEKFKPVGLT 243

Query: 244 AVSVLQACAQSNDLIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMP 303
            VSVLQAC QSNDL+ GMEVH+FV E+Q+ MDV +CNA+IGLYA+CGSLDYA+ELF+EM 
Sbjct: 244 VVSVLQACLQSNDLMLGMEVHQFVIENQIEMDVLVCNALIGLYARCGSLDYAQELFDEMS 303

Query: 304 EKDEVTYGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRA 363
           EKDEVTYG+++SGYM HGFV++AM +FR+ ++P LSTWNAVISGLVQNNQ +  +++ R 
Sbjct: 304 EKDEVTYGSLVSGYMFHGFVDKAMGVFRDSKKPKLSTWNAVISGLVQNNQHEEALNLIRE 363

Query: 364 MQSHGCRPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGY 423
           MQ+ GC+PNTVTL+S+LP  S+FS LK GKE+HAYA+RN ++ NIYVATAIID+YAKSG 
Sbjct: 364 MQACGCKPNTVTLSSILPTISYFSNLKVGKEVHAYAVRNNFDWNIYVATAIIDTYAKSGL 423

Query: 424 LHGARQVFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVA 483
           L+GA++VFDQ KG+SLIIWT+IISAYA+HGD + ++ LFYEML +GIQPD VT T+VL A
Sbjct: 424 LYGAQRVFDQAKGKSLIIWTSIISAYASHGDGHTSIGLFYEMLNSGIQPDQVTITAVLTA 483

Query: 484 CAHSGELDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAK 543
           CAHSG +DEAWKIF+ +  EYGIQP VEHYACMVG+LSRAGKL+ A +FI KMP EP+AK
Sbjct: 484 CAHSGVVDEAWKIFDAMFPEYGIQPSVEHYACMVGILSRAGKLTEAADFIHKMPVEPSAK 543

Query: 544 VWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMK 603
           VWGALLNGASV+ DVELG++V  RLF+IEPENTGNYIIMANLYSQ GRW+EADK+R  MK
Sbjct: 544 VWGALLNGASVSRDVELGEFVCHRLFQIEPENTGNYIIMANLYSQAGRWEEADKVRERMK 603

Query: 604 EVGLKKIPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDD 663
           EVGL+KIPGSSWIET  GLQSF+ +DTSN+RT EIY  LEGLLG+MKE+G+         
Sbjct: 604 EVGLRKIPGSSWIETSKGLQSFIVKDTSNERTEEIYETLEGLLGMMKEKGL--------- 663

Query: 664 CGSVKEQYGSRDPIMEELEPPT-VIMAALSALSPPCLSDLSHSIFSDIHHHRRRLTFILS 723
                + + + DPIMEE+   T  IMAAL+ L+PP LS L+H+I S  HHH  RL+ +LS
Sbjct: 664 ---KAKVHKAHDPIMEEIATATATIMAALATLTPPQLSHLTHTILSHTHHHHHRLSSLLS 723

Query: 724 SPTLFSLTLRHLNSLSLSHKSLLLARFLLSALRRLSRPFQPPSKLLPYHPSTAAISPQDL 783
           SP LFSLTL  LNSL L HK+LL+A  LLS+L  L+  F P +      P    +  +DL
Sbjct: 724 SPILFSLTLHRLNSLPLPHKTLLIANHLLSSLYHLTLHFHPYTN----PPPPRVVRKRDL 783

Query: 784 DAAVLLLLLCEVRQHNPAALRTPITKWRATLCRIYSDSLLTISGVATGGGGALIPFIETV 843
           D+ +LLLLLCEV QHNP AL+ P  KWR  L ++YSD++LT+SG+    G AL+ +IE +
Sbjct: 784 DSVLLLLLLCEVHQHNPEALQAPTIKWREILSKLYSDNMLTVSGIGVYNGSALVSYIEVL 843

Query: 844 VRCWKFVGFVGSC-GGKARREVAASPVAVVELPSVAVGGGGGAAVECVICKEEMREGRDA 903
            RC +FV  +G C GGKA REVAASP AVV LPSV V  GG    EC+ICKEEMRE RD 
Sbjct: 844 TRCLRFVSVMGFCYGGKAGREVAASPAAVVALPSVKVSSGGS---ECMICKEEMREDRDV 903

Query: 904 CKLPCDHLFHWLCILPWLRKRNTCPCCRFQLPTDDIFGEIQRLWEILLKVG 953
           C+LPC HLFHW+CIL WLRKRNTCPCCRF LPTDD+FGEIQRLWEIL+K+G
Sbjct: 904 CELPCRHLFHWMCILRWLRKRNTCPCCRFTLPTDDVFGEIQRLWEILVKMG 935

BLAST of Cla97C10G203940 vs. ExPASy TrEMBL
Match: A0A6J1CWN9 (pentatricopeptide repeat-containing protein At2g37310 OS=Momordica charantia OX=3673 GN=LOC111015095 PE=4 SV=1)

HSP 1 Score: 1187.6 bits (3071), Expect = 0.0e+00
Identity = 580/657 (88.28%), Postives = 617/657 (93.91%), Query Frame = 0

Query: 10  QTSVPASVFLPWALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSSVAPDN 69
           Q S+PA   +PWALQA+ R DGMNY AYGRLIQ C D  F+RLGKQLHARLVL SV PDN
Sbjct: 3   QISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDN 62

Query: 70  FLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNS 129
           FLGSKLIAFYSKSGSLRDAYNVFG ISHKNIFSWNALFISYTLHNMH+DMLKLFSSLVNS
Sbjct: 63  FLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNS 122

Query: 130 NSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDIFVANALITFYSRCDE 189
           N+ DVKPDKFT+TCVLKALAS F++S+LAKEVHCF+LRRGLESDIFV NAL+T+YSRC+E
Sbjct: 123 NAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEE 182

Query: 190 LVLARIMFDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTAVSVLQ 249
           +VLARI+F RMP RDIVSWNAM+AG+SQ G YEECK+LFK MLSSVELKPNALTAVSVLQ
Sbjct: 183 VVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQ 242

Query: 250 ACAQSNDLIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT 309
           ACAQSNDLIFGMEVHRFVNESQ+ MDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT
Sbjct: 243 ACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT 302

Query: 310 YGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRAMQSHGC 369
           YG+MISGYMVHG VNQAMDLF+EL++PALSTWNAVISGLVQNNQQD V+DIFRAMQSHGC
Sbjct: 303 YGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGC 362

Query: 370 RPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQ 429
           RPN VTLASVLPVFSHFSTLKGGKEIHAYA+RN YNGNIYVATAIIDSYAKSGYLHGA Q
Sbjct: 363 RPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQ 422

Query: 430 VFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGE 489
           VFD +KGRSLIIWTAIISAYAAHGDANVALSLFYEML NGIQPDPVTFTSVLVACAHSGE
Sbjct: 423 VFDLVKGRSLIIWTAIISAYAAHGDANVALSLFYEMLRNGIQPDPVTFTSVLVACAHSGE 482

Query: 490 LDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALL 549
           LDEAWKIFN++L EYGIQPLVEHYACMVGVLSRAGKLS+AV+FISKMP EP+AKVWGALL
Sbjct: 483 LDEAWKIFNIMLPEYGIQPLVEHYACMVGVLSRAGKLSDAVDFISKMPIEPSAKVWGALL 542

Query: 550 NGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKK 609
           NGASVAGDVELGKYVFDRL EIEPENTG YIIMANLYSQ GRWKEADK+R+LMKEVGL+K
Sbjct: 543 NGASVAGDVELGKYVFDRLLEIEPENTGTYIIMANLYSQSGRWKEADKVRDLMKEVGLRK 602

Query: 610 IPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGS 667
           IPGSSWIET GGL SFVARDTSND TPEIY MLEGLLGLMKEEG ILQ+EID+DCGS
Sbjct: 603 IPGSSWIETSGGLHSFVARDTSNDSTPEIYEMLEGLLGLMKEEGYILQNEIDEDCGS 659

BLAST of Cla97C10G203940 vs. ExPASy TrEMBL
Match: A0A6J1F110 (pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita moschata OX=3662 GN=LOC111441405 PE=4 SV=1)

HSP 1 Score: 1168.3 bits (3021), Expect = 0.0e+00
Identity = 578/636 (90.88%), Postives = 600/636 (94.34%), Query Frame = 0

Query: 32  MNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAFYSKSGSLRDAYNV 91
           MNYGAYGRLIQ CTD  F RLGKQLHARLVLSSVAPDNFLGSKLIA YSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKSGSLRDAYNV 60

Query: 92  FGKISHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASL 151
           F  ISHKNIFSWNALFISYTLHNMH DMLKLFSSLVN NSTDVKPDKFTVTCVLKALASL
Sbjct: 61  FDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNVNSTDVKPDKFTVTCVLKALASL 120

Query: 152 FSNSVLAKEVHCFILRRGLESDIFVANALITFYSRCDELVLARIMFDRMPGRDIVSWNAM 211
           F+NS+LAKEVHCF+LRRGLESDIFV NALITFYSRCDEL LARIMFDR P RDIVSWNAM
Sbjct: 121 FTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELALARIMFDRTPERDIVSWNAM 180

Query: 212 LAGYSQAGLYEECKKLFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 271
           +AGYSQ G YE+CK+LFKAML S E KPNALTAVSVLQACA SNDLIFGMEVH+FVNES 
Sbjct: 181 VAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAHSNDLIFGMEVHKFVNESG 240

Query: 272 VVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISGYMVHGFVNQAMDLFR 331
           + MDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYG+MISGYMVHGFVNQAMDLFR
Sbjct: 241 IEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFR 300

Query: 332 ELERPALSTWNAVISGLVQNNQQDRVVDIFRAMQSHGCRPNTVTLASVLPVFSHFSTLKG 391
           ELERPALSTWNAVISGLVQNNQQD VVDIFRAMQ HGCRPNTVTLASVLP+FSHFSTLKG
Sbjct: 301 ELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKG 360

Query: 392 GKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIKGRSLIIWTAIISAYAA 451
           GKEIHAYA+RNAY+GNIYVATAIIDSYAKSGYL GARQVFDQ+K RSLIIWTAIISAYAA
Sbjct: 361 GKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLQGARQVFDQLKRRSLIIWTAIISAYAA 420

Query: 452 HGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWKIFNVLLLEYGIQPLVE 511
           HGDAN  LSLFYEMLTNGI+PDPVTFTSVLVACAHSGELDEAWKIFNVLL E+GIQPLVE
Sbjct: 421 HGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELDEAWKIFNVLLPEFGIQPLVE 480

Query: 512 HYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVAGDVELGKYVFDRLFEI 571
           HYACMVGVLSRAGKLS+AVEFISKMP EPTAKVWGALLNGASVAGDVELGKYVFDRL +I
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFDRLLDI 540

Query: 572 EPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSWIETRGGLQSFVARDTS 631
           EPENTGNYIIMANLYSQFGRWKEAD +R+LMKEVGLKKIPG+SWIETR GLQSFVARDTS
Sbjct: 541 EPENTGNYIIMANLYSQFGRWKEADNVRDLMKEVGLKKIPGNSWIETREGLQSFVARDTS 600

Query: 632 NDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGSV 668
           NDRTPEIYG LEGL+GLMKEEG+I QHEIDDDCGSV
Sbjct: 601 NDRTPEIYGTLEGLVGLMKEEGLIQQHEIDDDCGSV 636

BLAST of Cla97C10G203940 vs. ExPASy TrEMBL
Match: A0A6J1J0S5 (pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita maxima OX=3661 GN=LOC111482423 PE=4 SV=1)

HSP 1 Score: 1155.6 bits (2988), Expect = 0.0e+00
Identity = 575/635 (90.55%), Postives = 598/635 (94.17%), Query Frame = 0

Query: 32  MNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAFYSKSGSLRDAYNV 91
           MNYGAYGRLIQ CTD  F RLGKQLHARLVLSSVAPDNFLGSKLIA YSKSGSLRDAYNV
Sbjct: 1   MNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKSGSLRDAYNV 60

Query: 92  FGKISHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASL 151
           F  ISHKNIFSWNALFISYTLHNMH DMLKLFSSLVN NSTDVKPDKFTVTCVLKALASL
Sbjct: 61  FDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNLNSTDVKPDKFTVTCVLKALASL 120

Query: 152 FSNSVLAKEVHCFILRRGLESDIFVANALITFYSRCDELVLARIMFDRMPGRDIVSWNAM 211
           F+NS+LAKEVHCF+LRRGLESDIFV NALITFYSRCDELVLARIMF R P RDIVSWNAM
Sbjct: 121 FTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELVLARIMFHRTPERDIVSWNAM 180

Query: 212 LAGYSQAGLYEECKKLFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQ 271
           +AGYSQ G YE+CK+LFKAML S E KPNALTAVSVLQACAQSNDLIFGMEVH+FVNES 
Sbjct: 181 VAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAQSNDLIFGMEVHKFVNESG 240

Query: 272 VVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISGYMVHGFVNQAMDLFR 331
           + MDVSL NAVIGLYAKCGSLDYARELFE MPEKDEVTYG+MISGYMVHGFVNQAMDLFR
Sbjct: 241 IEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFR 300

Query: 332 ELERPALSTWNAVISGLVQNNQQDRVVDIFRAMQSHGCRPNTVTLASVLPVFSHFSTLKG 391
           ELERPALSTWNAVISGLVQNNQQD VVDIFRAMQ HGCRPNTVTLASVLP+FSHFSTLKG
Sbjct: 301 ELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKG 360

Query: 392 GKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIKGRSLIIWTAIISAYAA 451
           GKEIHAYA+RNAY+GNIYVATAIIDSYAKSGYL GARQVFDQ K RSLIIWTAIISAYAA
Sbjct: 361 GKEIHAYAVRNAYDGNIYVATAIIDSYAKSGYLQGARQVFDQSKRRSLIIWTAIISAYAA 420

Query: 452 HGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWKIFNVLLLEYGIQPLVE 511
           HGDAN  LSLFYEMLTNGI+PDPVTFTSVLVACAHSGEL+EAWKIFNVLL E+GIQPLVE
Sbjct: 421 HGDANATLSLFYEMLTNGIRPDPVTFTSVLVACAHSGELNEAWKIFNVLLPEFGIQPLVE 480

Query: 512 HYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVAGDVELGKYVFDRLFEI 571
           HYACMVGVLSRAGKLS+AVEFISKMP EPTAKVWGALLNGASVAGDVELGKYVFDRL +I
Sbjct: 481 HYACMVGVLSRAGKLSDAVEFISKMPIEPTAKVWGALLNGASVAGDVELGKYVFDRLLDI 540

Query: 572 EPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSWIETRGGLQSFVARDTS 631
           EPENTGNYIIMANLYSQFG WKEAD +R+LMKEVGLKKIPG+SWIETRGGLQSFVARDTS
Sbjct: 541 EPENTGNYIIMANLYSQFGWWKEADHVRDLMKEVGLKKIPGNSWIETRGGLQSFVARDTS 600

Query: 632 NDRTPEIYGMLEGLLGLMKEEGIILQHEIDDDCGS 667
           NDRTPEIYG LEGL+GLMK EG+I QHEIDD+CGS
Sbjct: 601 NDRTPEIYGTLEGLVGLMK-EGLIQQHEIDDECGS 634

BLAST of Cla97C10G203940 vs. TAIR 10
Match: AT2G37310.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 754.2 bits (1946), Expect = 1.3e-217
Identity = 372/646 (57.59%), Postives = 486/646 (75.23%), Query Frame = 0

Query: 22  ALQALHRTDGMNYGAYGRLIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAFYSK 81
           ALQ L     ++ GAYG LIQ  T H       QLHAR+V+ S+ PDNFL SKLI+FY++
Sbjct: 10  ALQGLLNKAAVDGGAYGHLIQHFTRHRLPLHVLQLHARIVVFSIKPDNFLASKLISFYTR 69

Query: 82  SGSLRDAYNVFGKISHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNS---NSTDVKPDK 141
               R A +VF +I+ +N FS+NAL I+YT   M+ D   LF S + S   +S   +PD 
Sbjct: 70  QDRFRQALHVFDEITVRNAFSYNALLIAYTSREMYFDAFSLFLSWIGSSCYSSDAARPDS 129

Query: 142 FTVTCVLKALASL--FSNSVLAKEVHCFILRRGLESDIFVANALITFYSRCDELVLARIM 201
            +++CVLKAL+    F    LA++VH F++R G +SD+FV N +IT+Y++CD +  AR +
Sbjct: 130 ISISCVLKALSGCDDFWLGSLARQVHGFVIRGGFDSDVFVGNGMITYYTKCDNIESARKV 189

Query: 202 FDRMPGRDIVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTAVSVLQACAQSND 261
           FD M  RD+VSWN+M++GYSQ+G +E+CKK++KAML+  + KPN +T +SV QAC QS+D
Sbjct: 190 FDEMSERDVVSWNSMISGYSQSGSFEDCKKMYKAMLACSDFKPNGVTVISVFQACGQSSD 249

Query: 262 LIFGMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISG 321
           LIFG+EVH+ + E+ + MD+SLCNAVIG YAKCGSLDYAR LF+EM EKD VTYGA+ISG
Sbjct: 250 LIFGLEVHKKMIENHIQMDLSLCNAVIGFYAKCGSLDYARALFDEMSEKDSVTYGAIISG 309

Query: 322 YMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDRVVDIFRAMQSHGCRPNTVTL 381
           YM HG V +AM LF E+E   LSTWNA+ISGL+QNN  + V++ FR M   G RPNTVTL
Sbjct: 310 YMAHGLVKEAMALFSEMESIGLSTWNAMISGLMQNNHHEEVINSFREMIRCGSRPNTVTL 369

Query: 382 ASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIKG 441
           +S+LP  ++ S LKGGKEIHA+AIRN  + NIYV T+IID+YAK G+L GA++VFD  K 
Sbjct: 370 SSLLPSLTYSSNLKGGKEIHAFAIRNGADNNIYVTTSIIDNYAKLGFLLGAQRVFDNCKD 429

Query: 442 RSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWKI 501
           RSLI WTAII+AYA HGD++ A SLF +M   G +PD VT T+VL A AHSG+ D A  I
Sbjct: 430 RSLIAWTAIITAYAVHGDSDSACSLFDQMQCLGTKPDDVTLTAVLSAFAHSGDSDMAQHI 489

Query: 502 FNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVAG 561
           F+ +L +Y I+P VEHYACMV VLSRAGKLS+A+EFISKMP +P AKVWGALLNGASV G
Sbjct: 490 FDSMLTKYDIEPGVEHYACMVSVLSRAGKLSDAMEFISKMPIDPIAKVWGALLNGASVLG 549

Query: 562 DVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSWI 621
           D+E+ ++  DRLFE+EPENTGNY IMANLY+Q GRW+EA+ +RN MK +GLKKIPG+SWI
Sbjct: 550 DLEIARFACDRLFEMEPENTGNYTIMANLYTQAGRWEEAEMVRNKMKRIGLKKIPGTSWI 609

Query: 622 ETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGIILQHEIDD 663
           ET  GL+SF+A+D+S +R+ E+Y ++EGL+  M ++  I + E+D+
Sbjct: 610 ETEKGLRSFIAKDSSCERSKEMYEIIEGLVESMSDKEYIRKQELDE 655

BLAST of Cla97C10G203940 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 403.7 bits (1036), Expect = 4.2e-112
Identity = 213/617 (34.52%), Postives = 356/617 (57.70%), Query Frame = 0

Query: 40  LIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAF--YSKSGSLRDAYNVFGKISH 99
           LI+ C     +R  KQ H  ++ +    D +  SKL A    S   SL  A  VF +I  
Sbjct: 36  LIERCVS---LRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPK 95

Query: 100 KNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVL 159
            N F+WN L  +Y   +    +L +++ L   + +   P+K+T   ++KA A + S S L
Sbjct: 96  PNSFAWNTLIRAYA--SGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLS-L 155

Query: 160 AKEVHCFILRRGLESDIFVANALITFYSRCDELVLARIMFDRMPGRDIVSWNAMLAGYSQ 219
            + +H   ++  + SD+FVAN+LI  Y  C +L  A  +F  +  +D+VSWN+M+ G+ Q
Sbjct: 156 GQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 215

Query: 220 AGLYEECKKLFKAMLSSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVVMDVS 279
            G  ++  +LFK M  S ++K + +T V VL ACA+  +L FG +V  ++ E++V ++++
Sbjct: 216 KGSPDKALELFKKM-ESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLT 275

Query: 280 LCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISGYMVHGFVNQAMDLFRELERPA 339
           L NA++ +Y KCGS++ A+ LF+ M EKD VT+  M+ GY +      A ++   + +  
Sbjct: 276 LANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKD 335

Query: 340 LSTWNAVISGLVQNNQQDRVVDIFRAMQ-SHGCRPNTVTLASVLPVFSHFSTLKGGKEIH 399
           +  WNA+IS   QN + +  + +F  +Q     + N +TL S L   +    L+ G+ IH
Sbjct: 336 IVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIH 395

Query: 400 AYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIKGRSLIIWTAIISAYAAHGDAN 459
           +Y  ++    N +V +A+I  Y+K G L  +R+VF+ ++ R + +W+A+I   A HG  N
Sbjct: 396 SYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 455

Query: 460 VALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWKIFNVLLLEYGIQPLVEHYACM 519
            A+ +FY+M    ++P+ VTFT+V  AC+H+G +DEA  +F+ +   YGI P  +HYAC+
Sbjct: 456 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 515

Query: 520 VGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENT 579
           V VL R+G L  AV+FI  MP  P+  VWGALL    +  ++ L +    RL E+EP N 
Sbjct: 516 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 575

Query: 580 GNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSWIETRGGLQSFVARDTSNDRTP 639
           G +++++N+Y++ G+W+   ++R  M+  GLKK PG S IE  G +  F++ D ++  + 
Sbjct: 576 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 635

Query: 640 EIYGMLEGLLGLMKEEG 654
           ++YG L  ++  +K  G
Sbjct: 636 KVYGKLHEVMEKLKSNG 645

BLAST of Cla97C10G203940 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 402.1 bits (1032), Expect = 1.2e-111
Identity = 227/686 (33.09%), Postives = 354/686 (51.60%), Query Frame = 0

Query: 40  LIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKN 99
           ++QLC D   ++ GK++   +  +    D+ LGSKL   Y+  G L++A  VF ++  + 
Sbjct: 100 VLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEK 159

Query: 100 IFSWNALFISYTLHNMHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAK 159
              WN L          +  + LF  +++S    V+ D +T +CV K+ +SL S     +
Sbjct: 160 ALFWNILMNELAKSGDFSGSIGLFKKMMSSG---VEMDSYTFSCVSKSFSSLRSVHG-GE 219

Query: 160 EVHCFILRRGLESDIFVANALITFYSRCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAG 219
           ++H FIL+ G      V N+L+ FY +   +  AR +FD M  RD++SWN+++ GY   G
Sbjct: 220 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 279

Query: 220 LYEECKKLFKAML-SSVELKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQVVMDVSL 279
           L E+   +F  ML S +E+  +  T VSV   CA S  +  G  VH    ++    +   
Sbjct: 280 LAEKGLSVFVQMLVSGIEI--DLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRF 339

Query: 280 CNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISGYMVHGFVNQAMDLFRELERPAL 339
           CN ++ +Y+KCG LD A+ +F EM ++  V+Y +MI+GY   G   +A+ LF E+E   +
Sbjct: 340 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 399

Query: 340 S----------------------------------------------------------- 399
           S                                                           
Sbjct: 400 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 459

Query: 400 -----------TWNAVISGLVQNNQQDRVVDIFR-AMQSHGCRPNTVTLASVLPVFSHFS 459
                      +WN +I G  +N   +  + +F   ++     P+  T+A VLP  +  S
Sbjct: 460 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 519

Query: 460 TLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIKGRSLIIWTAIIS 519
               G+EIH Y +RN Y  + +VA +++D YAK G L  A  +FD I  + L+ WT +I+
Sbjct: 520 AFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIA 579

Query: 520 AYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWKIFNVLLLEYGIQ 579
            Y  HG    A++LF +M   GI+ D ++F S+L AC+HSG +DE W+ FN++  E  I+
Sbjct: 580 GYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIE 639

Query: 580 PLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVAGDVELGKYVFDR 639
           P VEHYAC+V +L+R G L  A  FI  MP  P A +WGALL G  +  DV+L + V ++
Sbjct: 640 PTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEK 699

Query: 640 LFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSWIETRGGLQSFVA 654
           +FE+EPENTG Y++MAN+Y++  +W++  ++R  + + GL+K PG SWIE +G +  FVA
Sbjct: 700 VFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVA 759

BLAST of Cla97C10G203940 vs. TAIR 10
Match: AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 390.6 bits (1002), Expect = 3.7e-108
Identity = 222/673 (32.99%), Postives = 356/673 (52.90%), Query Frame = 0

Query: 40  LIQLCTDHLFVRLGKQLHARLVLSSVAPDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKN 99
           + + C +   VR G+  HA  +++    + F+G+ L+A YS+  SL DA  VF ++S  +
Sbjct: 133 VFKACGEISSVRCGESAHALSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWD 192

Query: 100 IFSWNALFISYTLHNMHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAK 159
           + SWN++  SY         L++FS +  +N    +PD  T+  VL   ASL ++S L K
Sbjct: 193 VVSWNSIIESYAKLGKPKVALEMFSRM--TNEFGCRPDNITLVNVLPPCASLGTHS-LGK 252

Query: 160 EVHCFILRRGLESDIFVANALITFYSRCDELVLARIMFDRMPGRDIVSWNAMLAGYSQAG 219
           ++HCF +   +  ++FV N L+  Y++C  +  A  +F  M  +D+VSWNAM+AGYSQ G
Sbjct: 253 QLHCFAVTSEMIQNMFVGNCLVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIG 312

Query: 220 LYEECKKLFKAM----------------------------------LSSVELKPNALTAV 279
            +E+  +LF+ M                                  + S  +KPN +T +
Sbjct: 313 RFEDAVRLFEKMQEEKIKMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLI 372

Query: 280 SVLQACAQSNDLIFGMEVHRFVNESQVVM-------DVSLCNAVIGLYAKCGSLDYAREL 339
           SVL  CA    L+ G E+H +  +  + +       +  + N +I +YAKC  +D AR +
Sbjct: 373 SVLSGCASVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAM 432

Query: 340 FEEM--PEKDEVTYGAMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLVQNNQQDR 399
           F+ +   E+D VT+  MI GY  HG  N+A++L  E+      T                
Sbjct: 433 FDSLSPKERDVVTWTVMIGGYSQHGDANKALELLSEMFEEDCQT---------------- 492

Query: 400 VVDIFRAMQSHGCRPNTVTLASVLPVFSHFSTLKGGKEIHAYAIRNAYNG-NIYVATAII 459
                        RPN  T++  L   +  + L+ GK+IHAYA+RN  N   ++V+  +I
Sbjct: 493 -------------RPNAFTISCALVACASLAALRIGKQIHAYALRNQQNAVPLFVSNCLI 552

Query: 460 DSYAKSGYLHGARQVFDQIKGRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPV 519
           D YAK G +  AR VFD +  ++ + WT++++ Y  HG    AL +F EM   G + D V
Sbjct: 553 DMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDEMRRIGFKLDGV 612

Query: 520 TFTSVLVACAHSGELDEAWKIFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISK 579
           T   VL AC+HSG +D+  + FN +   +G+ P  EHYAC+V +L RAG+L+ A+  I +
Sbjct: 613 TLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAGRLNAALRLIEE 672

Query: 580 MPFEPTAKVWGALLNGASVAGDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEA 639
           MP EP   VW A L+   + G VELG+Y  +++ E+   + G+Y +++NLY+  GRWK+ 
Sbjct: 673 MPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSNLYANAGRWKDV 732

Query: 640 DKIRNLMKEVGLKKIPGSSWIETRGGLQSFVARDTSNDRTPEIYGMLEGLLGLMKEEGII 664
            +IR+LM+  G+KK PG SW+E   G  +F   D ++    EIY +L   +  +K+ G +
Sbjct: 733 TRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFVGDKTHPHAKEIYQVLLDHMQRIKDIGYV 773

BLAST of Cla97C10G203940 vs. TAIR 10
Match: AT1G20230.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 380.9 bits (977), Expect = 2.9e-105
Identity = 203/638 (31.82%), Postives = 346/638 (54.23%), Query Frame = 0

Query: 55  QLHARLVLSSVAPDNFLGSKLIAFYSKSGSLRDAYNVFGKISHKNIFSWNALFISYTLHN 114
           Q HAR++ S    D ++ +KLIA YS      DA  V   I    I+S+++L  + T   
Sbjct: 36  QAHARILKSGAQNDGYISAKLIASYSNYNCFNDADLVLQSIPDPTIYSFSSLIYALTKAK 95

Query: 115 MHTDMLKLFSSLVNSNSTDVKPDKFTVTCVLKALASLFSNSVLAKEVHCFILRRGLESDI 174
           + T  + +FS +    S  + PD   +  + K  A L +  V  K++HC     GL+ D 
Sbjct: 96  LFTQSIGVFSRMF---SHGLIPDSHVLPNLFKVCAELSAFKV-GKQIHCVSCVSGLDMDA 155

Query: 175 FVANALITFYSRCDELVLARIMFDRMPGRD------------------------------ 234
           FV  ++   Y RC  +  AR +FDRM  +D                              
Sbjct: 156 FVQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAYARKGCLEEVVRILSEMESS 215

Query: 235 -----IVSWNAMLAGYSQAGLYEECKKLFKAMLSSVELKPNALTAVSVLQACAQSNDLIF 294
                IVSWN +L+G++++G ++E   +F+  +  +   P+ +T  SVL +   S  L  
Sbjct: 216 GIEANIVSWNGILSGFNRSGYHKEAVVMFQ-KIHHLGFCPDQVTVSSVLPSVGDSEMLNM 275

Query: 295 GMEVHRFVNESQVVMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVTYGAMISGYMV 354
           G  +H +V +  ++ D  + +A+I +Y K G +     LF +    +     A I+G   
Sbjct: 276 GRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVYGIISLFNQFEMMEAGVCNAYITGLSR 335

Query: 355 HGFVNQAMDLFRELERPALS----TWNAVISGLVQNNQQDRVVDIFRAMQSHGCRPNTVT 414
           +G V++A+++F   +   +     +W ++I+G  QN +    +++FR MQ  G +PN VT
Sbjct: 336 NGLVDKALEMFELFKEQTMELNVVSWTSIIAGCAQNGKDIEALELFREMQVAGVKPNHVT 395

Query: 415 LASVLPVFSHFSTLKGGKEIHAYAIRNAYNGNIYVATAIIDSYAKSGYLHGARQVFDQIK 474
           + S+LP   + + L  G+  H +A+R     N++V +A+ID YAK G ++ ++ VF+ + 
Sbjct: 396 IPSMLPACGNIAALGHGRSTHGFAVRVHLLDNVHVGSALIDMYAKCGRINLSQIVFNMMP 455

Query: 475 GRSLIIWTAIISAYAAHGDANVALSLFYEMLTNGIQPDPVTFTSVLVACAHSGELDEAWK 534
            ++L+ W ++++ ++ HG A   +S+F  ++   ++PD ++FTS+L AC   G  DE WK
Sbjct: 456 TKNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEGWK 515

Query: 535 IFNVLLLEYGIQPLVEHYACMVGVLSRAGKLSNAVEFISKMPFEPTAKVWGALLNGASVA 594
            F ++  EYGI+P +EHY+CMV +L RAGKL  A + I +MPFEP + VWGALLN   + 
Sbjct: 516 YFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSCRLQ 575

Query: 595 GDVELGKYVFDRLFEIEPENTGNYIIMANLYSQFGRWKEADKIRNLMKEVGLKKIPGSSW 654
            +V+L +   ++LF +EPEN G Y++++N+Y+  G W E D IRN M+ +GLKK PG SW
Sbjct: 576 NNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGCSW 635

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RXH80631.10.0e+0063.93hypothetical protein DVH24_004545 [Malus domestica][more]
XP_038905794.10.0e+0091.68pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida] >XP_03... [more]
XP_022145703.10.0e+0088.28pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia][more]
KAG6580575.10.0e+0089.26ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. soror... [more]
KAG7017327.10.0e+0089.11ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyr... [more]
Match NameE-valueIdentityDescription
Q9ZUT51.8e-21657.59Pentatricopeptide repeat-containing protein At2g37310 OS=Arabidopsis thaliana OX... [more]
O823805.9e-11134.52Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9SN391.7e-11033.09Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9LFL55.2e-10732.99Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... [more]
Q9LNU64.1e-10431.82Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
M5W6C80.0e+0064.74RING-type domain-containing protein OS=Prunus persica OX=3760 GN=PRUPE_ppa001024... [more]
A0A498IGK00.0e+0063.93RING-type domain-containing protein OS=Malus domestica OX=3750 GN=DVH24_004545 P... [more]
A0A6J1CWN90.0e+0088.28pentatricopeptide repeat-containing protein At2g37310 OS=Momordica charantia OX=... [more]
A0A6J1F1100.0e+0090.88pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita moschata OX=3... [more]
A0A6J1J0S50.0e+0090.55pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT2G37310.11.3e-21757.59Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.14.2e-11234.52Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.11.2e-11133.09Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G16860.13.7e-10832.99Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G20230.12.9e-10531.82Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001841Zinc finger, RING-typeSMARTSM00184ring_2coord: 888..928
e-value: 3.4E-5
score: 33.2
IPR001841Zinc finger, RING-typePFAMPF13639zf-RING_2coord: 887..929
e-value: 2.7E-10
score: 40.4
IPR001841Zinc finger, RING-typePROSITEPS50089ZF_RING_2coord: 888..929
score: 12.076194
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 858..935
e-value: 1.2E-16
score: 62.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 415..574
e-value: 8.7E-37
score: 129.1
coord: 23..152
e-value: 2.9E-12
score: 48.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 335..391
e-value: 3.8E-9
score: 38.1
coord: 153..259
e-value: 5.4E-20
score: 73.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 260..334
e-value: 3.5E-15
score: 58.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 450..598
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 340..373
e-value: 1.7E-6
score: 25.8
coord: 278..305
e-value: 5.9E-4
score: 17.8
coord: 308..334
e-value: 0.0019
score: 16.2
coord: 475..508
e-value: 9.4E-4
score: 17.2
coord: 206..240
e-value: 2.5E-6
score: 25.3
coord: 441..473
e-value: 7.1E-7
score: 27.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 579..606
e-value: 0.046
score: 14.0
coord: 308..335
e-value: 4.3E-5
score: 23.5
coord: 412..437
e-value: 0.013
score: 15.7
coord: 279..305
e-value: 1.1E-4
score: 22.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 339..379
e-value: 1.6E-7
score: 31.4
coord: 204..252
e-value: 2.2E-8
score: 34.1
coord: 440..485
e-value: 3.9E-10
score: 39.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 275..309
score: 10.029647
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 438..472
score: 11.005202
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 473..508
score: 10.906551
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 575..609
score: 8.593717
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 337..371
score: 11.092894
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 204..234
score: 10.457138
NoneNo IPR availablePANTHERPTHR47925:SF76BNAA04G21330D PROTEINcoord: 29..662
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 29..662
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 882..935

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C10G203940.2Cla97C10G203940.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding