Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGGGAATGGATTTGTTAAACGTCTCACGGGTGACCTTTTATTTCTTGATTCATCGCCTCTTTCCAAGCTCTTGAACCAATGTGCTCGCTCGAAGTCAGCTAGAGACACGAGTCGTGTTCATGCTTGCATAATTAAATCGCCCTTTGCGTCCGAAGTTTTTATCCAAAATAGGCTCATTGATGTATATGGGAAATGTGGATGTGTGGGTGTTGCTCGCAAGGTGTTTGATAGAATGCTTGAGAGAAATATTTTCTCTTGGAACTCCATTATTTGTGCATTCACTAAGTCCGGATTTCTTGATGATGCTGTCCACATCTTTGAGAAGATGCCTCAAGTTGACCAATGCTCGTGGAATTCTATGATTTCGGGTTTTGAACAACATGATTGCTTTGATGAAGCTTTAAAATATTTTGTTCAAATGCACGGTCATGGATTTTTTATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGTGCAGGTTTACAAGATTTGAAAATGGGTTCCCAAATCCACAGTTTAATATATAGGTCAAATTATTTATCAGATATGTATATGGGCTCTGCTCTAGTAGATATGTACTCTAAATGTGGAAGAGTTGATTGTGCTCGAAGTGTTTTTGATGGAATGACTGTGAGAAGTAGAGTTTCCTGGAATAGCTTGATTACGTGTTATGAACAGAACGGCCCAGTTGATGAGGCTCTTAGTATTTTTGTCGAGATGATCGAATGTGGGGTCGAACCTGATGAAGTAACTCTTGCTAGTGTTGTTAGTGCATGTGCAACTGTCTCGGCAATCAAAGAAGGTCAGCAGATTCATGCTCGAGTTGTGAAATGTGATGAATTTAGAAATGATCTTATTTTAGGCAATGCGTTGCTTGATATGTATGCTAAATGTAATAGGATTAACGAGGCTAGAATAGTTTTCGATCGGATGCCAATTAGGAGTGTGGTGTCTGAAACCTCAATGGTAAGTGGATATGCAAAAGCATCAAGTGTTAAAGCTGCAAGATCTATGTTTTCAAATATGATGGTGAAAGATGTAATTACTTGGAATGCTCTTATTGCCGGATGTACACAAAATGGAGAGAATGAAGAGGCACTTACACTCTTCCGTCTTTTGAAAAGGGAGTCTGTTTGGCCTACACACTACACATTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTGGCCGACAGGCTCACTCTCATGTTTTAAAACACGGATTTCGATTCCGATATGGAGATGAGTCAGATATCTTTGTTGGGAATTCTCTAATAGATATGTATATGAAATGTGGATCAGTTGAGAGTGGTTGTAGGGTGTTTGAACGTATGTTGGAAAGGGATTGTGTGTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAACAAGGCCCTTGGAATTTTCAGTGAAATGTTAGAATCAGGAGAGAAACCAGATCATGTAACAATGATTGGTGTTCTGTCTGCCTGTAGTCATGCCGGGCTGCTCGATGAAGGCCGCCATTACTTTCGATCGATGCGTGCACGACATGGTTTGGTACCGTTAAAGGATCATTATACATGTATGGTTGATTTACTTGGGCGAGCTGGCTGCCTTGAAGAAGCAAAAAATATAATAGAGGAGATGCCAATGCAGCCTGATGCTATCGTTTGGGGATCATTGCTTGCGGCTTGTAAAGTCCATAGGAACATAAAATTGGGGGAATATGTAGTGGAGAAGCTTTTAGAGGTAGATCCTGAGAATTCTGGGCCATATGTTCTTCTCTCTAATATGTATGCTGAACGTGGAGATTGGGGGAATGTTATGAGAATAAGGAAGCTGATGAGACAGAGAGGAGTGGTTAAACAACCAGGTTGCAGTTGGATTGAAATTCAGGGTGAGTTGAATGTTTTTATGGTTAAAGATAAAAGACATGCAAGGAAGCAAGAAATCTACATGCTTTTGAGGACACTTTTACAGCAGATGAAACGGGCTGGATATGTCCCATATGTTGGCAATGACGAGATTGATGAAGAACAATAGAAGATACACGACAAACCCTCATTGGTTAATTTATGAAATCCCACGTTGGTTGGAGAGGAGAACGAAACACTCTTTATAAGGGTGTGGAAATCTCTTCTTAGTGGACACGTCTTAAAAACCTTGAGTGAAAGCCGAAAGAAAAAACTCAAAGATGACAATATCTGCTAACAGTGGGCTTGGACCGTTACAAATTTAAATAAGAAAATGTCCTTGTTTTGATGACGTTTTTTAACTAACTATGCATGTTATGTGATGTTTTCACTTATAAAAATGTATAGATCATTTGTTATCCTCATTAAGGATGTATCTCATTCAATCACATACGAATATTCAAATTTATATTTAAATATCTAAACAAATATATAAATTTGAAAGTTTCATTAGATATATTAA
mRNA sequence
ATGGCGGGGAATGGATTTGTTAAACGTCTCACGGGTGACCTTTTATTTCTTGATTCATCGCCTCTTTCCAAGCTCTTGAACCAATGTGCTCGCTCGAAGTCAGCTAGAGACACGAGTCGTGTTCATGCTTGCATAATTAAATCGCCCTTTGCGTCCGAAGTTTTTATCCAAAATAGGCTCATTGATGTATATGGGAAATGTGGATGTGTGGGTGTTGCTCGCAAGGTGTTTGATAGAATGCTTGAGAGAAATATTTTCTCTTGGAACTCCATTATTTGTGCATTCACTAAGTCCGGATTTCTTGATGATGCTGTCCACATCTTTGAGAAGATGCCTCAAGTTGACCAATGCTCGTGGAATTCTATGATTTCGGGTTTTGAACAACATGATTGCTTTGATGAAGCTTTAAAATATTTTGTTCAAATGCACGGTCATGGATTTTTTATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGTGCAGGTTTACAAGATTTGAAAATGGGTTCCCAAATCCACAGTTTAATATATAGGTCAAATTATTTATCAGATATGTATATGGGCTCTGCTCTAGTAGATATGTACTCTAAATGTGGAAGAGTTGATTGTGCTCGAAGTGTTTTTGATGGAATGACTGTGAGAAGTAGAGTTTCCTGGAATAGCTTGATTACGTGTTATGAACAGAACGGCCCAGTTGATGAGGCTCTTAGTATTTTTGTCGAGATGATCGAATGTGGGGTCGAACCTGATGAAGTAACTCTTGCTAGTGTTGTTAGTGCATGTGCAACTGTCTCGGCAATCAAAGAAGGTCAGCAGATTCATGCTCGAGTTGTGAAATGTGATGAATTTAGAAATGATCTTATTTTAGGCAATGCGTTGCTTGATATGTATGCTAAATGTAATAGGATTAACGAGGCTAGAATAGTTTTCGATCGGATGCCAATTAGGAGTGTGGTGTCTGAAACCTCAATGGTAAGTGGATATGCAAAAGCATCAAGTGTTAAAGCTGCAAGATCTATGTTTTCAAATATGATGGTGAAAGATGTAATTACTTGGAATGCTCTTATTGCCGGATGTACACAAAATGGAGAGAATGAAGAGGCACTTACACTCTTCCGTCTTTTGAAAAGGGAGTCTGTTTGGCCTACACACTACACATTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTGGCCGACAGGCTCACTCTCATGTTTTAAAACACGGATTTCGATTCCGATATGGAGATGAGTCAGATATCTTTGTTGGGAATTCTCTAATAGATATGTATATGAAATGTGGATCAGTTGAGAGTGGTTGTAGGGTGTTTGAACGTATGTTGGAAAGGGATTGTGTGTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAACAAGGCCCTTGGAATTTTCAGTGAAATGTTAGAATCAGGAGAGAAACCAGATCATGTAACAATGATTGGTGTTCTGTCTGCCTGTAGTCATGCCGGGCTGCTCGATGAAGGCCGCCATTACTTTCGATCGATGCGTGCACGACATGGTTTGGTACCGTTAAAGGATCATTATACATGTATGGTTGATTTACTTGGGCGAGCTGGCTGCCTTGAAGAAGCAAAAAATATAATAGAGGAGATGCCAATGCAGCCTGATGCTATCGTTTGGGGATCATTGCTTGCGGCTTGTAAAGTCCATAGGAACATAAAATTGGGGGAATATGTAGTGGAGAAGCTTTTAGAGGTAGATCCTGAGAATTCTGGGCCATATGTTCTTCTCTCTAATATGTATGCTGAACGTGGAGATTGGGGGAATGTTATGAGAATAAGGAAGCTGATGAGACAGAGAGGAGTGGTTAAACAACCAGGTTGCAGTTGGATTGAAATTCAGGGTGAGTTGAATGTTTTTATGGTTAAAGATAAAAGACATGCAAGGAAGCAAGAAATCTACATGCTTTTGAGGACACTTTTACAGCAGATGAAACGGGCTGGATATATATATTAA
Coding sequence (CDS)
ATGGCGGGGAATGGATTTGTTAAACGTCTCACGGGTGACCTTTTATTTCTTGATTCATCGCCTCTTTCCAAGCTCTTGAACCAATGTGCTCGCTCGAAGTCAGCTAGAGACACGAGTCGTGTTCATGCTTGCATAATTAAATCGCCCTTTGCGTCCGAAGTTTTTATCCAAAATAGGCTCATTGATGTATATGGGAAATGTGGATGTGTGGGTGTTGCTCGCAAGGTGTTTGATAGAATGCTTGAGAGAAATATTTTCTCTTGGAACTCCATTATTTGTGCATTCACTAAGTCCGGATTTCTTGATGATGCTGTCCACATCTTTGAGAAGATGCCTCAAGTTGACCAATGCTCGTGGAATTCTATGATTTCGGGTTTTGAACAACATGATTGCTTTGATGAAGCTTTAAAATATTTTGTTCAAATGCACGGTCATGGATTTTTTATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGTGCAGGTTTACAAGATTTGAAAATGGGTTCCCAAATCCACAGTTTAATATATAGGTCAAATTATTTATCAGATATGTATATGGGCTCTGCTCTAGTAGATATGTACTCTAAATGTGGAAGAGTTGATTGTGCTCGAAGTGTTTTTGATGGAATGACTGTGAGAAGTAGAGTTTCCTGGAATAGCTTGATTACGTGTTATGAACAGAACGGCCCAGTTGATGAGGCTCTTAGTATTTTTGTCGAGATGATCGAATGTGGGGTCGAACCTGATGAAGTAACTCTTGCTAGTGTTGTTAGTGCATGTGCAACTGTCTCGGCAATCAAAGAAGGTCAGCAGATTCATGCTCGAGTTGTGAAATGTGATGAATTTAGAAATGATCTTATTTTAGGCAATGCGTTGCTTGATATGTATGCTAAATGTAATAGGATTAACGAGGCTAGAATAGTTTTCGATCGGATGCCAATTAGGAGTGTGGTGTCTGAAACCTCAATGGTAAGTGGATATGCAAAAGCATCAAGTGTTAAAGCTGCAAGATCTATGTTTTCAAATATGATGGTGAAAGATGTAATTACTTGGAATGCTCTTATTGCCGGATGTACACAAAATGGAGAGAATGAAGAGGCACTTACACTCTTCCGTCTTTTGAAAAGGGAGTCTGTTTGGCCTACACACTACACATTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTGGCCGACAGGCTCACTCTCATGTTTTAAAACACGGATTTCGATTCCGATATGGAGATGAGTCAGATATCTTTGTTGGGAATTCTCTAATAGATATGTATATGAAATGTGGATCAGTTGAGAGTGGTTGTAGGGTGTTTGAACGTATGTTGGAAAGGGATTGTGTGTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAACAAGGCCCTTGGAATTTTCAGTGAAATGTTAGAATCAGGAGAGAAACCAGATCATGTAACAATGATTGGTGTTCTGTCTGCCTGTAGTCATGCCGGGCTGCTCGATGAAGGCCGCCATTACTTTCGATCGATGCGTGCACGACATGGTTTGGTACCGTTAAAGGATCATTATACATGTATGGTTGATTTACTTGGGCGAGCTGGCTGCCTTGAAGAAGCAAAAAATATAATAGAGGAGATGCCAATGCAGCCTGATGCTATCGTTTGGGGATCATTGCTTGCGGCTTGTAAAGTCCATAGGAACATAAAATTGGGGGAATATGTAGTGGAGAAGCTTTTAGAGGTAGATCCTGAGAATTCTGGGCCATATGTTCTTCTCTCTAATATGTATGCTGAACGTGGAGATTGGGGGAATGTTATGAGAATAAGGAAGCTGATGAGACAGAGAGGAGTGGTTAAACAACCAGGTTGCAGTTGGATTGAAATTCAGGGTGAGTTGAATGTTTTTATGGTTAAAGATAAAAGACATGCAAGGAAGCAAGAAATCTACATGCTTTTGAGGACACTTTTACAGCAGATGAAACGGGCTGGATATATATATTAA
Protein sequence
MAGNGFVKRLTGDLLFLDSSPLSKLLNQCARSKSARDTSRVHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWNSMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIFVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAKCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLLRTLLQQMKRAGYIY
Homology
BLAST of CmaCh16G012950 vs. ExPASy Swiss-Prot
Match:
Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)
HSP 1 Score: 932.9 bits (2410), Expect = 2.0e-270
Identity = 446/669 (66.67%), Postives = 541/669 (80.87%), Query Frame = 0
Query: 1 MAGNGFVKRLTGDLLFLDSSPLSKLLNQCARSK-SARDTSRVHACIIKSPFASEVFIQNR 60
MA F+K F DSSP +KLL+ C +SK SA VHA +IKS F++E+FIQNR
Sbjct: 1 MATKSFLKLAADLSSFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNR 60
Query: 61 LIDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSW 120
LID Y KCG + R+VFD+M +RNI++WNS++ TK GFLD+A +F MP+ DQC+W
Sbjct: 61 LIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTW 120
Query: 121 NSMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYR 180
NSM+SGF QHD +EAL YF MH GF +NEYSF S LSAC+GL D+ G Q+HSLI +
Sbjct: 121 NSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAK 180
Query: 181 SNYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSI 240
S +LSD+Y+GSALVDMYSKCG V+ A+ VFD M R+ VSWNSLITC+EQNGP EAL +
Sbjct: 181 SPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDV 240
Query: 241 FVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYA 300
F M+E VEPDEVTLASV+SACA++SAIK GQ++H RVVK D+ RND+IL NA +DMYA
Sbjct: 241 FQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYA 300
Query: 301 KCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAG 360
KC+RI EAR +FD MPIR+V++ETSM+SGYA A+S KAAR MF+ M ++V++WNALIAG
Sbjct: 301 KCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAG 360
Query: 361 CTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFR 420
TQNGENEEAL+LF LLKRESV PTHY+F N+L ACA+LA+L LG QAH HVLKHGF+F+
Sbjct: 361 YTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQ 420
Query: 421 YGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGI 480
G+E DIFVGNSLIDMY+KCG VE G VF +M+ERDCVSWNAMI+G+AQNG+GN+AL +
Sbjct: 421 SGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALEL 480
Query: 481 FSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLG 540
F EMLESGEKPDH+TMIGVLSAC HAG ++EGRHYF SM G+ PL+DHYTCMVDLLG
Sbjct: 481 FREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLG 540
Query: 541 RAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVL 600
RAG LEEAK++IEEMPMQPD+++WGSLLAACKVHRNI LG+YV EKLLEV+P NSGPYVL
Sbjct: 541 RAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVL 600
Query: 601 LSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYML 660
LSNMYAE G W +VM +RK MR+ GV KQPGCSWI+IQG +VFMVKDK H RK++I+ L
Sbjct: 601 LSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSL 660
Query: 661 LRTLLQQMK 669
L L+ +M+
Sbjct: 661 LDILIAEMR 669
BLAST of CmaCh16G012950 vs. ExPASy Swiss-Prot
Match:
Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)
HSP 1 Score: 479.9 bits (1234), Expect = 4.6e-134
Identity = 248/687 (36.10%), Postives = 397/687 (57.79%), Query Frame = 0
Query: 23 SKLLNQCARSKSARDTSR-VHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRML 82
+ LL + + R T++ VH +IKS V++ N L++VY K G ARK+FD M
Sbjct: 17 TNLLQKSVNKSNGRFTAQLVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEMP 76
Query: 83 ERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWNSMISGFEQHDCFDEALKYFVQ 142
R FSWN+++ A++K G +D F+++PQ D SW +MI G++ + +A++
Sbjct: 77 LRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGD 136
Query: 143 MHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGR 202
M G +++ + L++ A + ++ G ++HS I + ++ + ++L++MY+KCG
Sbjct: 137 MVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGD 196
Query: 203 VDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIFVEMIECGV------------- 262
A+ VFD M VR SWN++I + Q G +D A++ F +M E +
Sbjct: 197 PMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQR 256
Query: 263 -------------------EPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLI 322
PD TLASV+SACA + + G+QIH+ +V + ++
Sbjct: 257 GYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIV 316
Query: 323 LGNALLDMYAKCNRINEARIVFDRMPIRSVVSE--TSMVSGYAKASSVKAARSMFSNMMV 382
L NAL+ MY++C + AR + ++ + + E T+++ GY K + A+++F ++
Sbjct: 317 L-NALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKD 376
Query: 383 KDVITWNALIAGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQA 442
+DV+ W A+I G Q+G EA+ LFR + P YT +L+ ++LA L G+Q
Sbjct: 377 RDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQI 436
Query: 443 HSHVLKHGFRFRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERM-LERDCVSWNAMIVG 502
H +K G+ + V N+LI MY K G++ S R F+ + ERD VSW +MI+
Sbjct: 437 HGSAVKS------GEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIA 496
Query: 503 YAQNGFGNKALGIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVP 562
AQ+G +AL +F ML G +PDH+T +GV SAC+HAGL+++GR YF M+ ++P
Sbjct: 497 LAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIP 556
Query: 563 LKDHYTCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKL 622
HY CMVDL GRAG L+EA+ IE+MP++PD + WGSLL+AC+VH+NI LG+ E+L
Sbjct: 557 TLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERL 616
Query: 623 LEVDPENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVK 674
L ++PENSG Y L+N+Y+ G W +IRK M+ V K+ G SWIE++ +++VF V+
Sbjct: 617 LLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVE 676
BLAST of CmaCh16G012950 vs. ExPASy Swiss-Prot
Match:
Q9SY02 (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H24 PE=3 SV=1)
HSP 1 Score: 472.6 bits (1215), Expect = 7.3e-132
Identity = 250/616 (40.58%), Postives = 377/616 (61.20%), Query Frame = 0
Query: 58 NRLIDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQC 117
N +I Y + G +ARK+FD M ER++ SWN +I + ++ L A +FE MP+ D C
Sbjct: 99 NGMISGYLRNGEFELARKLFDEMPERDLVSWNVMIKGYVRNRNLGKARELFEIMPERDVC 158
Query: 118 SWNSMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLI 177
SWN+M+SG+ Q+ C D+A F +M N+ S+ + LSA +Q+ KM + ++
Sbjct: 159 SWNTMLSGYAQNGCVDDARSVFDRMPE----KNDVSWNALLSAY--VQNSKM--EEACML 218
Query: 178 YRSNYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEAL 237
++S + + L+ + K ++ AR FD M VR VSWN++IT Y Q+G +DEA
Sbjct: 219 FKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEAR 278
Query: 238 SIFVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDM 297
+F E D T ++VS ++E +++ ++ + +E + NA+L
Sbjct: 279 QLFDE----SPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPERNE-----VSWNAMLAG 338
Query: 298 YAKCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALI 357
Y + R+ A+ +FD MP R+V + +M++GYA+ + A+++F M +D ++W A+I
Sbjct: 339 YVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMI 398
Query: 358 AGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFR 417
AG +Q+G + EAL LF ++RE +F + L+ CA++ L+LG+Q H ++K G+
Sbjct: 399 AGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGY- 458
Query: 418 FRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKAL 477
E+ FVGN+L+ MY KCGS+E +F+ M +D VSWN MI GY+++GFG AL
Sbjct: 459 -----ETGCFVGNALLLMYCKCGSIEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVAL 518
Query: 478 GIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDL 537
F M G KPD TM+ VLSACSH GL+D+GR YF +M +G++P HY CMVDL
Sbjct: 519 RFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDL 578
Query: 538 LGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPY 597
LGRAG LE+A N+++ MP +PDA +WG+LL A +VH N +L E +K+ ++PENSG Y
Sbjct: 579 LGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMY 638
Query: 598 VLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIY 657
VLLSN+YA G WG+V ++R MR +GV K PG SWIEIQ + + F V D+ H K EI+
Sbjct: 639 VLLSNLYASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIF 691
Query: 658 MLLRTLLQQMKRAGYI 674
L L +MK+AGY+
Sbjct: 699 AFLEELDLRMKKAGYV 691
BLAST of CmaCh16G012950 vs. ExPASy Swiss-Prot
Match:
Q9FRI5 (Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H74 PE=2 SV=1)
HSP 1 Score: 457.2 bits (1175), Expect = 3.2e-127
Identity = 248/682 (36.36%), Postives = 383/682 (56.16%), Query Frame = 0
Query: 31 RSKSARDTSRVHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRMLERNIFSWNS 90
R S + VH II F I NRLIDVY K + AR++FD + E + + +
Sbjct: 26 RRTSLQLARAVHGNIITFGFQPRAHILNRLIDVYCKSSELNYARQLFDEISEPDKIARTT 85
Query: 91 IICAFTKSGFLDDAVHIFEKMP--QVDQCSWNSMISGFEQHDCFDEALKYFVQMHGHGFF 150
++ + SG + A +FEK P D +N+MI+GF ++ A+ F +M GF
Sbjct: 86 MVSGYCASGDITLARGVFEKAPVCMRDTVMYNAMITGFSHNNDGYSAINLFCKMKHEGFK 145
Query: 151 MNEYSFGSALSACAGL-QDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGR----VD 210
+ ++F S L+ A + D K Q H+ +S + +ALV +YSKC +
Sbjct: 146 PDNFTFASVLAGLALVADDEKQCVQFHAAALKSGAGYITSVSNALVSVYSKCASSPSLLH 205
Query: 211 CARSVFDGMTVRSRVSWNSLITCYEQNGPVD----------------------------- 270
AR VFD + + SW +++T Y +NG D
Sbjct: 206 SARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLEGMDDNMKLVAYNAMISGYVNRG 265
Query: 271 ---EALSIFVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILG 330
EAL + M+ G+E DE T SV+ ACAT ++ G+Q+HA V++ ++F
Sbjct: 266 FYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQLGKQVHAYVLRREDF--SFHFD 325
Query: 331 NALLDMYAKCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVI 390
N+L+ +Y KC + +EAR +F++MP + +VS +++SGY + + A+ +F M K+++
Sbjct: 326 NSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYVSSGHIGEAKLIFKEMKEKNIL 385
Query: 391 TWNALIAGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHV 450
+W +I+G +NG EE L LF +KRE P Y F + +CA L G+Q H+ +
Sbjct: 386 SWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSGAIKSCAVLGAYCNGQQYHAQL 445
Query: 451 LKHGFRFRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNG 510
LK GF +S + GN+LI MY KCG VE +VF M D VSWNA+I Q+G
Sbjct: 446 LKIGF------DSSLSAGNALITMYAKCGVVEEARQVFRTMPCLDSVSWNALIAALGQHG 505
Query: 511 FGNKALGIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHY 570
G +A+ ++ EML+ G +PD +T++ VL+ACSHAGL+D+GR YF SM + + P DHY
Sbjct: 506 HGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQGRKYFDSMETVYRIPPGADHY 565
Query: 571 TCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDP 630
++DLL R+G +A+++IE +P +P A +W +LL+ C+VH N++LG +KL + P
Sbjct: 566 ARLIDLLCRSGKFSDAESVIESLPFKPTAEIWEALLSGCRVHGNMELGIIAADKLFGLIP 625
Query: 631 ENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHA 674
E+ G Y+LLSNM+A G W V R+RKLMR RGV K+ CSWIE++ +++ F+V D H
Sbjct: 626 EHDGTYMLLSNMHAATGQWEEVARVRKLMRDRGVKKEVACSWIEMETQVHTFLVDDTSHP 685
BLAST of CmaCh16G012950 vs. ExPASy Swiss-Prot
Match:
Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)
HSP 1 Score: 447.6 bits (1150), Expect = 2.5e-124
Identity = 232/649 (35.75%), Postives = 373/649 (57.47%), Query Frame = 0
Query: 26 LNQCARSKSARDTSRVHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRMLERNI 85
L+ CA+S++ + ++H I+K +A ++F+QN L+ Y +CG + ARKVFD M ERN+
Sbjct: 141 LSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNV 200
Query: 86 FSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWNSMISGFEQHDCFDEALKYFVQMHGH 145
SW S+IC + + F DAV +F +M + ++ + NS+
Sbjct: 201 VSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSV----------------------- 260
Query: 146 GFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGRVDCA 205
+ +SACA L+DL+ G ++++ I S + M SALVDMY KC +D A
Sbjct: 261 -------TMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVA 320
Query: 206 RSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIFVEMIECGVEPDEVTLASVVSACATV 265
+ +FD + N++ + Y + G EAL +F M++ GV PD +++ S +S+C+ +
Sbjct: 321 KRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQL 380
Query: 266 SAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAKCNRINEARIVFDRMPIRSVVSETSM 325
I G+ H V++ + F + + NAL+DMY KC+R + A +FDRM ++VV+ S+
Sbjct: 381 RNILWGKSCHGYVLR-NGFESWDNICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSI 440
Query: 326 VSGYAKASSVKAARSMFSNMMVKDVITWNALIAGCTQNGENEEALTLF-RLLKRESVWPT 385
V+GY + V AA F M K++++WN +I+G Q EEA+ +F + +E V
Sbjct: 441 VAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNAD 500
Query: 386 HYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRYGDESDIFVGNSLIDMYMKCGSVES 445
T ++ +AC +L L L + + ++ K+G + D+ +G +L+DM+ +CG ES
Sbjct: 501 GVTMMSIASACGHLGALDLAKWIYYYIEKNGIQL------DVRLGTTLVDMFSRCGDPES 560
Query: 446 GCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGIFSEMLESGEKPDHVTMIGVLSACSH 505
+F + RD +W A I A G +A+ +F +M+E G KPD V +G L+ACSH
Sbjct: 561 AMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFVGALTACSH 620
Query: 506 AGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWG 565
GL+ +G+ F SM HG+ P HY CMVDLLGRAG LEEA +IE+MPM+P+ ++W
Sbjct: 621 GGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPMEPNDVIWN 680
Query: 566 SLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRG 625
SLLAAC+V N+++ Y EK+ + PE +G YVLLSN+YA G W ++ ++R M+++G
Sbjct: 681 SLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKVRLSMKEKG 740
Query: 626 VVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLLRTLLQQMKRAGYI 674
+ K PG S I+I+G+ + F D+ H I +L + Q+ G++
Sbjct: 741 LRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHV 752
BLAST of CmaCh16G012950 vs. TAIR 10
Match:
AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 932.9 bits (2410), Expect = 1.4e-271
Identity = 446/669 (66.67%), Postives = 541/669 (80.87%), Query Frame = 0
Query: 1 MAGNGFVKRLTGDLLFLDSSPLSKLLNQCARSK-SARDTSRVHACIIKSPFASEVFIQNR 60
MA F+K F DSSP +KLL+ C +SK SA VHA +IKS F++E+FIQNR
Sbjct: 1 MATKSFLKLAADLSSFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNR 60
Query: 61 LIDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSW 120
LID Y KCG + R+VFD+M +RNI++WNS++ TK GFLD+A +F MP+ DQC+W
Sbjct: 61 LIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTW 120
Query: 121 NSMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYR 180
NSM+SGF QHD +EAL YF MH GF +NEYSF S LSAC+GL D+ G Q+HSLI +
Sbjct: 121 NSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAK 180
Query: 181 SNYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSI 240
S +LSD+Y+GSALVDMYSKCG V+ A+ VFD M R+ VSWNSLITC+EQNGP EAL +
Sbjct: 181 SPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDV 240
Query: 241 FVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYA 300
F M+E VEPDEVTLASV+SACA++SAIK GQ++H RVVK D+ RND+IL NA +DMYA
Sbjct: 241 FQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYA 300
Query: 301 KCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAG 360
KC+RI EAR +FD MPIR+V++ETSM+SGYA A+S KAAR MF+ M ++V++WNALIAG
Sbjct: 301 KCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAG 360
Query: 361 CTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFR 420
TQNGENEEAL+LF LLKRESV PTHY+F N+L ACA+LA+L LG QAH HVLKHGF+F+
Sbjct: 361 YTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQ 420
Query: 421 YGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGI 480
G+E DIFVGNSLIDMY+KCG VE G VF +M+ERDCVSWNAMI+G+AQNG+GN+AL +
Sbjct: 421 SGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALEL 480
Query: 481 FSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLG 540
F EMLESGEKPDH+TMIGVLSAC HAG ++EGRHYF SM G+ PL+DHYTCMVDLLG
Sbjct: 481 FREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLG 540
Query: 541 RAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVL 600
RAG LEEAK++IEEMPMQPD+++WGSLLAACKVHRNI LG+YV EKLLEV+P NSGPYVL
Sbjct: 541 RAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVL 600
Query: 601 LSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYML 660
LSNMYAE G W +VM +RK MR+ GV KQPGCSWI+IQG +VFMVKDK H RK++I+ L
Sbjct: 601 LSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSL 660
Query: 661 LRTLLQQMK 669
L L+ +M+
Sbjct: 661 LDILIAEMR 669
BLAST of CmaCh16G012950 vs. TAIR 10
Match:
AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )
HSP 1 Score: 479.9 bits (1234), Expect = 3.2e-135
Identity = 248/687 (36.10%), Postives = 397/687 (57.79%), Query Frame = 0
Query: 23 SKLLNQCARSKSARDTSR-VHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRML 82
+ LL + + R T++ VH +IKS V++ N L++VY K G ARK+FD M
Sbjct: 17 TNLLQKSVNKSNGRFTAQLVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEMP 76
Query: 83 ERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWNSMISGFEQHDCFDEALKYFVQ 142
R FSWN+++ A++K G +D F+++PQ D SW +MI G++ + +A++
Sbjct: 77 LRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGD 136
Query: 143 MHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGR 202
M G +++ + L++ A + ++ G ++HS I + ++ + ++L++MY+KCG
Sbjct: 137 MVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGD 196
Query: 203 VDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIFVEMIECGV------------- 262
A+ VFD M VR SWN++I + Q G +D A++ F +M E +
Sbjct: 197 PMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQR 256
Query: 263 -------------------EPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLI 322
PD TLASV+SACA + + G+QIH+ +V + ++
Sbjct: 257 GYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIV 316
Query: 323 LGNALLDMYAKCNRINEARIVFDRMPIRSVVSE--TSMVSGYAKASSVKAARSMFSNMMV 382
L NAL+ MY++C + AR + ++ + + E T+++ GY K + A+++F ++
Sbjct: 317 L-NALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKD 376
Query: 383 KDVITWNALIAGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQA 442
+DV+ W A+I G Q+G EA+ LFR + P YT +L+ ++LA L G+Q
Sbjct: 377 RDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQI 436
Query: 443 HSHVLKHGFRFRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERM-LERDCVSWNAMIVG 502
H +K G+ + V N+LI MY K G++ S R F+ + ERD VSW +MI+
Sbjct: 437 HGSAVKS------GEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIA 496
Query: 503 YAQNGFGNKALGIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVP 562
AQ+G +AL +F ML G +PDH+T +GV SAC+HAGL+++GR YF M+ ++P
Sbjct: 497 LAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIP 556
Query: 563 LKDHYTCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKL 622
HY CMVDL GRAG L+EA+ IE+MP++PD + WGSLL+AC+VH+NI LG+ E+L
Sbjct: 557 TLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERL 616
Query: 623 LEVDPENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVK 674
L ++PENSG Y L+N+Y+ G W +IRK M+ V K+ G SWIE++ +++VF V+
Sbjct: 617 LLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVE 676
BLAST of CmaCh16G012950 vs. TAIR 10
Match:
AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 472.6 bits (1215), Expect = 5.2e-133
Identity = 250/616 (40.58%), Postives = 377/616 (61.20%), Query Frame = 0
Query: 58 NRLIDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQC 117
N +I Y + G +ARK+FD M ER++ SWN +I + ++ L A +FE MP+ D C
Sbjct: 99 NGMISGYLRNGEFELARKLFDEMPERDLVSWNVMIKGYVRNRNLGKARELFEIMPERDVC 158
Query: 118 SWNSMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLI 177
SWN+M+SG+ Q+ C D+A F +M N+ S+ + LSA +Q+ KM + ++
Sbjct: 159 SWNTMLSGYAQNGCVDDARSVFDRMPE----KNDVSWNALLSAY--VQNSKM--EEACML 218
Query: 178 YRSNYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEAL 237
++S + + L+ + K ++ AR FD M VR VSWN++IT Y Q+G +DEA
Sbjct: 219 FKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEAR 278
Query: 238 SIFVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDM 297
+F E D T ++VS ++E +++ ++ + +E + NA+L
Sbjct: 279 QLFDE----SPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPERNE-----VSWNAMLAG 338
Query: 298 YAKCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALI 357
Y + R+ A+ +FD MP R+V + +M++GYA+ + A+++F M +D ++W A+I
Sbjct: 339 YVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMI 398
Query: 358 AGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFR 417
AG +Q+G + EAL LF ++RE +F + L+ CA++ L+LG+Q H ++K G+
Sbjct: 399 AGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGY- 458
Query: 418 FRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKAL 477
E+ FVGN+L+ MY KCGS+E +F+ M +D VSWN MI GY+++GFG AL
Sbjct: 459 -----ETGCFVGNALLLMYCKCGSIEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVAL 518
Query: 478 GIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDL 537
F M G KPD TM+ VLSACSH GL+D+GR YF +M +G++P HY CMVDL
Sbjct: 519 RFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDL 578
Query: 538 LGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPY 597
LGRAG LE+A N+++ MP +PDA +WG+LL A +VH N +L E +K+ ++PENSG Y
Sbjct: 579 LGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMY 638
Query: 598 VLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIY 657
VLLSN+YA G WG+V ++R MR +GV K PG SWIEIQ + + F V D+ H K EI+
Sbjct: 639 VLLSNLYASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIF 691
Query: 658 MLLRTLLQQMKRAGYI 674
L L +MK+AGY+
Sbjct: 699 AFLEELDLRMKKAGYV 691
BLAST of CmaCh16G012950 vs. TAIR 10
Match:
AT1G25360.1 (Pentatricopeptide repeat (PPR) superfamily protein )
HSP 1 Score: 457.2 bits (1175), Expect = 2.3e-128
Identity = 248/682 (36.36%), Postives = 383/682 (56.16%), Query Frame = 0
Query: 31 RSKSARDTSRVHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRMLERNIFSWNS 90
R S + VH II F I NRLIDVY K + AR++FD + E + + +
Sbjct: 26 RRTSLQLARAVHGNIITFGFQPRAHILNRLIDVYCKSSELNYARQLFDEISEPDKIARTT 85
Query: 91 IICAFTKSGFLDDAVHIFEKMP--QVDQCSWNSMISGFEQHDCFDEALKYFVQMHGHGFF 150
++ + SG + A +FEK P D +N+MI+GF ++ A+ F +M GF
Sbjct: 86 MVSGYCASGDITLARGVFEKAPVCMRDTVMYNAMITGFSHNNDGYSAINLFCKMKHEGFK 145
Query: 151 MNEYSFGSALSACAGL-QDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGR----VD 210
+ ++F S L+ A + D K Q H+ +S + +ALV +YSKC +
Sbjct: 146 PDNFTFASVLAGLALVADDEKQCVQFHAAALKSGAGYITSVSNALVSVYSKCASSPSLLH 205
Query: 211 CARSVFDGMTVRSRVSWNSLITCYEQNGPVD----------------------------- 270
AR VFD + + SW +++T Y +NG D
Sbjct: 206 SARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLEGMDDNMKLVAYNAMISGYVNRG 265
Query: 271 ---EALSIFVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILG 330
EAL + M+ G+E DE T SV+ ACAT ++ G+Q+HA V++ ++F
Sbjct: 266 FYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQLGKQVHAYVLRREDF--SFHFD 325
Query: 331 NALLDMYAKCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVI 390
N+L+ +Y KC + +EAR +F++MP + +VS +++SGY + + A+ +F M K+++
Sbjct: 326 NSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYVSSGHIGEAKLIFKEMKEKNIL 385
Query: 391 TWNALIAGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHV 450
+W +I+G +NG EE L LF +KRE P Y F + +CA L G+Q H+ +
Sbjct: 386 SWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSGAIKSCAVLGAYCNGQQYHAQL 445
Query: 451 LKHGFRFRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNG 510
LK GF +S + GN+LI MY KCG VE +VF M D VSWNA+I Q+G
Sbjct: 446 LKIGF------DSSLSAGNALITMYAKCGVVEEARQVFRTMPCLDSVSWNALIAALGQHG 505
Query: 511 FGNKALGIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHY 570
G +A+ ++ EML+ G +PD +T++ VL+ACSHAGL+D+GR YF SM + + P DHY
Sbjct: 506 HGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQGRKYFDSMETVYRIPPGADHY 565
Query: 571 TCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDP 630
++DLL R+G +A+++IE +P +P A +W +LL+ C+VH N++LG +KL + P
Sbjct: 566 ARLIDLLCRSGKFSDAESVIESLPFKPTAEIWEALLSGCRVHGNMELGIIAADKLFGLIP 625
Query: 631 ENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHA 674
E+ G Y+LLSNM+A G W V R+RKLMR RGV K+ CSWIE++ +++ F+V D H
Sbjct: 626 EHDGTYMLLSNMHAATGQWEEVARVRKLMRDRGVKKEVACSWIEMETQVHTFLVDDTSHP 685
BLAST of CmaCh16G012950 vs. TAIR 10
Match:
AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )
HSP 1 Score: 447.6 bits (1150), Expect = 1.8e-125
Identity = 232/649 (35.75%), Postives = 373/649 (57.47%), Query Frame = 0
Query: 26 LNQCARSKSARDTSRVHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRMLERNI 85
L+ CA+S++ + ++H I+K +A ++F+QN L+ Y +CG + ARKVFD M ERN+
Sbjct: 141 LSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNV 200
Query: 86 FSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWNSMISGFEQHDCFDEALKYFVQMHGH 145
SW S+IC + + F DAV +F +M + ++ + NS+
Sbjct: 201 VSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSV----------------------- 260
Query: 146 GFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGRVDCA 205
+ +SACA L+DL+ G ++++ I S + M SALVDMY KC +D A
Sbjct: 261 -------TMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVA 320
Query: 206 RSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIFVEMIECGVEPDEVTLASVVSACATV 265
+ +FD + N++ + Y + G EAL +F M++ GV PD +++ S +S+C+ +
Sbjct: 321 KRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQL 380
Query: 266 SAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAKCNRINEARIVFDRMPIRSVVSETSM 325
I G+ H V++ + F + + NAL+DMY KC+R + A +FDRM ++VV+ S+
Sbjct: 381 RNILWGKSCHGYVLR-NGFESWDNICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSI 440
Query: 326 VSGYAKASSVKAARSMFSNMMVKDVITWNALIAGCTQNGENEEALTLF-RLLKRESVWPT 385
V+GY + V AA F M K++++WN +I+G Q EEA+ +F + +E V
Sbjct: 441 VAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNAD 500
Query: 386 HYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRYGDESDIFVGNSLIDMYMKCGSVES 445
T ++ +AC +L L L + + ++ K+G + D+ +G +L+DM+ +CG ES
Sbjct: 501 GVTMMSIASACGHLGALDLAKWIYYYIEKNGIQL------DVRLGTTLVDMFSRCGDPES 560
Query: 446 GCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGIFSEMLESGEKPDHVTMIGVLSACSH 505
+F + RD +W A I A G +A+ +F +M+E G KPD V +G L+ACSH
Sbjct: 561 AMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFVGALTACSH 620
Query: 506 AGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWG 565
GL+ +G+ F SM HG+ P HY CMVDLLGRAG LEEA +IE+MPM+P+ ++W
Sbjct: 621 GGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPMEPNDVIWN 680
Query: 566 SLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRG 625
SLLAAC+V N+++ Y EK+ + PE +G YVLLSN+YA G W ++ ++R M+++G
Sbjct: 681 SLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKVRLSMKEKG 740
Query: 626 VVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLLRTLLQQMKRAGYI 674
+ K PG S I+I+G+ + F D+ H I +L + Q+ G++
Sbjct: 741 LRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHV 752
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9SIT7 | 2.0e-270 | 66.67 | Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... | [more] |
Q9SHZ8 | 4.6e-134 | 36.10 | Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... | [more] |
Q9SY02 | 7.3e-132 | 40.58 | Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX... | [more] |
Q9FRI5 | 3.2e-127 | 36.36 | Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX... | [more] |
Q9LUJ2 | 2.5e-124 | 35.75 | Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... | [more] |
Match Name | E-value | Identity | Description | |
AT2G13600.1 | 1.4e-271 | 66.67 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT2G22070.1 | 3.2e-135 | 36.10 | pentatricopeptide (PPR) repeat-containing protein | [more] |
AT4G02750.1 | 5.2e-133 | 40.58 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT1G25360.1 | 2.3e-128 | 36.36 | Pentatricopeptide repeat (PPR) superfamily protein | [more] |
AT3G22690.1 | 1.8e-125 | 35.75 | CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... | [more] |