Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCCATTCTCATCTTCCTCCTTGCTTCATCTGTTTTCTCTGACAGGAACGGAGCAATGTCGTCGATTTCTCATCTTTTAATCCTTTTCTTCGTCGTCTTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCTGAAACAAGAAAGCGATGCCAATAATGAAGAACAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCATCAAGGATATTCGCTATCACGATCAATCTCGCCTCCGAGCCATCTCCGCCCACCTGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGTGAAGGAGGCGTCGGGTTCGAATCATCCTCCACATTCGCAGACGCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAAGTGGGAACGCCGCCGCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGCGTAACAAAATGAGAGAGAGATTCAATTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTCGGCGGCCAACCCGATTGTCCAACCCCCAACACCCCTTGTTCCTATACCTACAGGTATTAATAATTAATATTATTATTATTATTATTATTTTAAATAAAATTTTGGTGGGGACCATTATAACAATAAATGGATGTTGGTTGGTCTGTTTAAAAAATAAATAAATAACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACAGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATGACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCTTAGGCTCTAGCATCTACTCCTTCGTCTACAAAGCCGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTCTTCGGCACCCCCTCCCCCAAGACCTTCTCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAAACTCTTCACCGGCGGCCGATACAGCTGCTACTACGGCGTCCAACTGTCCGGAATCTCCGTCGACGGACAGATCCTGAACATCCCCCCTCACGTTTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACGGGCACCAGCCTGACGATGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAAGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAACTTTGCTTCAATGACACTGAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGACGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTGTATTGCGATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAGACTTTCATTTGGAAATATGATTTACTCAAGGGATCCGTCACTTTTGCTCCCTCCGATTGCGCCTAGACCTTCTCCATTTTCTTTCATTTATTACTTCCTTCTTATTAATTATACTCATATAAT
mRNA sequence
CGCCATTCTCATCTTCCTCCTTGCTTCATCTGTTTTCTCTGACAGGAACGGAGCAATGTCGTCGATTTCTCATCTTTTAATCCTTTTCTTCGTCGTCTTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCTGAAACAAGAAAGCGATGCCAATAATGAAGAACAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCATCAAGGATATTCGCTATCACGATCAATCTCGCCTCCGAGCCATCTCCGCCCACCTGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGTGAAGGAGGCGTCGGGTTCGAATCATCCTCCACATTCGCAGACGCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAAGTGGGAACGCCGCCGCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGCGTAACAAAATGAGAGAGAGATTCAATTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTCGGCGGCCAACCCGATTGTCCAACCCCCAACACCCCTTGTTCCTATACCTACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACAGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATGACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCTTAGGCTCTAGCATCTACTCCTTCGTCTACAAAGCCGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTCTTCGGCACCCCCTCCCCCAAGACCTTCTCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAAACTCTTCACCGGCGGCCGATACAGCTGCTACTACGGCGTCCAACTGTCCGGAATCTCCGTCGACGGACAGATCCTGAACATCCCCCCTCACGTTTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACGGGCACCAGCCTGACGATGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAAGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAACTTTGCTTCAATGACACTGAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGACGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTGTATTGCGATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAGACTTTCATTTGGAAATATGATTTACTCAAGGGATCCGTCACTTTTGCTCCCTCCGATTGCGCCTAGACCTTCTCCATTTTCTTTCATTTATTACTTCCTTCTTATTAATTATACTCATATAAT
Coding sequence (CDS)
ATGTCGTCGATTTCTCATCTTTTAATCCTTTTCTTCGTCGTCTTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCTGAAACAAGAAAGCGATGCCAATAATGAAGAACAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCATCAAGGATATTCGCTATCACGATCAATCTCGCCTCCGAGCCATCTCCGCCCACCTGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGTGAAGGAGGCGTCGGGTTCGAATCATCCTCCACATTCGCAGACGCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAAGTGGGAACGCCGCCGCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGCGTAACAAAATGAGAGAGAGATTCAATTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTCGGCGGCCAACCCGATTGTCCAACCCCCAACACCCCTTGTTCCTATACCTACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACAGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATGACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCTTAGGCTCTAGCATCTACTCCTTCGTCTACAAAGCCGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTCTTCGGCACCCCCTCCCCCAAGACCTTCTCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAAACTCTTCACCGGCGGCCGATACAGCTGCTACTACGGCGTCCAACTGTCCGGAATCTCCGTCGACGGACAGATCCTGAACATCCCCCCTCACGTTTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACGGGCACCAGCCTGACGATGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAAGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAACTTTGCTTCAATGACACTGAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGACGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTGTATTGCGATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAGACTTTCATTTGGAAATATGATTTACTCAAGGGATCCGTCACTTTTGCTCCCTCCGATTGCGCCTAG
Protein sequence
MSSISHLLILFFVVFFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWTKVVENAEEKVKEASGSNHPPHSQTPIALKTYPGADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNKMRERFNYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLFTGGRYSCYYGVQLSGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMEKDVKGEREKNFKLCFNDTEWNFGMLPKLGFHFEDGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDCA
Homology
BLAST of CmaCh02G006910 vs. ExPASy Swiss-Prot
Match:
Q9LTW4 (Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE=1 SV=1)
HSP 1 Score: 273.1 bits (697), Expect = 6.6e-72
Identity = 163/466 (34.98%), Postives = 241/466 (51.72%), Query Frame = 0
Query: 60 VKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWTKVVENAEEKVKEASGSNHPPHS 119
+K H + + K RI+D+ DQ R IS N+ VK GS
Sbjct: 51 LKLAHRDTLLPKPLSRIEDVIGADQKRHSLISRK-------RNSTVGVKMDLGS------ 110
Query: 120 QTPIALKTYPGADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPS 179
G D+G++++F +++VGTP +KF ++ DTGS+L W CRYR
Sbjct: 111 ----------GIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYR---------- 170
Query: 180 PIHKMRNKMRERFNYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYS 239
R K R A++S SF + C ++ C D L CPTP+TPCSY Y
Sbjct: 171 ----ARGKDNRR---VFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 230
Query: 240 YLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYS 299
Y G A G+FA ET+TV LTNG+ +L L GC+ T F GADG++GL S +S
Sbjct: 231 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSF-QGADGVLGLAFSDFS 290
Query: 300 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLFTG 359
F A + G FSYCL DHL N +Y +FG+ + ++P+
Sbjct: 291 FT-STATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLT-------- 350
Query: 360 GRYSCYYGVQLSGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMA 419
R +Y + + GIS+ +L+IP VW+ SG GTILD+GTSLT+L A+ V+ +A
Sbjct: 351 -RIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLA 410
Query: 420 PKIEKFGRMEKDVKGEREKNFKLCFNDTE-WNFGMLPKLGFHFEDGAVFEPPDRSYIVSA 479
+ + R++ + + CF+ T +N LP+L FH + GA FEP +SY+V A
Sbjct: 411 RYLVELKRVKPE-----GVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDA 460
Query: 480 SYQCSCIAITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDC 525
+ C+ S P+ N++GNI+QQ ++W++DL+ +++FAPS C
Sbjct: 471 APGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 460
BLAST of CmaCh02G006910 vs. ExPASy Swiss-Prot
Match:
Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)
HSP 1 Score: 158.3 bits (399), Expect = 2.4e-37
Identity = 127/398 (31.91%), Postives = 183/398 (45.98%), Query Frame = 0
Query: 134 GSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNKMRERFN 193
G E+ + + +GTP F+ I DTGSDL+WT+C C S P+PI FN
Sbjct: 92 GDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQC--EPCTQCFSQPTPI----------FN 151
Query: 194 YALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYSYLSGDRAMGIFATE 253
SSSFS +PC S+ C QD P N C YTY Y G G ATE
Sbjct: 152 ----PQDSSSFSTLPCESQYC-QDL------PSETCNNNECQYTYGYGDGSTTQGYMATE 211
Query: 254 TVTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYSFVYKAAENNVG-GG 313
T T ++ + +I +GC E+ +GA GLIG+G S + +G G
Sbjct: 212 TFTFETSS-----VPNIAFGCGEDNQGFGQGNGA-GLIGMGWGPLSL-----PSQLGVGQ 271
Query: 314 FSYCLADHLRNITAISYFVFGTPSPKTF---SASTSSPIGPPATTKLFTGGRYSCYYGVQ 373
FSYC+ +G+ SP T SA++ P G P+TT L YY +
Sbjct: 272 FSYCMTS------------YGSSSPSTLALGSAASGVPEGSPSTT-LIHSSLNPTYYYIT 331
Query: 374 LSGISVDGQILNIPPHVWNIKSG--CGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGR 433
L GI+V G L IP + ++ G I+D+GT+LT L A++AV +A +I
Sbjct: 332 LQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI----- 391
Query: 434 MEKDVKGEREKNFKLCFND-TEWNFGMLPKLGFHFEDGAVFEPPDRSYIVSASYQCSCIA 493
E CF ++ + +P++ F DG V +++ ++S + C+A
Sbjct: 392 -NLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQF-DGGVLNLGEQNILISPAEGVICLA 435
Query: 494 ITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDC 525
+ S I+I GNI QQ YDL +V+F P+ C
Sbjct: 452 MGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
BLAST of CmaCh02G006910 vs. ExPASy Swiss-Prot
Match:
Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)
HSP 1 Score: 148.7 bits (374), Expect = 1.9e-34
Identity = 113/391 (28.90%), Postives = 177/391 (45.27%), Query Frame = 0
Query: 135 SSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNKMRERFNY 194
S E+ + + +GTPP IADTGSDLLWT+C +P ++ F+
Sbjct: 87 SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQC------------APCDDCYTQVDPLFD- 146
Query: 195 ALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYSYLSGDRAMGIFATET 254
SS++ + CSS QC + L Q C T + CSY+ SY G A +T
Sbjct: 147 ---PKTSSTYKDVSCSSSQC----TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDT 206
Query: 255 VTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFS 314
+T+ ++ + QLK+I+ GC + F G++GLG S + K +++ G FS
Sbjct: 207 LTLGSSDTRPMQLKNIIIGCGHNNAGT-FNKKGSGIVGLGGGPVSLI-KQLGDSIDGKFS 266
Query: 315 YCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLFTGGRYSCYYGVQLSGIS 374
YCL S FGT + + S S+P+ A+ + F Y + L IS
Sbjct: 267 YCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETF--------YYLTLKSIS 326
Query: 375 VDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMEKDVKG 434
V + + G I+D+GT+LT+L + + +A+A I + + K
Sbjct: 327 VGSKQIQYSGSDSESSEG-NIIIDSGTTLTLLPTEFYSELEDAVASSI------DAEKKQ 386
Query: 435 EREKNFKLCFNDTEWNFGMLPKLGFHFEDGAVFEPPDRSYIVSASYQCSCIAITSLPFPS 494
+ + LC++ T +P + HF DGA + + V S C A PS
Sbjct: 387 DPQSGLSLCYSAT--GDLKVPVITMHF-DGADVKLDSSNAFVQVSEDLVCFAFRG--SPS 435
Query: 495 INILGNIIQQTFIWKYDLLKGSVTFAPSDCA 526
+I GN+ Q F+ YD + +V+F P+DCA
Sbjct: 447 FSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435
BLAST of CmaCh02G006910 vs. ExPASy Swiss-Prot
Match:
Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)
HSP 1 Score: 142.1 bits (357), Expect = 1.8e-32
Identity = 120/398 (30.15%), Postives = 173/398 (43.47%), Query Frame = 0
Query: 134 GSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNKMRERFN 193
G E+ + L +GTP Q F+ I DTGSDL+WT+C+ P + N+ FN
Sbjct: 91 GDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ------------PCTQCFNQSTPIFN 150
Query: 194 YALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYSYLSGDRAMGIFATE 253
SSSFS +PCSS+ C + P C N C YTY Y G G TE
Sbjct: 151 ----PQGSSSFSTLPCSSQLC-----QALSSPTC--SNNFCQYTYGYGDGSETQGSMGTE 210
Query: 254 TVTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGF 313
T+T + + +I +GC E +GA GL+G+G S + F
Sbjct: 211 TLTFGSVS-----IPNITFGCGENNQGFGQGNGA-GLVGMGRGPLSLPSQLDVTK----F 270
Query: 314 SYCLADHLRNITAISYFVFGTPSPKTF---SASTSSPIGPPATTKLFTGGRYSCYYGVQL 373
SYC+ G+ +P S + S G P TT L + +Y + L
Sbjct: 271 SYCMTP------------IGSSTPSNLLLGSLANSVTAGSPNTT-LIQSSQIPTFYYITL 330
Query: 374 SGISVDGQILNIPPHVWNIKSGCGT---ILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGR 433
+G+SV L I P + + S GT I+D+GT+LT A+ +V + E +
Sbjct: 331 NGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSV------RQEFISQ 390
Query: 434 MEKDVKGEREKNFKLCF-NDTEWNFGMLPKLGFHFEDGAVFEPPDRSYIVSASYQCSCIA 493
+ V F LCF ++ + +P HF DG E P +Y +S S C+A
Sbjct: 391 INLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHF-DGGDLELPSENYFISPSNGLICLA 434
Query: 494 ITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDC 525
+ S ++I GNI QQ + YD V+FA + C
Sbjct: 451 MGS-SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
BLAST of CmaCh02G006910 vs. ExPASy Swiss-Prot
Match:
Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)
HSP 1 Score: 138.7 bits (348), Expect = 1.9e-31
Identity = 121/400 (30.25%), Postives = 175/400 (43.75%), Query Frame = 0
Query: 130 GADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNKMR 189
G GS E+F +L VGTP + M+ DTGSD++W +C CR S PI R
Sbjct: 134 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC--APCRRCYSQSDPIFDPR---- 193
Query: 190 ERFNYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYSYLSGDRAMGI 249
+S +++ IPCSS C + S C T C Y SY G +G
Sbjct: 194 ----------KSKTYATIPCSSPHCRRLDS-----AGCNTRRKTCLYQVSYGDGSFTVGD 253
Query: 250 FATETVTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYSFVYKAAENNV 309
F+TET+T R ++K + GC + + GA GL+GLG SF + +
Sbjct: 254 FSTETLTFR-----RNRVKGVALGCGHD--NEGLFVGAAGLLGLGKGKLSFPGQTG-HRF 313
Query: 310 GGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLFTGGRYSCYYGVQ 369
FSYCL D + PS F + S I T L + + +Y V
Sbjct: 314 NQKFSYCLVDRSAS---------SKPSSVVFGNAAVSRIA--RFTPLLSNPKLDTFYYVG 373
Query: 370 LSGISVDG-QILNIPPHVWNIK--SGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFG 429
L GISV G ++ + ++ + G I+D+GTS+T L PA+ A+ +A F
Sbjct: 374 LLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDA-------FR 433
Query: 430 RMEKDVKGEREKN-FKLCFNDTEWNFGMLPKLGFHFEDGAVFEPPDRSYIVSASYQCSCI 489
K +K + + F CF+ + N +P + HF GA P +Y++
Sbjct: 434 VGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFC 485
Query: 490 AITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDCA 526
+ ++I+GNI QQ F YDL V FAP CA
Sbjct: 494 FAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
BLAST of CmaCh02G006910 vs. TAIR 10
Match:
AT3G12700.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 273.1 bits (697), Expect = 4.7e-73
Identity = 163/466 (34.98%), Postives = 241/466 (51.72%), Query Frame = 0
Query: 60 VKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWTKVVENAEEKVKEASGSNHPPHS 119
+K H + + K RI+D+ DQ R IS N+ VK GS
Sbjct: 51 LKLAHRDTLLPKPLSRIEDVIGADQKRHSLISRK-------RNSTVGVKMDLGS------ 110
Query: 120 QTPIALKTYPGADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPS 179
G D+G++++F +++VGTP +KF ++ DTGS+L W CRYR
Sbjct: 111 ----------GIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYR---------- 170
Query: 180 PIHKMRNKMRERFNYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYS 239
R K R A++S SF + C ++ C D L CPTP+TPCSY Y
Sbjct: 171 ----ARGKDNRR---VFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 230
Query: 240 YLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYS 299
Y G A G+FA ET+TV LTNG+ +L L GC+ T F GADG++GL S +S
Sbjct: 231 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSF-QGADGVLGLAFSDFS 290
Query: 300 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLFTG 359
F A + G FSYCL DHL N +Y +FG+ + ++P+
Sbjct: 291 FT-STATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLT-------- 350
Query: 360 GRYSCYYGVQLSGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMA 419
R +Y + + GIS+ +L+IP VW+ SG GTILD+GTSLT+L A+ V+ +A
Sbjct: 351 -RIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLA 410
Query: 420 PKIEKFGRMEKDVKGEREKNFKLCFNDTE-WNFGMLPKLGFHFEDGAVFEPPDRSYIVSA 479
+ + R++ + + CF+ T +N LP+L FH + GA FEP +SY+V A
Sbjct: 411 RYLVELKRVKPE-----GVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDA 460
Query: 480 SYQCSCIAITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDC 525
+ C+ S P+ N++GNI+QQ ++W++DL+ +++FAPS C
Sbjct: 471 APGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 460
BLAST of CmaCh02G006910 vs. TAIR 10
Match:
AT3G25700.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 202.6 bits (514), Expect = 7.8e-52
Identity = 133/411 (32.36%), Postives = 199/411 (48.42%), Query Frame = 0
Query: 130 GADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNKMR 189
GA GS ++FV L++G PPQ +IADTGSDL+W +C CR +CS+ SP
Sbjct: 76 GAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKC--SACR-NCSHHSPA-------- 135
Query: 190 ERFNYALYANQSSSFSPIPCSSKQCIQDFSELGGQPD-CPTPN-----TPCSYTYSYLSG 249
+ SS+FSP C C L +PD P N + C Y Y Y G
Sbjct: 136 ----TVFFPRHSSTFSPAHCYDPVC-----RLVPKPDRAPICNHTRIHSTCHYEYGYADG 195
Query: 250 DRAMGIFATETVTVRLTNGKEKQLKDILYGC----TEEMTDSQFLDGADGLIGLGSSIYS 309
G+FA ET +++ ++GKE +LK + +GC + + +GA+G++GLG S
Sbjct: 196 SLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPIS 255
Query: 310 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLFTG 369
F + G FSYCL D+ + SY + G + I T L T
Sbjct: 256 FASQLG-RRFGNKFSYCLMDYTLSPPPTSYLIIG---------NGGDGISKLFFTPLLTN 315
Query: 370 GRYSCYYGVQLSGISVDGQILNIPPHVWNI--KSGCGTILDTGTSLTMLTAPAHDAVIEA 429
+Y V+L + V+G L I P +W I GT++D+GT+L L PA+ +VI A
Sbjct: 316 PLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAA 375
Query: 430 MAPKIEKFGRMEKDVKGEREKNFKLCFN--DTEWNFGMLPKLGFHFEDGAVFEPPDRSYI 489
+ R++ + F LC N +LP+L F F GAVF PP R+Y
Sbjct: 376 VR------RRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYF 435
Query: 490 VSASYQCSCIAITSL-PFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDCA 526
+ Q C+AI S+ P +++GN++QQ F++++D + + F+ CA
Sbjct: 436 IETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450
BLAST of CmaCh02G006910 vs. TAIR 10
Match:
AT2G42980.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 173.7 bits (439), Expect = 3.9e-43
Identity = 149/492 (30.28%), Postives = 217/492 (44.11%), Query Frame = 0
Query: 55 HHPEVVK---RLHDEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWTKVVENAEEKVK--- 114
H E VK R+ E K + + D++ D +R++ + A N +K +N + + K
Sbjct: 73 HTRESVKPQSRIKQETK--RTTHSVVDLQIQDLTRIKTLHARFNKSKKQKNEKVRKKITS 132
Query: 115 EASGSNHPPHSQTPIALKTYPGADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRY 174
+ S P S + G GS E+F+ + VGTPP+ F++I DTGSDL W +C
Sbjct: 133 DISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQC-- 192
Query: 175 RRCRGDCSNPSPIHKMRNKMRERFNYALY-ANQSSSFSPIPCSSKQCIQDFSELGGQPD- 234
C DC H+ N Y S+SF I C+ +C L PD
Sbjct: 193 LPCY-DC-----FHQ---------NGMFYDPKTSASFKNITCNDPRC-----SLISSPDP 252
Query: 235 ---CPTPNTPCSYTYSYLSGDRAMGIFATETVTVRLT----NGKEKQLKDILYGCTEEMT 294
C + N C Y Y Y G FA ET TV LT E ++ ++++GC
Sbjct: 253 PVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGH--W 312
Query: 295 DSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKT 354
+ GA GL+GLG SF ++ G FSYCL D N S +FG
Sbjct: 313 NRGLFSGASGLLGLGRGPLSF-SSQLQSLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLL 372
Query: 355 FSASTSSPIGPPATTKLFTGGRYS--CYYGVQLSGISVDGQILNIPPHVWNIKS--GCGT 414
+ + T G S +Y +Q+ I V G+ L+IP WNI S GT
Sbjct: 373 NHTNLN-------FTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGGT 432
Query: 415 ILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMEKDVKGEREKNFKLCFN--DTEWNFGM 474
I+D+GT+L+ PA++ + A K+++ + +D CFN E N
Sbjct: 433 IIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDF-----PVLDPCFNVSGIEENNIH 492
Query: 475 LPKLGFHFEDGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTFIWKYDLL 526
LP+LG F DG V+ P + + S C+AI P + +I+GN QQ F YD
Sbjct: 493 LPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTK 525
BLAST of CmaCh02G006910 vs. TAIR 10
Match:
AT3G59080.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 167.5 bits (423), Expect = 2.8e-41
Identity = 151/538 (28.07%), Postives = 230/538 (42.75%), Query Frame = 0
Query: 11 FFVVFFFSPLTVAVADQSNANNL------KQESDANNEEQEFVRLDLIHRHHPEVVKRLH 70
F + F +P+ A S +N+ K+ + E + V+ L R K
Sbjct: 38 FSGIDFPNPMRFGSASSSTSNDCGFSSPEKEPTKERTGENKTVKFHLKRRETTTTEKATT 97
Query: 71 DEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWT---KVVENAEEKVKEASGSNHPPHSQT 130
+ + +E +I+D+ R + + T K +N +E V ++
Sbjct: 98 NSV----LELQIRDLTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVTTTPVASSVEEQAG 157
Query: 131 PIALKTYPGADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 190
+ G GS E+F+ + VG+PP+ F++I DTGSDL W +C C DC
Sbjct: 158 QLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQC--LPCY-DCFQQ--- 217
Query: 191 HKMRNKMRERFNYALY-ANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTP----NTPCSY 250
N A Y S+S+ I C+ ++C L PD P P N C Y
Sbjct: 218 -----------NGAFYDPKASASYKNITCNDQRC-----NLVSSPDPPMPCKSDNQSCPY 277
Query: 251 TYSYLSGDRAMGIFATETVTVRL-TNGKEKQL---KDILYGCTEEMTDSQFLDGADGLIG 310
Y Y G FA ET TV L TNG +L +++++GC + GA GL+G
Sbjct: 278 YYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGH--WNRGLFHGAAGLLG 337
Query: 311 LGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPA 370
LG SF ++ G FSYCL D + S +FG P
Sbjct: 338 LGRGPLSF-SSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSH--------PNL 397
Query: 371 TTKLFTGGR---YSCYYGVQLSGISVDGQILNIPPHVWNIKS--GCGTILDTGTSLTMLT 430
F G+ +Y VQ+ I V G++LNIP WNI S GTI+D+GT+L+
Sbjct: 398 NFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFA 457
Query: 431 APAHDAVIEAMAPKIEKFGRMEKDVKGEREKNFKLCFNDTEWNFGMLPKLGFHFEDGAVF 490
PA++ + +A K + + +D CFN + + LP+LG F DGAV+
Sbjct: 458 EPAYEFIKNKIAEKAKGKYPVYRDF-----PILDPCFNVSGIHNVQLPELGIAFADGAVW 517
Query: 491 EPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDCA 526
P + + + C+A+ P + +I+GN QQ F YD + + +AP+ CA
Sbjct: 518 NFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCA 533
BLAST of CmaCh02G006910 vs. TAIR 10
Match:
AT3G59080.2 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 150.6 bits (379), Expect = 3.5e-36
Identity = 139/533 (26.08%), Postives = 214/533 (40.15%), Query Frame = 0
Query: 11 FFVVFFFSPLTVAVADQSNANNL------KQESDANNEEQEFVRLDLIHRHHPEVVKRLH 70
F + F +P+ A S +N+ K+ + E + V+ L R K
Sbjct: 38 FSGIDFPNPMRFGSASSSTSNDCGFSSPEKEPTKERTGENKTVKFHLKRRETTTTEKATT 97
Query: 71 DEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWT---KVVENAEEKVKEASGSNHPPHSQT 130
+ + +E +I+D+ R + + T K +N +E V ++
Sbjct: 98 NSV----LELQIRDLTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVTTTPVASSVEEQAG 157
Query: 131 PIALKTYPGADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 190
+ G GS E+F+ + VG+PP+ F++I DTGSDL W +C
Sbjct: 158 QLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQC--------------- 217
Query: 191 HKMRNKMRERFNYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYSYL 250
+PC C Q N C Y Y Y
Sbjct: 218 -------------------------LPC--YDCFQQ-----------NDNQSCPYYYWYG 277
Query: 251 SGDRAMGIFATETVTVRL-TNGKEKQL---KDILYGCTEEMTDSQFLDGADGLIGLGSSI 310
G FA ET TV L TNG +L +++++GC + GA GL+GLG
Sbjct: 278 DSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGH--WNRGLFHGAAGLLGLGRGP 337
Query: 311 YSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLF 370
SF ++ G FSYCL D + S +FG P F
Sbjct: 338 LSF-SSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSH--------PNLNFTSF 397
Query: 371 TGGR---YSCYYGVQLSGISVDGQILNIPPHVWNIKS--GCGTILDTGTSLTMLTAPAHD 430
G+ +Y VQ+ I V G++LNIP WNI S GTI+D+GT+L+ PA++
Sbjct: 398 VAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYE 457
Query: 431 AVIEAMAPKIEKFGRMEKDVKGEREKNFKLCFNDTEWNFGMLPKLGFHFEDGAVFEPPDR 490
+ +A K + + +D CFN + + LP+LG F DGAV+ P
Sbjct: 458 FIKNKIAEKAKGKYPVYRDF-----PILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTE 497
Query: 491 SYIVSASYQCSCIAITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDCA 526
+ + + C+A+ P + +I+GN QQ F YD + + +AP+ CA
Sbjct: 518 NSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCA 497
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9LTW4 | 6.6e-72 | 34.98 | Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE... | [more] |
Q766C2 | 2.4e-37 | 31.91 | Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... | [more] |
Q6XBF8 | 1.9e-34 | 28.90 | Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1 | [more] |
Q766C3 | 1.8e-32 | 30.15 | Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... | [more] |
Q9LNJ3 | 1.9e-31 | 30.25 | Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... | [more] |