Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGACTAAACAATTATATGCCCATTTTCGTAATTTCACAACACAATTTTGTTATAAATCTACCATACCCCCATGCAAAAACTCCGGTAACCGGTAACCTTCAAATTTTCGTTTTCACCTCTTCCACCGCCGTCTCTCTCAACTCCGGCCACCGAGTTGAGTCCGAGTCATGGCCGTCAAATTTTTCTTCTTCGTCTTTTTAGCCCTTCTGGCACTAAACTCCAACGCTTCCGATCTCTGCGCCGCCGGATCGGACGGCTCCGGTGATCTCTCCGTCATCCCCATCTACGGAAAATGCTCGCCGTTCACGGCGCCGAAGTCAGAATCTTGGGTGAACACGGTGATTGATATGGCTTCGAAAGACCCAGCCCGAATTAAGTACTTGTCGAGCCTCGCCGCCCAGAAGACGGTGGCGGCGCCTATCGCCTCCGGGCAACATGCTCTCAATATTGGGAATTATGTGGTGAGAGTTCAATTGGGTACTCCGGGTCAAGCTATGTATATGGTTCTTGACACCAGTAGTGACGCCGCCTGGGCACCGTGCTCCGGCTGCTCCGGCTGCTCCGCCACCACTTTCTTGTCTAAGAATTCCTCCACTTTTGCCACTTTGGATTGCTCCAAACCGCAGTGTAGCCAGGTTTGTTGAATCGCTACTCAAATTATTTAATTTACATTTATTATAATTATTATTTATGTTTAAAAAAAAAAAAAATCTGTCGTTTAATTTTCAAAATAGCAATTTTAGTGTTTAGAATTTATAAATTAATTAATTGAATCTCAGTTTAAATAATTGTTTAAAAAACATAAAATTAGATCTAATTACTTATTAATATTAACTCGAATTTAAATTATTTTGGTATATTTTATAAAATGATCCAAAATTTTCTATTTTTTATCATCAAATAAATTACAAATATTTAAATATTTATTTGTAATTGAGGACTAAATATCTTGAACATGTGAATTAAATTTATTTGTTAATTTTCTTCAGGCTCGGGGTCTTTCCTGTCCGACAACCGGTAGCGTCGATTGCTTATTCAACCAAACATACGGCGGCGACTCATCGTTTTCCGCCACCCTAGTTCAAGACACTCTCCACTTGGGAACCGATGTCATTCCAAATTTCTCATTCGGTTGCATCAGCTCCGCCTCCGGCAGCTCCATTCCGCCGCAAGGCTTATTGGGTCTCGGCCGTGGTCCCCTCTCACTCATCTCACAATCCACTTCACTCTACTCCGGTTTATTCTCGTACTGTCTCCCGAGTTTCAAATCCTACTACTTCTCCGGTTCACTAAAACTCGGCCCGGTCGGACAACCTAAGTCAATCCGAACCACCCCACTCCTTAAGAACCCGCACAGACCGTCTCTCTACTACGTCAACCTAACAGGCATCAGTGTCGGCCGGGTTCTCGTCCCAATTCCACCAGAGACTCTCGCATTCGACCCGAACACTGGTGCGGGAACCATCATCGACTCGGGGACGGTAATAACCCGGTTCGTATACCCTGTCTACACAGCGGTTCGAGACGAATTTAGAAAGCAAGTGGGCGGTTCGTTTTCGCCATTGGGAGCTTTCGACACGTGTTTTACAACGAGCAATGAAATGGCGGCGCCCGGCATTACGTTCCATTTGAGTGGATTGGACTTGAAATTGCCGATGGAGAACAGTTTGATTCACAGCAGCGCCGGTTCGTTGGCTTGTTTGGCGATGGCGGCGGCGCCCAACAATGTGAACTCTGTGTTGAATGTTATCGCCAATTTGCAGCAACAAAATCATCGGATTTTGTTTGATATTGCAAATTCTAAGCTGGGGATTGCTCGCGAGCTCTGTAATTAGCTGAATCACTGACAATGGAAGCTATGAATAATTTGGAAAATGTATCGTTTCTTTTAATTTAATGTCCAATCCAATGTATGGTTAAAGTGGGTTTTAAAGATGGTGGCTTTTAGTATCCATTTTAATGGGCTTGGCCCAATATCATTTGGGCCCAAGGTCCACATTTGGATGAAAGTGGTTTTTAAAGATAGTGGCTTTTTAGTATCTACTTTAATGGGCTTGGCCCAATGTCATCTTGGGCCCAAGGTCCACAATTGGATTTGTCTTCTCTTGGTGTTATCATTTCGTTTAACAAAATTGGAG
mRNA sequence
CAGACTAAACAATTATATGCCCATTTTCGTAATTTCACAACACAATTTTGTTATAAATCTACCATACCCCCATGCAAAAACTCCGGTAACCGGTAACCTTCAAATTTTCGTTTTCACCTCTTCCACCGCCGTCTCTCTCAACTCCGGCCACCGAGTTGAGTCCGAGTCATGGCCGTCAAATTTTTCTTCTTCGTCTTTTTAGCCCTTCTGGCACTAAACTCCAACGCTTCCGATCTCTGCGCCGCCGGATCGGACGGCTCCGGTGATCTCTCCGTCATCCCCATCTACGGAAAATGCTCGCCGTTCACGGCGCCGAAGTCAGAATCTTGGGTGAACACGGTGATTGATATGGCTTCGAAAGACCCAGCCCGAATTAAGTACTTGTCGAGCCTCGCCGCCCAGAAGACGGTGGCGGCGCCTATCGCCTCCGGGCAACATGCTCTCAATATTGGGAATTATGTGGTGAGAGTTCAATTGGGTACTCCGGGTCAAGCTATGTATATGGTTCTTGACACCAGTAGTGACGCCGCCTGGGCACCGTGCTCCGGCTGCTCCGGCTGCTCCGCCACCACTTTCTTGTCTAAGAATTCCTCCACTTTTGCCACTTTGGATTGCTCCAAACCGCAGTGTAGCCAGGCTCGGGGTCTTTCCTGTCCGACAACCGGTAGCGTCGATTGCTTATTCAACCAAACATACGGCGGCGACTCATCGTTTTCCGCCACCCTAGTTCAAGACACTCTCCACTTGGGAACCGATGTCATTCCAAATTTCTCATTCGGTTGCATCAGCTCCGCCTCCGGCAGCTCCATTCCGCCGCAAGGCTTATTGGGTCTCGGCCGTGGTCCCCTCTCACTCATCTCACAATCCACTTCACTCTACTCCGGTTTATTCTCGTACTGTCTCCCGAGTTTCAAATCCTACTACTTCTCCGGTTCACTAAAACTCGGCCCGGTCGGACAACCTAAGTCAATCCGAACCACCCCACTCCTTAAGAACCCGCACAGACCGTCTCTCTACTACGTCAACCTAACAGGCATCAGTGTCGGCCGGGTTCTCGTCCCAATTCCACCAGAGACTCTCGCATTCGACCCGAACACTGGTGCGGGAACCATCATCGACTCGGGGACGGTAATAACCCGGTTCGTATACCCTGTCTACACAGCGGTTCGAGACGAATTTAGAAAGCAAGTGGGCGGTTCGTTTTCGCCATTGGGAGCTTTCGACACGTGTTTTACAACGAGCAATGAAATGGCGGCGCCCGGCATTACGTTCCATTTGAGTGGATTGGACTTGAAATTGCCGATGGAGAACAGTTTGATTCACAGCAGCGCCGGTTCGTTGGCTTGTTTGGCGATGGCGGCGGCGCCCAACAATGTGAACTCTGTGTTGAATGTTATCGCCAATTTGCAGCAACAAAATCATCGGATTTTGTTTGATATTGCAAATTCTAAGCTGGGGATTGCTCGCGAGCTCTGTAATTAGCTGAATCACTGACAATGGAAGCTATGAATAATTTGGAAAATGTATCGTTTCTTTTAATTTAATGTCCAATCCAATGTATGGTTAAAGTGGGTTTTAAAGATGGTGGCTTTTAGTATCCATTTTAATGGGCTTGGCCCAATATCATTTGGGCCCAAGGTCCACATTTGGATGAAAGTGGTTTTTAAAGATAGTGGCTTTTTAGTATCTACTTTAATGGGCTTGGCCCAATGTCATCTTGGGCCCAAGGTCCACAATTGGATTTGTCTTCTCTTGGTGTTATCATTTCGTTTAACAAAATTGGAG
Coding sequence (CDS)
ATGGCCGTCAAATTTTTCTTCTTCGTCTTTTTAGCCCTTCTGGCACTAAACTCCAACGCTTCCGATCTCTGCGCCGCCGGATCGGACGGCTCCGGTGATCTCTCCGTCATCCCCATCTACGGAAAATGCTCGCCGTTCACGGCGCCGAAGTCAGAATCTTGGGTGAACACGGTGATTGATATGGCTTCGAAAGACCCAGCCCGAATTAAGTACTTGTCGAGCCTCGCCGCCCAGAAGACGGTGGCGGCGCCTATCGCCTCCGGGCAACATGCTCTCAATATTGGGAATTATGTGGTGAGAGTTCAATTGGGTACTCCGGGTCAAGCTATGTATATGGTTCTTGACACCAGTAGTGACGCCGCCTGGGCACCGTGCTCCGGCTGCTCCGGCTGCTCCGCCACCACTTTCTTGTCTAAGAATTCCTCCACTTTTGCCACTTTGGATTGCTCCAAACCGCAGTGTAGCCAGGCTCGGGGTCTTTCCTGTCCGACAACCGGTAGCGTCGATTGCTTATTCAACCAAACATACGGCGGCGACTCATCGTTTTCCGCCACCCTAGTTCAAGACACTCTCCACTTGGGAACCGATGTCATTCCAAATTTCTCATTCGGTTGCATCAGCTCCGCCTCCGGCAGCTCCATTCCGCCGCAAGGCTTATTGGGTCTCGGCCGTGGTCCCCTCTCACTCATCTCACAATCCACTTCACTCTACTCCGGTTTATTCTCGTACTGTCTCCCGAGTTTCAAATCCTACTACTTCTCCGGTTCACTAAAACTCGGCCCGGTCGGACAACCTAAGTCAATCCGAACCACCCCACTCCTTAAGAACCCGCACAGACCGTCTCTCTACTACGTCAACCTAACAGGCATCAGTGTCGGCCGGGTTCTCGTCCCAATTCCACCAGAGACTCTCGCATTCGACCCGAACACTGGTGCGGGAACCATCATCGACTCGGGGACGGTAATAACCCGGTTCGTATACCCTGTCTACACAGCGGTTCGAGACGAATTTAGAAAGCAAGTGGGCGGTTCGTTTTCGCCATTGGGAGCTTTCGACACGTGTTTTACAACGAGCAATGAAATGGCGGCGCCCGGCATTACGTTCCATTTGAGTGGATTGGACTTGAAATTGCCGATGGAGAACAGTTTGATTCACAGCAGCGCCGGTTCGTTGGCTTGTTTGGCGATGGCGGCGGCGCCCAACAATGTGAACTCTGTGTTGAATGTTATCGCCAATTTGCAGCAACAAAATCATCGGATTTTGTTTGATATTGCAAATTCTAAGCTGGGGATTGCTCGCGAGCTCTGTAATTAG
Protein sequence
MAVKFFFFVFLALLALNSNASDLCAAGSDGSGDLSVIPIYGKCSPFTAPKSESWVNTVIDMASKDPARIKYLSSLAAQKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDTSSDAAWAPCSGCSGCSATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVDCLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSLISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISVGRVLVPIPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVGGSFSPLGAFDTCFTTSNEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDIANSKLGIARELCN
Homology
BLAST of CmaCh05G005530 vs. ExPASy Swiss-Prot
Match:
O04496 (Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1)
HSP 1 Score: 560.8 bits (1444), Expect = 1.3e-158
Identity = 296/446 (66.37%), Postives = 351/446 (78.70%), Query Frame = 0
Query: 2 AVKFFFFVFLAL-LALNSNASDLCA-AGSDGSGDLSVIPIYGKCSPFTAPK-SESWVNTV 61
++ FFFF+ L L + D CA A DGS DLS+IPI KCSPF S S ++TV
Sbjct: 5 SLHFFFFLTLLLPFTFTTATRDTCATAAPDGSDDLSIIPINAKCSPFAPTHVSASVIDTV 64
Query: 62 IDMASKDPARIKYLSSLAA--QKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDT 121
+ MAS D R+ YLSSL A K + P+ASG + L+IGNYVVR +LGTP Q M+MVLDT
Sbjct: 65 LHMASSDSHRLTYLSSLVAGKPKPTSVPVASG-NQLHIGNYVVRAKLGTPPQLMFMVLDT 124
Query: 122 SSDAAWAPCSGCSGCS--ATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVD--CLF 181
S+DA W PCSGCSGCS +T+F + +SST++T+ CS QC+QARGL+CP++ C F
Sbjct: 125 SNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSF 184
Query: 182 NQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSLISQ 241
NQ+YGGDSSFSA+LVQDTL L DVIPNFSFGCI+SASG+S+PPQGL+GLGRGP+SL+SQ
Sbjct: 185 NQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQ 244
Query: 242 STSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISV 301
+TSLYSG+FSYCLPSF+S+YFSGSLKLG +GQPKSIR TPLL+NP RPSLYYVNLTG+SV
Sbjct: 245 TTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSV 304
Query: 302 GRVLVPIPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVG-GSFSPLGAF 361
G V VP+ P L FD N+GAGTIIDSGTVITRF PVY A+RDEFRKQV SFS LGAF
Sbjct: 305 GSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAF 364
Query: 362 DTCFTTSNEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIA 421
DTCF+ NE AP IT H++ LDLKLPMEN+LIHSSAG+L CL+MA N N+VLNVIA
Sbjct: 365 DTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIA 424
Query: 422 NLQQQNHRILFDIANSKLGIARELCN 438
NLQQQN RILFD+ NS++GIA E CN
Sbjct: 425 NLQQQNLRILFDVPNSRIGIAPEPCN 449
BLAST of CmaCh05G005530 vs. ExPASy Swiss-Prot
Match:
Q6F4N5 (Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1)
HSP 1 Score: 414.1 bits (1063), Expect = 2.0e-114
Identity = 226/410 (55.12%), Postives = 293/410 (71.46%), Query Frame = 0
Query: 49 PKSESWVNTVIDMASKDPARIKYLSSLAAQKTV-AAPIASGQHALNIGNYVVRVQLGTPG 108
P S S + ++I +A D AR+ +LSS AA V +AP+ASGQ +YVVR LG+P
Sbjct: 33 PSSPSPLESIIALARDDDARLLFLSSKAATAGVSSAPVASGQAP---PSYVVRAGLGSPS 92
Query: 109 QAMYMVLDTSSDAAWAPCSGCSGC-SATTFLSKNSSTFATLDCSKPQCSQARGLSCPT-T 168
Q + + LDTS+DA WA CS C C S++ F NSS++A+L CS C +G +CP
Sbjct: 93 QQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPLFQGQACPAPQ 152
Query: 169 GSVD----------CLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASG--SS 228
G D C F++ + D+SF A L DTL LG D IPN++FGC+SS +G ++
Sbjct: 153 GGGDAAPPPATLPTCAFSKPF-ADASFQAALASDTLRLGKDAIPNYTFGCVSSVTGPTTN 212
Query: 229 IPPQGLLGLGRGPLSLISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGP-VGQPKSIRTTP 288
+P QGLLGLGRGP++L+SQ+ SLY+G+FSYCLPS++SYYFSGSL+LG GQP+S+R TP
Sbjct: 213 MPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGGQPRSVRYTP 272
Query: 289 LLKNPHRPSLYYVNLTGISVGRVLVPIPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTA 348
+L+NPHR SLYYVN+TG+SVG V +P + AFD TGAGT++DSGTVITR+ PVY A
Sbjct: 273 MLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAA 332
Query: 349 VRDEFRKQVG--GSFSPLGAFDTCFTTSNEMA--APGITFHL-SGLDLKLPMENSLIHSS 408
+R+EFR+QV ++ LGAFDTCF T A AP +T H+ G+DL LPMEN+LIHSS
Sbjct: 333 LREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSS 392
Query: 409 AGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDIANSKLGIARELCN 438
A LACLAMA AP NVNSV+NVIANLQQQN R++FD+ANS++G A+E CN
Sbjct: 393 ATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKESCN 438
BLAST of CmaCh05G005530 vs. ExPASy Swiss-Prot
Match:
Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)
HSP 1 Score: 202.2 bits (513), Expect = 1.2e-50
Identity = 138/394 (35.03%), Postives = 185/394 (46.95%), Query Frame = 0
Query: 64 KDPARIKYLSSLAAQ----KTVAAPIASGQHALNI-------GNYVVRVQLGTPGQAMYM 123
+D R+K +++LAAQ AP G + + G Y R+ +GTP + +YM
Sbjct: 98 RDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYM 157
Query: 124 VLDTSSDAAW---APCSGCSGCSATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVD 183
VLDT SD W APC C S F + S T+AT+ CS P C + C T
Sbjct: 158 VLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKT- 217
Query: 184 CLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSL 243
CL+ +YG S +TL + + + GC G + GLLGLG+G LS
Sbjct: 218 CLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSF 277
Query: 244 ISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTG 303
Q+ ++ FSYCL + S+ G + R TPLL NP + YYV L G
Sbjct: 278 PGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLG 337
Query: 304 ISVGRVLVP-IPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVGGS---- 363
ISVG VP + D G IIDSGT +TR + P Y A+RD FR VG
Sbjct: 338 ISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKTLKR 397
Query: 364 FSPLGAFDTCFTTS--NEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNN 423
FDTCF S NE+ P + H G D+ LP N LI C A A
Sbjct: 398 APDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGG 457
Query: 424 VNSVLNVIANLQQQNHRILFDIANSKLGIARELC 437
L++I N+QQQ R+++D+A+S++G A C
Sbjct: 458 ----LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
BLAST of CmaCh05G005530 vs. ExPASy Swiss-Prot
Match:
Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)
HSP 1 Score: 190.7 bits (483), Expect = 3.6e-47
Identity = 132/357 (36.97%), Postives = 182/357 (50.98%), Query Frame = 0
Query: 95 GNYVVRVQLGTPGQAMYMVLDTSSDAAWAPCSGCSGC---SATTFLSKNSSTFATLDCSK 154
G Y++ + +GTP Q ++DT SD W C C+ C S F + SS+F+TL CS
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSS 152
Query: 155 PQCSQARGLSCPTTGSVDCLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASG 214
C + LS PT + C + YG S ++ +TL G+ IPN +FGC + G
Sbjct: 153 QLC---QALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQG 212
Query: 215 -SSIPPQGLLGLGRGPLSLISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRT 274
GL+G+GRGPLSL SQ L FSYC+ S S +L LG + + +
Sbjct: 213 FGQGNGAGLVGMGRGPLSLPSQ---LDVTKFSYCMTPIGSSTPS-NLLLGSLANSVTAGS 272
Query: 275 --TPLLKNPHRPSLYYVNLTGISVGRVLVPIPPETLAFDPNTG-AGTIIDSGTVITRFVY 334
T L+++ P+ YY+ L G+SVG +PI P A + N G G IIDSGT +T FV
Sbjct: 273 PNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVN 332
Query: 335 PVYTAVRDEFRKQ-----VGGSFSPLGAFDTCFTTSNE---MAAPGITFHLSGLDLKLPM 394
Y +VR EF Q V GS S FD CF T ++ + P H G DL+LP
Sbjct: 333 NAYQSVRQEFISQINLPVVNGSSS---GFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPS 392
Query: 395 ENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDIANSKLGIARELC 437
EN I S G L CLAM ++ +++ N+QQQN +++D NS + A C
Sbjct: 393 ENYFISPSNG-LICLAMGSSSQG----MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
BLAST of CmaCh05G005530 vs. ExPASy Swiss-Prot
Match:
Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)
HSP 1 Score: 182.2 bits (461), Expect = 1.3e-44
Identity = 114/350 (32.57%), Postives = 162/350 (46.29%), Query Frame = 0
Query: 95 GNYVVRVQLGTPGQAMYMVLDTSSDAAWAPCSGCSGC---SATTFLSKNSSTFATLDCSK 154
G Y VR+ +G+P + YMV+D+ SD W C C C S F S ++ + C
Sbjct: 129 GEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGS 188
Query: 155 PQCSQARGLSCPTTGSVDCLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASG 214
C + C + G C + YG S TL +TL V+ N + GC G
Sbjct: 189 SVCDRIENSGCHSGG---CRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRG 248
Query: 215 SSIPPQGLLGLGRGPLSLISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 274
I GLLG+G G +S + Q + G F YCL S + +GSL G P
Sbjct: 249 MFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWV 308
Query: 275 PLLKNPHRPSLYYVNLTGISVGRVLVPIPPETLAFDPNTGAGTIIDSGTVITRFVYPVYT 334
PL++NP PS YYV L G+ VG V +P+P G ++D+GT +TR Y
Sbjct: 309 PLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYV 368
Query: 335 AVRDEFRKQVGG--SFSPLGAFDTCFTTSN--EMAAPGITFHLS-GLDLKLPMENSLIHS 394
A RD F+ Q S + FDTC+ S + P ++F+ + G L LP N L+
Sbjct: 369 AFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPV 428
Query: 395 SAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDIANSKLGIARELC 437
C A AA+P L++I N+QQ+ ++ FD AN +G +C
Sbjct: 429 DDSGTYCFAFAASPTG----LSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470
BLAST of CmaCh05G005530 vs. TAIR 10
Match:
AT1G09750.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 560.8 bits (1444), Expect = 9.4e-160
Identity = 296/446 (66.37%), Postives = 351/446 (78.70%), Query Frame = 0
Query: 2 AVKFFFFVFLAL-LALNSNASDLCA-AGSDGSGDLSVIPIYGKCSPFTAPK-SESWVNTV 61
++ FFFF+ L L + D CA A DGS DLS+IPI KCSPF S S ++TV
Sbjct: 5 SLHFFFFLTLLLPFTFTTATRDTCATAAPDGSDDLSIIPINAKCSPFAPTHVSASVIDTV 64
Query: 62 IDMASKDPARIKYLSSLAA--QKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDT 121
+ MAS D R+ YLSSL A K + P+ASG + L+IGNYVVR +LGTP Q M+MVLDT
Sbjct: 65 LHMASSDSHRLTYLSSLVAGKPKPTSVPVASG-NQLHIGNYVVRAKLGTPPQLMFMVLDT 124
Query: 122 SSDAAWAPCSGCSGCS--ATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVD--CLF 181
S+DA W PCSGCSGCS +T+F + +SST++T+ CS QC+QARGL+CP++ C F
Sbjct: 125 SNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSF 184
Query: 182 NQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSLISQ 241
NQ+YGGDSSFSA+LVQDTL L DVIPNFSFGCI+SASG+S+PPQGL+GLGRGP+SL+SQ
Sbjct: 185 NQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQ 244
Query: 242 STSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISV 301
+TSLYSG+FSYCLPSF+S+YFSGSLKLG +GQPKSIR TPLL+NP RPSLYYVNLTG+SV
Sbjct: 245 TTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSV 304
Query: 302 GRVLVPIPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVG-GSFSPLGAF 361
G V VP+ P L FD N+GAGTIIDSGTVITRF PVY A+RDEFRKQV SFS LGAF
Sbjct: 305 GSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAF 364
Query: 362 DTCFTTSNEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIA 421
DTCF+ NE AP IT H++ LDLKLPMEN+LIHSSAG+L CL+MA N N+VLNVIA
Sbjct: 365 DTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIA 424
Query: 422 NLQQQNHRILFDIANSKLGIARELCN 438
NLQQQN RILFD+ NS++GIA E CN
Sbjct: 425 NLQQQNLRILFDVPNSRIGIAPEPCN 449
BLAST of CmaCh05G005530 vs. TAIR 10
Match:
AT3G54400.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 441.4 bits (1134), Expect = 8.3e-124
Identity = 245/430 (56.98%), Postives = 299/430 (69.53%), Query Frame = 0
Query: 9 VFLALLALNSNASDLCAAGSDGSGDLSVIPIYGKCSPFTAPKSESWVNTVIDMASKDPAR 68
+ ++LL L S + + C S S DL V I CSPF S SW +T++ +D AR
Sbjct: 8 LLISLLILKSESIN-CNEKSH-SSDLRVFHINSLCSPFKT--SVSWADTLL----QDKAR 67
Query: 69 IKYLSSLAAQKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDTSSDAAWAPCSGC 128
YLSSLA + + PIASG+ + Y+VR +GTP Q M + LDTS+DAAW PCSGC
Sbjct: 68 FLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC 127
Query: 129 SGCSATT-FLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVDCLFNQTYGGDSSFSATLV 188
GCS++ F SS+ TL C PQC QA SC T S C FN TYGG S+ A L
Sbjct: 128 VGCSSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSC--TVSKSCGFNMTYGG-STIEAYLT 187
Query: 189 QDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSLISQSTSLYSGLFSYCLPS 248
QDTL L +DVIPN++FGCI+ ASG+S+P QGL+GLGRGPLSLISQS +LY FSYCLP+
Sbjct: 188 QDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPN 247
Query: 249 FKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISVGRVLVPIPPETLAFD 308
KS FSGSL+LGP QP I+TTPLLKNP R SLYYVNL GI VG +V IP LAFD
Sbjct: 248 SKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFD 307
Query: 309 PNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQV-GGSFSPLGAFDTCFTTSNEMAAPGI 368
P TGAGTI DSGTV TR V P Y AVR+EFR++V + + LG FDTC+ S + P +
Sbjct: 308 PATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCY--SGSVVFPSV 367
Query: 369 TFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRILFDIAN 428
TF +G+++ LP +N LIHSSAG+L+CLAMAAAP NVNSVLNVIA++QQQNHR+L D+ N
Sbjct: 368 TFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPN 424
Query: 429 SKLGIARELC 437
S+LGI+RE C
Sbjct: 428 SRLGISRETC 424
BLAST of CmaCh05G005530 vs. TAIR 10
Match:
AT5G07030.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 417.5 bits (1072), Expect = 1.3e-116
Identity = 224/437 (51.26%), Postives = 289/437 (66.13%), Query Frame = 0
Query: 7 FFVFLALLALNSNASDLCAAGSDGSGDLSVIPIYGKCSPFTAPKSESWVNTVIDMASKDP 66
F + L LN DL GS L + I CSPF + SW V+ ++D
Sbjct: 27 FSILPLALGLNHPNCDLTKTQDQGS-TLRIFHIDSPCSPFKSSSPLSWEARVLQTLAQDQ 86
Query: 67 ARIKYLSSLAAQKTVAAPIASGQHALNIGNYVVRVQLGTPGQAMYMVLDTSSDAAWAPCS 126
AR++YLSSL A ++V PIASG+ L Y+V+ +GTP Q + + +DTSSD AW PCS
Sbjct: 87 ARLQYLSSLVAGRSV-VPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCS 146
Query: 127 GCSGC-SATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVDCLFNQTYGGDSSFSAT 186
GC GC S T F S++F + CS PQC Q PT G+ C FN TY G SS +A
Sbjct: 147 GCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPN---PTCGARACSFNLTY-GSSSIAAN 206
Query: 187 LVQDTLHLGTDVIPNFSFGCISSASGSSI--PPQGLLGLGRGPLSLISQSTSLYSGLFSY 246
L QDT+ L D I F+FGC++ +G PPQGLLGLGRGPLSL+SQ+ S+Y FSY
Sbjct: 207 LSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSY 266
Query: 247 CLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTGISVGRVLVPIPPET 306
CLPSF+S FSGSL+LGP QP+ ++ T LL+NP R SLYYVNL I VGR +V +PP
Sbjct: 267 CLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAA 326
Query: 307 LAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVGGS---FSPLGAFDTCFTTSNE 366
+AF+P+TGAGTI DSGTV TR PVY AVR+EFRK+V + + LG FDTC+ S +
Sbjct: 327 IAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCY--SGQ 386
Query: 367 MAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNHRI 426
+ P ITF G+++ +P +N ++HS+AGS +CLAMAAAP NVNSV+NVIA++QQQNHR+
Sbjct: 387 VKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRV 446
Query: 427 LFDIANSKLGIARELCN 438
L D+ N +LG+ARE C+
Sbjct: 447 LIDVPNGRLGLARERCS 455
BLAST of CmaCh05G005530 vs. TAIR 10
Match:
AT1G01300.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 202.2 bits (513), Expect = 8.5e-52
Identity = 138/394 (35.03%), Postives = 185/394 (46.95%), Query Frame = 0
Query: 64 KDPARIKYLSSLAAQ----KTVAAPIASGQHALNI-------GNYVVRVQLGTPGQAMYM 123
+D R+K +++LAAQ AP G + + G Y R+ +GTP + +YM
Sbjct: 98 RDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYM 157
Query: 124 VLDTSSDAAW---APCSGCSGCSATTFLSKNSSTFATLDCSKPQCSQARGLSCPTTGSVD 183
VLDT SD W APC C S F + S T+AT+ CS P C + C T
Sbjct: 158 VLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKT- 217
Query: 184 CLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRGPLSL 243
CL+ +YG S +TL + + + GC G + GLLGLG+G LS
Sbjct: 218 CLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSF 277
Query: 244 ISQSTSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPSLYYVNLTG 303
Q+ ++ FSYCL + S+ G + R TPLL NP + YYV L G
Sbjct: 278 PGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLG 337
Query: 304 ISVGRVLVP-IPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQVGGS---- 363
ISVG VP + D G IIDSGT +TR + P Y A+RD FR VG
Sbjct: 338 ISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKTLKR 397
Query: 364 FSPLGAFDTCFTTS--NEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNN 423
FDTCF S NE+ P + H G D+ LP N LI C A A
Sbjct: 398 APDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGG 457
Query: 424 VNSVLNVIANLQQQNHRILFDIANSKLGIARELC 437
L++I N+QQQ R+++D+A+S++G A C
Sbjct: 458 ----LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
BLAST of CmaCh05G005530 vs. TAIR 10
Match:
AT3G61820.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 196.4 bits (498), Expect = 4.7e-50
Identity = 140/402 (34.83%), Postives = 186/402 (46.27%), Query Frame = 0
Query: 64 KDPARIKYLSSLAAQKT--------------VAAPIASGQHALNIGNYVVRVQLGTPGQA 123
+D R+K ++SLAA T + + SG + G Y +R+ +GTP
Sbjct: 89 RDSLRVKSITSLAAVSTGRNATKRTPRTAGGFSGAVISGL-SQGSGEYFMRLGVGTPATN 148
Query: 124 MYMVLDTSSDAAWAPCSGCSGCSATT---FLSKNSSTFATLDCSKPQCSQARGLS-CPTT 183
+YMVLDT SD W CS C C T F K S TFAT+ C C + S C T
Sbjct: 149 VYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTR 208
Query: 184 GSVDCLFNQTYGGDSSFSATLVQDTLHLGTDVIPNFSFGCISSASGSSIPPQGLLGLGRG 243
S CL+ +YG S +TL + + GC G + GLLGLGRG
Sbjct: 209 RSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRG 268
Query: 244 PLSLISQSTSLYSGLFSYCL----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLKNPHRPS 303
LS SQ+ + Y+G FSYCL S S ++ G PK+ TPLL NP +
Sbjct: 269 GLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDT 328
Query: 304 LYYVNLTGISVGRVLVP-IPPETLAFDPNTGAGTIIDSGTVITRFVYPVYTAVRDEFRKQ 363
YY+ L GISVG VP + D G IIDSGT +TR P Y A+RD FR
Sbjct: 329 FYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFR-- 388
Query: 364 VGGS----FSPLGAFDTCFTTS--NEMAAPGITFHLSGLDLKLPMENSLIHSSAGSLACL 423
+G + FDTCF S + P + FH G ++ LP N LI + C
Sbjct: 389 LGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVNTEGRFCF 448
Query: 424 AMAAAPNNVNSVLNVIANLQQQNHRILFDIANSKLGIARELC 437
A A + L++I N+QQQ R+ +D+ S++G C
Sbjct: 449 AFAGTMGS----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
O04496 | 1.3e-158 | 66.37 | Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1 | [more] |
Q6F4N5 | 2.0e-114 | 55.12 | Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1 | [more] |
Q9LNJ3 | 1.2e-50 | 35.03 | Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... | [more] |
Q766C3 | 3.6e-47 | 36.97 | Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... | [more] |
Q9LHE3 | 1.3e-44 | 32.57 | Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... | [more] |