Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATAATATTCATCGCAACTTTTAGCTCACAGACAGCTTCACACACACACACACAGTACCTCTTCTCTCTCTCTAGCGTCTCTTTCTCTCTCTTCCGAGTTTCCCACCGGCGCCTGAATACCACCGCATTCCCCGGCCGCCATGGATTTCACCATCCGTGGCGCCTCGAAGCTCTGGTTTGCTGTACTTTTTGTGGTGTTACCGATTTTTGTGGCGGTTGCATTGCCGGAAAATTCGACTTCTAGGTCGTCGTCTCCAGCAATGTCCGGCCAAATCAACTCAAATTCCATCCTCGTCGCTTTACTGGACTCTCATTATACGGAGCTGGCCGAGATAGTTGAGAAGGCTATGCTATTGCAGACCCTAGAGGACGCCGTCGGCAACCACAATATCACCATTTTCGCCCCTAAAAATGAAGCTCTGGAGCGCGATCTAGACCCTGAGTTTAAGAGGTTCCTTCTTGAACCGGGGAACTTGAAGTCGCTCCAGACCTTGCTGATGTTCCATGTGATTCCGACTCGGATCGGATCGAAAGACTGGCCGAGTCATTCCAAGTCCGCACGGCAATCCACGCTTTCCAAGCACGTTCTTCATCTCGTTGGTCATGATACCGGGGAGAAGACCGTCGATCTCGCCAATGTGATCCAATCCGATGCAATTACCAGACCGGACGGCGTTATTCACGGGATTGAACGATTGTTAATCCCTCAGTCCGTACAGGATGATTTCAATCGCAGACGGAATCTACAGGCTATAACCGCCGTGAAGCCGGAGGGAGCGCCTGAAGTGGATCCCAGGACTCACCGGCTAAAGAAACCTGCTCCACCTGCCGAACCTGGTTCAACGCCTGTCATTCCGATATACGATGCGTTGGCTCCAGGACCTAGTTTGGCTCCGGCTCCAGCACCAGGGCCCGGCGGAGCTCATCGTCATTTCAACGGCGAGAGGCAAGTTAAAGATTTCATCCACACTCTGCTACACTATGGTGGATATAACGAAATGGCCGACATTCTCGTTAACCTAACATCGCTAGCAACGGAAATGGGGCGGCTAGTATCGGAGGGTTATGTGCTCACTGTTCTGGCTCCTAACGATGAAGCCATGGCTAAACTCACTACAGACCAGTTGAGCGAGCCAGGGGCACCAGAGCAGATTATGTACTATCATCTAATACCCGAGTATCAGACGGAAGAGAGCATGTACAACGCTGTAAGGAGGTTCGGGAAGGTGCGATACGACTCCTTGAGGTTGCCGCACAAAGTAGTGGCTCAAGAGGCCGATGGGTCGGTCAAGTTCGGTAATGGAGATGGATCGGCATATTTGTTTGATCCTGATATTTACACGGATGGACGAATCTCGGTGCAGGGCATTGATGGAGTATTATTCCCGCCGGAGGAGACAGAGGCCAAGAACGCACCAAAGGCAGTTCAACCCACCAAAGTCGCGGCCAAGCCCCGGAGAGGTAATGGTTTTGTTAAATTTGAAGTGCATTTTCTACGTTATTTTGATCAATTCCTGACTGGTTTGGATAACGCTTGTTTAGAGTTTACTTCGTCAAGAACTGTAGCATAAATTCTCGAATTCCTTTTAAGAACTTAGAGATTGATGATGCCCATTAGGATTATATCCTTGTGATTTCACCCATCGAGAAGACATTACTTCTGCCATAACCCATGAATTAGACTTGTTCGTCCTAACATTGTTATCAATTGCTATGAAATAAAGAATGTGCTTGACTGTAACCCATTTGGCAAGTTGAATATAATAGATTAGTTACACATATAGGTGCGTGTATTTGACTGATTGGGTGTAGCCTTCATGTGCATTAAAAGTTACTCATTGGGGTTGCTTCAAAATTATGCCTCATCGCAATCAAAGTTGAAAATGAAATGTGAGGATTTCTAGAGACGTATCCATAGGGACAAAATTTCCTCCCTGGGATATTAGCCTTTGCAGGAGGAAGGGTTAACTAGAATTTGGTTGATTGCTGTGTTGTTCGGGCTGTTTTTTTGGACCTTTAGATACTAAAGTAATAATCTGGAGAACGTTAGAGTTTCCAAAAGTTCCGGTAGTCTGGGAAGAGGGTGAGATATTGTCTGCTTTTTATCAGCAGTAGACCTCACAATTTAAGAAGTTATACTAGTGAGAGGTTTGCACATCTTTACCCTCCACAACCAATGTGAGATCTCACAATCCATCTCCTTTAAGTGTCCAGGGACTGTGAGATCTCACAATCCATCTCCTTTAAGTGTCCATGGTCCCTGCTGTCATACTGCTCGGTATTTGACCATGATACTATTTGTAACCGCTCAAGCCCACCACTAACAAAAGATTGTCCGTTTTAGCTCGTAAAGTATCACCCCAGCCTCACAGTTTTAAATGCCAGCCCCCTCTCTAATACATTTTAAAACTTTGAGGGAAAATCCGAAACGAAAACTTAAATAGGATAATATGTACTTTTGGCTTTATATTAGGATGTTTTTTCAATTTAATTTCTTATGTTGCAATTTAGTCTATTGTGTTTCCCTATTTTTTAATTATTCCCTTTCCATCCATAAATGTTAAAATTGACAAATGACAAGCTTTTATAGCATGATACTTCATGAGTTGACTGTAATTTAGAGGAAATATTGGGTTACTTGATTGACGAAAAGTCAAAATTTCTACCAACCTCTTTAACATTTGGAAGAACTAGATGATTTCTCCAGTTAAATGACCTAATTCGTTCTCTAATTTTTTTCCAATGAACTAAATATTGTGTAATTTAGGCTTTATCATAAGTCAATTTTAACATTCAGATGGGCATTATTGAAGCTTAAAATATAAGGGATGAGATCCGAAGCTTTTTAGTAGGAGCTGGCATTGTACTTAAAAGCATGTATTGCTCAAGATATTTAGTAATAAATTATAGTTTACCAAATATATTATATTTGATATTTTATTATAAGGCAAATATCTAATATTTGCTTTATTACCATAGTTATCTTTCCCTCTATTGTTATTTCTATTTATTATTATTTCCATATATTTGTAATTTATTTGATTATAAATAAGATAAATTTTCACCATTTAGGTGTGGTGGATTAATCAAACATTTTTATGGTATCACCAAATATTTTTATGGTATCAAAGCCTTTCGGTTTAACAACCCAATCCTTTATTTTCAATGGTTTTTTAATCCTCTAATCTTCTTCAACACCACCCCTGTTACTGGCTTTGACGCCGCAGAGGCAGTCTTCATGGCTGCCGTAGGCGGAGCCGATTGATTACTCTTCGGCTCCACCCTATTGACCTTCAAATCACCTTTCGTCATATTCTGTACATACAACCTCACCTTTTTCAACTCAATCGGTTTTTCCTTGGCCAATATAGTCTCTCTCATTTTCTTCCCAATTAACCTTTCATCCTTCACAGAAACCCTAATCTCAGAATCCTCATTAGTGTTCTCTTCAGAAATATACGGAATCTCGACAAGCCCATCAACCTTTTGCAGATATTTTCCTTCGGAATTTGGGTGCCACACTGGCCTCTTCGCCAACCAAGTTGCCGTCGCCATTCCTGCCAAAATTGGCTCCTTCGCCCAATGTGGCGACTCTTTCTGACTGCCCTTCTTGCCAAAATTGCCTCCTTCGCCCAATGTGATGATTCTTTCGGCTTCAACATTGTCTTTGTCCCTTGCTCTAGGTTTCCCCTTCTACTTCTTAAGGTTAGACATACTCTCACCAGAGCCAAGGCAGCATCGTCGATGAGCGACAATCCCTCTTCATGGGTTAGACATATTCTCGCCGAAGCCATGGCAGCATAGACGCTAAGCGGCGATCCCTCTGCAATTTTCAATGGCCAAGAAAGTGTTTCTCTACGCCTCTTCTGAACCAGGTTGTTGACATCTTCACGAAAAGTGTTTCTCAACCTCTTTGAATTTTTTTAGATTCAAGCTTCACGTTCGTTTAAATCCGACGCTTAGCTTGCGGTGGGGTGTTAAGGATATTTAGTAATAAATTATAGTTTACTATATATATTATATTTTATTATAGGGCAAATATTAGATATTTGCCTTAATACCATAGTTATCTTTCCATTTATTGTTATTTTCATTGTAAATAAGATAAATTTTCACCATTTAGGTGTGGTGGATTAATCAAACATTCTTATATAGCCTAAAACTTAATATTCGTGCAGATGATTGGCTCACGATTCCTTAATTGCTTTTCTTAATTTAATACTACTGGTTTGAAACTTATCAAATTTTTCGAAATGGCAGGGAAACTACTGGAAGTTACTTGCCGGATGTTGAGAACTTTTGGACAGGATTCCTCTTTCACAGCCTGTCACTGA
mRNA sequence
ATAATATTCATCGCAACTTTTAGCTCACAGACAGCTTCACACACACACACACAGTACCTCTTCTCTCTCTCTAGCGTCTCTTTCTCTCTCTTCCGAGTTTCCCACCGGCGCCTGAATACCACCGCATTCCCCGGCCGCCATGGATTTCACCATCCGTGGCGCCTCGAAGCTCTGGTTTGCTGTACTTTTTGTGGTGTTACCGATTTTTGTGGCGGTTGCATTGCCGGAAAATTCGACTTCTAGGTCGTCGTCTCCAGCAATGTCCGGCCAAATCAACTCAAATTCCATCCTCGTCGCTTTACTGGACTCTCATTATACGGAGCTGGCCGAGATAGTTGAGAAGGCTATGCTATTGCAGACCCTAGAGGACGCCGTCGGCAACCACAATATCACCATTTTCGCCCCTAAAAATGAAGCTCTGGAGCGCGATCTAGACCCTGAGTTTAAGAGGTTCCTTCTTGAACCGGGGAACTTGAAGTCGCTCCAGACCTTGCTGATGTTCCATGTGATTCCGACTCGGATCGGATCGAAAGACTGGCCGAGTCATTCCAAGTCCGCACGGCAATCCACGCTTTCCAAGCACGTTCTTCATCTCGTTGGTCATGATACCGGGGAGAAGACCGTCGATCTCGCCAATGTGATCCAATCCGATGCAATTACCAGACCGGACGGCGTTATTCACGGGATTGAACGATTGTTAATCCCTCAGTCCGTACAGGATGATTTCAATCGCAGACGGAATCTACAGGCTATAACCGCCGTGAAGCCGGAGGGAGCGCCTGAAGTGGATCCCAGGACTCACCGGCTAAAGAAACCTGCTCCACCTGCCGAACCTGGTTCAACGCCTGTCATTCCGATATACGATGCGTTGGCTCCAGGACCTAGTTTGGCTCCGGCTCCAGCACCAGGGCCCGGCGGAGCTCATCGTCATTTCAACGGCGAGAGGCAAGTTAAAGATTTCATCCACACTCTGCTACACTATGGTGGATATAACGAAATGGCCGACATTCTCGTTAACCTAACATCGCTAGCAACGGAAATGGGGCGGCTAGTATCGGAGGGTTATGTGCTCACTGTTCTGGCTCCTAACGATGAAGCCATGGCTAAACTCACTACAGACCAGTTGAGCGAGCCAGGGGCACCAGAGCAGATTATGTACTATCATCTAATACCCGAGTATCAGACGGAAGAGAGCATGTACAACGCTGTAAGGAGGTTCGGGAAGGTGCGATACGACTCCTTGAGGTTGCCGCACAAAGTAGTGGCTCAAGAGGCCGATGGGTCGGTCAAGTTCGGTAATGGAGATGGATCGGCATATTTGTTTGATCCTGATATTTACACGGATGGACGAATCTCGGTGCAGGGCATTGATGGAGTATTATTCCCGCCGGAGGAGACAGAGGCCAAGAACGCACCAAAGGCAGTTCAACCCACCAAAGTCGCGGCCAAGCCCCGGAGAGGGAAACTACTGGAAGTTACTTGCCGGATGTTGAGAACTTTTGGACAGGATTCCTCTTTCACAGCCTGTCACTGA
Coding sequence (CDS)
ATGGATTTCACCATCCGTGGCGCCTCGAAGCTCTGGTTTGCTGTACTTTTTGTGGTGTTACCGATTTTTGTGGCGGTTGCATTGCCGGAAAATTCGACTTCTAGGTCGTCGTCTCCAGCAATGTCCGGCCAAATCAACTCAAATTCCATCCTCGTCGCTTTACTGGACTCTCATTATACGGAGCTGGCCGAGATAGTTGAGAAGGCTATGCTATTGCAGACCCTAGAGGACGCCGTCGGCAACCACAATATCACCATTTTCGCCCCTAAAAATGAAGCTCTGGAGCGCGATCTAGACCCTGAGTTTAAGAGGTTCCTTCTTGAACCGGGGAACTTGAAGTCGCTCCAGACCTTGCTGATGTTCCATGTGATTCCGACTCGGATCGGATCGAAAGACTGGCCGAGTCATTCCAAGTCCGCACGGCAATCCACGCTTTCCAAGCACGTTCTTCATCTCGTTGGTCATGATACCGGGGAGAAGACCGTCGATCTCGCCAATGTGATCCAATCCGATGCAATTACCAGACCGGACGGCGTTATTCACGGGATTGAACGATTGTTAATCCCTCAGTCCGTACAGGATGATTTCAATCGCAGACGGAATCTACAGGCTATAACCGCCGTGAAGCCGGAGGGAGCGCCTGAAGTGGATCCCAGGACTCACCGGCTAAAGAAACCTGCTCCACCTGCCGAACCTGGTTCAACGCCTGTCATTCCGATATACGATGCGTTGGCTCCAGGACCTAGTTTGGCTCCGGCTCCAGCACCAGGGCCCGGCGGAGCTCATCGTCATTTCAACGGCGAGAGGCAAGTTAAAGATTTCATCCACACTCTGCTACACTATGGTGGATATAACGAAATGGCCGACATTCTCGTTAACCTAACATCGCTAGCAACGGAAATGGGGCGGCTAGTATCGGAGGGTTATGTGCTCACTGTTCTGGCTCCTAACGATGAAGCCATGGCTAAACTCACTACAGACCAGTTGAGCGAGCCAGGGGCACCAGAGCAGATTATGTACTATCATCTAATACCCGAGTATCAGACGGAAGAGAGCATGTACAACGCTGTAAGGAGGTTCGGGAAGGTGCGATACGACTCCTTGAGGTTGCCGCACAAAGTAGTGGCTCAAGAGGCCGATGGGTCGGTCAAGTTCGGTAATGGAGATGGATCGGCATATTTGTTTGATCCTGATATTTACACGGATGGACGAATCTCGGTGCAGGGCATTGATGGAGTATTATTCCCGCCGGAGGAGACAGAGGCCAAGAACGCACCAAAGGCAGTTCAACCCACCAAAGTCGCGGCCAAGCCCCGGAGAGGGAAACTACTGGAAGTTACTTGCCGGATGTTGAGAACTTTTGGACAGGATTCCTCTTTCACAGCCTGTCACTGA
Protein sequence
MDFTIRGASKLWFAVLFVVLPIFVAVALPENSTSRSSSPAMSGQINSNSILVALLDSHYTELAEIVEKAMLLQTLEDAVGNHNITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLMFHVIPTRIGSKDWPSHSKSARQSTLSKHVLHLVGHDTGEKTVDLANVIQSDAITRPDGVIHGIERLLIPQSVQDDFNRRRNLQAITAVKPEGAPEVDPRTHRLKKPAPPAEPGSTPVIPIYDALAPGPSLAPAPAPGPGGAHRHFNGERQVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVSEGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHLIPEYQTEESMYNAVRRFGKVRYDSLRLPHKVVAQEADGSVKFGNGDGSAYLFDPDIYTDGRISVQGIDGVLFPPEETEAKNAPKAVQPTKVAAKPRRGKLLEVTCRMLRTFGQDSSFTACH
Homology
BLAST of CmaCh02G012940 vs. ExPASy Swiss-Prot
Match:
Q66GR0 (Fasciclin-like arabinogalactan protein 17 OS=Arabidopsis thaliana OX=3702 GN=FLA17 PE=2 SV=1)
HSP 1 Score: 648.3 bits (1671), Expect = 6.7e-185
Identity = 337/464 (72.63%), Postives = 379/464 (81.68%), Query Frame = 0
Query: 1 MDFTIRGASKLWFAVLFVVLPIFVAVALPENSTSRSSSPAMSGQINSNSILVALLDSHYT 60
MD I G S + LF + IF A + + S SS SGQINSNS+LVALLDS YT
Sbjct: 1 MDRRIYGGSAVIHLFLFFSVLIFSAASALSKNQSPSSG---SGQINSNSVLVALLDSRYT 60
Query: 61 ELAEIVEKAMLLQTLEDAVGNHNITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLM 120
ELAE+VEKA+LLQTLEDAVG HNITIFAP+NEALERDLDPEFKRFLLEPGNLKSLQTLLM
Sbjct: 61 ELAELVEKALLLQTLEDAVGRHNITIFAPRNEALERDLDPEFKRFLLEPGNLKSLQTLLM 120
Query: 121 FHVIPTRIGSKDWPS-HSKSARQSTLSKHVLHLVGHDTGEKTVDLANVIQSDAITRPDGV 180
FH+IP R+GS WPS S + TL + L + G+K VDLA +I+ D +TRPDG+
Sbjct: 121 FHIIPNRVGSNQWPSEESGRVKHHTLGNDQVRL-SNGQGKKMVDLAEIIRPDDLTRPDGL 180
Query: 181 IHGIERLLIPQSVQDDFNRRRNLQAITAVKPEGAPEVDPRTHRLKKPAPPAEPGSTPVIP 240
IHGIERLLIP+SVQ+DFNRRR+LQ+I+AV PEGAPEVDPRT+RLKKPA P GS P +P
Sbjct: 181 IHGIERLLIPRSVQEDFNRRRSLQSISAVLPEGAPEVDPRTNRLKKPAAPVPAGSPPALP 240
Query: 241 IYDALAPGPSLAPAPAPGPGGAHRHFNGERQVKDFIHTLLHYGGYNEMADILVNLTSLAT 300
I A+APGPSLAPAPAPGPGG HF+GE QVKDFIHTLLHYGGYNEMADILVNLTSLAT
Sbjct: 241 IQSAMAPGPSLAPAPAPGPGGKQHHFDGEAQVKDFIHTLLHYGGYNEMADILVNLTSLAT 300
Query: 301 EMGRLVSEGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHLIPEYQTEESMYNAVRR 360
EMGRLVSEGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQI+YYH+IPEYQTEESMYN+VRR
Sbjct: 301 EMGRLVSEGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIVYYHIIPEYQTEESMYNSVRR 360
Query: 361 FGKVRYDSLRLPHKVVAQEADGSVKFGNGDGSAYLFDPDIYTDGRISVQGIDGVLFPPEE 420
FGKV++D+LR PHKV A+EADGSVKFG+G+ SAYLFDPDIYTDGRISVQGIDGVLFP EE
Sbjct: 361 FGKVKFDTLRFPHKVAAKEADGSVKFGDGEKSAYLFDPDIYTDGRISVQGIDGVLFPQEE 420
Query: 421 TEAKNAPKAVQPTKVAAKPRRGKLLEVTCRMLRTFGQDSSFTAC 464
++ K P K +PRRGKLLEV C ML FG+D+ + C
Sbjct: 421 EVVESVKK---PVKKIVQPRRGKLLEVACSMLGAFGKDTYLSKC 457
BLAST of CmaCh02G012940 vs. ExPASy Swiss-Prot
Match:
Q8RWC5 (Fasciclin-like arabinogalactan protein 16 OS=Arabidopsis thaliana OX=3702 GN=FLA16 PE=2 SV=1)
HSP 1 Score: 636.3 bits (1640), Expect = 2.6e-181
Identity = 330/445 (74.16%), Postives = 369/445 (82.92%), Query Frame = 0
Query: 7 GASKLWFAVLFVVLPIFVAVALPENSTSRSSSPAMSGQINSNSILVALLDSHYTELAEIV 66
GA+K +L + L +A ALP+N + GQINSNS+LVALLDSHYTELAE+V
Sbjct: 6 GATKF---LLLLFLTTSIATALPDNK-------PVPGQINSNSVLVALLDSHYTELAELV 65
Query: 67 EKAMLLQTLEDAVGNHNITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLMFHVIPT 126
EKA+LLQTLE+AVG HNITIFAP+N+ALER+LDP FK FLLEP NLKSLQ+LLMFH++P
Sbjct: 66 EKALLLQTLEEAVGKHNITIFAPRNDALERNLDPLFKSFLLEPRNLKSLQSLLMFHILPK 125
Query: 127 RIGSKDWPSHSKSARQSTLSKHVLHLVGHDTGEKTVDLANVIQSDAITRPDGVIHGIERL 186
RI S WPS S R TLS LHL D VD A +I+ D + RPDG+IHGIERL
Sbjct: 126 RITSPQWPSLSHHHR--TLSNDHLHLT-VDVNTLKVDSAEIIRPDDVIRPDGIIHGIERL 185
Query: 187 LIPQSVQDDFNRRRNLQAITAVKPEGAPEVDPRTHRLKKPAPPAEPGSTPVIPIYDALAP 246
LIP+SVQ+DFNRRR+L++I+AV PEGAPEVDPRTHRLKKP+P G+ PV+PIYDA++P
Sbjct: 186 LIPRSVQEDFNRRRSLRSISAVIPEGAPEVDPRTHRLKKPSPAVPAGAPPVLPIYDAMSP 245
Query: 247 GPSLAPAPAPGPGGAHRHFNGERQVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVS 306
GPSLAPAPAPGPGG HFNG+ QVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVS
Sbjct: 246 GPSLAPAPAPGPGGPRGHFNGDAQVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVS 305
Query: 307 EGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHLIPEYQTEESMYNAVRRFGKVRYD 366
EGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYH+IPEYQTEESMYNAVRRFGKV+YD
Sbjct: 306 EGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHIIPEYQTEESMYNAVRRFGKVKYD 365
Query: 367 SLRLPHKVVAQEADGSVKFGNGDGSAYLFDPDIYTDGRISVQGIDGVLFPPEETEAKNAP 426
SLR PHKV+AQEADGSVKFG+GDGSAYLFDPDIYTDGRISVQGIDGVLFP EET A
Sbjct: 366 SLRFPHKVLAQEADGSVKFGHGDGSAYLFDPDIYTDGRISVQGIDGVLFPKEETPATEIK 425
Query: 427 KAVQPTKVAAKPRRGKLLEVTCRML 452
A K +K RRGKL+EV CRM+
Sbjct: 426 PAAPVVKKVSKSRRGKLMEVACRMM 437
BLAST of CmaCh02G012940 vs. ExPASy Swiss-Prot
Match:
Q93W32 (Fasciclin-like arabinogalactan protein 18 OS=Arabidopsis thaliana OX=3702 GN=FLA18 PE=2 SV=1)
HSP 1 Score: 623.6 bits (1607), Expect = 1.8e-177
Identity = 318/428 (74.30%), Postives = 359/428 (83.88%), Query Frame = 0
Query: 42 SGQINSNSILVALLDSHYTELAEIVEKAMLLQTLEDAVGNHNITIFAPKNEALERDLDPE 101
SGQINSNS+LVALLDS YTELAE+VEKA+LLQTLEDAVG HNITIFAP+NEALERDLDP+
Sbjct: 36 SGQINSNSVLVALLDSRYTELAELVEKALLLQTLEDAVGRHNITIFAPRNEALERDLDPD 95
Query: 102 FKRFLLEPGNLKSLQTLLMFHVIPTRIGSKDWPSHSKSARQSTLSKH--VLHL--VGHDT 161
FKRFLL+PGNLKSLQTLL+ H+IP R+GS WP + + H VLHL +
Sbjct: 96 FKRFLLQPGNLKSLQTLLLSHIIPKRVGSNQWPEENSGRVKHVTLGHDQVLHLSKLKGTN 155
Query: 162 GEKTVDLANVIQSDAITRPDGVIHGIERLLIPQSVQDDFNRRRNLQAITAVKPEGAPEVD 221
G++ V+ A + + D +TRPDG+IHGIERLLIP+SVQ+DFNRRRNL++I+AV PEGAPE+D
Sbjct: 156 GKRLVNSAVITRPDDLTRPDGLIHGIERLLIPRSVQEDFNRRRNLRSISAVLPEGAPEID 215
Query: 222 PRTHRLKKPAPPAE--PGSTPVIPIYDALAPGPSLAPAPAPGPGGAHRHFNGERQVKDFI 281
PRT+RLKK A GS PV+PI A+APGPSLAPAPAPGPGGAH+HFNG+ QVKDFI
Sbjct: 216 PRTNRLKKSATAVSVPAGSPPVLPIESAMAPGPSLAPAPAPGPGGAHKHFNGDAQVKDFI 275
Query: 282 HTLLHYGGYNEMADILVNLTSLATEMGRLVSEGYVLTVLAPNDEAMAKLTTDQLSEPGAP 341
HTLLHYGGYNEMADILVNLTSLATEMGRLVSEGYVLTVLAPNDEAM KLTTDQLSEPGAP
Sbjct: 276 HTLLHYGGYNEMADILVNLTSLATEMGRLVSEGYVLTVLAPNDEAMGKLTTDQLSEPGAP 335
Query: 342 EQIMYYHLIPEYQTEESMYNAVRRFGKVRYDSLRLPHKVVAQEADGSVKFGNGDGSAYLF 401
EQIMYYH+IPEYQTEESMYN+VRRFGKV+Y++LR PHKV A+EADGSVKFG+GD SAYLF
Sbjct: 336 EQIMYYHIIPEYQTEESMYNSVRRFGKVKYETLRFPHKVGAKEADGSVKFGSGDRSAYLF 395
Query: 402 DPDIYTDGRISVQGIDGVLFPPEETEAKNAPKAVQPTKVAAKPRRGKLLEVTCRMLRTFG 461
DPDIYTDGRISVQGIDGVLF PEE E + K P K +PRRGKLLEV C ML G
Sbjct: 396 DPDIYTDGRISVQGIDGVLF-PEEKEEETVKKPTGPVKKVVQPRRGKLLEVACSMLGAIG 455
Query: 462 QDSSFTAC 464
+DS + C
Sbjct: 456 KDSYLSRC 462
BLAST of CmaCh02G012940 vs. ExPASy Swiss-Prot
Match:
Q9FT45 (Fasciclin-like arabinogalactan protein 15 OS=Arabidopsis thaliana OX=3702 GN=FLA15 PE=2 SV=1)
HSP 1 Score: 616.7 bits (1589), Expect = 2.2e-175
Identity = 326/455 (71.65%), Postives = 365/455 (80.22%), Query Frame = 0
Query: 9 SKLWFAVLFVVLPIFVAVALPENSTSRSSSPAMSGQINSNSILVALLDSHYTELAEIVEK 68
SKL F F++L I + ALP+ SGQINSNS+LVALLDSHYTELAE+VEK
Sbjct: 5 SKLLF---FLLLTISITTALPDKPG--------SGQINSNSVLVALLDSHYTELAELVEK 64
Query: 69 AMLLQTLEDAVGNHNITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLMFHVIPTRI 128
A+LLQTLE+AVG HNITIFAP+N+ALE++LDPEFK FLL+P NLKSLQ+LLMFH++P RI
Sbjct: 65 ALLLQTLEEAVGQHNITIFAPRNDALEKNLDPEFKSFLLQPKNLKSLQSLLMFHILPKRI 124
Query: 129 GSKDWPSHSKSARQSTLSKHVLHLVGHDTGEKTVDLANVIQSDAITRPDGVIHGIERLLI 188
S + S S R TLS LH V+ A + + D +TRPDG+IHGIERLLI
Sbjct: 125 TSPQFSSAVVSHR--TLSNDHLHFT-----NGKVNSAEITKPDDLTRPDGIIHGIERLLI 184
Query: 189 PQSVQDDFNRRRNLQAITAVKPEGAPEVDPRTHRLKKPAPPAEPGSTPVIPIYDALAPGP 248
P+SVQ+DFNRRR+L++I AV PEGAPEVDPRTHRLKK P G+ PV+P+YDA++PGP
Sbjct: 185 PRSVQEDFNRRRSLRSIAAVLPEGAPEVDPRTHRLKKKPAPIPAGAPPVLPVYDAMSPGP 244
Query: 249 SLAPAPAPGPGGAHRHFNGERQVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVSEG 308
SLAPAPAPGPGG HFNGE QVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVSEG
Sbjct: 245 SLAPAPAPGPGGPRHHFNGEAQVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVSEG 304
Query: 309 YVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHLIPEYQTEESMYNAVRRFGKVRYDSL 368
YVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYH+IPEYQTEESMYN+VRRFGK+RYDSL
Sbjct: 305 YVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHIIPEYQTEESMYNSVRRFGKIRYDSL 364
Query: 369 RLPHKVVAQEADGSVKFGNGDGSAYLFDPDIYTDGRISVQGIDGVLFPPEETEAKNAPKA 428
R PHKV AQEADGSVKFG+GDGSAYLFDPDIYTDGRISVQGIDGVLFP E+T +
Sbjct: 365 RFPHKVEAQEADGSVKFGHGDGSAYLFDPDIYTDGRISVQGIDGVLFPEEKTPVEK-KTG 424
Query: 429 VQPTKVAAKPRRGKLLEVTCRMLRTFGQDSSFTAC 464
V K A KPRRGKL+EV C ML S F C
Sbjct: 425 VPVVKKAPKPRRGKLMEVACTML-----GSQFPTC 435
BLAST of CmaCh02G012940 vs. TAIR 10
Match:
AT5G06390.1 (FASCICLIN-like arabinogalactan protein 17 precursor )
HSP 1 Score: 648.3 bits (1671), Expect = 4.7e-186
Identity = 337/464 (72.63%), Postives = 379/464 (81.68%), Query Frame = 0
Query: 1 MDFTIRGASKLWFAVLFVVLPIFVAVALPENSTSRSSSPAMSGQINSNSILVALLDSHYT 60
MD I G S + LF + IF A + + S SS SGQINSNS+LVALLDS YT
Sbjct: 1 MDRRIYGGSAVIHLFLFFSVLIFSAASALSKNQSPSSG---SGQINSNSVLVALLDSRYT 60
Query: 61 ELAEIVEKAMLLQTLEDAVGNHNITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLM 120
ELAE+VEKA+LLQTLEDAVG HNITIFAP+NEALERDLDPEFKRFLLEPGNLKSLQTLLM
Sbjct: 61 ELAELVEKALLLQTLEDAVGRHNITIFAPRNEALERDLDPEFKRFLLEPGNLKSLQTLLM 120
Query: 121 FHVIPTRIGSKDWPS-HSKSARQSTLSKHVLHLVGHDTGEKTVDLANVIQSDAITRPDGV 180
FH+IP R+GS WPS S + TL + L + G+K VDLA +I+ D +TRPDG+
Sbjct: 121 FHIIPNRVGSNQWPSEESGRVKHHTLGNDQVRL-SNGQGKKMVDLAEIIRPDDLTRPDGL 180
Query: 181 IHGIERLLIPQSVQDDFNRRRNLQAITAVKPEGAPEVDPRTHRLKKPAPPAEPGSTPVIP 240
IHGIERLLIP+SVQ+DFNRRR+LQ+I+AV PEGAPEVDPRT+RLKKPA P GS P +P
Sbjct: 181 IHGIERLLIPRSVQEDFNRRRSLQSISAVLPEGAPEVDPRTNRLKKPAAPVPAGSPPALP 240
Query: 241 IYDALAPGPSLAPAPAPGPGGAHRHFNGERQVKDFIHTLLHYGGYNEMADILVNLTSLAT 300
I A+APGPSLAPAPAPGPGG HF+GE QVKDFIHTLLHYGGYNEMADILVNLTSLAT
Sbjct: 241 IQSAMAPGPSLAPAPAPGPGGKQHHFDGEAQVKDFIHTLLHYGGYNEMADILVNLTSLAT 300
Query: 301 EMGRLVSEGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHLIPEYQTEESMYNAVRR 360
EMGRLVSEGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQI+YYH+IPEYQTEESMYN+VRR
Sbjct: 301 EMGRLVSEGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIVYYHIIPEYQTEESMYNSVRR 360
Query: 361 FGKVRYDSLRLPHKVVAQEADGSVKFGNGDGSAYLFDPDIYTDGRISVQGIDGVLFPPEE 420
FGKV++D+LR PHKV A+EADGSVKFG+G+ SAYLFDPDIYTDGRISVQGIDGVLFP EE
Sbjct: 361 FGKVKFDTLRFPHKVAAKEADGSVKFGDGEKSAYLFDPDIYTDGRISVQGIDGVLFPQEE 420
Query: 421 TEAKNAPKAVQPTKVAAKPRRGKLLEVTCRMLRTFGQDSSFTAC 464
++ K P K +PRRGKLLEV C ML FG+D+ + C
Sbjct: 421 EVVESVKK---PVKKIVQPRRGKLLEVACSMLGAFGKDTYLSKC 457
BLAST of CmaCh02G012940 vs. TAIR 10
Match:
AT2G35860.1 (FASCICLIN-like arabinogalactan protein 16 precursor )
HSP 1 Score: 636.3 bits (1640), Expect = 1.9e-182
Identity = 330/445 (74.16%), Postives = 369/445 (82.92%), Query Frame = 0
Query: 7 GASKLWFAVLFVVLPIFVAVALPENSTSRSSSPAMSGQINSNSILVALLDSHYTELAEIV 66
GA+K +L + L +A ALP+N + GQINSNS+LVALLDSHYTELAE+V
Sbjct: 6 GATKF---LLLLFLTTSIATALPDNK-------PVPGQINSNSVLVALLDSHYTELAELV 65
Query: 67 EKAMLLQTLEDAVGNHNITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLMFHVIPT 126
EKA+LLQTLE+AVG HNITIFAP+N+ALER+LDP FK FLLEP NLKSLQ+LLMFH++P
Sbjct: 66 EKALLLQTLEEAVGKHNITIFAPRNDALERNLDPLFKSFLLEPRNLKSLQSLLMFHILPK 125
Query: 127 RIGSKDWPSHSKSARQSTLSKHVLHLVGHDTGEKTVDLANVIQSDAITRPDGVIHGIERL 186
RI S WPS S R TLS LHL D VD A +I+ D + RPDG+IHGIERL
Sbjct: 126 RITSPQWPSLSHHHR--TLSNDHLHLT-VDVNTLKVDSAEIIRPDDVIRPDGIIHGIERL 185
Query: 187 LIPQSVQDDFNRRRNLQAITAVKPEGAPEVDPRTHRLKKPAPPAEPGSTPVIPIYDALAP 246
LIP+SVQ+DFNRRR+L++I+AV PEGAPEVDPRTHRLKKP+P G+ PV+PIYDA++P
Sbjct: 186 LIPRSVQEDFNRRRSLRSISAVIPEGAPEVDPRTHRLKKPSPAVPAGAPPVLPIYDAMSP 245
Query: 247 GPSLAPAPAPGPGGAHRHFNGERQVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVS 306
GPSLAPAPAPGPGG HFNG+ QVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVS
Sbjct: 246 GPSLAPAPAPGPGGPRGHFNGDAQVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVS 305
Query: 307 EGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHLIPEYQTEESMYNAVRRFGKVRYD 366
EGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYH+IPEYQTEESMYNAVRRFGKV+YD
Sbjct: 306 EGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHIIPEYQTEESMYNAVRRFGKVKYD 365
Query: 367 SLRLPHKVVAQEADGSVKFGNGDGSAYLFDPDIYTDGRISVQGIDGVLFPPEETEAKNAP 426
SLR PHKV+AQEADGSVKFG+GDGSAYLFDPDIYTDGRISVQGIDGVLFP EET A
Sbjct: 366 SLRFPHKVLAQEADGSVKFGHGDGSAYLFDPDIYTDGRISVQGIDGVLFPKEETPATEIK 425
Query: 427 KAVQPTKVAAKPRRGKLLEVTCRML 452
A K +K RRGKL+EV CRM+
Sbjct: 426 PAAPVVKKVSKSRRGKLMEVACRMM 437
BLAST of CmaCh02G012940 vs. TAIR 10
Match:
AT3G11700.1 (FASCICLIN-like arabinogalactan protein 18 precursor )
HSP 1 Score: 623.6 bits (1607), Expect = 1.3e-178
Identity = 318/428 (74.30%), Postives = 359/428 (83.88%), Query Frame = 0
Query: 42 SGQINSNSILVALLDSHYTELAEIVEKAMLLQTLEDAVGNHNITIFAPKNEALERDLDPE 101
SGQINSNS+LVALLDS YTELAE+VEKA+LLQTLEDAVG HNITIFAP+NEALERDLDP+
Sbjct: 36 SGQINSNSVLVALLDSRYTELAELVEKALLLQTLEDAVGRHNITIFAPRNEALERDLDPD 95
Query: 102 FKRFLLEPGNLKSLQTLLMFHVIPTRIGSKDWPSHSKSARQSTLSKH--VLHL--VGHDT 161
FKRFLL+PGNLKSLQTLL+ H+IP R+GS WP + + H VLHL +
Sbjct: 96 FKRFLLQPGNLKSLQTLLLSHIIPKRVGSNQWPEENSGRVKHVTLGHDQVLHLSKLKGTN 155
Query: 162 GEKTVDLANVIQSDAITRPDGVIHGIERLLIPQSVQDDFNRRRNLQAITAVKPEGAPEVD 221
G++ V+ A + + D +TRPDG+IHGIERLLIP+SVQ+DFNRRRNL++I+AV PEGAPE+D
Sbjct: 156 GKRLVNSAVITRPDDLTRPDGLIHGIERLLIPRSVQEDFNRRRNLRSISAVLPEGAPEID 215
Query: 222 PRTHRLKKPAPPAE--PGSTPVIPIYDALAPGPSLAPAPAPGPGGAHRHFNGERQVKDFI 281
PRT+RLKK A GS PV+PI A+APGPSLAPAPAPGPGGAH+HFNG+ QVKDFI
Sbjct: 216 PRTNRLKKSATAVSVPAGSPPVLPIESAMAPGPSLAPAPAPGPGGAHKHFNGDAQVKDFI 275
Query: 282 HTLLHYGGYNEMADILVNLTSLATEMGRLVSEGYVLTVLAPNDEAMAKLTTDQLSEPGAP 341
HTLLHYGGYNEMADILVNLTSLATEMGRLVSEGYVLTVLAPNDEAM KLTTDQLSEPGAP
Sbjct: 276 HTLLHYGGYNEMADILVNLTSLATEMGRLVSEGYVLTVLAPNDEAMGKLTTDQLSEPGAP 335
Query: 342 EQIMYYHLIPEYQTEESMYNAVRRFGKVRYDSLRLPHKVVAQEADGSVKFGNGDGSAYLF 401
EQIMYYH+IPEYQTEESMYN+VRRFGKV+Y++LR PHKV A+EADGSVKFG+GD SAYLF
Sbjct: 336 EQIMYYHIIPEYQTEESMYNSVRRFGKVKYETLRFPHKVGAKEADGSVKFGSGDRSAYLF 395
Query: 402 DPDIYTDGRISVQGIDGVLFPPEETEAKNAPKAVQPTKVAAKPRRGKLLEVTCRMLRTFG 461
DPDIYTDGRISVQGIDGVLF PEE E + K P K +PRRGKLLEV C ML G
Sbjct: 396 DPDIYTDGRISVQGIDGVLF-PEEKEEETVKKPTGPVKKVVQPRRGKLLEVACSMLGAIG 455
Query: 462 QDSSFTAC 464
+DS + C
Sbjct: 456 KDSYLSRC 462
BLAST of CmaCh02G012940 vs. TAIR 10
Match:
AT3G52370.1 (FASCICLIN-like arabinogalactan protein 15 precursor )
HSP 1 Score: 616.7 bits (1589), Expect = 1.5e-176
Identity = 326/455 (71.65%), Postives = 365/455 (80.22%), Query Frame = 0
Query: 9 SKLWFAVLFVVLPIFVAVALPENSTSRSSSPAMSGQINSNSILVALLDSHYTELAEIVEK 68
SKL F F++L I + ALP+ SGQINSNS+LVALLDSHYTELAE+VEK
Sbjct: 5 SKLLF---FLLLTISITTALPDKPG--------SGQINSNSVLVALLDSHYTELAELVEK 64
Query: 69 AMLLQTLEDAVGNHNITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLMFHVIPTRI 128
A+LLQTLE+AVG HNITIFAP+N+ALE++LDPEFK FLL+P NLKSLQ+LLMFH++P RI
Sbjct: 65 ALLLQTLEEAVGQHNITIFAPRNDALEKNLDPEFKSFLLQPKNLKSLQSLLMFHILPKRI 124
Query: 129 GSKDWPSHSKSARQSTLSKHVLHLVGHDTGEKTVDLANVIQSDAITRPDGVIHGIERLLI 188
S + S S R TLS LH V+ A + + D +TRPDG+IHGIERLLI
Sbjct: 125 TSPQFSSAVVSHR--TLSNDHLHFT-----NGKVNSAEITKPDDLTRPDGIIHGIERLLI 184
Query: 189 PQSVQDDFNRRRNLQAITAVKPEGAPEVDPRTHRLKKPAPPAEPGSTPVIPIYDALAPGP 248
P+SVQ+DFNRRR+L++I AV PEGAPEVDPRTHRLKK P G+ PV+P+YDA++PGP
Sbjct: 185 PRSVQEDFNRRRSLRSIAAVLPEGAPEVDPRTHRLKKKPAPIPAGAPPVLPVYDAMSPGP 244
Query: 249 SLAPAPAPGPGGAHRHFNGERQVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVSEG 308
SLAPAPAPGPGG HFNGE QVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVSEG
Sbjct: 245 SLAPAPAPGPGGPRHHFNGEAQVKDFIHTLLHYGGYNEMADILVNLTSLATEMGRLVSEG 304
Query: 309 YVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHLIPEYQTEESMYNAVRRFGKVRYDSL 368
YVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYH+IPEYQTEESMYN+VRRFGK+RYDSL
Sbjct: 305 YVLTVLAPNDEAMAKLTTDQLSEPGAPEQIMYYHIIPEYQTEESMYNSVRRFGKIRYDSL 364
Query: 369 RLPHKVVAQEADGSVKFGNGDGSAYLFDPDIYTDGRISVQGIDGVLFPPEETEAKNAPKA 428
R PHKV AQEADGSVKFG+GDGSAYLFDPDIYTDGRISVQGIDGVLFP E+T +
Sbjct: 365 RFPHKVEAQEADGSVKFGHGDGSAYLFDPDIYTDGRISVQGIDGVLFPEEKTPVEK-KTG 424
Query: 429 VQPTKVAAKPRRGKLLEVTCRMLRTFGQDSSFTAC 464
V K A KPRRGKL+EV C ML S F C
Sbjct: 425 VPVVKKAPKPRRGKLMEVACTML-----GSQFPTC 435
BLAST of CmaCh02G012940 vs. TAIR 10
Match:
AT5G05650.1 (BEST Arabidopsis thaliana protein match is: FASCICLIN-like arabinogalactan protein 17 precursor (TAIR:AT5G06390.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 112.5 bits (280), Expect = 9.4e-25
Identity = 53/92 (57.61%), Postives = 68/92 (73.91%), Query Frame = 0
Query: 327 DQLSEPGAPEQIMYYHLIPEYQTEESMYNAVRRFGKVRYDSLRLPHKVVAQEADGSVKFG 386
DQLSE +QI YYH+IPEYQTE+S Y VRR G +++D+ PH + A+E S+KFG
Sbjct: 2 DQLSE----KQIWYYHIIPEYQTEKSFYACVRRSGMIKFDTFYFPHMLSARETQRSIKFG 61
Query: 387 NGDGSAYLFDPDIYTDGRISVQGIDGVLFPPE 419
+G S L+DPDIYTDG+IS+QG+ GVLFP E
Sbjct: 62 DGVWSGCLYDPDIYTDGKISIQGVGGVLFPRE 89
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q66GR0 | 6.7e-185 | 72.63 | Fasciclin-like arabinogalactan protein 17 OS=Arabidopsis thaliana OX=3702 GN=FLA... | [more] |
Q8RWC5 | 2.6e-181 | 74.16 | Fasciclin-like arabinogalactan protein 16 OS=Arabidopsis thaliana OX=3702 GN=FLA... | [more] |
Q93W32 | 1.8e-177 | 74.30 | Fasciclin-like arabinogalactan protein 18 OS=Arabidopsis thaliana OX=3702 GN=FLA... | [more] |
Q9FT45 | 2.2e-175 | 71.65 | Fasciclin-like arabinogalactan protein 15 OS=Arabidopsis thaliana OX=3702 GN=FLA... | [more] |
Match Name | E-value | Identity | Description | |
AT5G06390.1 | 4.7e-186 | 72.63 | FASCICLIN-like arabinogalactan protein 17 precursor | [more] |
AT2G35860.1 | 1.9e-182 | 74.16 | FASCICLIN-like arabinogalactan protein 16 precursor | [more] |
AT3G11700.1 | 1.3e-178 | 74.30 | FASCICLIN-like arabinogalactan protein 18 precursor | [more] |
AT3G52370.1 | 1.5e-176 | 71.65 | FASCICLIN-like arabinogalactan protein 15 precursor | [more] |
AT5G05650.1 | 9.4e-25 | 57.61 | BEST Arabidopsis thaliana protein match is: FASCICLIN-like arabinogalactan prote... | [more] |