Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGTAAAAACGACGTTTTTACATGGAGATCTAGATGAGGAGATCTATATGCAACAACCAGAAGGGTTTGCAGCTCCAAGCAAGGAGCACATGGTGTGTAAGCTCAATAAGAGCTTGTATGGACTGAAACAAGCACCGAGACAATGGTACAAGAAGTTTGACTCCTTCATGTGCAAAAGTGGTTTCCAAAGGAGTGAAAAGGATCAGTGTTGCTACCTCAAGAAATACACTGATTCTTATGTGTTTCTACTCCTGTATGTGAATGATATACTAATTGCTGGATCAAGTATGAGGGAGATAAATCACCTGAAGGCAAGCTTGTCTTCAGTATTTGAGATGAAAGATTTAGGTGCAGCGAAGTAGATTCTTGGGATGAGGATTTCTCGAGATAGATCTGCTGGCACATTAAATCTATCCCAAGAGCAGTACATTGAGAAGATGTTGTCCAAATTCAAGATGAATAACGCTAAACCCAGGACTACCCCCTTGGCAAATCATATTAAATTGTCAAAGGGGCAATCTCCCAAGACAGTTGAGGAACGTGAGCACATGGCATCAGTTACGTACGCTTCTGCAGTTGGGAGTTTGATGTATGCTATGGTCTGCACTACTTCTGCAGTTGGGAGTTTGATGTATGCTATGGTCTGCACTAGACCTGACATAACACATGCAGTGGGAGTTGTTAGCAAGTACATGGCAAATCTAGGGAAGGAACATTGGGAAGCTGTGAAGTGGCTTCTGAGATATCTGAGAGGTACATCCAATACTTCACTTTGTTATGGCAATGACAAAGTAGTTTTGCAAGGTTTTGTGGATGCTGATCTGAGTGGAGATGTAGACTCCAGCAATAGCACATCTGGATATATCTACAATATAGATGGAACAGCAGTGAGTTGGATGTCCAAGCTTCAGAAATGTATTGCTCTTTCATCTACTGAAGCTGAGTACGTGGCCATAACTGAAGCTAGAAAGAAGATGATATGA
mRNA sequence
ATGAATGTAAAAACGACGTTTTTACATGGAGATCTAGATGAGGAGATCTATATGCAACAACCAGAAGGGTTTGCAGCTCCAAGCAAGGAGCACATGGTGTGTAAGCTCAATAAGAGCTTGTATGGACTGAAACAAGCACCGAGACAATGGTACAAGAAGTTTGACTCCTTCATGTGCAAAAGTGGTTTCCAAAGGAGTGAAAAGGATCAGTGTTGCTACCTCAAGAAATACACTGATTCTTATGTGTTTCTACTCCTATCTGCTGGCACATTAAATCTATCCCAAGAGCAGTACATTGAGAAGATGTTGTCCAAATTCAAGATGAATAACGCTAAACCCAGGACTACCCCCTTGGCAAATCATATTAAATTGTCAAAGGGGCAATCTCCCAAGACAGTTGAGGAACGTGAGCACATGGCATCAGTTACGTACGCTTCTGCAGTTGGGAGTTTGATGTATGCTATGGTCTGCACTACTTCTGCAGTTGGGAGTTTGATGTATGCTATGGTCTGCACTAGACCTGACATAACACATGCAGTGGGAGTTGTTAGCAAGTACATGGCAAATCTAGGGAAGGAACATTGGGAAGCTGTGAAGTGGCTTCTGAGATATCTGAGAGGTACATCCAATACTTCACTTTGTTATGGCAATGACAAAGTAGTTTTGCAAGGTTTTGTGGATGCTGATCTGAGTGGAGATGTAGACTCCAGCAATAGCACATCTGGATATATCTACAATATAGATGGAACAGCAGTGAGTTGGATGTCCAAGCTTCAGAAATGTATTGCTCTTTCATCTACTGAAGCTGAGTACGTGGCCATAACTGAAGCTAGAAAGAAGATGATATGA
Coding sequence (CDS)
ATGAATGTAAAAACGACGTTTTTACATGGAGATCTAGATGAGGAGATCTATATGCAACAACCAGAAGGGTTTGCAGCTCCAAGCAAGGAGCACATGGTGTGTAAGCTCAATAAGAGCTTGTATGGACTGAAACAAGCACCGAGACAATGGTACAAGAAGTTTGACTCCTTCATGTGCAAAAGTGGTTTCCAAAGGAGTGAAAAGGATCAGTGTTGCTACCTCAAGAAATACACTGATTCTTATGTGTTTCTACTCCTATCTGCTGGCACATTAAATCTATCCCAAGAGCAGTACATTGAGAAGATGTTGTCCAAATTCAAGATGAATAACGCTAAACCCAGGACTACCCCCTTGGCAAATCATATTAAATTGTCAAAGGGGCAATCTCCCAAGACAGTTGAGGAACGTGAGCACATGGCATCAGTTACGTACGCTTCTGCAGTTGGGAGTTTGATGTATGCTATGGTCTGCACTACTTCTGCAGTTGGGAGTTTGATGTATGCTATGGTCTGCACTAGACCTGACATAACACATGCAGTGGGAGTTGTTAGCAAGTACATGGCAAATCTAGGGAAGGAACATTGGGAAGCTGTGAAGTGGCTTCTGAGATATCTGAGAGGTACATCCAATACTTCACTTTGTTATGGCAATGACAAAGTAGTTTTGCAAGGTTTTGTGGATGCTGATCTGAGTGGAGATGTAGACTCCAGCAATAGCACATCTGGATATATCTACAATATAGATGGAACAGCAGTGAGTTGGATGTCCAAGCTTCAGAAATGTATTGCTCTTTCATCTACTGAAGCTGAGTACGTGGCCATAACTGAAGCTAGAAAGAAGATGATATGA
Protein sequence
MNVKTTFLHGDLDEEIYMQQPEGFAAPSKEHMVCKLNKSLYGLKQAPRQWYKKFDSFMCKSGFQRSEKDQCCYLKKYTDSYVFLLLSAGTLNLSQEQYIEKMLSKFKMNNAKPRTTPLANHIKLSKGQSPKTVEEREHMASVTYASAVGSLMYAMVCTTSAVGSLMYAMVCTRPDITHAVGVVSKYMANLGKEHWEAVKWLLRYLRGTSNTSLCYGNDKVVLQGFVDADLSGDVDSSNSTSGYIYNIDGTAVSWMSKLQKCIALSSTEAEYVAITEARKKMI
Homology
BLAST of CmaCh02G004940 vs. ExPASy Swiss-Prot
Match:
P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)
HSP 1 Score: 325.1 bits (832), Expect = 7.9e-88
Identity = 167/329 (50.76%), Postives = 215/329 (65.35%), Query Frame = 0
Query: 1 MNVKTTFLHGDLDEEIYMQQPEGFAAPSKEHMVCKLNKSLYGLKQAPRQWYKKFDSFMCK 60
++VKT FLHGDL+EEIYM+QPEGF K+HMVCKLNKSLYGLKQAPRQWY KFDSFM
Sbjct: 921 LDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKS 980
Query: 61 SGFQRSEKDQCCYLKKYTD-SYVFLLL--------------------------------- 120
+ ++ D C Y K++++ +++ LLL
Sbjct: 981 QTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGP 1040
Query: 121 -------------SAGTLNLSQEQYIEKMLSKFKMNNAKPRTTPLANHIKLSKGQSPKTV 180
++ L LSQE+YIE++L +F M NAKP +TPLA H+KLSK P TV
Sbjct: 1041 AQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTV 1100
Query: 181 EEREHMASVTYASAVGSLMYAMVCTTSAVGSLMYAMVCTRPDITHAVGVVSKYMANLGKE 240
EE+ +MA V Y +SAVGSLMYAMVCTRPDI HAVGVVS+++ N GKE
Sbjct: 1101 EEKGNMAKVPY--------------SSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKE 1160
Query: 241 HWEAVKWLLRYLRGTSNTSLCYGNDKVVLQGFVDADLSGDVDSSNSTSGYIYNIDGTAVS 283
HWEAVKW+LRYLRGT+ LC+G +L+G+ DAD++GD+D+ S++GY++ G A+S
Sbjct: 1161 HWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAIS 1220
BLAST of CmaCh02G004940 vs. ExPASy Swiss-Prot
Match:
P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)
HSP 1 Score: 155.6 bits (392), Expect = 8.3e-37
Identity = 107/335 (31.94%), Postives = 160/335 (47.76%), Query Frame = 0
Query: 1 MNVKTTFLHGDLDEEIYMQQPEGFAAPSKEHMVCKLNKSLYGLKQAPRQWYKKFDSFMCK 60
M+VKT FL+G L EEIYM+ P+G + S VCKLNK++YGLKQA R W++ F+ + +
Sbjct: 1001 MDVKTAFLNGTLKEEIYMRLPQGISCNSDN--VCKLNKAIYGLKQAARCWFEVFEQALKE 1060
Query: 61 SGFQRSEKDQCCYLKK------------YTDSYVF------------------------- 120
F S D+C Y+ Y D V
Sbjct: 1061 CEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLN 1120
Query: 121 ---------LLLSAGTLNLSQEQYIEKMLSKFKMNNAKPRTTPLANHIKLSKGQSPKTVE 180
+ + + LSQ Y++K+LSKF M N +TPL + I S +
Sbjct: 1121 EIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDED-- 1180
Query: 181 EREHMASVTYASAVGSLMYAMVCTT---SAVGSLMYAMVCTRPDITHAVGVVSKYMANLG 240
C T S +G LMY M+CTRPD+T AV ++S+Y +
Sbjct: 1181 ----------------------CNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNN 1240
Query: 241 KEHWEAVKWLLRYLRGTSNTSLCYGNDKVV---LQGFVDADLSGDVDSSNSTSGYIYNI- 283
E W+ +K +LRYL+GT + L + + + G+VD+D +G ST+GY++ +
Sbjct: 1241 SELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMF 1300
BLAST of CmaCh02G004940 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 153.7 bits (387), Expect = 3.1e-36
Identity = 108/326 (33.13%), Postives = 153/326 (46.93%), Query Frame = 0
Query: 1 MNVKTTFLHGDLDEEIYMQQPEGFAAPSKEHMVCKLNKSLYGLKQAPRQWYKKFDSFMCK 60
++V FL G L +++YM QP GF + + VCKL K+LYGLKQAPR WY + +++
Sbjct: 1064 LDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLT 1123
Query: 61 SGFQRSEKDQCCYLKKYTDSYVFLL------LSAGT------------------------ 120
GF S D ++ + S V++L L G
Sbjct: 1124 IGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEEL 1183
Query: 121 --------------LNLSQEQYIEKMLSKFKMNNAKPRTTPLANHIKLSKGQSPKTVEER 180
L+LSQ +YI +L++ M AKP TTP+A KLS K +
Sbjct: 1184 HYFLGIEAKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPT 1243
Query: 181 EHMASVTYASAVGSLMYAMVCTTSAVGSLMYAMVCTRPDITHAVGVVSKYMANLGKEHWE 240
E Y VGSL Y + TRPDI++AV +S++M +EH +
Sbjct: 1244 E------YRGIVGSLQY---------------LAFTRPDISYAVNRLSQFMHMPTEEHLQ 1303
Query: 241 AVKWLLRYLRGTSNTSLCYGNDKVV-LQGFVDADLSGDVDSSNSTSGYIYNIDGTAVSWM 282
A+K +LRYL GT N + + L + DAD +GD D ST+GYI + +SW
Sbjct: 1304 ALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWS 1363
BLAST of CmaCh02G004940 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 149.1 bits (375), Expect = 7.7e-35
Identity = 102/326 (31.29%), Postives = 150/326 (46.01%), Query Frame = 0
Query: 1 MNVKTTFLHGDLDEEIYMQQPEGFAAPSKEHMVCKLNKSLYGLKQAPRQWYKKFDSFMCK 60
++V FL G L +E+YM QP GF + VC+L K++YGLKQAPR WY + +++
Sbjct: 1047 LDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLT 1106
Query: 61 SGFQRSEKDQCCY----------------------------------------LKKYTDS 120
GF S D + +K++ D
Sbjct: 1107 VGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDL 1166
Query: 121 YVFLLLSAGT----LNLSQEQYIEKMLSKFKMNNAKPRTTPLANHIKLSKGQSPKTVEER 180
+ FL + A L+LSQ +Y +L++ M AKP TP+A KL+ K +
Sbjct: 1167 HYFLGIEAKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPT 1226
Query: 181 EHMASVTYASAVGSLMYAMVCTTSAVGSLMYAMVCTRPDITHAVGVVSKYMANLGKEHWE 240
E Y VGSL Y + TRPD+++AV +S+YM +HW
Sbjct: 1227 E------YRGIVGSLQY---------------LAFTRPDLSYAVNRLSQYMHMPTDDHWN 1286
Query: 241 AVKWLLRYLRGTSNTSLCYGNDKVV-LQGFVDADLSGDVDSSNSTSGYIYNIDGTAVSWM 282
A+K +LRYL GT + + + L + DAD +GD D ST+GYI + +SW
Sbjct: 1287 ALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWS 1346
BLAST of CmaCh02G004940 vs. ExPASy Swiss-Prot
Match:
P0CV72 (Secreted RxLR effector protein 161 OS=Plasmopara viticola OX=143451 GN=RXLR161 PE=2 SV=1)
HSP 1 Score: 116.3 bits (290), Expect = 5.6e-25
Identity = 61/126 (48.41%), Postives = 90/126 (71.43%), Query Frame = 0
Query: 160 SAVGSLMYAMVCTRPDITHAVGVVSKYMANLGKEHWEAVKWLLRYLRGTSNTSLCY---G 219
SAVG++MY MV TRPD+ AVGV+S++ ++ HW+A+K +LRYL+ T L + G
Sbjct: 8 SAVGAIMYLMVVTRPDLAAAVGVLSQFASDPCPTHWQALKRVLRYLQSTQTYGLEFTRAG 67
Query: 220 NDKVVLQGFVDADLSGDVDSSNSTSGYIYNIDGTAVSWMSKLQKCIALSSTEAEYVAITE 279
K+V G+ DAD +GDV+S STSGY++ ++G VSW SK Q+ +ALSSTE EY+A++E
Sbjct: 68 TAKLV--GYSDADWAGDVESRRSTSGYLFKLNGGCVSWRSKKQRTVALSSTEDEYMALSE 127
Query: 280 ARKKMI 283
A ++ +
Sbjct: 128 ATQEAV 131
BLAST of CmaCh02G004940 vs. TAIR 10
Match:
AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )
HSP 1 Score: 123.6 bits (309), Expect = 2.5e-28
Identity = 92/313 (29.39%), Postives = 156/313 (49.84%), Query Frame = 0
Query: 1 MNVKTTFLHGDLDEEIYMQQPEGFAAPSKEHM----VCKLNKSLYGLKQAPRQWYKKFDS 60
+++ FL+GDLDEEIYM+ P G+AA + + VC L KS+YGLKQA RQW+ KF
Sbjct: 193 LDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSV 252
Query: 61 FMCKSGFQRSEKDQCCYLKKYTDSYVFLLLSAGTL------NLSQEQYIEKMLSKFKMNN 120
+ GF +S D +LK ++ +L+ + + + ++ ++ S FK+ +
Sbjct: 253 TLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRD 312
Query: 121 AKPRTTPLANHIKLSKGQSPKTVEEREHM--------------------ASVTYASAVGS 180
P L +++++ + + +R++ SVT+++ G
Sbjct: 313 LGPLKYFLG--LEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGG 372
Query: 181 LMYAMVCTTSAVGSLMYAMVCTRPDITHAVGVVSKYMANLGKEHWEAVKWLLRYLRGTSN 240
+G LMY + TR DI+ AV +S++ H +AV +L Y++GT
Sbjct: 373 DFVDAKAYRRLIGRLMYLQI-TRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVG 432
Query: 241 TSLCYGND-KVVLQGFVDADLSGDVDSSNSTSGYIYNIDGTAVSWMSKLQKCIALSSTEA 283
L Y + ++ LQ F DA D+ ST+GY + + +SW SK Q+ ++ SS EA
Sbjct: 433 QGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEA 492
BLAST of CmaCh02G004940 vs. TAIR 10
Match:
ATMG00810.1 (DNA/RNA polymerases superfamily protein )
HSP 1 Score: 78.2 bits (191), Expect = 1.2e-14
Identity = 58/185 (31.35%), Postives = 96/185 (51.89%), Query Frame = 0
Query: 91 LNLSQEQYIEKMLSKFKMNNAKPRTTPLANHIKLSKGQSPKTVEEREHMASVTYASAVGS 150
L LSQ +Y E++L+ M + KP +TPL +KL+ +V ++ + S VG+
Sbjct: 54 LFLSQTKYAEQILNNAGMLDCKPMSTPLP--LKLN-----SSVSTAKYPDPSDFRSIVGA 113
Query: 151 LMYAMVCTTSAVGSLMYAMVCTRPDITHAVGVVSKYMANLGKEHWEAVKWLLRYLRGTSN 210
L Y + TRPDI++AV +V + M ++ +K +LRY++GT
Sbjct: 114 LQY---------------LTLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIF 173
Query: 211 TSL-CYGNDKVVLQGFVDADLSGDVDSSNSTSGYIYNIDGTAVSWMSKLQKCIALSSTEA 270
L + N K+ +Q F D+D +G + ST+G+ + +SW +K Q ++ SSTE
Sbjct: 174 HGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTET 216
Query: 271 EYVAI 275
EY A+
Sbjct: 234 EYRAL 216
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
P10978 | 7.9e-88 | 50.76 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
P04146 | 8.3e-37 | 31.94 | Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3 | [more] |
Q94HW2 | 3.1e-36 | 33.13 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Q9ZT94 | 7.7e-35 | 31.29 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
P0CV72 | 5.6e-25 | 48.41 | Secreted RxLR effector protein 161 OS=Plasmopara viticola OX=143451 GN=RXLR161 P... | [more] |
Match Name | E-value | Identity | Description | |
AT4G23160.1 | 2.5e-28 | 29.39 | cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | [more] |
ATMG00810.1 | 1.2e-14 | 31.35 | DNA/RNA polymerases superfamily protein | [more] |