Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CGATTCTTGAGTCAAGTTTTGATCTAAAATTTTCATGGTATCAGAGCTTGGTTACACGAAAATTATTCAAGATCAAACTTACAAGGATCTAATCATGGCCAAGAATGAAAATAGCCTTACCATCCAAACTGTTCATCAATGTTCCAGCCTTATCTCCATCAAGTTGTCTTCTAACAATTATCTCCTGTTGAAATCTCAAATATTGTCGTTGATTCGAACTATGGGAGTGGAACATCACCTTTATGAAGATCAACCACTCGAAAAAGAAATCACTGACATCAACGACAAAAAGGTTTCCAACCCACAGCATAATGTTTGGAAACACAATGATGGATTGTTAACATCGTGGATGTTGGGAACTATTACTGAAGAGGTACTTAGCATGATTGAAAACTCTAGCACCTCAAATCAAGTTTGGAATTCCTTAGAAAAGCAACTACTCACAATGACAAAAGAGAATGAACTCCATCTCAATGAAGCTCTTGTCAGCCTAAGAAAGGGAAATCTAAGTCTGGGGGAGTTTCTAAAGAAGTTCAAGGCTCTTTGTGACAAGGTGGCAACAATGAAAAATCTATTGGAGATGAAACAACCAAAGTTCTTCACCTAG
mRNA sequence
CGATTCTTGAGTCAAGTTTTGATCTAAAATTTTCATGGTATCAGAGCTTGGTTACACGAAAATTATTCAAGATCAAACTTACAAGGATCTAATCATGGCCAAGAATGAAAATAGCCTTACCATCCAAACTGTTCATCAATGTTCCAGCCTTATCTCCATCAAGTTGTCTTCTAACAATTATCTCCTGTTGAAATCTCAAATATTGTCGTTGATTCGAACTATGGGAGTGGAACATCACCTTTATGAAGATCAACCACTCGAAAAAGAAATCACTGACATCAACGACAAAAAGGTTTCCAACCCACAGCATAATGTTTGGAAACACAATGATGGATTGTTAACATCGTGGATGTTGGGAACTATTACTGAAGAGGTACTTAGCATGATTGAAAACTCTAGCACCTCAAATCAAGTTTGGAATTCCTTAGAAAAGCAACTACTCACAATGACAAAAGAGAATGAACTCCATCTCAATGAAGCTCTTGTCAGCCTAAGAAAGGGAAATCTAAGTCTGGGGGAGTTTCTAAAGAAGTTCAAGGCTCTTTGTGACAAGGTGGCAACAATGAAAAATCTATTGGAGATGAAACAACCAAAGTTCTTCACCTAG
Coding sequence (CDS)
ATGGTATCAGAGCTTGGTTACACGAAAATTATTCAAGATCAAACTTACAAGGATCTAATCATGGCCAAGAATGAAAATAGCCTTACCATCCAAACTGTTCATCAATGTTCCAGCCTTATCTCCATCAAGTTGTCTTCTAACAATTATCTCCTGTTGAAATCTCAAATATTGTCGTTGATTCGAACTATGGGAGTGGAACATCACCTTTATGAAGATCAACCACTCGAAAAAGAAATCACTGACATCAACGACAAAAAGGTTTCCAACCCACAGCATAATGTTTGGAAACACAATGATGGATTGTTAACATCGTGGATGTTGGGAACTATTACTGAAGAGGTACTTAGCATGATTGAAAACTCTAGCACCTCAAATCAAGTTTGGAATTCCTTAGAAAAGCAACTACTCACAATGACAAAAGAGAATGAACTCCATCTCAATGAAGCTCTTGTCAGCCTAAGAAAGGGAAATCTAAGTCTGGGGGAGTTTCTAAAGAAGTTCAAGGCTCTTTGTGACAAGGTGGCAACAATGAAAAATCTATTGGAGATGAAACAACCAAAGTTCTTCACCTAG
Protein sequence
MVSELGYTKIIQDQTYKDLIMAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEITDINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTKENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMKNLLEMKQPKFFT
Homology
BLAST of Lcy04g001810 vs. ExPASy TrEMBL
Match:
A0A6J1DMG5 (uncharacterized protein LOC111021379 OS=Momordica charantia OX=3673 GN=LOC111021379 PE=4 SV=1)
HSP 1 Score: 184.1 bits (466), Expect = 5.4e-43
Identity = 91/162 (56.17%), Postives = 125/162 (77.16%), Query Frame = 0
Query: 21 MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
MA EN LT+Q+ HQCSSLIS+KL+S+NYLL KSQ+L LIRT+G+EHHL E+ P+ E
Sbjct: 1 MALPENLLTVQSFHQCSSLISLKLNSSNYLLWKSQVLPLIRTLGLEHHLXEEAPVVDECK 60
Query: 81 DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
+ Q W +NDGLLTSW+LG I E+VL+++E + T+ +VW+SLE+ LLTMTK
Sbjct: 61 GKEGESAXXTQVRTWINNDGLLTSWLLGIIAEDVLTLLEGTETAKEVWHSLEELLLTMTK 120
Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMKNLLE 183
ENE+HLNEAL++L+KG+LS+ E+++KFK LCD++ MK L+
Sbjct: 121 ENEIHLNEALLTLKKGSLSMDEYIRKFKNLCDRLXAMKKPLD 162
BLAST of Lcy04g001810 vs. ExPASy TrEMBL
Match:
A0A2Z6P7T0 (Reverse transcriptase Ty1/copia-type domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_243070 PE=4 SV=1)
HSP 1 Score: 155.2 bits (391), Expect = 2.7e-34
Identity = 82/154 (53.25%), Postives = 110/154 (71.43%), Query Frame = 0
Query: 25 ENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYED-QPLEKEITDIN 84
E LTIQ+ HQCSSLISIKLS++N+LL KSQIL LIR++G+EHH+ D + EITD +
Sbjct: 13 EPKLTIQSFHQCSSLISIKLSTSNFLLWKSQILPLIRSLGLEHHITADTSKPDDEITDSS 72
Query: 85 DKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTKENE 144
K+ NP W NDGLLTSW+LG + EE +SMI T++ +W+SL +QLL T++ E
Sbjct: 73 GTKIKNPDAVQWILNDGLLTSWLLGNMKEETVSMILGGDTAHYIWSSLHEQLLPNTEDGE 132
Query: 145 LHLNEALVSLRKGNLSLGEFLKKFKALCDKVATM 178
L +L +L KGNLSL E+++KFK LCDK+ +
Sbjct: 133 AQLKNSLYALSKGNLSLDEYIRKFKELCDKLTAI 166
BLAST of Lcy04g001810 vs. ExPASy TrEMBL
Match:
A0A438E6Z5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1575 PE=4 SV=1)
HSP 1 Score: 151.4 bits (381), Expect = 3.9e-33
Identity = 75/158 (47.47%), Postives = 109/158 (68.99%), Query Frame = 0
Query: 21 MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
MA EN L+IQ HQCSSL+SIKL+ +N LL +SQ+L L+R++G+ HHL E++ KE
Sbjct: 46 MANPENVLSIQAFHQCSSLVSIKLNMSNLLLWRSQVLPLVRSLGLIHHLSENRHASKETM 105
Query: 81 DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
K+ + W HNDGLLTSW+LG +TEEV+ +++ + T+ VWNSL ++LL MTK
Sbjct: 106 GTETKETHDQSIETWSHNDGLLTSWLLGLMTEEVMLLLDGTETAYDVWNSLGEKLLPMTK 165
Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMK 179
E E+ L L ++KG SL E+L++FK +CD +A ++
Sbjct: 166 EKEVQLTNRLRGVKKGTRSLDEYLREFKGICDALAAVR 203
BLAST of Lcy04g001810 vs. ExPASy TrEMBL
Match:
A0A2K3PNP5 (Copia-like polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g013628 PE=4 SV=1)
HSP 1 Score: 150.2 bits (378), Expect = 8.6e-33
Identity = 81/165 (49.09%), Postives = 115/165 (69.70%), Query Frame = 0
Query: 17 KDLI---MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYED- 76
KD++ A E LTIQ+ HQCSSL+S+KLS++N+LL KSQ+L LIR++G+EHH+ +
Sbjct: 811 KDIVTSPSAVEEPKLTIQSFHQCSSLVSLKLSTSNFLLWKSQMLPLIRSLGLEHHITTNT 870
Query: 77 QPLEKEITDINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLE 136
+ EITD + K +NP W NDGLLTSW+LG + EE LSMI T+ +W+SL
Sbjct: 871 SKPDDEITDSSGTKTNNPNAVQWGLNDGLLTSWLLGNMKEETLSMILGGDTAYYIWSSLH 930
Query: 137 KQLLTMTKENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATM 178
+QLL T++ E L +L +L KGNLSL E+++KFK LC+K++ +
Sbjct: 931 EQLLPNTEDGEAQLKNSLYALSKGNLSLDEYIRKFKELCNKLSAI 975
BLAST of Lcy04g001810 vs. ExPASy TrEMBL
Match:
A0A438JZ09 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=29760 GN=RE1_2967 PE=4 SV=1)
HSP 1 Score: 150.2 bits (378), Expect = 8.6e-33
Identity = 75/158 (47.47%), Postives = 109/158 (68.99%), Query Frame = 0
Query: 21 MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
MA EN L+IQ HQCSSL+SIKL+ +N LL +SQ+L L+R++G+ HHL E++ +E
Sbjct: 1 MANPENVLSIQAFHQCSSLLSIKLNMSNLLLWRSQVLPLVRSLGLIHHLSENRHASEETM 60
Query: 81 DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
K+ + W HNDGLLTSW+LG +TEEV+ +++ T+ VWNSL ++LL MTK
Sbjct: 61 GTETKETHDQSIETWSHNDGLLTSWLLGLMTEEVMLLLDGIETAYDVWNSLGEKLLPMTK 120
Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMK 179
E E+ L L ++KG SL E+L++FK +CD +AT++
Sbjct: 121 EKEVRLTNRLRGVKKGTRSLDEYLREFKGICDALATVR 158
BLAST of Lcy04g001810 vs. NCBI nr
Match:
XP_022154021.1 (uncharacterized protein LOC111021379 [Momordica charantia] >XP_022154022.1 uncharacterized protein LOC111021379 [Momordica charantia])
HSP 1 Score: 184.1 bits (466), Expect = 1.1e-42
Identity = 91/162 (56.17%), Postives = 125/162 (77.16%), Query Frame = 0
Query: 21 MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
MA EN LT+Q+ HQCSSLIS+KL+S+NYLL KSQ+L LIRT+G+EHHL E+ P+ E
Sbjct: 1 MALPENLLTVQSFHQCSSLISLKLNSSNYLLWKSQVLPLIRTLGLEHHLXEEAPVVDECK 60
Query: 81 DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
+ Q W +NDGLLTSW+LG I E+VL+++E + T+ +VW+SLE+ LLTMTK
Sbjct: 61 GKEGESAXXTQVRTWINNDGLLTSWLLGIIAEDVLTLLEGTETAKEVWHSLEELLLTMTK 120
Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMKNLLE 183
ENE+HLNEAL++L+KG+LS+ E+++KFK LCD++ MK L+
Sbjct: 121 ENEIHLNEALLTLKKGSLSMDEYIRKFKNLCDRLXAMKKPLD 162
BLAST of Lcy04g001810 vs. NCBI nr
Match:
GAU44375.1 (hypothetical protein TSUD_243070 [Trifolium subterraneum])
HSP 1 Score: 155.2 bits (391), Expect = 5.5e-34
Identity = 82/154 (53.25%), Postives = 110/154 (71.43%), Query Frame = 0
Query: 25 ENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYED-QPLEKEITDIN 84
E LTIQ+ HQCSSLISIKLS++N+LL KSQIL LIR++G+EHH+ D + EITD +
Sbjct: 13 EPKLTIQSFHQCSSLISIKLSTSNFLLWKSQILPLIRSLGLEHHITADTSKPDDEITDSS 72
Query: 85 DKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTKENE 144
K+ NP W NDGLLTSW+LG + EE +SMI T++ +W+SL +QLL T++ E
Sbjct: 73 GTKIKNPDAVQWILNDGLLTSWLLGNMKEETVSMILGGDTAHYIWSSLHEQLLPNTEDGE 132
Query: 145 LHLNEALVSLRKGNLSLGEFLKKFKALCDKVATM 178
L +L +L KGNLSL E+++KFK LCDK+ +
Sbjct: 133 AQLKNSLYALSKGNLSLDEYIRKFKELCDKLTAI 166
BLAST of Lcy04g001810 vs. NCBI nr
Match:
RVW43526.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])
HSP 1 Score: 151.4 bits (381), Expect = 8.0e-33
Identity = 75/158 (47.47%), Postives = 109/158 (68.99%), Query Frame = 0
Query: 21 MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
MA EN L+IQ HQCSSL+SIKL+ +N LL +SQ+L L+R++G+ HHL E++ KE
Sbjct: 46 MANPENVLSIQAFHQCSSLVSIKLNMSNLLLWRSQVLPLVRSLGLIHHLSENRHASKETM 105
Query: 81 DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
K+ + W HNDGLLTSW+LG +TEEV+ +++ + T+ VWNSL ++LL MTK
Sbjct: 106 GTETKETHDQSIETWSHNDGLLTSWLLGLMTEEVMLLLDGTETAYDVWNSLGEKLLPMTK 165
Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMK 179
E E+ L L ++KG SL E+L++FK +CD +A ++
Sbjct: 166 EKEVQLTNRLRGVKKGTRSLDEYLREFKGICDALAAVR 203
BLAST of Lcy04g001810 vs. NCBI nr
Match:
RVX14187.1 (Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera])
HSP 1 Score: 150.2 bits (378), Expect = 1.8e-32
Identity = 75/158 (47.47%), Postives = 109/158 (68.99%), Query Frame = 0
Query: 21 MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
MA EN L+IQ HQCSSL+SIKL+ +N LL +SQ+L L+R++G+ HHL E++ +E
Sbjct: 1 MANPENVLSIQAFHQCSSLLSIKLNMSNLLLWRSQVLPLVRSLGLIHHLSENRHASEETM 60
Query: 81 DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
K+ + W HNDGLLTSW+LG +TEEV+ +++ T+ VWNSL ++LL MTK
Sbjct: 61 GTETKETHDQSIETWSHNDGLLTSWLLGLMTEEVMLLLDGIETAYDVWNSLGEKLLPMTK 120
Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMK 179
E E+ L L ++KG SL E+L++FK +CD +AT++
Sbjct: 121 EKEVRLTNRLRGVKKGTRSLDEYLREFKGICDALATVR 158
BLAST of Lcy04g001810 vs. NCBI nr
Match:
PNY16899.1 (copia-like polyprotein, partial [Trifolium pratense])
HSP 1 Score: 150.2 bits (378), Expect = 1.8e-32
Identity = 81/165 (49.09%), Postives = 115/165 (69.70%), Query Frame = 0
Query: 17 KDLI---MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYED- 76
KD++ A E LTIQ+ HQCSSL+S+KLS++N+LL KSQ+L LIR++G+EHH+ +
Sbjct: 811 KDIVTSPSAVEEPKLTIQSFHQCSSLVSLKLSTSNFLLWKSQMLPLIRSLGLEHHITTNT 870
Query: 77 QPLEKEITDINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLE 136
+ EITD + K +NP W NDGLLTSW+LG + EE LSMI T+ +W+SL
Sbjct: 871 SKPDDEITDSSGTKTNNPNAVQWGLNDGLLTSWLLGNMKEETLSMILGGDTAYYIWSSLH 930
Query: 137 KQLLTMTKENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATM 178
+QLL T++ E L +L +L KGNLSL E+++KFK LC+K++ +
Sbjct: 931 EQLLPNTEDGEAQLKNSLYALSKGNLSLDEYIRKFKELCNKLSAI 975
BLAST of Lcy04g001810 vs. TAIR 10
Match:
AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 42.0 bits (97), Expect = 6.4e-04
Identity = 39/156 (25.00%), Postives = 71/156 (45.51%), Query Frame = 0
Query: 18 DLIMAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEK 77
D ++ E S I + + +++ L+ NY + + +L + GV H+
Sbjct: 3 DTTLSSYEKSFGIMQI-RAYIFVTLDLNKLNYDVWRELFETLCLSFGVLGHI----DGSS 62
Query: 78 EITDINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVL-SMIENSSTSNQVWNSLEKQLL 137
T + +K+ WK DGL+ W+ GTIT+ +L ++I+ T+ +W SLE
Sbjct: 63 TPTPMTEKR--------WKERDGLVKMWIYGTITDSLLDTIIKVGCTARDLWLSLENLFR 122
Query: 138 TMTKENELHLNEALVSLRKGNLSLGEFLKKFKALCD 173
+ L L + +LS+ E+ +K K+L D
Sbjct: 123 DNKEARALQFENELRTTTIDDLSVHEYCQKLKSLSD 145
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DMG5 | 5.4e-43 | 56.17 | uncharacterized protein LOC111021379 OS=Momordica charantia OX=3673 GN=LOC111021... | [more] |
A0A2Z6P7T0 | 2.7e-34 | 53.25 | Reverse transcriptase Ty1/copia-type domain-containing protein OS=Trifolium subt... | [more] |
A0A438E6Z5 | 3.9e-33 | 47.47 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... | [more] |
A0A2K3PNP5 | 8.6e-33 | 49.09 | Copia-like polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g013628... | [more] |
A0A438JZ09 | 8.6e-33 | 47.47 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=2976... | [more] |
Match Name | E-value | Identity | Description | |
XP_022154021.1 | 1.1e-42 | 56.17 | uncharacterized protein LOC111021379 [Momordica charantia] >XP_022154022.1 uncha... | [more] |
GAU44375.1 | 5.5e-34 | 53.25 | hypothetical protein TSUD_243070 [Trifolium subterraneum] | [more] |
RVW43526.1 | 8.0e-33 | 47.47 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera] | [more] |
RVX14187.1 | 1.8e-32 | 47.47 | Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera] | [more] |
PNY16899.1 | 1.8e-32 | 49.09 | copia-like polyprotein, partial [Trifolium pratense] | [more] |
Match Name | E-value | Identity | Description | |
AT5G48050.1 | 6.4e-04 | 25.00 | CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... | [more] |