Lcy04g001810 (gene) Sponge gourd (P93075) v1

Overview
NameLcy04g001810
Typegene
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
LocationChr04: 10566260 .. 10566866 (+)
RNA-Seq ExpressionLcy04g001810
SyntenyLcy04g001810
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CGATTCTTGAGTCAAGTTTTGATCTAAAATTTTCATGGTATCAGAGCTTGGTTACACGAAAATTATTCAAGATCAAACTTACAAGGATCTAATCATGGCCAAGAATGAAAATAGCCTTACCATCCAAACTGTTCATCAATGTTCCAGCCTTATCTCCATCAAGTTGTCTTCTAACAATTATCTCCTGTTGAAATCTCAAATATTGTCGTTGATTCGAACTATGGGAGTGGAACATCACCTTTATGAAGATCAACCACTCGAAAAAGAAATCACTGACATCAACGACAAAAAGGTTTCCAACCCACAGCATAATGTTTGGAAACACAATGATGGATTGTTAACATCGTGGATGTTGGGAACTATTACTGAAGAGGTACTTAGCATGATTGAAAACTCTAGCACCTCAAATCAAGTTTGGAATTCCTTAGAAAAGCAACTACTCACAATGACAAAAGAGAATGAACTCCATCTCAATGAAGCTCTTGTCAGCCTAAGAAAGGGAAATCTAAGTCTGGGGGAGTTTCTAAAGAAGTTCAAGGCTCTTTGTGACAAGGTGGCAACAATGAAAAATCTATTGGAGATGAAACAACCAAAGTTCTTCACCTAG

mRNA sequence

CGATTCTTGAGTCAAGTTTTGATCTAAAATTTTCATGGTATCAGAGCTTGGTTACACGAAAATTATTCAAGATCAAACTTACAAGGATCTAATCATGGCCAAGAATGAAAATAGCCTTACCATCCAAACTGTTCATCAATGTTCCAGCCTTATCTCCATCAAGTTGTCTTCTAACAATTATCTCCTGTTGAAATCTCAAATATTGTCGTTGATTCGAACTATGGGAGTGGAACATCACCTTTATGAAGATCAACCACTCGAAAAAGAAATCACTGACATCAACGACAAAAAGGTTTCCAACCCACAGCATAATGTTTGGAAACACAATGATGGATTGTTAACATCGTGGATGTTGGGAACTATTACTGAAGAGGTACTTAGCATGATTGAAAACTCTAGCACCTCAAATCAAGTTTGGAATTCCTTAGAAAAGCAACTACTCACAATGACAAAAGAGAATGAACTCCATCTCAATGAAGCTCTTGTCAGCCTAAGAAAGGGAAATCTAAGTCTGGGGGAGTTTCTAAAGAAGTTCAAGGCTCTTTGTGACAAGGTGGCAACAATGAAAAATCTATTGGAGATGAAACAACCAAAGTTCTTCACCTAG

Coding sequence (CDS)

ATGGTATCAGAGCTTGGTTACACGAAAATTATTCAAGATCAAACTTACAAGGATCTAATCATGGCCAAGAATGAAAATAGCCTTACCATCCAAACTGTTCATCAATGTTCCAGCCTTATCTCCATCAAGTTGTCTTCTAACAATTATCTCCTGTTGAAATCTCAAATATTGTCGTTGATTCGAACTATGGGAGTGGAACATCACCTTTATGAAGATCAACCACTCGAAAAAGAAATCACTGACATCAACGACAAAAAGGTTTCCAACCCACAGCATAATGTTTGGAAACACAATGATGGATTGTTAACATCGTGGATGTTGGGAACTATTACTGAAGAGGTACTTAGCATGATTGAAAACTCTAGCACCTCAAATCAAGTTTGGAATTCCTTAGAAAAGCAACTACTCACAATGACAAAAGAGAATGAACTCCATCTCAATGAAGCTCTTGTCAGCCTAAGAAAGGGAAATCTAAGTCTGGGGGAGTTTCTAAAGAAGTTCAAGGCTCTTTGTGACAAGGTGGCAACAATGAAAAATCTATTGGAGATGAAACAACCAAAGTTCTTCACCTAG

Protein sequence

MVSELGYTKIIQDQTYKDLIMAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEITDINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTKENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMKNLLEMKQPKFFT
Homology
BLAST of Lcy04g001810 vs. ExPASy TrEMBL
Match: A0A6J1DMG5 (uncharacterized protein LOC111021379 OS=Momordica charantia OX=3673 GN=LOC111021379 PE=4 SV=1)

HSP 1 Score: 184.1 bits (466), Expect = 5.4e-43
Identity = 91/162 (56.17%), Postives = 125/162 (77.16%), Query Frame = 0

Query: 21  MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
           MA  EN LT+Q+ HQCSSLIS+KL+S+NYLL KSQ+L LIRT+G+EHHL E+ P+  E  
Sbjct: 1   MALPENLLTVQSFHQCSSLISLKLNSSNYLLWKSQVLPLIRTLGLEHHLXEEAPVVDECK 60

Query: 81  DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
               +     Q   W +NDGLLTSW+LG I E+VL+++E + T+ +VW+SLE+ LLTMTK
Sbjct: 61  GKEGESAXXTQVRTWINNDGLLTSWLLGIIAEDVLTLLEGTETAKEVWHSLEELLLTMTK 120

Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMKNLLE 183
           ENE+HLNEAL++L+KG+LS+ E+++KFK LCD++  MK  L+
Sbjct: 121 ENEIHLNEALLTLKKGSLSMDEYIRKFKNLCDRLXAMKKPLD 162

BLAST of Lcy04g001810 vs. ExPASy TrEMBL
Match: A0A2Z6P7T0 (Reverse transcriptase Ty1/copia-type domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_243070 PE=4 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 2.7e-34
Identity = 82/154 (53.25%), Postives = 110/154 (71.43%), Query Frame = 0

Query: 25  ENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYED-QPLEKEITDIN 84
           E  LTIQ+ HQCSSLISIKLS++N+LL KSQIL LIR++G+EHH+  D    + EITD +
Sbjct: 13  EPKLTIQSFHQCSSLISIKLSTSNFLLWKSQILPLIRSLGLEHHITADTSKPDDEITDSS 72

Query: 85  DKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTKENE 144
             K+ NP    W  NDGLLTSW+LG + EE +SMI    T++ +W+SL +QLL  T++ E
Sbjct: 73  GTKIKNPDAVQWILNDGLLTSWLLGNMKEETVSMILGGDTAHYIWSSLHEQLLPNTEDGE 132

Query: 145 LHLNEALVSLRKGNLSLGEFLKKFKALCDKVATM 178
             L  +L +L KGNLSL E+++KFK LCDK+  +
Sbjct: 133 AQLKNSLYALSKGNLSLDEYIRKFKELCDKLTAI 166

BLAST of Lcy04g001810 vs. ExPASy TrEMBL
Match: A0A438E6Z5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1575 PE=4 SV=1)

HSP 1 Score: 151.4 bits (381), Expect = 3.9e-33
Identity = 75/158 (47.47%), Postives = 109/158 (68.99%), Query Frame = 0

Query: 21  MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
           MA  EN L+IQ  HQCSSL+SIKL+ +N LL +SQ+L L+R++G+ HHL E++   KE  
Sbjct: 46  MANPENVLSIQAFHQCSSLVSIKLNMSNLLLWRSQVLPLVRSLGLIHHLSENRHASKETM 105

Query: 81  DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
               K+  +     W HNDGLLTSW+LG +TEEV+ +++ + T+  VWNSL ++LL MTK
Sbjct: 106 GTETKETHDQSIETWSHNDGLLTSWLLGLMTEEVMLLLDGTETAYDVWNSLGEKLLPMTK 165

Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMK 179
           E E+ L   L  ++KG  SL E+L++FK +CD +A ++
Sbjct: 166 EKEVQLTNRLRGVKKGTRSLDEYLREFKGICDALAAVR 203

BLAST of Lcy04g001810 vs. ExPASy TrEMBL
Match: A0A2K3PNP5 (Copia-like polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g013628 PE=4 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 8.6e-33
Identity = 81/165 (49.09%), Postives = 115/165 (69.70%), Query Frame = 0

Query: 17  KDLI---MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYED- 76
           KD++    A  E  LTIQ+ HQCSSL+S+KLS++N+LL KSQ+L LIR++G+EHH+  + 
Sbjct: 811 KDIVTSPSAVEEPKLTIQSFHQCSSLVSLKLSTSNFLLWKSQMLPLIRSLGLEHHITTNT 870

Query: 77  QPLEKEITDINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLE 136
              + EITD +  K +NP    W  NDGLLTSW+LG + EE LSMI    T+  +W+SL 
Sbjct: 871 SKPDDEITDSSGTKTNNPNAVQWGLNDGLLTSWLLGNMKEETLSMILGGDTAYYIWSSLH 930

Query: 137 KQLLTMTKENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATM 178
           +QLL  T++ E  L  +L +L KGNLSL E+++KFK LC+K++ +
Sbjct: 931 EQLLPNTEDGEAQLKNSLYALSKGNLSLDEYIRKFKELCNKLSAI 975

BLAST of Lcy04g001810 vs. ExPASy TrEMBL
Match: A0A438JZ09 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=29760 GN=RE1_2967 PE=4 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 8.6e-33
Identity = 75/158 (47.47%), Postives = 109/158 (68.99%), Query Frame = 0

Query: 21  MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
           MA  EN L+IQ  HQCSSL+SIKL+ +N LL +SQ+L L+R++G+ HHL E++   +E  
Sbjct: 1   MANPENVLSIQAFHQCSSLLSIKLNMSNLLLWRSQVLPLVRSLGLIHHLSENRHASEETM 60

Query: 81  DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
               K+  +     W HNDGLLTSW+LG +TEEV+ +++   T+  VWNSL ++LL MTK
Sbjct: 61  GTETKETHDQSIETWSHNDGLLTSWLLGLMTEEVMLLLDGIETAYDVWNSLGEKLLPMTK 120

Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMK 179
           E E+ L   L  ++KG  SL E+L++FK +CD +AT++
Sbjct: 121 EKEVRLTNRLRGVKKGTRSLDEYLREFKGICDALATVR 158

BLAST of Lcy04g001810 vs. NCBI nr
Match: XP_022154021.1 (uncharacterized protein LOC111021379 [Momordica charantia] >XP_022154022.1 uncharacterized protein LOC111021379 [Momordica charantia])

HSP 1 Score: 184.1 bits (466), Expect = 1.1e-42
Identity = 91/162 (56.17%), Postives = 125/162 (77.16%), Query Frame = 0

Query: 21  MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
           MA  EN LT+Q+ HQCSSLIS+KL+S+NYLL KSQ+L LIRT+G+EHHL E+ P+  E  
Sbjct: 1   MALPENLLTVQSFHQCSSLISLKLNSSNYLLWKSQVLPLIRTLGLEHHLXEEAPVVDECK 60

Query: 81  DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
               +     Q   W +NDGLLTSW+LG I E+VL+++E + T+ +VW+SLE+ LLTMTK
Sbjct: 61  GKEGESAXXTQVRTWINNDGLLTSWLLGIIAEDVLTLLEGTETAKEVWHSLEELLLTMTK 120

Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMKNLLE 183
           ENE+HLNEAL++L+KG+LS+ E+++KFK LCD++  MK  L+
Sbjct: 121 ENEIHLNEALLTLKKGSLSMDEYIRKFKNLCDRLXAMKKPLD 162

BLAST of Lcy04g001810 vs. NCBI nr
Match: GAU44375.1 (hypothetical protein TSUD_243070 [Trifolium subterraneum])

HSP 1 Score: 155.2 bits (391), Expect = 5.5e-34
Identity = 82/154 (53.25%), Postives = 110/154 (71.43%), Query Frame = 0

Query: 25  ENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYED-QPLEKEITDIN 84
           E  LTIQ+ HQCSSLISIKLS++N+LL KSQIL LIR++G+EHH+  D    + EITD +
Sbjct: 13  EPKLTIQSFHQCSSLISIKLSTSNFLLWKSQILPLIRSLGLEHHITADTSKPDDEITDSS 72

Query: 85  DKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTKENE 144
             K+ NP    W  NDGLLTSW+LG + EE +SMI    T++ +W+SL +QLL  T++ E
Sbjct: 73  GTKIKNPDAVQWILNDGLLTSWLLGNMKEETVSMILGGDTAHYIWSSLHEQLLPNTEDGE 132

Query: 145 LHLNEALVSLRKGNLSLGEFLKKFKALCDKVATM 178
             L  +L +L KGNLSL E+++KFK LCDK+  +
Sbjct: 133 AQLKNSLYALSKGNLSLDEYIRKFKELCDKLTAI 166

BLAST of Lcy04g001810 vs. NCBI nr
Match: RVW43526.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 151.4 bits (381), Expect = 8.0e-33
Identity = 75/158 (47.47%), Postives = 109/158 (68.99%), Query Frame = 0

Query: 21  MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
           MA  EN L+IQ  HQCSSL+SIKL+ +N LL +SQ+L L+R++G+ HHL E++   KE  
Sbjct: 46  MANPENVLSIQAFHQCSSLVSIKLNMSNLLLWRSQVLPLVRSLGLIHHLSENRHASKETM 105

Query: 81  DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
               K+  +     W HNDGLLTSW+LG +TEEV+ +++ + T+  VWNSL ++LL MTK
Sbjct: 106 GTETKETHDQSIETWSHNDGLLTSWLLGLMTEEVMLLLDGTETAYDVWNSLGEKLLPMTK 165

Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMK 179
           E E+ L   L  ++KG  SL E+L++FK +CD +A ++
Sbjct: 166 EKEVQLTNRLRGVKKGTRSLDEYLREFKGICDALAAVR 203

BLAST of Lcy04g001810 vs. NCBI nr
Match: RVX14187.1 (Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera])

HSP 1 Score: 150.2 bits (378), Expect = 1.8e-32
Identity = 75/158 (47.47%), Postives = 109/158 (68.99%), Query Frame = 0

Query: 21  MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEKEIT 80
           MA  EN L+IQ  HQCSSL+SIKL+ +N LL +SQ+L L+R++G+ HHL E++   +E  
Sbjct: 1   MANPENVLSIQAFHQCSSLLSIKLNMSNLLLWRSQVLPLVRSLGLIHHLSENRHASEETM 60

Query: 81  DINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLEKQLLTMTK 140
               K+  +     W HNDGLLTSW+LG +TEEV+ +++   T+  VWNSL ++LL MTK
Sbjct: 61  GTETKETHDQSIETWSHNDGLLTSWLLGLMTEEVMLLLDGIETAYDVWNSLGEKLLPMTK 120

Query: 141 ENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATMK 179
           E E+ L   L  ++KG  SL E+L++FK +CD +AT++
Sbjct: 121 EKEVRLTNRLRGVKKGTRSLDEYLREFKGICDALATVR 158

BLAST of Lcy04g001810 vs. NCBI nr
Match: PNY16899.1 (copia-like polyprotein, partial [Trifolium pratense])

HSP 1 Score: 150.2 bits (378), Expect = 1.8e-32
Identity = 81/165 (49.09%), Postives = 115/165 (69.70%), Query Frame = 0

Query: 17  KDLI---MAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYED- 76
           KD++    A  E  LTIQ+ HQCSSL+S+KLS++N+LL KSQ+L LIR++G+EHH+  + 
Sbjct: 811 KDIVTSPSAVEEPKLTIQSFHQCSSLVSLKLSTSNFLLWKSQMLPLIRSLGLEHHITTNT 870

Query: 77  QPLEKEITDINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVLSMIENSSTSNQVWNSLE 136
              + EITD +  K +NP    W  NDGLLTSW+LG + EE LSMI    T+  +W+SL 
Sbjct: 871 SKPDDEITDSSGTKTNNPNAVQWGLNDGLLTSWLLGNMKEETLSMILGGDTAYYIWSSLH 930

Query: 137 KQLLTMTKENELHLNEALVSLRKGNLSLGEFLKKFKALCDKVATM 178
           +QLL  T++ E  L  +L +L KGNLSL E+++KFK LC+K++ +
Sbjct: 931 EQLLPNTEDGEAQLKNSLYALSKGNLSLDEYIRKFKELCNKLSAI 975

BLAST of Lcy04g001810 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 42.0 bits (97), Expect = 6.4e-04
Identity = 39/156 (25.00%), Postives = 71/156 (45.51%), Query Frame = 0

Query: 18  DLIMAKNENSLTIQTVHQCSSLISIKLSSNNYLLLKSQILSLIRTMGVEHHLYEDQPLEK 77
           D  ++  E S  I  + +    +++ L+  NY + +    +L  + GV  H+        
Sbjct: 3   DTTLSSYEKSFGIMQI-RAYIFVTLDLNKLNYDVWRELFETLCLSFGVLGHI----DGSS 62

Query: 78  EITDINDKKVSNPQHNVWKHNDGLLTSWMLGTITEEVL-SMIENSSTSNQVWNSLEKQLL 137
             T + +K+        WK  DGL+  W+ GTIT+ +L ++I+   T+  +W SLE    
Sbjct: 63  TPTPMTEKR--------WKERDGLVKMWIYGTITDSLLDTIIKVGCTARDLWLSLENLFR 122

Query: 138 TMTKENELHLNEALVSLRKGNLSLGEFLKKFKALCD 173
              +   L     L +    +LS+ E+ +K K+L D
Sbjct: 123 DNKEARALQFENELRTTTIDDLSVHEYCQKLKSLSD 145

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DMG55.4e-4356.17uncharacterized protein LOC111021379 OS=Momordica charantia OX=3673 GN=LOC111021... [more]
A0A2Z6P7T02.7e-3453.25Reverse transcriptase Ty1/copia-type domain-containing protein OS=Trifolium subt... [more]
A0A438E6Z53.9e-3347.47Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A2K3PNP58.6e-3349.09Copia-like polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g013628... [more]
A0A438JZ098.6e-3347.47Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=2976... [more]
Match NameE-valueIdentityDescription
XP_022154021.11.1e-4256.17uncharacterized protein LOC111021379 [Momordica charantia] >XP_022154022.1 uncha... [more]
GAU44375.15.5e-3453.25hypothetical protein TSUD_243070 [Trifolium subterraneum][more]
RVW43526.18.0e-3347.47Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVX14187.11.8e-3247.47Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera][more]
PNY16899.11.8e-3249.09copia-like polyprotein, partial [Trifolium pratense][more]
Match NameE-valueIdentityDescription
AT5G48050.16.4e-0425.00CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (P93075) v1
Date Performed: 2021-12-06
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 95..179
e-value: 3.5E-8
score: 33.3
NoneNo IPR availablePANTHERPTHR47481:SF2SUBFAMILY NOT NAMEDcoord: 83..179
NoneNo IPR availablePANTHERPTHR47481FAMILY NOT NAMEDcoord: 83..179

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lcy04g001810.1Lcy04g001810.1mRNA