Lag0029089 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0029089
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase domain-containing protein
Locationchr8: 35212969 .. 35213496 (-)
RNA-Seq ExpressionLag0029089
SyntenyLag0029089
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGAGCCAGCGAAAAGGGCGGAGGATCAACATCCGGATTCTGATCTCCTCAAATTTTTTGTCGATGCAGCCTGGTTATCCATCCCTCCAGAAACAGGAATTGGAGTCATCTGCATGAACCACCATGGTAATGTAGTTGGCGCTAGTGCTTCCTTCATAGAAATGGATTTTGAAGCCCCCCTTGCTGAGTTGAAGGCCATAAAAGAAGGGATGAAGCTTGCTTTAAATTTTGACTGCCCTAAAATCGAAATTTATTCTGACTGTCTACAAGCAATTAAATTCGTGTCAAAAGCCACAGAGCCTTGGTGTAATGTGGAGGTGCAAGTGGAGGATATTTGGAATTTGTCTAATCTGTTTGCAGAATGTAGTTTTAAGTATATCCCAAGAGAGCTGAATTCCCTAGCTGATAGCTTAGCTAAGAAGGCTAAACGATCGGGTATTAATGATGTATGGGTTGGTTCAATCCCAGAGTGGATTGTTTCGTTGGTCGAGTATGACCGTTCTATCTCTGCCCACGTGGCGTAA

mRNA sequence

ATGTTTGAGCCAGCGAAAAGGGCGGAGGATCAACATCCGGATTCTGATCTCCTCAAATTTTTTGTCGATGCAGCCTGGTTATCCATCCCTCCAGAAACAGGAATTGGAGTCATCTGCATGAACCACCATGGTAATGTAGTTGGCGCTAGTGCTTCCTTCATAGAAATGGATTTTGAAGCCCCCCTTGCTGAGTTGAAGGCCATAAAAGAAGGGATGAAGCTTGCTTTAAATTTTGACTGCCCTAAAATCGAAATTTATTCTGACTGTCTACAAGCAATTAAATTCGTGTCAAAAGCCACAGAGCCTTGGTGTAATGTGGAGGTGCAAGTGGAGGATATTTGGAATTTGTCTAATCTGTTTGCAGAATGTAGTTTTAAGTATATCCCAAGAGAGCTGAATTCCCTAGCTGATAGCTTAGCTAAGAAGGCTAAACGATCGGGTATTAATGATGTATGGGTTGGTTCAATCCCAGAGTGGATTGTTTCGTTGGTCGAGTATGACCGTTCTATCTCTGCCCACGTGGCGTAA

Coding sequence (CDS)

ATGTTTGAGCCAGCGAAAAGGGCGGAGGATCAACATCCGGATTCTGATCTCCTCAAATTTTTTGTCGATGCAGCCTGGTTATCCATCCCTCCAGAAACAGGAATTGGAGTCATCTGCATGAACCACCATGGTAATGTAGTTGGCGCTAGTGCTTCCTTCATAGAAATGGATTTTGAAGCCCCCCTTGCTGAGTTGAAGGCCATAAAAGAAGGGATGAAGCTTGCTTTAAATTTTGACTGCCCTAAAATCGAAATTTATTCTGACTGTCTACAAGCAATTAAATTCGTGTCAAAAGCCACAGAGCCTTGGTGTAATGTGGAGGTGCAAGTGGAGGATATTTGGAATTTGTCTAATCTGTTTGCAGAATGTAGTTTTAAGTATATCCCAAGAGAGCTGAATTCCCTAGCTGATAGCTTAGCTAAGAAGGCTAAACGATCGGGTATTAATGATGTATGGGTTGGTTCAATCCCAGAGTGGATTGTTTCGTTGGTCGAGTATGACCGTTCTATCTCTGCCCACGTGGCGTAA

Protein sequence

MFEPAKRAEDQHPDSDLLKFFVDAAWLSIPPETGIGVICMNHHGNVVGASASFIEMDFEAPLAELKAIKEGMKLALNFDCPKIEIYSDCLQAIKFVSKATEPWCNVEVQVEDIWNLSNLFAECSFKYIPRELNSLADSLAKKAKRSGINDVWVGSIPEWIVSLVEYDRSISAHVA
Homology
BLAST of Lag0029089 vs. NCBI nr
Match: XP_038902513.1 (uncharacterized protein LOC120089172 [Benincasa hispida])

HSP 1 Score: 101.3 bits (251), Expect = 8.8e-18
Identity = 52/143 (36.36%), Postives = 78/143 (54.55%), Query Frame = 0

Query: 13  PDSDLLKFFVDAAWLSIPPETGIGVICMNHHGNVVGASASFIEMDFEAPLAELKAIKEGM 72
           P +  +K  VDAAW   P  +G   I  ++ G++     S I+  +  PLAE   + +G+
Sbjct: 59  PPTSFIKLNVDAAWKFSPYSSGFSAIIRDNQGSLKVVQISSIDAVYPPPLAEAFVVLQGL 118

Query: 73  KLALNFDCPKIEIYSDCLQAIKFVSKATEPWCNVEVQVEDIWNLSNLFAECSFKYIPREL 132
           +L    +  KI + SDC  AI    K      +V V +E+IW +S +F   +F YIPR +
Sbjct: 119 RLTSKMNFKKIIVKSDCSGAIDLFLKILISDSSVRVWLEEIWEISIVFYPINFAYIPRNI 178

Query: 133 NSLADSLAKKAKRSGINDVWVGS 156
           N LAD +AK+ +  GINDVW+ S
Sbjct: 179 NKLADIVAKRTRILGINDVWMDS 201

BLAST of Lag0029089 vs. NCBI nr
Match: XP_027093792.1 (uncharacterized protein LOC113714198 [Coffea arabica])

HSP 1 Score: 91.3 bits (225), Expect = 9.1e-15
Identity = 49/155 (31.61%), Postives = 74/155 (47.74%), Query Frame = 0

Query: 13  PDSDLLKFFVDAAWLSIPPETGIGVICMNHHGNVVGASASFIEMDFEAPLAELKAIKEGM 72
           P   +++   DAA  +I   TG+G+I  N HG +V A         EA   E  AI+  +
Sbjct: 88  PKEGIMRINTDAAISAIMVRTGLGIIARNWHGQIVKAQGIIGRRRSEAATEESLAIRSAL 147

Query: 73  KLALNFDCPKIEIYSDCLQAIKFVSKATEPWCNVEVQVEDIWNLSNLFAECSFKYIPREL 132
           ++       KIE+ SDC   +  ++      C ++  +EDI  L N F  C F ++PR +
Sbjct: 148 EMTQLAGWTKIEVQSDCKNIVSSINPDNVQDCKIQTILEDIEALKNSFDSCLFSFVPRTV 207

Query: 133 NSLADSLAKKAKRSGINDVWVGSIPEWIVSLVEYD 168
           N  + +LA+ A RS  N  W  S P W+  L   D
Sbjct: 208 NICSHALAQFAVRSVQNFEWKDSFPIWLSQLASKD 242

BLAST of Lag0029089 vs. NCBI nr
Match: XP_027119939.1 (uncharacterized protein LOC113736922 [Coffea arabica])

HSP 1 Score: 87.4 bits (215), Expect = 1.3e-13
Identity = 50/161 (31.06%), Postives = 81/161 (50.31%), Query Frame = 0

Query: 13  PDSDLLKFFVDAAWLSIPPETGIGVICMNHHGNVVGASASFIEMDFEAPLAELKAIKEGM 72
           PD+ ++K   DAA  +     G+G+I  + HGN+V A  +       A +    AI++G+
Sbjct: 89  PDAGVVKINTDAAIPTKLAGAGLGMIAKDDHGNLVEARGTRKYSRGGAEMEMADAIRQGL 148

Query: 73  KLALNFDCPKIEIYSDCLQAIKFVSKATEPWCNVEVQVEDIWNLSNLFAECSFKYIPREL 132
            +A      +IE+  DC  AI+ + K  E    ++  +EDI  LS +F  CSF ++ R+ 
Sbjct: 149 LMAKEVGWQRIEMQFDCKAAIEQIHKKGEEETPIDTIMEDIKQLSGMFQYCSFSFVYRDG 208

Query: 133 NSLADSLAKKAKRSGINDVWVGSIPEWIVSLVEYDRSISAH 174
           N  A  LA+ A +   N VW  S P W+   ++ D   + H
Sbjct: 209 NRCAYQLAQFATKLVSNIVWKQSFPLWLKESIQEDNRTNVH 249

BLAST of Lag0029089 vs. NCBI nr
Match: XP_027061983.1 (uncharacterized protein LOC113688376 [Coffea arabica])

HSP 1 Score: 85.5 bits (210), Expect = 5.0e-13
Identity = 46/155 (29.68%), Postives = 73/155 (47.10%), Query Frame = 0

Query: 13  PDSDLLKFFVDAAWLSIPPETGIGVICMNHHGNVVGASASFIEMDFEAPLAELKAIKEGM 72
           P   ++K   DAA  +    TG+G+I  N HG +V A         EA   E  AI+  +
Sbjct: 88  PKEGVMKINTDAAISATMVRTGLGIIARNWHGVIVRAKGITERKRGEAATEETLAIRGEL 147

Query: 73  KLALNFDCPKIEIYSDCLQAIKFVSKATEPWCNVEVQVEDIWNLSNLFAECSFKYIPREL 132
           ++A       IE+ SDC   +  ++      C ++  +EDI  L   F  C+F ++PR  
Sbjct: 148 EMAQIAGWTNIEVQSDCKNVVSLINTGNVQDCKLQTILEDIEILKMRFDRCAFSFVPRTA 207

Query: 133 NSLADSLAKKAKRSGINDVWVGSIPEWIVSLVEYD 168
           N  + ++A+ A +S  N  W  S P W+ +L   D
Sbjct: 208 NGCSHAMAQFAVKSVRNIEWESSFPNWLSALARKD 242

BLAST of Lag0029089 vs. NCBI nr
Match: XP_027088768.1 (uncharacterized protein LOC113710130 [Coffea arabica])

HSP 1 Score: 84.3 bits (207), Expect = 1.1e-12
Identity = 47/155 (30.32%), Postives = 76/155 (49.03%), Query Frame = 0

Query: 13  PDSDLLKFFVDAAWLSIPPETGIGVICMNHHGNVVGASASFIEMDFEAPLAELKAIKEGM 72
           P    +K   DAA+      TGIGV+  N  G ++   A       E  + E  AI+ GM
Sbjct: 38  PSRGTIKLNTDAAFSQNLERTGIGVVARNAEGELMKVWARAELKRSEPQVEEAAAIRMGM 97

Query: 73  KLALNFDCPKIEIYSDCLQAIKFVSKATEPWCNVEVQVEDIWNLSNLFAECSFKYIPREL 132
           ++A   +   +E+ SDC + +  ++K  +   N+ V +EDI N+  LF +C+F ++ R+ 
Sbjct: 98  QMAWKANWRAVELQSDCKEVVDMINKKQKQQNNIVVILEDIANMRCLFEQCTFSFVHRDG 157

Query: 133 NSLADSLAKKAKRSGINDVWVGSIPEWIVSLVEYD 168
           N  A S+AK A +   N  W    P W+    + D
Sbjct: 158 NRCAHSVAKFAVKLTTNVEWDECFPMWLQEEAQND 192

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038902513.18.8e-1836.36uncharacterized protein LOC120089172 [Benincasa hispida][more]
XP_027093792.19.1e-1531.61uncharacterized protein LOC113714198 [Coffea arabica][more]
XP_027119939.11.3e-1331.06uncharacterized protein LOC113736922 [Coffea arabica][more]
XP_027061983.15.0e-1329.68uncharacterized protein LOC113688376 [Coffea arabica][more]
XP_027088768.11.1e-1230.32uncharacterized protein LOC113710130 [Coffea arabica][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 22..143
e-value: 8.7E-23
score: 80.5
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 17..144
e-value: 1.5E-17
score: 65.9
NoneNo IPR availablePANTHERPTHR33033POLYNUCLEOTIDYL TRANSFERASE, RIBONUCLEASE H-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 13..164
NoneNo IPR availablePANTHERPTHR33033:SF67SUBFAMILY NOT NAMEDcoord: 13..164
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 21..141
e-value: 7.27487E-27
score: 96.2292
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 17..145

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0029089.1Lag0029089.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity