Tan0019286 (gene) Snake gourd v1

Overview
NameTan0019286
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
LocationLG04: 42863079 .. 42863606 (+)
RNA-Seq ExpressionTan0019286
SyntenyTan0019286
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTATAATAGAATATGAGAAGAAGTTCACAGAGTTATCAAAGTATGCTAGCACTATTGTTGCAAGCGAGATAGATCGATGCAAGAGGTTCGAGGATGGTTTACGAGCAGAGATCCGAACACCTGTGACAGCAAGTTTTGAGTGGGCTGAGTTCTCCAAGCTTGTGGAGACGGCATTACGAGTAGAGCGAAGCCTAGTAGATGACAGAATGGGAAAAGGAGCTGTAGGTGGTGGTCATACCACTTATTCTGTTGGTTTACCACGAGACCGATTCCAACGAGGAGATAATAGGAGATTCACTCCAGGTATCTCTGGAAAAGGAAGCTTTAAACCCCTTCGTGGTGGGCAAACTACTCCAAGGACTGGTACAGGTGGACGTGGGGAAAGACAGAGGCAGCAGGGATCAAGTTTTCTGATAGGATCAGCAAGGGATTCCCGTACAGGGCAACCCAGTGAGTCAGTAGCTAGTTCAGCAAGGAAACCTTTGTGCAACACTTGTGGAAGTATCATTGGGGTCGATGTTTGA

mRNA sequence

ATGACTATAATAGAATATGAGAAGAAGTTCACAGAGTTATCAAAGTATGCTAGCACTATTGTTGCAAGCGAGATAGATCGATGCAAGAGGTTCGAGGATGGTTTACGAGCAGAGATCCGAACACCTGTGACAGCAAGTTTTGAGTGGGCTGAGTTCTCCAAGCTTGTGGAGACGGCATTACGAGTAGAGCGAAGCCTAGTAGATGACAGAATGGGAAAAGGAGCTGTAGGTGGTGGTCATACCACTTATTCTGTTGGTTTACCACGAGACCGATTCCAACGAGGAGATAATAGGAGATTCACTCCAGGTATCTCTGGAAAAGGAAGCTTTAAACCCCTTCGTGGTGGGCAAACTACTCCAAGGACTGGTACAGGTGGACGTGGGGAAAGACAGAGGCAGCAGGGATCAAGTTTTCTGATAGGATCAGCAAGGGATTCCCGTACAGGGCAACCCAGTGAGTCAGTAGCTAGTTCAGCAAGGAAACCTTTGTGCAACACTTGTGGAAGTATCATTGGGGTCGATGTTTGA

Coding sequence (CDS)

ATGACTATAATAGAATATGAGAAGAAGTTCACAGAGTTATCAAAGTATGCTAGCACTATTGTTGCAAGCGAGATAGATCGATGCAAGAGGTTCGAGGATGGTTTACGAGCAGAGATCCGAACACCTGTGACAGCAAGTTTTGAGTGGGCTGAGTTCTCCAAGCTTGTGGAGACGGCATTACGAGTAGAGCGAAGCCTAGTAGATGACAGAATGGGAAAAGGAGCTGTAGGTGGTGGTCATACCACTTATTCTGTTGGTTTACCACGAGACCGATTCCAACGAGGAGATAATAGGAGATTCACTCCAGGTATCTCTGGAAAAGGAAGCTTTAAACCCCTTCGTGGTGGGCAAACTACTCCAAGGACTGGTACAGGTGGACGTGGGGAAAGACAGAGGCAGCAGGGATCAAGTTTTCTGATAGGATCAGCAAGGGATTCCCGTACAGGGCAACCCAGTGAGTCAGTAGCTAGTTCAGCAAGGAAACCTTTGTGCAACACTTGTGGAAGTATCATTGGGGTCGATGTTTGA

Protein sequence

MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETALRVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRGDNRRFTPGISGKGSFKPLRGGQTTPRTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTCGSIIGVDV
Homology
BLAST of Tan0019286 vs. NCBI nr
Match: KAA0060484.1 (Gag protease polyprotein-like protein [Cucumis melo var. makuwa] >TYK18569.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 133.7 bits (335), Expect = 1.6e-27
Identity = 75/168 (44.64%), Postives = 105/168 (62.50%), Query Frame = 0

Query: 1   MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETAL 60
           MTI EYEKK+TELS YA+ ++  E++RCKRFE+GLR EIRTPVTA  +W +FSKLVE AL
Sbjct: 162 MTIAEYEKKYTELSMYATRVIEDEVERCKRFEEGLREEIRTPVTACADWNDFSKLVEAAL 221

Query: 61  RVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRGDNRRFTPGISGKGSFKPLRGGQTTP 120
           RVE+SL ++R  +        T+S  + R+R  +  + RF PG+S +G+FK    G +  
Sbjct: 222 RVEKSL-NERKQERETSKNVCTFSSSMHRNRQGKERSGRFVPGVSSRGNFKSQYNGSSFS 281

Query: 121 RTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTCG 169
           ++G+ G    QR  GSS  I S   S   +    V+ S++  +C  CG
Sbjct: 282 KSGSSGGA--QRSSGSSHPISSTGGSHIARSDRVVSESSKSSVCYNCG 326

BLAST of Tan0019286 vs. NCBI nr
Match: KAA0035138.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 129.4 bits (324), Expect = 3.0e-26
Identity = 81/170 (47.65%), Postives = 106/170 (62.35%), Query Frame = 0

Query: 1   MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETAL 60
           +++ EYEKK+TELS+YA  IVASE DRC+RFE GLR EIRTPVTA  +W  FS+LVETAL
Sbjct: 208 LSVAEYEKKYTELSRYADVIVASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETAL 267

Query: 61  RVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRGDNRRFTPG--ISGKGSFKPLRGGQT 120
           RVE+S+ +++       G  TT       + F+  + RRFTPG  IS +  FK   GGQ 
Sbjct: 268 RVEQSITEEKSAVELSRGTSTT-------NGFRGREQRRFTPGINISSRQDFKNRSGGQA 327

Query: 121 TPRTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTCG 169
           +     G   +RQ Q+  S  I S   S+ GQ  ES+AS+ R+  C +CG
Sbjct: 328 SRNVSYGSVFQRQSQRIPSQPIRSTVRSQPGQ--ESIASTVRRTPCTSCG 368

BLAST of Tan0019286 vs. NCBI nr
Match: TYK15233.1 (uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa])

HSP 1 Score: 127.5 bits (319), Expect = 1.1e-25
Identity = 73/168 (43.45%), Postives = 101/168 (60.12%), Query Frame = 0

Query: 1   MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETAL 60
           MT+ EYEKK+TELSKYA+ ++  E++R KRFE+GLR EIRT VTA  +W +FSKLVE AL
Sbjct: 292 MTVAEYEKKYTELSKYATRVIEDEVERYKRFEEGLREEIRTSVTACADWNDFSKLVEAAL 351

Query: 61  RVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRGDNRRFTPGISGKGSFKPLRGGQTTP 120
           RV +SL ++R  +        T+S  + R+R  +  + RF PG+  +G+FK    G    
Sbjct: 352 RVGKSL-NERKRERETSKNVRTFSSSMHRNRLGKERSGRFVPGVPSRGNFKSQYNGSYFS 411

Query: 121 RTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTCG 169
            +G+G  GE QR  GSS  I S   S   +    V+ S +  +C  CG
Sbjct: 412 NSGSG--GEAQRSSGSSHPISSIGGSHIARSDRVVSESCKSSVCYNCG 456

BLAST of Tan0019286 vs. NCBI nr
Match: KAA0039476.1 (uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa])

HSP 1 Score: 127.5 bits (319), Expect = 1.1e-25
Identity = 73/168 (43.45%), Postives = 101/168 (60.12%), Query Frame = 0

Query: 1   MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETAL 60
           MT+ EYEKK+TELSKYA+ ++  E++R KRFE+GLR EIRT VTA  +W +FSKLVE AL
Sbjct: 256 MTVAEYEKKYTELSKYATRVIEDEVERYKRFEEGLREEIRTSVTACADWNDFSKLVEAAL 315

Query: 61  RVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRGDNRRFTPGISGKGSFKPLRGGQTTP 120
           RV +SL ++R  +        T+S  + R+R  +  + RF PG+  +G+FK    G    
Sbjct: 316 RVGKSL-NERKRERETSKNVRTFSSSMHRNRLGKERSGRFVPGVPSRGNFKSQYNGSYFS 375

Query: 121 RTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTCG 169
            +G+G  GE QR  GSS  I S   S   +    V+ S +  +C  CG
Sbjct: 376 NSGSG--GEAQRSSGSSHPISSIGGSHIARSDRVVSESCKSSVCYNCG 420

BLAST of Tan0019286 vs. NCBI nr
Match: KAA0036813.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 126.7 bits (317), Expect = 1.9e-25
Identity = 73/167 (43.71%), Postives = 102/167 (61.08%), Query Frame = 0

Query: 1   MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETAL 60
           MT+ EYEKK+TELSKYA+ ++  E +RCKRFE+GLR EIRTPVTA  +W +FSKLVE AL
Sbjct: 140 MTVAEYEKKYTELSKYATRVIVDEGERCKRFEEGLREEIRTPVTACADWNDFSKLVEVAL 199

Query: 61  RVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRGDNRRFTPGISGKGSFKPLRGGQTTP 120
           RVE+SL ++R  +        T+S  + R+R  +  + RF P +S +GSFK    G +  
Sbjct: 200 RVEKSL-NERKREREASKNLRTFSSSMHRNRPGKERSGRFVPRVSSRGSFKSQYSGSSFS 259

Query: 121 RTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTC 168
           ++ +GG    QR   SS  I S   S   + +  V+ S +  +C  C
Sbjct: 260 KSRSGGGA--QRSSDSSHTISSTGGSHVARSNRVVSESGKSSVCYNC 303

BLAST of Tan0019286 vs. ExPASy TrEMBL
Match: A0A5A7UZM6 (Gag protease polyprotein-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00750 PE=4 SV=1)

HSP 1 Score: 133.7 bits (335), Expect = 7.7e-28
Identity = 75/168 (44.64%), Postives = 105/168 (62.50%), Query Frame = 0

Query: 1   MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETAL 60
           MTI EYEKK+TELS YA+ ++  E++RCKRFE+GLR EIRTPVTA  +W +FSKLVE AL
Sbjct: 162 MTIAEYEKKYTELSMYATRVIEDEVERCKRFEEGLREEIRTPVTACADWNDFSKLVEAAL 221

Query: 61  RVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRGDNRRFTPGISGKGSFKPLRGGQTTP 120
           RVE+SL ++R  +        T+S  + R+R  +  + RF PG+S +G+FK    G +  
Sbjct: 222 RVEKSL-NERKQERETSKNVCTFSSSMHRNRQGKERSGRFVPGVSSRGNFKSQYNGSSFS 281

Query: 121 RTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTCG 169
           ++G+ G    QR  GSS  I S   S   +    V+ S++  +C  CG
Sbjct: 282 KSGSSGGA--QRSSGSSHPISSTGGSHIARSDRVVSESSKSSVCYNCG 326

BLAST of Tan0019286 vs. ExPASy TrEMBL
Match: A0A5A7SX06 (DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold57G002230 PE=4 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 1.5e-26
Identity = 81/170 (47.65%), Postives = 106/170 (62.35%), Query Frame = 0

Query: 1   MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETAL 60
           +++ EYEKK+TELS+YA  IVASE DRC+RFE GLR EIRTPVTA  +W  FS+LVETAL
Sbjct: 208 LSVAEYEKKYTELSRYADVIVASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETAL 267

Query: 61  RVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRGDNRRFTPG--ISGKGSFKPLRGGQT 120
           RVE+S+ +++       G  TT       + F+  + RRFTPG  IS +  FK   GGQ 
Sbjct: 268 RVEQSITEEKSAVELSRGTSTT-------NGFRGREQRRFTPGINISSRQDFKNRSGGQA 327

Query: 121 TPRTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTCG 169
           +     G   +RQ Q+  S  I S   S+ GQ  ES+AS+ R+  C +CG
Sbjct: 328 SRNVSYGSVFQRQSQRIPSQPIRSTVRSQPGQ--ESIASTVRRTPCTSCG 368

BLAST of Tan0019286 vs. ExPASy TrEMBL
Match: A0A5A7TBS0 (CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold64G002900 PE=4 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 5.5e-26
Identity = 73/168 (43.45%), Postives = 101/168 (60.12%), Query Frame = 0

Query: 1   MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETAL 60
           MT+ EYEKK+TELSKYA+ ++  E++R KRFE+GLR EIRT VTA  +W +FSKLVE AL
Sbjct: 256 MTVAEYEKKYTELSKYATRVIEDEVERYKRFEEGLREEIRTSVTACADWNDFSKLVEAAL 315

Query: 61  RVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRGDNRRFTPGISGKGSFKPLRGGQTTP 120
           RV +SL ++R  +        T+S  + R+R  +  + RF PG+  +G+FK    G    
Sbjct: 316 RVGKSL-NERKRERETSKNVRTFSSSMHRNRLGKERSGRFVPGVPSRGNFKSQYNGSYFS 375

Query: 121 RTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTCG 169
            +G+G  GE QR  GSS  I S   S   +    V+ S +  +C  CG
Sbjct: 376 NSGSG--GEAQRSSGSSHPISSIGGSHIARSDRVVSESCKSSVCYNCG 420

BLAST of Tan0019286 vs. ExPASy TrEMBL
Match: A0A5D3CTK6 (CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold892G00030 PE=4 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 5.5e-26
Identity = 73/168 (43.45%), Postives = 101/168 (60.12%), Query Frame = 0

Query: 1   MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETAL 60
           MT+ EYEKK+TELSKYA+ ++  E++R KRFE+GLR EIRT VTA  +W +FSKLVE AL
Sbjct: 292 MTVAEYEKKYTELSKYATRVIEDEVERYKRFEEGLREEIRTSVTACADWNDFSKLVEAAL 351

Query: 61  RVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRGDNRRFTPGISGKGSFKPLRGGQTTP 120
           RV +SL ++R  +        T+S  + R+R  +  + RF PG+  +G+FK    G    
Sbjct: 352 RVGKSL-NERKRERETSKNVRTFSSSMHRNRLGKERSGRFVPGVPSRGNFKSQYNGSYFS 411

Query: 121 RTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTCG 169
            +G+G  GE QR  GSS  I S   S   +    V+ S +  +C  CG
Sbjct: 412 NSGSG--GEAQRSSGSSHPISSIGGSHIARSDRVVSESCKSSVCYNCG 456

BLAST of Tan0019286 vs. ExPASy TrEMBL
Match: A0A5D3BTP3 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold374G00020 PE=4 SV=1)

HSP 1 Score: 126.7 bits (317), Expect = 9.4e-26
Identity = 80/171 (46.78%), Postives = 108/171 (63.16%), Query Frame = 0

Query: 1   MTIIEYEKKFTELSKYASTIVASEIDRCKRFEDGLRAEIRTPVTASFEWAEFSKLVETAL 60
           +++ +YE+K+TELS+YA  IVASE DRC RFE GLR EIRTPVTA  +W  FS+LVETAL
Sbjct: 382 LSVAKYERKYTELSRYAEMIVASESDRCHRFERGLRFEIRTPVTAIAKWMNFSQLVETAL 441

Query: 61  RVERSLVDDRMGKGAVGGGHTTYSVGLPRDRFQRG-DNRRFTPG--ISGKGSFKPLRGGQ 120
           RV++S+V+++       G  TT  +        RG + RRFTPG  +SG   FK   GG+
Sbjct: 442 RVKQSIVEEKSAMELSRGVSTTSGI--------RGREQRRFTPGVNVSGCQDFKRRSGGK 501

Query: 121 TTPRTGTGGRGERQRQQGSSFLIGSARDSRTGQPSESVASSARKPLCNTCG 169
              +  +G   +RQ ++ SS    S   SRTGQ  ESVAS +++  C +CG
Sbjct: 502 PLRQMSSGSAYQRQSRRASSQPANSVARSRTGQ--ESVASESKRTPCVSCG 542

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAA0060484.11.6e-2744.64Gag protease polyprotein-like protein [Cucumis melo var. makuwa] >TYK18569.1 Gag... [more]
KAA0035138.13.0e-2647.65DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa][more]
TYK15233.11.1e-2543.45uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa][more]
KAA0039476.11.1e-2543.45uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa][more]
KAA0036813.11.9e-2543.71DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5A7UZM67.7e-2844.64Gag protease polyprotein-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
A0A5A7SX061.5e-2647.65DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7TBS05.5e-2643.45CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A5D3CTK65.5e-2643.45CCHC-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A5D3BTP39.4e-2646.78Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold37... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 79..159
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 115..159

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0019286.1Tan0019286.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process
molecular_function GO:0016787 hydrolase activity