Tan0018288 (gene) Snake gourd v1

Overview
NameTan0018288
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationLG01: 20552520 .. 20555427 (-)
RNA-Seq ExpressionTan0018288
SyntenyTan0018288
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTGTTTTAACTTCCTCCTTAATAAATGCTTGCATCAATCGCAAATTCATATCTATAACTTTTTTTGATATAATTTTCGTAATCCTTTCAAAATCCTCCAATAAAGGATCTGGATCAGATGTAGAGAAAATATTCCCAAAATATTGAATGAAGATATTTGAGAACAAAACTATTTTGTTTTTCTCACTCCACTGGATTAAATCTTCTCTTGGACTCTCAATGTATGAGAAATTATGTATGAGAAATTATTCAATGAGGCCGCCACAATGTCATTATCTTATTATGACCGTTATGGATATGTGGATCCAATAATTTCCATGCCCTCTAACCGTTGCATATAAAATTGAAATACTTCTCTCTTTTCATCCAATGGTGAAAAACTGTCGGTCATATTGTGACGGCCTCATCAAATAATTTCTTCTTATGTAGAGGTGTACCAACCTTGTCTAATGTGTCCCACCAACGAGCAACCTTGACCAATTCTTTACAATTTGGATTTGCATGCCAACTAGAATTCTTCAACCTTTAAAGGTCTTTTTGTGAAGACATAATTCTATGGGTCGATCATCCAATTTAGCCCAATCCAAGTTTCTAACGTTTTTTAGGAAATTTTGTTATTTAGTGAAACAAATTAAACTCTATTATAACAAGTAGGCGGGTTAAAAAGTTATTGTTATAATTATAGTTAATTAATATCTAAGTTAATTTATTAACCATAAAATAAAAATGTGTAACACTATTATTTAAGTTAATAAATTATTTTTAAATTTTAAAAGAAATATATTATATTTATAATTAATCTTTTTTAAAAAATATTCTAAAAACAATGCTATTCATTCAAATGGAGCACTAAGATTGTTCCATTCCATATTGGTTATTTTATTCTTTTATCAAATGCATTGTTTATGTTTGATTAGGATATTCCGTTAATGAATAAAAGTTTAGTTAGGATAGGATAAAGATTATTAACAAGAATTCTTGTTATTTAACTATTTTATCCCTAATTAATATTTCCTCAATGAATAATATTTTTTAATTAGGATAGAATAATGACCGTCTCTCGTTATTAGTCCTTATCTTATCATAACTAAATTTTTCTTCATTCTAGATAAGATATTCTAATCTAGATAAGCTATAATTAAATATATTTAACTCGAGGATTGTTCCATATCCCTACCTCAAATGATATTGAACTAAAAAAAACTATATAAAATGTTTTGATATTGATATTGACTTAAAATTAAAAATCTCAAATGTATAGAAATGTTGTAAAATAATGATGCAGATGAGTATTCTAAAATAATCATAAAATTAAAGTTAAGGAAGTTTTAAAATTTATTTCTAACAAATAATAATAACACTAACAATAATAAGTATTTCCGCTATTTTTAAATAAGATAAAAATTCTTATAATCAGTATTTCAATTAACATAATGGTATTTTATTGACATATTTTTTATATTTCATCAATCTTTTTGCTATAAGATAATGATCTATCAATCCTTTATTGATTAATGAAAACATTGGTGAGTTGGATTTTTTTTTTTTTTTAGCTCAAAAGAGTTGAGAATGCAAAAAAGGAAAGAGAGAGACATGGAGCGATGATTTGAGTAATCTAAATAATATATAAACAAAAAAAAAAAAGGTTAAAATTGTGTAATTGGCCTAGAGCCCATCTTATATAGATGTATATAAGGGATTCCTAGTTGTAAATTAACACAAATGAGTTAATAGTCCTTAATTAATTTAAAGATAATTAGGATTACGATCTATTAAATAACCTTCGAAAATTAAACCTAGATTTTTTTTTTCTTTTATCTAGTTTTCTTGTCAAATAACTTGAACATAAATGATGATATAGCTAAGATAAAATGGGGTTTCTATCAAGGATAAAGGCCCAACAAGACATGCCAGTGAGCCCAACAAAGGTGGCCCATCAAAACCATTGTCAAGAGCAGATCAAGGCCCACAAGATCACAACTCTCGGTTAATAGAAATAGAAGGATAGAGCTGGACTTTACATTCTTGGATCCTCTCTGTATTTTATTATTCTTTCATTCTTTACTCTACCGTTGGTTGCCCTAATTCTTAGTGTGTATAAACTCTATAAATTAACAGTGTTTTATCACTGAAATTGTGCAAGTTGAATGAAATAGAGGTTTGCCCTCAAAAGCTGTCTCTCTGCCTTTACTTTGATTTGGTATCAGAGCCATTCATGGCAAACGCCTCAATGACTCCTGGGATGTCTCAATCGGTAGGAAGCAACAATTTCGGAACTCCACCATTGAATCAACTTCTCAATCAAGTAACATCAATCAAGTTGGATCGAAGTAACTTTCTCCTCTGGAAAAACCTTGCTCTCCCAATCCTCCGGAGCTATAAACTGGAAGGCCATCTCCTCGGGTTAAAACCGCGCCCTCCAATGTTTCTACAGAATGATGATGGGTCTGGAACAACAAGCGATGTAACATCATCCTCGTCTGTTGAAGGTACTGTGAATCCTCTATATGAAGCTTGGCTAACAGTTGATCAACTGCTTCTAGGTTGGCTCTATAATTCAATGGCACCTGAAATAGCAACACAGGTAATGGGTTATGATAATTCAAAAGATCTTTGGGATGTTATTCAACTTTTATATGGCATTCAATCCAGAGTAGAGGAGGATTTTCTTCGCCAAGTATTTCAGCAAACCAGAAAAGGCAATCAGAAGATGATGGACTATCTTCGCGTAATGAAATGCCACGCCGACAACTTAGGACAAGCTGGAAGTCCAGTGTGGAATAGAGCCCTAATTTCTCAAGTTCTTCTTGGCCTAGATGAGGAATACAACCCAGTGGTGGCTACCCTTCAAGGTAAGCCTGATGTTCAATGGTCTGATGTTCATAATGAACTCATTGTTTTTTAA

mRNA sequence

ATGGTTGTTTTAACTTCCTCCTTAATAAATGCTTGCATCAATCGCAAATTCATATCTATAACTTTTTTTGATATAATTTTCGTAATCCTTTCAAAATCCTCCAATAAAGGATCTGGATCAGATGATAAAGGCCCAACAAGACATGCCAGTGAGCCCAACAAAGGTGGCCCATCAAAACCATTCTATAAACTGGAAGGCCATCTCCTCGGGTTAAAACCGCGCCCTCCAATGTTTCTACAGAATGATGATGGGTCTGGAACAACAAGCGATGTAACATCATCCTCGTCTGTTGAAGTATTTCAGCAAACCAGAAAAGGCAATCAGAAGATGATGGACTATCTTCGCGTAATGAAATGCCACGCCGACAACTTAGGACAAGCTGGAAGTCCAGTGTGGAATAGAGCCCTAATTTCTCAAGTTCTTCTTGGCCTAGATGAGGAATACAACCCAGTGGTGGCTACCCTTCAAGGTAAGCCTGATGTTCAATGGTCTGATGTTCATAATGAACTCATTGTTTTTTAA

Coding sequence (CDS)

ATGGTTGTTTTAACTTCCTCCTTAATAAATGCTTGCATCAATCGCAAATTCATATCTATAACTTTTTTTGATATAATTTTCGTAATCCTTTCAAAATCCTCCAATAAAGGATCTGGATCAGATGATAAAGGCCCAACAAGACATGCCAGTGAGCCCAACAAAGGTGGCCCATCAAAACCATTCTATAAACTGGAAGGCCATCTCCTCGGGTTAAAACCGCGCCCTCCAATGTTTCTACAGAATGATGATGGGTCTGGAACAACAAGCGATGTAACATCATCCTCGTCTGTTGAAGTATTTCAGCAAACCAGAAAAGGCAATCAGAAGATGATGGACTATCTTCGCGTAATGAAATGCCACGCCGACAACTTAGGACAAGCTGGAAGTCCAGTGTGGAATAGAGCCCTAATTTCTCAAGTTCTTCTTGGCCTAGATGAGGAATACAACCCAGTGGTGGCTACCCTTCAAGGTAAGCCTGATGTTCAATGGTCTGATGTTCATAATGAACTCATTGTTTTTTAA

Protein sequence

MVVLTSSLINACINRKFISITFFDIIFVILSKSSNKGSGSDDKGPTRHASEPNKGGPSKPFYKLEGHLLGLKPRPPMFLQNDDGSGTTSDVTSSSSVEVFQQTRKGNQKMMDYLRVMKCHADNLGQAGSPVWNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELIVF
Homology
BLAST of Tan0018288 vs. NCBI nr
Match: KAA0057475.1 (uncharacterized protein E6C27_scaffold280G003560 [Cucumis melo var. makuwa] >TYK30171.1 uncharacterized protein E5676_scaffold216G001590 [Cucumis melo var. makuwa])

HSP 1 Score: 113.6 bits (283), Expect = 1.7e-21
Identity = 52/74 (70.27%), Postives = 62/74 (83.78%), Query Frame = 0

Query: 100 FQQTRKGNQKMMDYLRVMKCHADNLGQAGSPVWNRALISQVLLGLDEEYNPVVATLQGKP 159
           FQ TR+GN KM DYLR+MK +ADNLGQAGSPV +R LISQVLLGLDE YNPV A +QGKP
Sbjct: 127 FQTTREGNYKMEDYLRIMKINADNLGQAGSPVPHRYLISQVLLGLDEVYNPVTAVIQGKP 186

Query: 160 DVQWSDVHNELIVF 174
           D+ W D+ +EL++F
Sbjct: 187 DISWLDMQSELLIF 200

BLAST of Tan0018288 vs. NCBI nr
Match: XP_022148963.1 (uncharacterized protein LOC111017501 [Momordica charantia])

HSP 1 Score: 111.7 bits (278), Expect = 6.4e-21
Identity = 55/84 (65.48%), Postives = 63/84 (75.00%), Query Frame = 0

Query: 86  GTTSDVTSSSSVEVFQQTRKGNQKMMDYLRVMKCHADNLGQAGSPVWNRALISQVLLGLD 145
           G  S        +VFQQTRKG+ KM D+LRVMK HADNLGQAGSPV  R+LISQVLLGLD
Sbjct: 79  GVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLD 138

Query: 146 EEYNPVVATLQGKPDVQWSDVHNE 170
           EEYNPVVAT+QGK  + W ++  E
Sbjct: 139 EEYNPVVATIQGKRGISWPEMQAE 162

BLAST of Tan0018288 vs. NCBI nr
Match: KGN65684.1 (hypothetical protein Csa_019689 [Cucumis sativus])

HSP 1 Score: 111.7 bits (278), Expect = 6.4e-21
Identity = 54/106 (50.94%), Postives = 69/106 (65.09%), Query Frame = 0

Query: 68  LLGLKPRPPMFLQNDDGSGTTSDVTSSSSVEVFQQTRKGNQKMMDYLRVMKCHADNLGQA 127
           L+G      M+    D  G  S        + FQ TRKGN  M DYLR+MK +ADNLGQA
Sbjct: 34  LIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQA 93

Query: 128 GSPVWNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELIVF 174
            SP+  RALISQVLLGLDE YNPV+  +QGKP++ W D+ ++L++F
Sbjct: 94  ESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISWLDMQSKLLIF 139

BLAST of Tan0018288 vs. NCBI nr
Match: XP_038902487.1 (uncharacterized protein LOC120089143 [Benincasa hispida])

HSP 1 Score: 106.3 bits (264), Expect = 2.7e-19
Identity = 48/76 (63.16%), Postives = 61/76 (80.26%), Query Frame = 0

Query: 98  EVFQQTRKGNQKMMDYLRVMKCHADNLGQAGSPVWNRALISQVLLGLDEEYNPVVATLQG 157
           +VFQQT KG  KM +YLRVMK H+DNLG  GSPV  RAL+SQVLLGLDEE+NP VAT+QG
Sbjct: 189 QVFQQTCKGAMKMPEYLRVMKTHSDNLGLTGSPVPTRALVSQVLLGLDEEFNPFVATIQG 248

Query: 158 KPDVQWSDVHNELIVF 174
           + ++ W+++  EL+ F
Sbjct: 249 RSEISWTNMQTELLAF 264

BLAST of Tan0018288 vs. NCBI nr
Match: XP_038896600.1 (uncharacterized protein LOC120084860 [Benincasa hispida])

HSP 1 Score: 103.6 bits (257), Expect = 1.7e-18
Identity = 47/76 (61.84%), Postives = 58/76 (76.32%), Query Frame = 0

Query: 98  EVFQQTRKGNQKMMDYLRVMKCHADNLGQAGSPVWNRALISQVLLGLDEEYNPVVATLQG 157
           ++FQQTRKG QKM  YL++MK H+DNL Q  SPV  R LISQVLLGLDEEYN VV  +QG
Sbjct: 39  QLFQQTRKGGQKMSGYLKLMKLHSDNLAQTSSPVSTRTLISQVLLGLDEEYNLVVVGIQG 98

Query: 158 KPDVQWSDVHNELIVF 174
           KP + W D+ +EL+ +
Sbjct: 99  KPGISWLDMQSELLTY 114

BLAST of Tan0018288 vs. ExPASy TrEMBL
Match: A0A5D3E3L7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold216G001590 PE=4 SV=1)

HSP 1 Score: 113.6 bits (283), Expect = 8.2e-22
Identity = 52/74 (70.27%), Postives = 62/74 (83.78%), Query Frame = 0

Query: 100 FQQTRKGNQKMMDYLRVMKCHADNLGQAGSPVWNRALISQVLLGLDEEYNPVVATLQGKP 159
           FQ TR+GN KM DYLR+MK +ADNLGQAGSPV +R LISQVLLGLDE YNPV A +QGKP
Sbjct: 127 FQTTREGNYKMEDYLRIMKINADNLGQAGSPVPHRYLISQVLLGLDEVYNPVTAVIQGKP 186

Query: 160 DVQWSDVHNELIVF 174
           D+ W D+ +EL++F
Sbjct: 187 DISWLDMQSELLIF 200

BLAST of Tan0018288 vs. ExPASy TrEMBL
Match: A0A0A0LXB7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G496800 PE=4 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 3.1e-21
Identity = 54/106 (50.94%), Postives = 69/106 (65.09%), Query Frame = 0

Query: 68  LLGLKPRPPMFLQNDDGSGTTSDVTSSSSVEVFQQTRKGNQKMMDYLRVMKCHADNLGQA 127
           L+G      M+    D  G  S        + FQ TRKGN  M DYLR+MK +ADNLGQA
Sbjct: 34  LIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQA 93

Query: 128 GSPVWNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELIVF 174
            SP+  RALISQVLLGLDE YNPV+  +QGKP++ W D+ ++L++F
Sbjct: 94  ESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISWLDMQSKLLIF 139

BLAST of Tan0018288 vs. ExPASy TrEMBL
Match: A0A6J1D5J0 (uncharacterized protein LOC111017501 OS=Momordica charantia OX=3673 GN=LOC111017501 PE=4 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 3.1e-21
Identity = 55/84 (65.48%), Postives = 63/84 (75.00%), Query Frame = 0

Query: 86  GTTSDVTSSSSVEVFQQTRKGNQKMMDYLRVMKCHADNLGQAGSPVWNRALISQVLLGLD 145
           G  S        +VFQQTRKG+ KM D+LRVMK HADNLGQAGSPV  R+LISQVLLGLD
Sbjct: 79  GVQSQAEEDYLRQVFQQTRKGSLKMTDFLRVMKSHADNLGQAGSPVPTRSLISQVLLGLD 138

Query: 146 EEYNPVVATLQGKPDVQWSDVHNE 170
           EEYNPVVAT+QGK  + W ++  E
Sbjct: 139 EEYNPVVATIQGKRGISWPEMQAE 162

BLAST of Tan0018288 vs. ExPASy TrEMBL
Match: A0A5A7SIT7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold19G00360 PE=4 SV=1)

HSP 1 Score: 103.6 bits (257), Expect = 8.4e-19
Identity = 52/106 (49.06%), Postives = 67/106 (63.21%), Query Frame = 0

Query: 68  LLGLKPRPPMFLQNDDGSGTTSDVTSSSSVEVFQQTRKGNQKMMDYLRVMKCHADNLGQA 127
           L+G      ++    D  G  S        ++ Q TRKGN KM +YL VMK + DNLGQ 
Sbjct: 131 LMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQTTRKGNTKMEEYLLVMKTNVDNLGQV 190

Query: 128 GSPVWNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELIVF 174
           GSPV  RALISQVLLGLDE YN V+  +QGKPD+ W D+ ++L++F
Sbjct: 191 GSPVPRRALISQVLLGLDEVYNLVIVVIQGKPDISWLDMQSKLLIF 236

BLAST of Tan0018288 vs. ExPASy TrEMBL
Match: A0A5D3C373 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G002430 PE=4 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 4.2e-18
Identity = 50/106 (47.17%), Postives = 65/106 (61.32%), Query Frame = 0

Query: 68  LLGLKPRPPMFLQNDDGSGTTSDVTSSSSVEVFQQTRKGNQKMMDYLRVMKCHADNLGQA 127
           L+G      ++    D  G  S        ++FQ TRK      DYLR+MK ++D LGQA
Sbjct: 49  LMGFTNAKDLWEATQDLFGVQSRAEEDFLRQMFQTTRKVRASYEDYLRIMKTNSDKLGQA 108

Query: 128 GSPVWNRALISQVLLGLDEEYNPVVATLQGKPDVQWSDVHNELIVF 174
           GSPV  RA ISQ LLGLDE YNPV+A +QGKP++ W D+ +EL+ F
Sbjct: 109 GSPVPKRAFISQALLGLDEVYNPVIAVIQGKPEISWIDMQSELLTF 154

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAA0057475.11.7e-2170.27uncharacterized protein E6C27_scaffold280G003560 [Cucumis melo var. makuwa] >TYK... [more]
XP_022148963.16.4e-2165.48uncharacterized protein LOC111017501 [Momordica charantia][more]
KGN65684.16.4e-2150.94hypothetical protein Csa_019689 [Cucumis sativus][more]
XP_038902487.12.7e-1963.16uncharacterized protein LOC120089143 [Benincasa hispida][more]
XP_038896600.11.7e-1861.84uncharacterized protein LOC120084860 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A5D3E3L78.2e-2270.27Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0LXB73.1e-2150.94Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G496800 PE=4 SV=1[more]
A0A6J1D5J03.1e-2165.48uncharacterized protein LOC111017501 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
A0A5A7SIT78.4e-1949.06Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5D3C3734.2e-1847.17Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 75..95
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 35..59
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 81..95

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0018288.1Tan0018288.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding