Tan0013001 (gene) Snake gourd v1

Overview
NameTan0013001
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
LocationLG04: 38620072 .. 38620395 (+)
RNA-Seq ExpressionTan0013001
SyntenyTan0013001
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGAGCTCTTTTATTCAATTACTCGCCTCCGAAAAACTTAACGGTGATAACTTTGAAACTTGGAAATCAAACCTGAATACGATTCTTGTGATTGATGATCTAAGGTTCGTCTTGACGGATGAATGTCCTCCCCTTCCCAGTTCGTCTGCAATTCGAACAGTTCGGGAAGCATTTGAAAAATGGACTAGGGCTAATGATAAAGCTCGGGTCTACATCTTAGCAAGCCTATCTGATGTGTTGTCTAAGAAACATGAGAGCATGATTACCGCAAAGGAGATCATGGGATCATTACAAGCCATGTTTGGACAACTGTCCTTGTAG

mRNA sequence

ATGTCGAGCTCTTTTATTCAATTACTCGCCTCCGAAAAACTTAACGGTGATAACTTTGAAACTTGGAAATCAAACCTGAATACGATTCTTGTGATTGATGATCTAAGGTTCGTCTTGACGGATGAATGTCCTCCCCTTCCCAGTTCGTCTGCAATTCGAACAGTTCGGGAAGCATTTGAAAAATGGACTAGGGCTAATGATAAAGCTCGGGTCTACATCTTAGCAAGCCTATCTGATGTGTTGTCTAAGAAACATGAGAGCATGATTACCGCAAAGGAGATCATGGGATCATTACAAGCCATGTTTGGACAACTGTCCTTGTAG

Coding sequence (CDS)

ATGTCGAGCTCTTTTATTCAATTACTCGCCTCCGAAAAACTTAACGGTGATAACTTTGAAACTTGGAAATCAAACCTGAATACGATTCTTGTGATTGATGATCTAAGGTTCGTCTTGACGGATGAATGTCCTCCCCTTCCCAGTTCGTCTGCAATTCGAACAGTTCGGGAAGCATTTGAAAAATGGACTAGGGCTAATGATAAAGCTCGGGTCTACATCTTAGCAAGCCTATCTGATGTGTTGTCTAAGAAACATGAGAGCATGATTACCGCAAAGGAGATCATGGGATCATTACAAGCCATGTTTGGACAACTGTCCTTGTAG

Protein sequence

MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQLSL
Homology
BLAST of Tan0013001 vs. NCBI nr
Match: XP_022157095.1 (uncharacterized protein LOC111023904 [Momordica charantia])

HSP 1 Score: 162.2 bits (409), Expect = 2.6e-36
Identity = 77/106 (72.64%), Postives = 95/106 (89.62%), Query Frame = 0

Query: 1   MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFE 60
           M+SS +QLLASEKLNG N+ TWK+NLNTILV+DDLRFVLT+ECP  P+ +A R VREAF+
Sbjct: 1   MNSSIVQLLASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAVNANRNVREAFD 60

Query: 61  KWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQLS 107
           +W +ANDKARVYILAS++DVL+KKHE ++TAKEIM SL+AMFG+LS
Sbjct: 61  RWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKAMFGELS 106

BLAST of Tan0013001 vs. NCBI nr
Match: XP_038882358.1 (uncharacterized protein LOC120073622 [Benincasa hispida])

HSP 1 Score: 161.0 bits (406), Expect = 5.7e-36
Identity = 76/106 (71.70%), Postives = 94/106 (88.68%), Query Frame = 0

Query: 1   MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFE 60
           M+SS IQLL SEKLNGDN+  WKSNLNTILV+DDLRFVLT+ECP  P+S+A RTVREA++
Sbjct: 1   MNSSIIQLLTSEKLNGDNYSAWKSNLNTILVVDDLRFVLTEECPQAPTSNANRTVREAYD 60

Query: 61  KWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQLS 107
           +W +AN+KAR+YILAS+SDVL+KKHES+ TAKEI+ SL+ +FGQ S
Sbjct: 61  RWVKANEKARIYILASMSDVLAKKHESLATAKEIIDSLRELFGQPS 106

BLAST of Tan0013001 vs. NCBI nr
Match: XP_022157844.1 (uncharacterized protein LOC111024457 [Momordica charantia])

HSP 1 Score: 160.6 bits (405), Expect = 7.4e-36
Identity = 76/106 (71.70%), Postives = 95/106 (89.62%), Query Frame = 0

Query: 1   MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFE 60
           M+SS +QLLASEKLNG N+ TWK+NLNTILV+DDLRFVLT+ECP  P+++A R VREAF+
Sbjct: 1   MNSSIVQLLASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPATNANRNVREAFD 60

Query: 61  KWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQLS 107
           +W +ANDKARVYILAS++DVL+KKHE ++TAKEIM SL+AMFG+ S
Sbjct: 61  RWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKAMFGEPS 106

BLAST of Tan0013001 vs. NCBI nr
Match: XP_022158568.1 (uncharacterized protein LOC111025021 [Momordica charantia])

HSP 1 Score: 160.2 bits (404), Expect = 9.7e-36
Identity = 77/106 (72.64%), Postives = 94/106 (88.68%), Query Frame = 0

Query: 1   MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFE 60
           MS+S IQLLAS+KLNGDN+  WKSNLNTILVIDDLRFVLT+ECPP P+ +A RTVR+A++
Sbjct: 7   MSTSTIQLLASDKLNGDNYGIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYD 66

Query: 61  KWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQLS 107
           +W +AN+KARVYILAS+S+VLSKKHE + T +EIM SLQA+FGQ S
Sbjct: 67  RWVKANEKARVYILASISEVLSKKHERLATTREIMDSLQALFGQPS 112

BLAST of Tan0013001 vs. NCBI nr
Match: KAA0032529.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 159.5 bits (402), Expect = 1.7e-35
Identity = 73/105 (69.52%), Postives = 95/105 (90.48%), Query Frame = 0

Query: 1   MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFE 60
           M+SS +QLLASEKLNGDN+E WKSNLNTILV+DDLRF+LT+ECP  P+S+A RT R+A++
Sbjct: 1   MNSSIVQLLASEKLNGDNYEAWKSNLNTILVVDDLRFILTEECPQTPASNANRTSRKAYD 60

Query: 61  KWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQL 106
           +W +AN+KARVYILA+++DVL+KKHES+ T K+IM +L+AMFGQL
Sbjct: 61  QWVKANEKARVYILANMTDVLAKKHESLATPKDIMNALKAMFGQL 105

BLAST of Tan0013001 vs. ExPASy TrEMBL
Match: A0A6J1DS54 (uncharacterized protein LOC111023904 OS=Momordica charantia OX=3673 GN=LOC111023904 PE=4 SV=1)

HSP 1 Score: 162.2 bits (409), Expect = 1.2e-36
Identity = 77/106 (72.64%), Postives = 95/106 (89.62%), Query Frame = 0

Query: 1   MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFE 60
           M+SS +QLLASEKLNG N+ TWK+NLNTILV+DDLRFVLT+ECP  P+ +A R VREAF+
Sbjct: 1   MNSSIVQLLASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPAVNANRNVREAFD 60

Query: 61  KWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQLS 107
           +W +ANDKARVYILAS++DVL+KKHE ++TAKEIM SL+AMFG+LS
Sbjct: 61  RWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKAMFGELS 106

BLAST of Tan0013001 vs. ExPASy TrEMBL
Match: A0A6J1DXQ5 (uncharacterized protein LOC111024457 OS=Momordica charantia OX=3673 GN=LOC111024457 PE=4 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 3.6e-36
Identity = 76/106 (71.70%), Postives = 95/106 (89.62%), Query Frame = 0

Query: 1   MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFE 60
           M+SS +QLLASEKLNG N+ TWK+NLNTILV+DDLRFVLT+ECP  P+++A R VREAF+
Sbjct: 1   MNSSIVQLLASEKLNGVNYSTWKNNLNTILVVDDLRFVLTEECPQTPATNANRNVREAFD 60

Query: 61  KWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQLS 107
           +W +ANDKARVYILAS++DVL+KKHE ++TAKEIM SL+AMFG+ S
Sbjct: 61  RWVKANDKARVYILASMTDVLAKKHEPLMTAKEIMDSLKAMFGEPS 106

BLAST of Tan0013001 vs. ExPASy TrEMBL
Match: A0A6J1DWG6 (uncharacterized protein LOC111025021 OS=Momordica charantia OX=3673 GN=LOC111025021 PE=4 SV=1)

HSP 1 Score: 160.2 bits (404), Expect = 4.7e-36
Identity = 77/106 (72.64%), Postives = 94/106 (88.68%), Query Frame = 0

Query: 1   MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFE 60
           MS+S IQLLAS+KLNGDN+  WKSNLNTILVIDDLRFVLT+ECPP P+ +A RTVR+A++
Sbjct: 7   MSTSTIQLLASDKLNGDNYGIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYD 66

Query: 61  KWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQLS 107
           +W +AN+KARVYILAS+S+VLSKKHE + T +EIM SLQA+FGQ S
Sbjct: 67  RWVKANEKARVYILASISEVLSKKHERLATTREIMDSLQALFGQPS 112

BLAST of Tan0013001 vs. ExPASy TrEMBL
Match: A0A5A7STL5 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold465G00400 PE=4 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 8.0e-36
Identity = 73/105 (69.52%), Postives = 95/105 (90.48%), Query Frame = 0

Query: 1   MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFE 60
           M+SS +QLLASEKLNGDN+E WKSNLNTILV+DDLRF+LT+ECP  P+S+A RT R+A++
Sbjct: 1   MNSSIVQLLASEKLNGDNYEAWKSNLNTILVVDDLRFILTEECPQTPASNANRTSRKAYD 60

Query: 61  KWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQL 106
           +W +AN+KARVYILA+++DVL+KKHES+ T K+IM +L+AMFGQL
Sbjct: 61  QWVKANEKARVYILANMTDVLAKKHESLATPKDIMNALKAMFGQL 105

BLAST of Tan0013001 vs. ExPASy TrEMBL
Match: A0A6J1DXP1 (uncharacterized protein LOC111025468 OS=Momordica charantia OX=3673 GN=LOC111025468 PE=4 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 8.0e-36
Identity = 77/104 (74.04%), Postives = 92/104 (88.46%), Query Frame = 0

Query: 1   MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFE 60
           MS+S IQLLAS+KLNGDN+  WKSNLNTILVIDDLR VLT+ECPP P+ +A RTVREA++
Sbjct: 1   MSTSIIQLLASDKLNGDNYGIWKSNLNTILVIDDLRSVLTEECPPAPAPNANRTVREAYD 60

Query: 61  KWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQ 105
           +W +ANDKARVYILAS+SDVLSKKHE + TA+E+M SLQA+ GQ
Sbjct: 61  RWVKANDKARVYILASISDVLSKKHERLATAREMMDSLQALSGQ 104

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022157095.12.6e-3672.64uncharacterized protein LOC111023904 [Momordica charantia][more]
XP_038882358.15.7e-3671.70uncharacterized protein LOC120073622 [Benincasa hispida][more]
XP_022157844.17.4e-3671.70uncharacterized protein LOC111024457 [Momordica charantia][more]
XP_022158568.19.7e-3672.64uncharacterized protein LOC111025021 [Momordica charantia][more]
KAA0032529.11.7e-3569.52gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A6J1DS541.2e-3672.64uncharacterized protein LOC111023904 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A6J1DXQ53.6e-3671.70uncharacterized protein LOC111024457 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6J1DWG64.7e-3672.64uncharacterized protein LOC111025021 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A5A7STL58.0e-3669.52Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold465G0040... [more]
A0A6J1DXP18.0e-3674.04uncharacterized protein LOC111025468 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR35317OS04G0629600 PROTEINcoord: 4..104
NoneNo IPR availablePANTHERPTHR35317:SF16ZINC FINGER, CCHC-TYPE-RELATEDcoord: 4..104

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0013001.1Tan0013001.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding