Tan0009391 (gene) Snake gourd v1

Overview
NameTan0009391
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontype IV secretion system protein virB10-like
LocationLG02: 9978988 .. 9979525 (-)
RNA-Seq ExpressionTan0009391
SyntenyTan0009391
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTCACTTGTCGTCCACTTCCTTCCGACATCTCTGAACGTCTTGTTTTTAAGTAACCCCAAAAACCCAAAAAAAAAAAAAAAAATCCCAAATCCCCTTCCGCTCCTCCTTCCTCAGCCGCCGATTCAATTTCACCACACAAATTTAAATCCTTCCGATGTCCATTCCCGTCGATCAGCTCCAGCCCCCGCCGGCCTCCGCCGCCTCCCATCCCGGTCAGGGTTCCGTCGGCCCAGTCATCGCCGTCCTCGCCGTGATTTCCATCCTCGGCGTCATTGCCGGCATGATCGGCCGCCTCTGCTCCGGCCGCCCCGTCTTCGGTTACGGCGCCCACTACGACGTCGAAGATTGGGTCGAGAAGAAATGCGCTTCCTGCCTCGATGGGTCCCTCGATCCTCCTCCCCCTCCGCCGCATCTCCGCCGCCCGCCGCCGCTCGAGGCTGTTCCGGTGGCGGAGCCGCCTGATATCAAGGAAGGCGGTGATGGCGAACGCGAGAATTTGCAGTCGGCGCCTCCCGGAAGCGGCGGTGAGTCGTGA

mRNA sequence

GTTTCACTTGTCGTCCACTTCCTTCCGACATCTCTGAACGTCTTGTTTTTAAGTAACCCCAAAAACCCAAAAAAAAAAAAAAAAATCCCAAATCCCCTTCCGCTCCTCCTTCCTCAGCCGCCGATTCAATTTCACCACACAAATTTAAATCCTTCCGATGTCCATTCCCGTCGATCAGCTCCAGCCCCCGCCGGCCTCCGCCGCCTCCCATCCCGGTCAGGGTTCCGTCGGCCCAGTCATCGCCGTCCTCGCCGTGATTTCCATCCTCGGCGTCATTGCCGGCATGATCGGCCGCCTCTGCTCCGGCCGCCCCGTCTTCGGTTACGGCGCCCACTACGACGTCGAAGATTGGGTCGAGAAGAAATGCGCTTCCTGCCTCGATGGGTCCCTCGATCCTCCTCCCCCTCCGCCGCATCTCCGCCGCCCGCCGCCGCTCGAGGCTGTTCCGGTGGCGGAGCCGCCTGATATCAAGGAAGGCGGTGATGGCGAACGCGAGAATTTGCAGTCGGCGCCTCCCGGAAGCGGCGGTGAGTCGTGA

Coding sequence (CDS)

ATGTCCATTCCCGTCGATCAGCTCCAGCCCCCGCCGGCCTCCGCCGCCTCCCATCCCGGTCAGGGTTCCGTCGGCCCAGTCATCGCCGTCCTCGCCGTGATTTCCATCCTCGGCGTCATTGCCGGCATGATCGGCCGCCTCTGCTCCGGCCGCCCCGTCTTCGGTTACGGCGCCCACTACGACGTCGAAGATTGGGTCGAGAAGAAATGCGCTTCCTGCCTCGATGGGTCCCTCGATCCTCCTCCCCCTCCGCCGCATCTCCGCCGCCCGCCGCCGCTCGAGGCTGTTCCGGTGGCGGAGCCGCCTGATATCAAGGAAGGCGGTGATGGCGAACGCGAGAATTTGCAGTCGGCGCCTCCCGGAAGCGGCGGTGAGTCGTGA

Protein sequence

MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVEDWVEKKCASCLDGSLDPPPPPPHLRRPPPLEAVPVAEPPDIKEGGDGERENLQSAPPGSGGES
Homology
BLAST of Tan0009391 vs. NCBI nr
Match: XP_023530142.1 (uncharacterized protein LOC111792788 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 203.4 bits (516), Expect = 1.2e-48
Identity = 106/132 (80.30%), Postives = 114/132 (86.36%), Query Frame = 0

Query: 1   MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
           MS+P+DQLQPPP++AA HP QGSVGPVIAVLAVISILG IAGMIGR+C GRPVFGY AHY
Sbjct: 1   MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60

Query: 61  DVEDWVEKKCASCLDGSLDPPPPPPHLRRPPPLEAVPVAE----PPDIKEGGDG----ER 120
           DVEDWVEKKCASCLDGSLD   PPPHLR PPP+EA+PVAE    PP+IKEGGDG    ER
Sbjct: 61  DVEDWVEKKCASCLDGSLD---PPPHLRPPPPIEAIPVAEPLGGPPNIKEGGDGDGDRER 120

Query: 121 ENLQSAPPGSGG 125
           ENLQS PPGSGG
Sbjct: 121 ENLQSVPPGSGG 129

BLAST of Tan0009391 vs. NCBI nr
Match: KAG6588645.1 (hypothetical protein SDJN03_17210, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 203.0 bits (515), Expect = 1.5e-48
Identity = 106/132 (80.30%), Postives = 115/132 (87.12%), Query Frame = 0

Query: 1   MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
           MS+PVDQLQPPP++AA HP QGSVGPVIAVLAVISILG IAGMIGR+C GRPVFGY AHY
Sbjct: 1   MSVPVDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60

Query: 61  DVEDWVEKKCASCLDGSLDPPPPPPHLRRPPPLEAVPVA----EPPDIKEGGDG--EREN 120
           DVEDWVEKKCA+CLDGSLD   PPPHLR PPP+EA+PV+    EPP+IKEGGDG  EREN
Sbjct: 61  DVEDWVEKKCATCLDGSLD---PPPHLRPPPPIEAIPVSEPLGEPPNIKEGGDGDREREN 120

Query: 121 LQSAPPGSGGES 127
           LQS  PGSGGES
Sbjct: 121 LQSVAPGSGGES 129

BLAST of Tan0009391 vs. NCBI nr
Match: KAG6571994.1 (hypothetical protein SDJN03_28722, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 201.4 bits (511), Expect = 4.5e-48
Identity = 106/131 (80.92%), Postives = 114/131 (87.02%), Query Frame = 0

Query: 1   MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
           MS PVDQLQPPP    +H G GSVGPVIAVLAVISILGVIAG+IGRLCSGRPVFGYGAHY
Sbjct: 1   MSTPVDQLQPPP---TAHSGYGSVGPVIAVLAVISILGVIAGIIGRLCSGRPVFGYGAHY 60

Query: 61  DVEDWVEKKCASCLDGSLD-PPPPPPHLRRPPPLEAVPVAE----PPDIKEGGDGERENL 120
           DVE+WVEKKCASCLDGSLD PPPPPPHLR PPPL+AVPV E    PP+IK+G D +RENL
Sbjct: 61  DVEEWVEKKCASCLDGSLDPPPPPPPHLRHPPPLDAVPVVEPLGGPPEIKQGADDKRENL 120

Query: 121 QSAPPGSGGES 127
           QSA PG+GGES
Sbjct: 121 QSAAPGTGGES 128

BLAST of Tan0009391 vs. NCBI nr
Match: KAG7011671.1 (hypothetical protein SDJN02_26577, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 201.1 bits (510), Expect = 5.8e-48
Identity = 106/133 (79.70%), Postives = 114/133 (85.71%), Query Frame = 0

Query: 1   MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
           MS PVDQLQPPP    +H G GSVGPVIAVLAVISILGVIAG+IGRLCSGRPVFGYGAHY
Sbjct: 1   MSTPVDQLQPPP---TAHSGHGSVGPVIAVLAVISILGVIAGIIGRLCSGRPVFGYGAHY 60

Query: 61  DVEDWVEKKCASCLDGSLD---PPPPPPHLRRPPPLEAVPVAE----PPDIKEGGDGERE 120
           DVE+WVEKKCASCLDGSLD   PPPPPPHLR PPPL+AVPV E    PP+IK+G D +RE
Sbjct: 61  DVEEWVEKKCASCLDGSLDPPPPPPPPPHLRHPPPLDAVPVVEPLGGPPEIKQGADDKRE 120

Query: 121 NLQSAPPGSGGES 127
           NLQSA PG+GGES
Sbjct: 121 NLQSAAPGTGGES 130

BLAST of Tan0009391 vs. NCBI nr
Match: XP_022989510.1 (uncharacterized protein LOC111486567 [Cucurbita maxima])

HSP 1 Score: 200.3 bits (508), Expect = 1.0e-47
Identity = 106/132 (80.30%), Postives = 114/132 (86.36%), Query Frame = 0

Query: 1   MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
           MS+P+DQLQPPPA+AA HP QGSVGPVIAVLAVISILG IAGMIGR+C GRPVFGY AHY
Sbjct: 1   MSVPLDQLQPPPAAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60

Query: 61  DVEDWVEKKCASCLDGSLDPPPPPPHLRRPPPLEAVPVAE----PPDIKEGGDG--EREN 120
           DVEDWVEKKCA+CLDGSLD   PPP LR PPP+EA+PVAE    PP+IKE GDG  EREN
Sbjct: 61  DVEDWVEKKCATCLDGSLD---PPPLLRPPPPIEAIPVAEPLGGPPNIKEDGDGDKEREN 120

Query: 121 LQSAPPGSGGES 127
           LQS PPGSGGES
Sbjct: 121 LQSVPPGSGGES 129

BLAST of Tan0009391 vs. ExPASy TrEMBL
Match: A0A6J1JMK0 (uncharacterized protein LOC111486567 OS=Cucurbita maxima OX=3661 GN=LOC111486567 PE=4 SV=1)

HSP 1 Score: 200.3 bits (508), Expect = 4.8e-48
Identity = 106/132 (80.30%), Postives = 114/132 (86.36%), Query Frame = 0

Query: 1   MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
           MS+P+DQLQPPPA+AA HP QGSVGPVIAVLAVISILG IAGMIGR+C GRPVFGY AHY
Sbjct: 1   MSVPLDQLQPPPAAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAHY 60

Query: 61  DVEDWVEKKCASCLDGSLDPPPPPPHLRRPPPLEAVPVAE----PPDIKEGGDG--EREN 120
           DVEDWVEKKCA+CLDGSLD   PPP LR PPP+EA+PVAE    PP+IKE GDG  EREN
Sbjct: 61  DVEDWVEKKCATCLDGSLD---PPPLLRPPPPIEAIPVAEPLGGPPNIKEDGDGDKEREN 120

Query: 121 LQSAPPGSGGES 127
           LQS PPGSGGES
Sbjct: 121 LQSVPPGSGGES 129

BLAST of Tan0009391 vs. ExPASy TrEMBL
Match: A0A6J1ELS6 (uncharacterized protein LOC111434531 OS=Cucurbita moschata OX=3662 GN=LOC111434531 PE=4 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 6.3e-48
Identity = 105/132 (79.55%), Postives = 114/132 (86.36%), Query Frame = 0

Query: 1   MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
           MS+P+DQLQPPP++AA HP QGSVGPVIAVLAVISILG IAGMIGR+C GRPVFGY A Y
Sbjct: 1   MSVPLDQLQPPPSAAAIHPAQGSVGPVIAVLAVISILGFIAGMIGRVCFGRPVFGYNAQY 60

Query: 61  DVEDWVEKKCASCLDGSLDPPPPPPHLRRPPPLEAVPVAE----PPDIKEGGDG--EREN 120
           DVEDWVEKKCASCLDGSLD   PPPHLR PPP+EA+PV+E    PP+IKEGGDG  EREN
Sbjct: 61  DVEDWVEKKCASCLDGSLD---PPPHLRPPPPIEAIPVSEPLGGPPNIKEGGDGDREREN 120

Query: 121 LQSAPPGSGGES 127
           LQS  PGSGGES
Sbjct: 121 LQSVAPGSGGES 129

BLAST of Tan0009391 vs. ExPASy TrEMBL
Match: A0A6J1GL66 (uncharacterized protein LOC111455383 OS=Cucurbita moschata OX=3662 GN=LOC111455383 PE=4 SV=1)

HSP 1 Score: 198.7 bits (504), Expect = 1.4e-47
Identity = 104/129 (80.62%), Postives = 112/129 (86.82%), Query Frame = 0

Query: 1   MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
           MS PVDQLQPPP    +H G GSVGPVIAVLAVISILGVIAG+IGRLCSGRPVFGYGAHY
Sbjct: 1   MSTPVDQLQPPP---TAHSGYGSVGPVIAVLAVISILGVIAGIIGRLCSGRPVFGYGAHY 60

Query: 61  DVEDWVEKKCASCLDGSLD-PPPPPPHLRRPPPLEAVPVAE----PPDIKEGGDGERENL 120
           DVE+WVEKKCASCLDGSLD PPPPPPHLR PPPL+AVPV E    PP+IK+G D +RENL
Sbjct: 61  DVEEWVEKKCASCLDGSLDPPPPPPPHLRHPPPLDAVPVVEPLGGPPEIKQGADDKRENL 120

Query: 121 QSAPPGSGG 125
           QSA PG+GG
Sbjct: 121 QSAAPGTGG 126

BLAST of Tan0009391 vs. ExPASy TrEMBL
Match: A0A6J1IBA8 (uncharacterized protein LOC111470940 OS=Cucurbita maxima OX=3661 GN=LOC111470940 PE=4 SV=1)

HSP 1 Score: 198.4 bits (503), Expect = 1.8e-47
Identity = 102/128 (79.69%), Postives = 110/128 (85.94%), Query Frame = 0

Query: 1   MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
           MS P DQLQPPP    +H G GSVGPVIAVLAVISILGVIAG+IGRLCSGRPVFGYGAHY
Sbjct: 1   MSTPFDQLQPPP---TAHSGHGSVGPVIAVLAVISILGVIAGIIGRLCSGRPVFGYGAHY 60

Query: 61  DVEDWVEKKCASCLDGSLDPPPPPPHLRRPPPLEAVPVAE----PPDIKEGGDGERENLQ 120
           DVE+WVEKKCASCLDGSLDPPPPP HLR PPPL+AVPV E    PP+IK+G D +RENLQ
Sbjct: 61  DVEEWVEKKCASCLDGSLDPPPPPAHLRHPPPLDAVPVVEPLGGPPEIKQGADEKRENLQ 120

Query: 121 SAPPGSGG 125
           SA PG+GG
Sbjct: 121 SAAPGTGG 125

BLAST of Tan0009391 vs. ExPASy TrEMBL
Match: A0A0A0K6N4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G074850 PE=4 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 1.7e-45
Identity = 102/135 (75.56%), Postives = 112/135 (82.96%), Query Frame = 0

Query: 1   MSIPVDQLQPPPASAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60
           MS P+DQLQPPP   +SH    SVGP+IAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY
Sbjct: 1   MSTPIDQLQPPPPLHSSH---ASVGPLIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHY 60

Query: 61  DVEDWVEKKCASCLDGSLDPPPPPPHLRRPPPLEAVPVAE-----PPDIKEG----GDGE 120
           D+EDWVEKKCASCLDGSLDPPPPPPHLR PPPL++VPVAE     PP+IK+      D +
Sbjct: 61  DLEDWVEKKCASCLDGSLDPPPPPPHLRHPPPLDSVPVAEPLGGPPPEIKQSAHADADAK 120

Query: 121 RENLQSAPPGSGGES 127
            ENLQSA PG+GGES
Sbjct: 121 GENLQSAAPGTGGES 132

BLAST of Tan0009391 vs. TAIR 10
Match: AT2G26520.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G57500.1); Has 51 Blast hits to 51 proteins in 11 species: Archae - 0; Bacteria - 1; Metazoa - 0; Fungi - 0; Plants - 50; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 79.0 bits (193), Expect = 3.1e-15
Identity = 52/129 (40.31%), Postives = 70/129 (54.26%), Query Frame = 0

Query: 9   QPPPA-------SAASHPGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYD 68
           QPPPA       S+ S  G  ++GP IAV  V+++L V+A +IGRLCSG+ + GYG  YD
Sbjct: 9   QPPPATEVSQDSSSVSSAGNSTIGPFIAVFIVVTVLCVLASVIGRLCSGKTILGYG-DYD 68

Query: 69  VEDWVEKKCASCLDGSLDPPPPPPHLRRPP--PL-------EAVPVAEPPDIKEGGDGER 121
           +E W E +C SC+DG + P  P P    PP  PL        A       D+    DGE+
Sbjct: 69  MERWAESRCGSCIDGHIHPHRPSPSPTPPPRQPLHHTSSGVSAESEGHVADLDHETDGEK 128

BLAST of Tan0009391 vs. TAIR 10
Match: AT3G57500.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G26520.1); Has 51 Blast hits to 51 proteins in 11 species: Archae - 0; Bacteria - 1; Metazoa - 0; Fungi - 0; Plants - 50; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 67.0 bits (162), Expect = 1.2e-11
Identity = 42/100 (42.00%), Postives = 57/100 (57.00%), Query Frame = 0

Query: 8   LQPPPASAASH-PGQGSVGPVIAVLAVISILGVIAGMIGRLCSGRPVFGYGAHYDVEDWV 67
           + PP     SH     S+  ++ VLAVI+IL V+AG+  RLC GR +  +G  +D+E WV
Sbjct: 21  IDPPSQDQPSHNSDHRSIETLVVVLAVITILSVLAGVFARLCGGRHL-SHGGDHDIEGWV 80

Query: 68  EKKCASCLDG-----SLDPPPPPPHLRRPPPLEAVPVAEP 102
           E+KC SC+D      S  P PPPP    PPP  A   ++P
Sbjct: 81  ERKCRSCIDAGIPAVSAAPSPPPP----PPPATAEERSKP 115

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023530142.11.2e-4880.30uncharacterized protein LOC111792788 [Cucurbita pepo subsp. pepo][more]
KAG6588645.11.5e-4880.30hypothetical protein SDJN03_17210, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG6571994.14.5e-4880.92hypothetical protein SDJN03_28722, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7011671.15.8e-4879.70hypothetical protein SDJN02_26577, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022989510.11.0e-4780.30uncharacterized protein LOC111486567 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1JMK04.8e-4880.30uncharacterized protein LOC111486567 OS=Cucurbita maxima OX=3661 GN=LOC111486567... [more]
A0A6J1ELS66.3e-4879.55uncharacterized protein LOC111434531 OS=Cucurbita moschata OX=3662 GN=LOC1114345... [more]
A0A6J1GL661.4e-4780.62uncharacterized protein LOC111455383 OS=Cucurbita moschata OX=3662 GN=LOC1114553... [more]
A0A6J1IBA81.8e-4779.69uncharacterized protein LOC111470940 OS=Cucurbita maxima OX=3661 GN=LOC111470940... [more]
A0A0A0K6N41.7e-4575.56Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G074850 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G26520.13.1e-1540.31unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G57500.11.2e-1142.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 77..98
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 75..126
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availablePANTHERPTHR33429OS02G0708000 PROTEIN-RELATEDcoord: 1..103
NoneNo IPR availablePANTHERPTHR33429:SF2OS01G0888850 PROTEINcoord: 1..103

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009391.1Tan0009391.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane