Tan0009541 (gene) Snake gourd v1

Overview
NameTan0009541
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTranscription factor bHLH14
LocationLG05: 4150399 .. 4151600 (-)
RNA-Seq ExpressionTan0009541
SyntenyTan0009541
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATTTCTGTGGGAGAAGAGCGAGTCATGGAGATGGATAGTGAGGAAAACCAGAGACTCAAAGCCATTCTTCTTGGCCTTCGCCACCGTCTGCGGCGTCGTTCCCGGCGTCATCGGCTACTGCGTCATGCAGGCCACCAATTCCCGCAACGAGCAACTCGAAGCCCGCCTCCGCCAGAATGCTAGACCTGAATCTCTCGTATGTTCCTTTCCCTCTTCTTCAGTTCCGTTCCGTTCCTAAAACCCTATTATTGTCGCAATTTTGTTCCACATATCCCTGATTGAGTTTCTTTAATTGAGAATTTCTTGTTCCGTCGGACATTTCGCGACTTGGCGATCGCGTAAATTTCGATGTGCTGCTCTACTGCGATGATTATAATCTGTTTATCCCATGGACTTACCTGGTTATTCGATTGAAAACTTGGGATTTTTTTTTTTTAAAAAAATTGTTCTTGAGTTGACTGCACCATTCGAATCTGTTTTCCCTGATCAGACCTCTAATTTCCATGGTGTTAGTGTTAACACTGAGAAGGAATTTTTCCACATAAGCACTCCTTTACACTTGTTTTCATACAAAAAGATATATCAGACGGAGTAAAGCCCTTTGCTTGTGACATAAATTTCACTTGCATTCTATTTTGCCATTTGAAATTTCGCCTCATTCCATACAAGAAGAATACAACATTCAGTTCTTTTATTCAGGCCAGTTAATCTCTTCTGCACGTTGTCATTTGAGCTGGCATTGAACTTTCTGGCTGGTGCTTTGCTTTCATGTGTTTTCACCTCTTTTCGTACCATTTTTTTATTAACTGCTTCCCTCTTTTGAGATTTTGCTCGTCTGACCACCAATGCTGCTCTTATAAAGCTATATTTTGTGAATTCCTGGGACTGTCAGAACTTGTATGGTTCCATATTTGTCTGAAGCAGTAGGTTGAATCTTACAATCAAGTTCATCAGGTAATTTGCTTCTCAATGTTTGTGGTTTTGTTGTTACTTGATATTGAAATTCAGATGATGGGGAAAGTAAATCGAGAGAGACTGGCAGAATATCTAGGTGAGTTGCAGAGGAAAGAGGACACAAACGATCGGTATGTTGCTGCTTTGAGAGGAGAGACATTGACAAGGAAGCCGTATGTGAGAATTCAACCAATCCCAAATCAAAGCAACGATGTTGAACAGATCAAGAAGGAAAATAAATAA

mRNA sequence

ATGTCATTTCTGTGGGAGAAGAGCGAGTCATGGAGATGGATAGTGAGGAAAACCAGAGACTCAAAGCCATTCTTCTTGGCCTTCGCCACCGTCTGCGGCGTCGTTCCCGGCGTCATCGGCTACTGCGTCATGCAGGCCACCAATTCCCGCAACGAGCAACTCGAAGCCCGCCTCCGCCAGAATGCTAGACCTGAATCTCTCATGATGGGGAAAGTAAATCGAGAGAGACTGGCAGAATATCTAGGTGAGTTGCAGAGGAAAGAGGACACAAACGATCGGTATGTTGCTGCTTTGAGAGGAGAGACATTGACAAGGAAGCCGTATGTGAGAATTCAACCAATCCCAAATCAAAGCAACGATGTTGAACAGATCAAGAAGGAAAATAAATAA

Coding sequence (CDS)

ATGTCATTTCTGTGGGAGAAGAGCGAGTCATGGAGATGGATAGTGAGGAAAACCAGAGACTCAAAGCCATTCTTCTTGGCCTTCGCCACCGTCTGCGGCGTCGTTCCCGGCGTCATCGGCTACTGCGTCATGCAGGCCACCAATTCCCGCAACGAGCAACTCGAAGCCCGCCTCCGCCAGAATGCTAGACCTGAATCTCTCATGATGGGGAAAGTAAATCGAGAGAGACTGGCAGAATATCTAGGTGAGTTGCAGAGGAAAGAGGACACAAACGATCGGTATGTTGCTGCTTTGAGAGGAGAGACATTGACAAGGAAGCCGTATGTGAGAATTCAACCAATCCCAAATCAAAGCAACGATGTTGAACAGATCAAGAAGGAAAATAAATAA

Protein sequence

MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQNARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPNQSNDVEQIKKENK
Homology
BLAST of Tan0009541 vs. NCBI nr
Match: XP_023007266.1 (uncharacterized protein LOC111499804 [Cucurbita maxima])

HSP 1 Score: 248.1 bits (632), Expect = 4.3e-62
Identity = 124/133 (93.23%), Postives = 129/133 (96.99%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ
Sbjct: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPNQSND 120
           NARPESLMMG+VNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIP+QSN+
Sbjct: 61  NARPESLMMGRVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPSQSNE 120

Query: 121 V----EQIKKENK 130
           V    +Q+KKENK
Sbjct: 121 VADKNQQVKKENK 133

BLAST of Tan0009541 vs. NCBI nr
Match: XP_022947349.1 (uncharacterized protein LOC111451237 [Cucurbita moschata])

HSP 1 Score: 245.4 bits (625), Expect = 2.8e-61
Identity = 125/134 (93.28%), Postives = 129/134 (96.27%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ
Sbjct: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPI-PNQSN 120
           NARPESLMMG+VNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPI PNQSN
Sbjct: 61  NARPESLMMGRVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPPNQSN 120

Query: 121 DV----EQIKKENK 130
           +V    +Q+KKENK
Sbjct: 121 EVADKNQQVKKENK 134

BLAST of Tan0009541 vs. NCBI nr
Match: KAG7035319.1 (hypothetical protein SDJN02_02114, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 245.4 bits (625), Expect = 2.8e-61
Identity = 125/134 (93.28%), Postives = 129/134 (96.27%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ
Sbjct: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIP-NQSN 120
           NARPESLMMG+VNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIP NQSN
Sbjct: 61  NARPESLMMGRVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPTNQSN 120

Query: 121 DV----EQIKKENK 130
           +V    +Q+KKENK
Sbjct: 121 EVADKNQQVKKENK 134

BLAST of Tan0009541 vs. NCBI nr
Match: XP_023533278.1 (uncharacterized protein LOC111795217 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 243.4 bits (620), Expect = 1.1e-60
Identity = 124/134 (92.54%), Postives = 128/134 (95.52%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ
Sbjct: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPI-PNQSN 120
           NARPESLMMG+VNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPI PNQSN
Sbjct: 61  NARPESLMMGRVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPPNQSN 120

Query: 121 DV----EQIKKENK 130
           +V    +Q+K ENK
Sbjct: 121 EVADKNQQVKNENK 134

BLAST of Tan0009541 vs. NCBI nr
Match: XP_022149659.1 (uncharacterized protein LOC111018036 [Momordica charantia])

HSP 1 Score: 239.2 bits (609), Expect = 2.0e-59
Identity = 118/133 (88.72%), Postives = 125/133 (93.98%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MSFLWEKSE+WRW+VRKTRDSKPFF AFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ
Sbjct: 1   MSFLWEKSETWRWVVRKTRDSKPFFFAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPNQSND 120
           NARPESLMMG+VNRERLAEYLGELQRKEDTNDRYVAAL+GETLTRKPYVRIQPIP + ND
Sbjct: 61  NARPESLMMGQVNRERLAEYLGELQRKEDTNDRYVAALKGETLTRKPYVRIQPIPTEGND 120

Query: 121 V----EQIKKENK 130
                ++IKKENK
Sbjct: 121 AADKEQKIKKENK 133

BLAST of Tan0009541 vs. ExPASy TrEMBL
Match: A0A6J1L027 (uncharacterized protein LOC111499804 OS=Cucurbita maxima OX=3661 GN=LOC111499804 PE=4 SV=1)

HSP 1 Score: 248.1 bits (632), Expect = 2.1e-62
Identity = 124/133 (93.23%), Postives = 129/133 (96.99%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ
Sbjct: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPNQSND 120
           NARPESLMMG+VNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIP+QSN+
Sbjct: 61  NARPESLMMGRVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPSQSNE 120

Query: 121 V----EQIKKENK 130
           V    +Q+KKENK
Sbjct: 121 VADKNQQVKKENK 133

BLAST of Tan0009541 vs. ExPASy TrEMBL
Match: A0A6J1G6J1 (uncharacterized protein LOC111451237 OS=Cucurbita moschata OX=3662 GN=LOC111451237 PE=4 SV=1)

HSP 1 Score: 245.4 bits (625), Expect = 1.3e-61
Identity = 125/134 (93.28%), Postives = 129/134 (96.27%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ
Sbjct: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPI-PNQSN 120
           NARPESLMMG+VNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPI PNQSN
Sbjct: 61  NARPESLMMGRVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPPNQSN 120

Query: 121 DV----EQIKKENK 130
           +V    +Q+KKENK
Sbjct: 121 EVADKNQQVKKENK 134

BLAST of Tan0009541 vs. ExPASy TrEMBL
Match: A0A6J1D7P4 (uncharacterized protein LOC111018036 OS=Momordica charantia OX=3673 GN=LOC111018036 PE=4 SV=1)

HSP 1 Score: 239.2 bits (609), Expect = 9.6e-60
Identity = 118/133 (88.72%), Postives = 125/133 (93.98%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MSFLWEKSE+WRW+VRKTRDSKPFF AFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ
Sbjct: 1   MSFLWEKSETWRWVVRKTRDSKPFFFAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPNQSND 120
           NARPESLMMG+VNRERLAEYLGELQRKEDTNDRYVAAL+GETLTRKPYVRIQPIP + ND
Sbjct: 61  NARPESLMMGQVNRERLAEYLGELQRKEDTNDRYVAALKGETLTRKPYVRIQPIPTEGND 120

Query: 121 V----EQIKKENK 130
                ++IKKENK
Sbjct: 121 AADKEQKIKKENK 133

BLAST of Tan0009541 vs. ExPASy TrEMBL
Match: A0A0A0KEE2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G135430 PE=4 SV=1)

HSP 1 Score: 231.9 bits (590), Expect = 1.5e-57
Identity = 115/133 (86.47%), Postives = 122/133 (91.73%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MS LWEKSE+WRW+VRKTRDSK FF  FATVCG+VPG+IGYCVMQATNS NEQLEARLRQ
Sbjct: 1   MSILWEKSETWRWVVRKTRDSKSFFFTFATVCGLVPGLIGYCVMQATNSTNEQLEARLRQ 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPNQSND 120
           NARPESLMMG+VNRERLAEYLGELQRKEDTNDRYVAAL G+TLTRKPYVRIQPIPNQSND
Sbjct: 61  NARPESLMMGQVNRERLAEYLGELQRKEDTNDRYVAALEGKTLTRKPYVRIQPIPNQSND 120

Query: 121 V----EQIKKENK 130
                +QIKKENK
Sbjct: 121 ATVKEQQIKKENK 133

BLAST of Tan0009541 vs. ExPASy TrEMBL
Match: A0A5A7UJD3 (Transcription factor bHLH14 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold24G005450 PE=4 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 6.4e-56
Identity = 113/133 (84.96%), Postives = 120/133 (90.23%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MS LWEKSE+WRW+VRKTRDSK FF  FAT CG+VP +IGYCVMQATNS NEQLEARLRQ
Sbjct: 1   MSILWEKSETWRWLVRKTRDSKSFFFTFATACGLVPCLIGYCVMQATNSTNEQLEARLRQ 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPNQSND 120
           NARPESLMMG+VNRERLAEYLGELQRKEDTNDRYVAAL G+TLTRKPYVRIQPIPNQSND
Sbjct: 61  NARPESLMMGQVNRERLAEYLGELQRKEDTNDRYVAALEGKTLTRKPYVRIQPIPNQSND 120

Query: 121 V----EQIKKENK 130
                +QIKKENK
Sbjct: 121 ATVKEQQIKKENK 133

BLAST of Tan0009541 vs. TAIR 10
Match: AT1G79390.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 187.2 bits (474), Expect = 8.3e-48
Identity = 88/121 (72.73%), Postives = 106/121 (87.60%), Query Frame = 0

Query: 1   MSFLWEKSESWRWIVRKTRDSKPFFLAFATVCGVVPGVIGYCVMQATNSRNEQLEARLRQ 60
           MSF++EKS +WRW+V KTRDS+ FF  FA +CGV+PGVIGY VMQ TNS N +LEARLR+
Sbjct: 1   MSFMYEKSNTWRWLVMKTRDSRSFFFTFAALCGVIPGVIGYGVMQVTNSSNPELEARLRK 60

Query: 61  NARPESLMMGKVNRERLAEYLGELQRKEDTNDRYVAALRGETLTRKPYVRIQPIPNQSND 120
           +ARP++LMMGKVN+ERLAEYLGEL++K+DTNDRYVAALRGETLTRKPY RIQP+P   + 
Sbjct: 61  SARPDTLMMGKVNQERLAEYLGELKQKQDTNDRYVAALRGETLTRKPYQRIQPMPKPDDT 120

Query: 121 V 122
           V
Sbjct: 121 V 121

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023007266.14.3e-6293.23uncharacterized protein LOC111499804 [Cucurbita maxima][more]
XP_022947349.12.8e-6193.28uncharacterized protein LOC111451237 [Cucurbita moschata][more]
KAG7035319.12.8e-6193.28hypothetical protein SDJN02_02114, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023533278.11.1e-6092.54uncharacterized protein LOC111795217 [Cucurbita pepo subsp. pepo][more]
XP_022149659.12.0e-5988.72uncharacterized protein LOC111018036 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1L0272.1e-6293.23uncharacterized protein LOC111499804 OS=Cucurbita maxima OX=3661 GN=LOC111499804... [more]
A0A6J1G6J11.3e-6193.28uncharacterized protein LOC111451237 OS=Cucurbita moschata OX=3662 GN=LOC1114512... [more]
A0A6J1D7P49.6e-6088.72uncharacterized protein LOC111018036 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A0A0KEE21.5e-5786.47Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G135430 PE=4 SV=1[more]
A0A5A7UJD36.4e-5684.96Transcription factor bHLH14 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaf... [more]
Match NameE-valueIdentityDescription
AT1G79390.18.3e-4872.73unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36338OS02G0495900 PROTEINcoord: 1..126

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009541.1Tan0009541.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane