Tan0001449 (gene) Snake gourd v1

Overview
NameTan0001449
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG11: 21630081 .. 21630485 (+)
RNA-Seq ExpressionTan0001449
SyntenyTan0001449
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATATCATTGCAACCCCATCAGCTCAAAGCATGAGGAGGCTATGGAGAAGAAGAAGATACCAAAGGCTAGGAAGTAGCACTGGCCCTGTAAGATCAAGGAGCTTCAGGCTTGGGCGACTGTGGCGGATGAGACGTCGAGCTTCAAAATTGCAGTTGAAAATGGCGTCGCCGTTGAAGCTTCTGGCGAAGTTTCACGACGCATATGTCGAGATGATGATGAGGGTGGCCAACAGTGTTGGGAACATGTATGCCATTGGAGCCTTTGGCAATGGGAAGAGGATACCAAAACCTCATAATCAAGTCTCTCTGGTTTCTTGTGGTGGGGAACAAGTTGATGCCAAGTTGGTCTTGGAAGTCTATAACAAACTTGCTGCTTCCAAGAACTTGGGTGCTGCTTTTTGA

mRNA sequence

ATGGATATCATTGCAACCCCATCAGCTCAAAGCATGAGGAGGCTATGGAGAAGAAGAAGATACCAAAGGCTAGGAAGTAGCACTGGCCCTGTAAGATCAAGGAGCTTCAGGCTTGGGCGACTGTGGCGGATGAGACGTCGAGCTTCAAAATTGCAGTTGAAAATGGCGTCGCCGTTGAAGCTTCTGGCGAAGTTTCACGACGCATATGTCGAGATGATGATGAGGGTGGCCAACAGTGTTGGGAACATGTATGCCATTGGAGCCTTTGGCAATGGGAAGAGGATACCAAAACCTCATAATCAAGTCTCTCTGGTTTCTTGTGGTGGGGAACAAGTTGATGCCAAGTTGGTCTTGGAAGTCTATAACAAACTTGCTGCTTCCAAGAACTTGGGTGCTGCTTTTTGA

Coding sequence (CDS)

ATGGATATCATTGCAACCCCATCAGCTCAAAGCATGAGGAGGCTATGGAGAAGAAGAAGATACCAAAGGCTAGGAAGTAGCACTGGCCCTGTAAGATCAAGGAGCTTCAGGCTTGGGCGACTGTGGCGGATGAGACGTCGAGCTTCAAAATTGCAGTTGAAAATGGCGTCGCCGTTGAAGCTTCTGGCGAAGTTTCACGACGCATATGTCGAGATGATGATGAGGGTGGCCAACAGTGTTGGGAACATGTATGCCATTGGAGCCTTTGGCAATGGGAAGAGGATACCAAAACCTCATAATCAAGTCTCTCTGGTTTCTTGTGGTGGGGAACAAGTTGATGCCAAGTTGGTCTTGGAAGTCTATAACAAACTTGCTGCTTCCAAGAACTTGGGTGCTGCTTTTTGA

Protein sequence

MDIIATPSAQSMRRLWRRRRYQRLGSSTGPVRSRSFRLGRLWRMRRRASKLQLKMASPLKLLAKFHDAYVEMMMRVANSVGNMYAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLEVYNKLAASKNLGAAF
Homology
BLAST of Tan0001449 vs. NCBI nr
Match: KAG7016737.1 (hypothetical protein SDJN02_21847, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 225.3 bits (573), Expect = 3.1e-55
Identity = 112/130 (86.15%), Postives = 121/130 (93.08%), Query Frame = 0

Query: 1   MDIIATPSAQSMRRLWRRRRYQRLGSSTGPVRSRSFRLGRLWRMRRRASKLQLKMASPLK 60
           MDIIATPSA S++RLWRRRRYQRLGS TG  RSRSFRLGRLWRMRRRA KL+LKMASPLK
Sbjct: 1   MDIIATPSALSLKRLWRRRRYQRLGSGTGAARSRSFRLGRLWRMRRRAPKLRLKMASPLK 60

Query: 61  LLAKFHDAYVEMMMRVANSVGNMYAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLEV 120
           +LA+FHDAYV+MM RVAN+VGNMYAIG FGNGKRIPKP NQ+SLVSCGGEQVDAKLVLE+
Sbjct: 61  VLARFHDAYVKMMTRVANNVGNMYAIGGFGNGKRIPKPRNQISLVSCGGEQVDAKLVLEI 120

Query: 121 YNKLAASKNL 131
           YNKLA SKNL
Sbjct: 121 YNKLAVSKNL 130

BLAST of Tan0001449 vs. NCBI nr
Match: KAG6579224.1 (hypothetical protein SDJN03_23672, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 223.0 bits (567), Expect = 1.5e-54
Identity = 111/130 (85.38%), Postives = 120/130 (92.31%), Query Frame = 0

Query: 1   MDIIATPSAQSMRRLWRRRRYQRLGSSTGPVRSRSFRLGRLWRMRRRASKLQLKMASPLK 60
           MDIIATPSA S++RLWRRRRYQRLGS  G  RSRSFRLGRLWRMRRRA KL+LKMASPLK
Sbjct: 1   MDIIATPSALSLKRLWRRRRYQRLGSGMGAARSRSFRLGRLWRMRRRAPKLRLKMASPLK 60

Query: 61  LLAKFHDAYVEMMMRVANSVGNMYAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLEV 120
           +LA+FHDAYV+MM RVAN+VGNMYAIG FGNGKRIPKP NQ+SLVSCGGEQVDAKLVLE+
Sbjct: 61  VLARFHDAYVKMMTRVANNVGNMYAIGGFGNGKRIPKPRNQISLVSCGGEQVDAKLVLEI 120

Query: 121 YNKLAASKNL 131
           YNKLA SKNL
Sbjct: 121 YNKLAVSKNL 130

BLAST of Tan0001449 vs. NCBI nr
Match: XP_022939174.1 (uncharacterized protein LOC111445168 [Cucurbita moschata])

HSP 1 Score: 221.1 bits (562), Expect = 5.8e-54
Identity = 112/133 (84.21%), Postives = 118/133 (88.72%), Query Frame = 0

Query: 1   MDIIATPSAQSMRRLWRRRRYQRLGSSTGPVRSRSFRLGRLWRMRRRASKLQLKMASPLK 60
           MDIIATPSA S++RLWRRRRYQRLGS TG  RSRSF LGRLWRMRRRA KL+LKMA P K
Sbjct: 1   MDIIATPSALSLKRLWRRRRYQRLGSGTGAARSRSFHLGRLWRMRRRAPKLRLKMALPFK 60

Query: 61  LLAKFHDAYVEMMMRVANSVGNMYAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLEV 120
            LA+FHDAYV+MM RVAN VGNMYAIG FGNGKRIPKP NQVSLVSCGGEQVDAKLVLE+
Sbjct: 61  ALARFHDAYVKMMTRVANKVGNMYAIGGFGNGKRIPKPRNQVSLVSCGGEQVDAKLVLEM 120

Query: 121 YNKLAASKNLGAA 134
           YNKLAASKNL  A
Sbjct: 121 YNKLAASKNLSVA 133

BLAST of Tan0001449 vs. NCBI nr
Match: XP_022993314.1 (uncharacterized protein LOC111489358 [Cucurbita maxima])

HSP 1 Score: 221.1 bits (562), Expect = 5.8e-54
Identity = 111/133 (83.46%), Postives = 120/133 (90.23%), Query Frame = 0

Query: 1   MDIIATPSAQSMRRLWRRRRYQRLGSSTGPVRSRSFRLGRLWRMRRRASKLQLKMASPLK 60
           MDIIATPSA S++RLWRRRRYQRLGS TG  RSRSFRLGRLWR+RRRA KL+LKM SP K
Sbjct: 1   MDIIATPSALSLKRLWRRRRYQRLGSGTGAARSRSFRLGRLWRIRRRAPKLRLKMVSPFK 60

Query: 61  LLAKFHDAYVEMMMRVANSVGNMYAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLEV 120
            LA+FHDAYV+MMMRVANSVG+MYAIG +GNGKRIPKP NQVSLVSCGGEQVD KLVLE+
Sbjct: 61  ALARFHDAYVKMMMRVANSVGSMYAIGGYGNGKRIPKPSNQVSLVSCGGEQVDTKLVLEM 120

Query: 121 YNKLAASKNLGAA 134
           YNKLAASKNL  A
Sbjct: 121 YNKLAASKNLSVA 133

BLAST of Tan0001449 vs. NCBI nr
Match: XP_022152306.1 (uncharacterized protein LOC111020054 [Momordica charantia])

HSP 1 Score: 220.7 bits (561), Expect = 7.6e-54
Identity = 113/137 (82.48%), Postives = 123/137 (89.78%), Query Frame = 0

Query: 1   MDIIATPSAQSMRRLWRRRRYQRLGSSTGPVRSRSFRLGRLWRMRRRASKLQLKMASPLK 60
           M+IIATPS++S+RR WRRR+YQRLGSS GP RSRSFRLGRLWRMRR A +L+ KMASPLK
Sbjct: 1   MEIIATPSSESIRRTWRRRKYQRLGSSNGPTRSRSFRLGRLWRMRRGAPRLRFKMASPLK 60

Query: 61  LLAKFHDAYVEMMMRVANSVGNMYAIGAFGNG---KRIPKPHNQVSLVSCGGEQVDAKLV 120
           LLA+FHDAYVEMMMRVAN VGNMYAIGAFG G   + IPKP NQVSLVSCGGEQVDAKLV
Sbjct: 61  LLARFHDAYVEMMMRVANGVGNMYAIGAFGGGGKRRMIPKPGNQVSLVSCGGEQVDAKLV 120

Query: 121 LEVYNKLAASKNLGAAF 135
           LE+YNKLAASKNL AAF
Sbjct: 121 LEIYNKLAASKNLAAAF 137

BLAST of Tan0001449 vs. ExPASy TrEMBL
Match: A0A6J1JZV2 (uncharacterized protein LOC111489358 OS=Cucurbita maxima OX=3661 GN=LOC111489358 PE=4 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 2.8e-54
Identity = 111/133 (83.46%), Postives = 120/133 (90.23%), Query Frame = 0

Query: 1   MDIIATPSAQSMRRLWRRRRYQRLGSSTGPVRSRSFRLGRLWRMRRRASKLQLKMASPLK 60
           MDIIATPSA S++RLWRRRRYQRLGS TG  RSRSFRLGRLWR+RRRA KL+LKM SP K
Sbjct: 1   MDIIATPSALSLKRLWRRRRYQRLGSGTGAARSRSFRLGRLWRIRRRAPKLRLKMVSPFK 60

Query: 61  LLAKFHDAYVEMMMRVANSVGNMYAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLEV 120
            LA+FHDAYV+MMMRVANSVG+MYAIG +GNGKRIPKP NQVSLVSCGGEQVD KLVLE+
Sbjct: 61  ALARFHDAYVKMMMRVANSVGSMYAIGGYGNGKRIPKPSNQVSLVSCGGEQVDTKLVLEM 120

Query: 121 YNKLAASKNLGAA 134
           YNKLAASKNL  A
Sbjct: 121 YNKLAASKNLSVA 133

BLAST of Tan0001449 vs. ExPASy TrEMBL
Match: A0A6J1FGC7 (uncharacterized protein LOC111445168 OS=Cucurbita moschata OX=3662 GN=LOC111445168 PE=4 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 2.8e-54
Identity = 112/133 (84.21%), Postives = 118/133 (88.72%), Query Frame = 0

Query: 1   MDIIATPSAQSMRRLWRRRRYQRLGSSTGPVRSRSFRLGRLWRMRRRASKLQLKMASPLK 60
           MDIIATPSA S++RLWRRRRYQRLGS TG  RSRSF LGRLWRMRRRA KL+LKMA P K
Sbjct: 1   MDIIATPSALSLKRLWRRRRYQRLGSGTGAARSRSFHLGRLWRMRRRAPKLRLKMALPFK 60

Query: 61  LLAKFHDAYVEMMMRVANSVGNMYAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLEV 120
            LA+FHDAYV+MM RVAN VGNMYAIG FGNGKRIPKP NQVSLVSCGGEQVDAKLVLE+
Sbjct: 61  ALARFHDAYVKMMTRVANKVGNMYAIGGFGNGKRIPKPRNQVSLVSCGGEQVDAKLVLEM 120

Query: 121 YNKLAASKNLGAA 134
           YNKLAASKNL  A
Sbjct: 121 YNKLAASKNLSVA 133

BLAST of Tan0001449 vs. ExPASy TrEMBL
Match: A0A6J1DHB4 (uncharacterized protein LOC111020054 OS=Momordica charantia OX=3673 GN=LOC111020054 PE=4 SV=1)

HSP 1 Score: 220.7 bits (561), Expect = 3.7e-54
Identity = 113/137 (82.48%), Postives = 123/137 (89.78%), Query Frame = 0

Query: 1   MDIIATPSAQSMRRLWRRRRYQRLGSSTGPVRSRSFRLGRLWRMRRRASKLQLKMASPLK 60
           M+IIATPS++S+RR WRRR+YQRLGSS GP RSRSFRLGRLWRMRR A +L+ KMASPLK
Sbjct: 1   MEIIATPSSESIRRTWRRRKYQRLGSSNGPTRSRSFRLGRLWRMRRGAPRLRFKMASPLK 60

Query: 61  LLAKFHDAYVEMMMRVANSVGNMYAIGAFGNG---KRIPKPHNQVSLVSCGGEQVDAKLV 120
           LLA+FHDAYVEMMMRVAN VGNMYAIGAFG G   + IPKP NQVSLVSCGGEQVDAKLV
Sbjct: 61  LLARFHDAYVEMMMRVANGVGNMYAIGAFGGGGKRRMIPKPGNQVSLVSCGGEQVDAKLV 120

Query: 121 LEVYNKLAASKNLGAAF 135
           LE+YNKLAASKNL AAF
Sbjct: 121 LEIYNKLAASKNLAAAF 137

BLAST of Tan0001449 vs. ExPASy TrEMBL
Match: A0A6J1H4C2 (uncharacterized protein LOC111460357 OS=Cucurbita moschata OX=3662 GN=LOC111460357 PE=4 SV=1)

HSP 1 Score: 216.5 bits (550), Expect = 6.9e-53
Identity = 109/134 (81.34%), Postives = 117/134 (87.31%), Query Frame = 0

Query: 1   MDIIATPSAQSMRRLWRRRRYQRLGSSTGPVRSRSFRLGRLWRMRRRASKLQLKMASPLK 60
           MDIIATPSAQ MRRLWRRR YQ+LGS T P RSRSFRL RLW MRR+ASK+QLK+ASPLK
Sbjct: 1   MDIIATPSAQGMRRLWRRRGYQKLGSDTRPTRSRSFRLRRLWHMRRQASKVQLKIASPLK 60

Query: 61  LLAKFHDAYVEMMMRVANSVGNMYAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLEV 120
           LLAKFHD YVEMMM+VANSVGN+YAIGAFGNGKRIPKP NQVSL +C GE VDAK VLE+
Sbjct: 61  LLAKFHDTYVEMMMKVANSVGNIYAIGAFGNGKRIPKPRNQVSLATCDGEHVDAKFVLEI 120

Query: 121 YNKLAASKNLGAAF 135
           YNKLA SKNL   F
Sbjct: 121 YNKLAPSKNLANGF 134

BLAST of Tan0001449 vs. ExPASy TrEMBL
Match: A0A5D3BKB9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1728G00020 PE=4 SV=1)

HSP 1 Score: 183.0 bits (463), Expect = 8.5e-43
Identity = 102/133 (76.69%), Postives = 115/133 (86.47%), Query Frame = 0

Query: 1   MDIIATPSAQSMRRLWRRRRYQRL--GSSTGPVRS-RSFRLGRLWRMRRRAS-KLQLKMA 60
           MDIIAT SAQ++RRLWRRR YQRL  GS+T P RS RSFR+GR+  MRRRAS KL+LKM+
Sbjct: 1   MDIIATSSAQNLRRLWRRRTYQRLRSGSNTVPTRSLRSFRVGRMMIMRRRASPKLRLKMS 60

Query: 61  SPLKLLAKFHDAYVEMMMRVANSVGNMYAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKL 120
           SPLK++AK HDAYVEMMMR+ANSVGNMYAIG FGN KRIPKP NQV L   GGEQ+DAKL
Sbjct: 61  SPLKVVAKIHDAYVEMMMRLANSVGNMYAIGGFGNRKRIPKPQNQVPL---GGEQIDAKL 120

Query: 121 VLEVYNKLAASKN 130
           VLE+YNKLA+SKN
Sbjct: 121 VLEIYNKLASSKN 130

BLAST of Tan0001449 vs. TAIR 10
Match: AT2G47485.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G62650.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 52.4 bits (124), Expect = 3.3e-07
Identity = 47/131 (35.88%), Postives = 60/131 (45.80%), Query Frame = 0

Query: 14  RLWRR-RRYQRLGSSTGP----------VRSRSFRLGRLWRMRRRASKLQLKMASPLKLL 73
           R WRR R YQRL  S             V+    R  R WR++       LK ASP KLL
Sbjct: 4   RYWRRLRGYQRLDGSAKKASSGRRNVKRVKMDPTRKRRFWRIKIVPKLRILKKASPKKLL 63

Query: 74  AKFHDAYVEMMMRVANS--VGNMYAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLEV 131
               DAYV MM+R+ANS  VG+ Y  G +G G  +              ++ D K ++E+
Sbjct: 64  TWLRDAYVNMMLRLANSRVVGSSYGYGEYGYGSGL------------ASKEYDEKKLVEI 122

BLAST of Tan0001449 vs. TAIR 10
Match: AT3G62650.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G47485.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 45.1 bits (105), Expect = 5.3e-05
Identity = 40/125 (32.00%), Postives = 59/125 (47.20%), Query Frame = 0

Query: 13  RRLWRR-RRYQRLGSSTGPVRSR---------SFRLGRLWRMRRRASKLQLKMASPLKLL 72
           +R WRR R Y++L  S+     R           R  R WR++       LK ASP K L
Sbjct: 6   KRYWRRWRGYEKLDGSSETTSGRRKGKRVKMDPTRKKRFWRIKIVPKLRILKTASPKKFL 65

Query: 73  AKFHDAYVEMMMRVANS--VGNM-YAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLE 125
               D+YV+MMMR+ANS  VG+  Y    FG+G+                ++ D K+++E
Sbjct: 66  VWLRDSYVKMMMRLANSRVVGSSGYGGSGFGSGQM---------------KEYDEKMLVE 115

BLAST of Tan0001449 vs. TAIR 10
Match: AT3G62650.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G47485.1); Has 57 Blast hits to 57 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 57; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 45.1 bits (105), Expect = 5.3e-05
Identity = 40/125 (32.00%), Postives = 59/125 (47.20%), Query Frame = 0

Query: 13  RRLWRR-RRYQRLGSSTGPVRSR---------SFRLGRLWRMRRRASKLQLKMASPLKLL 72
           +R WRR R Y++L  S+     R           R  R WR++       LK ASP K L
Sbjct: 6   KRYWRRWRGYEKLDGSSETTSGRRKGKRVKMDPTRKKRFWRIKIVPKLRILKTASPKKFL 65

Query: 73  AKFHDAYVEMMMRVANS--VGNM-YAIGAFGNGKRIPKPHNQVSLVSCGGEQVDAKLVLE 125
               D+YV+MMMR+ANS  VG+  Y    FG+G+                ++ D K+++E
Sbjct: 66  VWLRDSYVKMMMRLANSRVVGSSGYGGSGFGSGQM---------------KEYDEKMLVE 115

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG7016737.13.1e-5586.15hypothetical protein SDJN02_21847, partial [Cucurbita argyrosperma subsp. argyro... [more]
KAG6579224.11.5e-5485.38hypothetical protein SDJN03_23672, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022939174.15.8e-5484.21uncharacterized protein LOC111445168 [Cucurbita moschata][more]
XP_022993314.15.8e-5483.46uncharacterized protein LOC111489358 [Cucurbita maxima][more]
XP_022152306.17.6e-5482.48uncharacterized protein LOC111020054 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1JZV22.8e-5483.46uncharacterized protein LOC111489358 OS=Cucurbita maxima OX=3661 GN=LOC111489358... [more]
A0A6J1FGC72.8e-5484.21uncharacterized protein LOC111445168 OS=Cucurbita moschata OX=3662 GN=LOC1114451... [more]
A0A6J1DHB43.7e-5482.48uncharacterized protein LOC111020054 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A6J1H4C26.9e-5381.34uncharacterized protein LOC111460357 OS=Cucurbita moschata OX=3662 GN=LOC1114603... [more]
A0A5D3BKB98.5e-4376.69Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT2G47485.13.3e-0735.88unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G62650.25.3e-0532.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G62650.15.3e-0532.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33702:SF2SUBFAMILY NOT NAMEDcoord: 1..131
NoneNo IPR availablePANTHERPTHR33702BNAA09G40010D PROTEINcoord: 1..131

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001449.1Tan0001449.1mRNA