Tan0014923 (gene) Snake gourd v1

Overview
NameTan0014923
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtochlorophyllide reductase
LocationLG02: 3766109 .. 3766854 (+)
RNA-Seq ExpressionTan0014923
SyntenyTan0014923
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATATATACACATAAAATAAAAATAAAAATAAAAATAAAAAGTCCTTAAAAAGGCAACCATAAACCTACATGATAAACACCACCAACTACAATATCAAAACCTCCCATTATTAAACTTCCATCTCCATAGCTTTCTTCTCCAAATTATCCTCTCTCTTTCTCAAGAATTCTCAAGAATCAGATGTGGCGGTTGGCGGCGGCGGTGAGAAGAAACCTCCACAACATAAGGAAAAGCCCACGAGTGGCAGATGAAAGCATGTACGGCGGAGGCGGCGGAGTGGACGGCGGCGGAAGGGTCGTCGACGGCGACCGGAGAGAGGGGAATTCATATGGGTTTTCGATGATTTACAGTGTTCTTCGAACCCCATTTTCTCTTCTCTCTTGCTTCTCTCAACCTCACGTTAATGGCGTCGACGGGATGTGGGTCTCCGGCGAGTTCGCCAGAATATCGGAAGTGAATCATCTTATGGTAAGCGACAGCATGCGATATGCAATCTTAATGTAAATGGGTTTCTGTGGTTTCGATTGGATCTGGTTTTCAAAGAGACAAATCGATGGTACAGATTATAATTATTATACAGCCATTGGGACATACTTTGGATTGGAAATTTTCTCTTTTTCCCCTTTTTCATTATATTGTATTACAATTATCGGGTAGGAATTCGAATCTTCAATCGAAATGTTACTCCTGCTTTATCGTTAAGTTATGTTCAGATTGGTGTTTGAATTAGACTTGAGTGCCCA

mRNA sequence

AAAATATATACACATAAAATAAAAATAAAAATAAAAATAAAAAGTCCTTAAAAAGGCAACCATAAACCTACATGATAAACACCACCAACTACAATATCAAAACCTCCCATTATTAAACTTCCATCTCCATAGCTTTCTTCTCCAAATTATCCTCTCTCTTTCTCAAGAATTCTCAAGAATCAGATGTGGCGGTTGGCGGCGGCGGTGAGAAGAAACCTCCACAACATAAGGAAAAGCCCACGAGTGGCAGATGAAAGCATGTACGGCGGAGGCGGCGGAGTGGACGGCGGCGGAAGGGTCGTCGACGGCGACCGGAGAGAGGGGAATTCATATGGGTTTTCGATGATTTACAGTGTTCTTCGAACCCCATTTTCTCTTCTCTCTTGCTTCTCTCAACCTCACGTTAATGGCGTCGACGGGATGTGGGTCTCCGGCGAGTTCGCCAGAATATCGGAAGTGAATCATCTTATGGTAAGCGACAGCATGCGATATGCAATCTTAATGTAAATGGGTTTCTGTGGTTTCGATTGGATCTGGTTTTCAAAGAGACAAATCGATGGTACAGATTATAATTATTATACAGCCATTGGGACATACTTTGGATTGGAAATTTTCTCTTTTTCCCCTTTTTCATTATATTGTATTACAATTATCGGGTAGGAATTCGAATCTTCAATCGAAATGTTACTCCTGCTTTATCGTTAAGTTATGTTCAGATTGGTGTTTGAATTAGACTTGAGTGCCCA

Coding sequence (CDS)

ATGTGGCGGTTGGCGGCGGCGGTGAGAAGAAACCTCCACAACATAAGGAAAAGCCCACGAGTGGCAGATGAAAGCATGTACGGCGGAGGCGGCGGAGTGGACGGCGGCGGAAGGGTCGTCGACGGCGACCGGAGAGAGGGGAATTCATATGGGTTTTCGATGATTTACAGTGTTCTTCGAACCCCATTTTCTCTTCTCTCTTGCTTCTCTCAACCTCACGTTAATGGCGTCGACGGGATGTGGGTCTCCGGCGAGTTCGCCAGAATATCGGAAGTGAATCATCTTATGGTAAGCGACAGCATGCGATATGCAATCTTAATGTAA

Protein sequence

MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGRVVDGDRREGNSYGFSMIYSVLRTPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM
Homology
BLAST of Tan0014923 vs. NCBI nr
Match: KAG6572248.1 (hypothetical protein SDJN03_28976, partial [Cucurbita argyrosperma subsp. sororia] >KAG7011878.1 hypothetical protein SDJN02_26785, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 205.7 bits (522), Expect = 2.0e-49
Identity = 98/107 (91.59%), Postives = 103/107 (96.26%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGRVVDGDRREGNSYGFSMIYSVLR 60
           MWRLAAAVRRNLHNIRKSPRVADESMY  GGG DGGGRV+DG+RR+GNSYGFSMIY++LR
Sbjct: 1   MWRLAAAVRRNLHNIRKSPRVADESMYSAGGGADGGGRVIDGNRRDGNSYGFSMIYTLLR 60

Query: 61  TPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
           TPFSLLSCFSQPHVNGVDGMWVSGEF RISEVNHLMVSDSMRYAILM
Sbjct: 61  TPFSLLSCFSQPHVNGVDGMWVSGEFPRISEVNHLMVSDSMRYAILM 107

BLAST of Tan0014923 vs. NCBI nr
Match: XP_038888385.1 (uncharacterized protein LOC120078235 [Benincasa hispida])

HSP 1 Score: 205.3 bits (521), Expect = 2.6e-49
Identity = 104/112 (92.86%), Postives = 106/112 (94.64%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYG--GGGGVDGGGR---VVDGDRREGNSYGFSMI 60
           MWRLAAAVRRNLHNIRKSPRVADESMYG  GGGGVDGGGR   VVDGDRREGNSYGFSMI
Sbjct: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGAGGGGVDGGGRVAAVVDGDRREGNSYGFSMI 60

Query: 61  YSVLRTPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
           YS+LRTPFS+LSCFSQPHVNGVDGMWVSGEF RISEVNHLMVSDSMRYAILM
Sbjct: 61  YSLLRTPFSILSCFSQPHVNGVDGMWVSGEFPRISEVNHLMVSDSMRYAILM 112

BLAST of Tan0014923 vs. NCBI nr
Match: XP_022969305.1 (uncharacterized protein LOC111468352 [Cucurbita maxima])

HSP 1 Score: 202.2 bits (513), Expect = 2.2e-48
Identity = 97/107 (90.65%), Postives = 101/107 (94.39%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGRVVDGDRREGNSYGFSMIYSVLR 60
           MWRLAAAVRRNLHNIRKSPRVADESMY  GGG DGG  V+DG+RREGNSYGFSMIY++LR
Sbjct: 1   MWRLAAAVRRNLHNIRKSPRVADESMYSAGGGADGGATVIDGNRREGNSYGFSMIYTLLR 60

Query: 61  TPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
           TPFSLLSCFSQPHVNGVDGMWVSGEF RISEVNHLMVSDSMRYAILM
Sbjct: 61  TPFSLLSCFSQPHVNGVDGMWVSGEFPRISEVNHLMVSDSMRYAILM 107

BLAST of Tan0014923 vs. NCBI nr
Match: XP_031744820.1 (uncharacterized protein LOC101211016 [Cucumis sativus] >KAE8645720.1 hypothetical protein Csa_020583 [Cucumis sativus])

HSP 1 Score: 201.8 bits (512), Expect = 2.9e-48
Identity = 102/110 (92.73%), Postives = 104/110 (94.55%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGR---VVDGDRREGNSYGFSMIYS 60
           MWRLAAAVRRNLHNIRKSPRVADESMY  GGGVDGGGR   VVDGDRREGNSYGFSMIYS
Sbjct: 1   MWRLAAAVRRNLHNIRKSPRVADESMY--GGGVDGGGRVGAVVDGDRREGNSYGFSMIYS 60

Query: 61  VLRTPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
           +LRTPFS+LSCFSQPHVNGVDGMWVSGEF RISEVNHLMVSDSMRYAILM
Sbjct: 61  LLRTPFSILSCFSQPHVNGVDGMWVSGEFPRISEVNHLMVSDSMRYAILM 108

BLAST of Tan0014923 vs. NCBI nr
Match: KAA0059153.1 (uncharacterized protein E6C27_scaffold430G00710 [Cucumis melo var. makuwa] >TYK02535.1 uncharacterized protein E5676_scaffold201G00140 [Cucumis melo var. makuwa])

HSP 1 Score: 201.4 bits (511), Expect = 3.8e-48
Identity = 102/111 (91.89%), Postives = 104/111 (93.69%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGR----VVDGDRREGNSYGFSMIY 60
           MWRLAAAVRRNLHNIRKSPRVADESMY  GGGVDGGGR    VVDGDRREGNSYGFSMIY
Sbjct: 1   MWRLAAAVRRNLHNIRKSPRVADESMY--GGGVDGGGRVAAVVVDGDRREGNSYGFSMIY 60

Query: 61  SVLRTPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
           S+LRTPFS+LSCFSQPHVNGVDGMWVSGEF RISEVNHLMVSDSMRYAILM
Sbjct: 61  SLLRTPFSILSCFSQPHVNGVDGMWVSGEFPRISEVNHLMVSDSMRYAILM 109

BLAST of Tan0014923 vs. ExPASy TrEMBL
Match: A0A6J1HZK3 (uncharacterized protein LOC111468352 OS=Cucurbita maxima OX=3661 GN=LOC111468352 PE=4 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 1.1e-48
Identity = 97/107 (90.65%), Postives = 101/107 (94.39%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGRVVDGDRREGNSYGFSMIYSVLR 60
           MWRLAAAVRRNLHNIRKSPRVADESMY  GGG DGG  V+DG+RREGNSYGFSMIY++LR
Sbjct: 1   MWRLAAAVRRNLHNIRKSPRVADESMYSAGGGADGGATVIDGNRREGNSYGFSMIYTLLR 60

Query: 61  TPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
           TPFSLLSCFSQPHVNGVDGMWVSGEF RISEVNHLMVSDSMRYAILM
Sbjct: 61  TPFSLLSCFSQPHVNGVDGMWVSGEFPRISEVNHLMVSDSMRYAILM 107

BLAST of Tan0014923 vs. ExPASy TrEMBL
Match: A0A5D3BTM8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold201G00140 PE=4 SV=1)

HSP 1 Score: 201.4 bits (511), Expect = 1.8e-48
Identity = 102/111 (91.89%), Postives = 104/111 (93.69%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGR----VVDGDRREGNSYGFSMIY 60
           MWRLAAAVRRNLHNIRKSPRVADESMY  GGGVDGGGR    VVDGDRREGNSYGFSMIY
Sbjct: 1   MWRLAAAVRRNLHNIRKSPRVADESMY--GGGVDGGGRVAAVVVDGDRREGNSYGFSMIY 60

Query: 61  SVLRTPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
           S+LRTPFS+LSCFSQPHVNGVDGMWVSGEF RISEVNHLMVSDSMRYAILM
Sbjct: 61  SLLRTPFSILSCFSQPHVNGVDGMWVSGEFPRISEVNHLMVSDSMRYAILM 109

BLAST of Tan0014923 vs. ExPASy TrEMBL
Match: A0A6J1C6U0 (uncharacterized protein LOC111007908 OS=Momordica charantia OX=3673 GN=LOC111007908 PE=4 SV=1)

HSP 1 Score: 192.2 bits (487), Expect = 1.1e-45
Identity = 95/107 (88.79%), Postives = 99/107 (92.52%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGRVVDGDRREGNSYGFSMIYSVLR 60
           MWRLAA +RRNLHNIRKSPRVADESMYGGGGG DGGG V + DRR GNSYGFSMIYSVLR
Sbjct: 1   MWRLAAELRRNLHNIRKSPRVADESMYGGGGG-DGGGIVAERDRRGGNSYGFSMIYSVLR 60

Query: 61  TPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
           TPFS+L CFSQPHVNG DG+WVSGEFARISEVNHLMVSDSMRYAILM
Sbjct: 61  TPFSVLCCFSQPHVNGADGLWVSGEFARISEVNHLMVSDSMRYAILM 106

BLAST of Tan0014923 vs. ExPASy TrEMBL
Match: A0A1S3C134 (uncharacterized protein LOC103495667 OS=Cucumis melo OX=3656 GN=LOC103495667 PE=4 SV=1)

HSP 1 Score: 184.1 bits (466), Expect = 3.0e-43
Identity = 93/106 (87.74%), Postives = 96/106 (90.57%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGR----VVDGDRREGNSYGFSMIY 60
           MWRLAAAVRRNLHNIRKSPRVADESMY  GGGVDGGGR    VVDGDRREGNSYGFSMIY
Sbjct: 41  MWRLAAAVRRNLHNIRKSPRVADESMY--GGGVDGGGRVAAVVVDGDRREGNSYGFSMIY 100

Query: 61  SVLRTPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMR 103
           S+LRTPFS+LSCFSQPHVNGVDGMWVSGEF RISEVNHLM S+  R
Sbjct: 101 SLLRTPFSILSCFSQPHVNGVDGMWVSGEFPRISEVNHLMQSNLWR 144

BLAST of Tan0014923 vs. ExPASy TrEMBL
Match: A0A6J1ER34 (uncharacterized protein LOC111437033 OS=Cucurbita moschata OX=3662 GN=LOC111437033 PE=4 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 4.0e-43
Identity = 94/108 (87.04%), Postives = 98/108 (90.74%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGRVVDGD-RREGNSYGFSMIYSVL 60
           MWRLAAAVR  LHNIRKSPRVADESM+  GGGVDGGGRVVDG  RRE NS+G S+IY+VL
Sbjct: 1   MWRLAAAVRSKLHNIRKSPRVADESMF-SGGGVDGGGRVVDGGRRRERNSHGVSIIYNVL 60

Query: 61  RTPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
           RTPFS LSCFS PHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM
Sbjct: 61  RTPFSFLSCFSHPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 107

BLAST of Tan0014923 vs. TAIR 10
Match: AT5G35732.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G04795.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 107.8 bits (268), Expect = 5.3e-24
Identity = 57/107 (53.27%), Postives = 77/107 (71.96%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESMYGGGGGVDGGGRVVDGDRREGNSYGFSMIYSVLR 60
           M ++ + +RRNL N+RKSPRVAD++         G G V +G RR+    GF+ +  ++R
Sbjct: 1   MTQMLSVLRRNLQNLRKSPRVADDTELPSSTSGAGPGVVANG-RRD----GFNSV--IMR 60

Query: 61  TPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
            PFS++SCF+ P V+G DG+WVSG++  ISEVNHLMVSDSMRYAILM
Sbjct: 61  FPFSIISCFAVPRVSGTDGLWVSGDYGSISEVNHLMVSDSMRYAILM 100

BLAST of Tan0014923 vs. TAIR 10
Match: AT2G04795.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G35732.1); Has 18 Blast hits to 18 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 18; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 98.6 bits (244), Expect = 3.2e-21
Identity = 52/116 (44.83%), Postives = 70/116 (60.34%), Query Frame = 0

Query: 1   MWRLAAAVRRNLHNIRKSPRVADESM---------YGGGGGVDGGGRVVDGDRREGNSYG 60
           M ++ + +RRNL N+RKSPRVADES          +GGG G +GG               
Sbjct: 1   MLKMLSILRRNLQNLRKSPRVADESALPSTTVNGDHGGGNGSNGG--------------- 60

Query: 61  FSMIYSVLRTPFSLLSCFSQPHVNGVDGMWVSGEFARISEVNHLMVSDSMRYAILM 108
                 +++ P S++SCFS P V+  DG+WVSG++ R+SEVNHLMV D MRYA+LM
Sbjct: 61  ------IMKFPLSIMSCFSVPRVSRADGVWVSGDYGRVSEVNHLMVCDGMRYALLM 95

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6572248.12.0e-4991.59hypothetical protein SDJN03_28976, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_038888385.12.6e-4992.86uncharacterized protein LOC120078235 [Benincasa hispida][more]
XP_022969305.12.2e-4890.65uncharacterized protein LOC111468352 [Cucurbita maxima][more]
XP_031744820.12.9e-4892.73uncharacterized protein LOC101211016 [Cucumis sativus] >KAE8645720.1 hypothetica... [more]
KAA0059153.13.8e-4891.89uncharacterized protein E6C27_scaffold430G00710 [Cucumis melo var. makuwa] >TYK0... [more]
Match NameE-valueIdentityDescription
A0A6J1HZK31.1e-4890.65uncharacterized protein LOC111468352 OS=Cucurbita maxima OX=3661 GN=LOC111468352... [more]
A0A5D3BTM81.8e-4891.89Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1C6U01.1e-4588.79uncharacterized protein LOC111007908 OS=Momordica charantia OX=3673 GN=LOC111007... [more]
A0A1S3C1343.0e-4387.74uncharacterized protein LOC103495667 OS=Cucumis melo OX=3656 GN=LOC103495667 PE=... [more]
A0A6J1ER344.0e-4387.04uncharacterized protein LOC111437033 OS=Cucurbita moschata OX=3662 GN=LOC1114370... [more]
Match NameE-valueIdentityDescription
AT5G35732.15.3e-2453.27unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G04795.13.2e-2144.83unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR48165BNAC03G44900D PROTEINcoord: 1..107

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0014923.1Tan0014923.1mRNA