Tan0001630 (gene) Snake gourd v1

Overview
NameTan0001630
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionclassical arabinogalactan protein 9-like
LocationLG01: 23939280 .. 23940166 (-)
RNA-Seq ExpressionTan0001630
SyntenyTan0001630
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACCATAATAATACTAAAAAAAATCTGGTTAACATACTTTTGTGTAGGTGGAAGCGTACAATAGCAGTAGCATGGAAAAATCAATGCACTCATAAGTAAGATTAGCCCCCCAAAAACTTCTCTCTTCTTTCCCTTTATATATATCATCCTCTCCCATCTTCTCCCTCGCTTCAATCTTCCTCTCTCTCTCTCTTTTTTTCTTCGAGCTATGGCCGCCGCACGTGCCATGCTCGTGCTCGCCCTCTCTTTTCTTGCTCTTTCATCCTCCGCCATTGCTCAAGCTCCCTCCCCTGCTCCTGTTCCCTCCGCCGCCGCACCGCCGGTTTCAGTACCACCGACTACACCACCGCCCGCCACCCCCCCACCGGCAGCCTCGCCTCCACCAAGCATCTCCACTCCACCGATGGCGGCTCCTCCTCCCTCCATCCCGCCGATGATGTCTTCTCCGCCGCCTATGGCAATGAGTCCAGGCCCAATAATGGCCGACAGCCCACCATCGCCTCCAGGCCCAAGCCCAGGCCCAGAACCAATGACGCCTCCGGGCCCAGCTCCATCCCCGCCACCGCCCCCACCGCGGGGAGGGGGATTTTCCATCCAGCACGGCGGCTACAAGGCGGGTCTTGCGGCGGTGTTGGGTGGCCTAGCCGTCGTCCTGGTTTAACAGCATAATACTTAAATTTGCTGTAGATTAATTATGAGCTCCATATCCATACTCATTTGTGTATTCTTTTTTCTTTTGTTTTTCATTTCCATCTCTTTCTTTCTCTCTGATCTCTTTCCCTGATGATTCTTGAGAGCAGCATGAGTGATTTGTGGGATATGTAACAGTTTATTTGTTAGCATTTTCATTTTTTTTCTAGCTGAATAAGAGAGACTTGTTTCTTCTT

mRNA sequence

CACCATAATAATACTAAAAAAAATCTGGTTAACATACTTTTGTGTAGGTGGAAGCGTACAATAGCAGTAGCATGGAAAAATCAATGCACTCATAAGTAAGATTAGCCCCCCAAAAACTTCTCTCTTCTTTCCCTTTATATATATCATCCTCTCCCATCTTCTCCCTCGCTTCAATCTTCCTCTCTCTCTCTCTTTTTTTCTTCGAGCTATGGCCGCCGCACGTGCCATGCTCGTGCTCGCCCTCTCTTTTCTTGCTCTTTCATCCTCCGCCATTGCTCAAGCTCCCTCCCCTGCTCCTGTTCCCTCCGCCGCCGCACCGCCGGTTTCAGTACCACCGACTACACCACCGCCCGCCACCCCCCCACCGGCAGCCTCGCCTCCACCAAGCATCTCCACTCCACCGATGGCGGCTCCTCCTCCCTCCATCCCGCCGATGATGTCTTCTCCGCCGCCTATGGCAATGAGTCCAGGCCCAATAATGGCCGACAGCCCACCATCGCCTCCAGGCCCAAGCCCAGGCCCAGAACCAATGACGCCTCCGGGCCCAGCTCCATCCCCGCCACCGCCCCCACCGCGGGGAGGGGGATTTTCCATCCAGCACGGCGGCTACAAGGCGGGTCTTGCGGCGGTGTTGGGTGGCCTAGCCGTCGTCCTGGTTTAACAGCATAATACTTAAATTTGCTGTAGATTAATTATGAGCTCCATATCCATACTCATTTGTGTATTCTTTTTTCTTTTGTTTTTCATTTCCATCTCTTTCTTTCTCTCTGATCTCTTTCCCTGATGATTCTTGAGAGCAGCATGAGTGATTTGTGGGATATGTAACAGTTTATTTGTTAGCATTTTCATTTTTTTTCTAGCTGAATAAGAGAGACTTGTTTCTTCTT

Coding sequence (CDS)

ATGGCCGCCGCACGTGCCATGCTCGTGCTCGCCCTCTCTTTTCTTGCTCTTTCATCCTCCGCCATTGCTCAAGCTCCCTCCCCTGCTCCTGTTCCCTCCGCCGCCGCACCGCCGGTTTCAGTACCACCGACTACACCACCGCCCGCCACCCCCCCACCGGCAGCCTCGCCTCCACCAAGCATCTCCACTCCACCGATGGCGGCTCCTCCTCCCTCCATCCCGCCGATGATGTCTTCTCCGCCGCCTATGGCAATGAGTCCAGGCCCAATAATGGCCGACAGCCCACCATCGCCTCCAGGCCCAAGCCCAGGCCCAGAACCAATGACGCCTCCGGGCCCAGCTCCATCCCCGCCACCGCCCCCACCGCGGGGAGGGGGATTTTCCATCCAGCACGGCGGCTACAAGGCGGGTCTTGCGGCGGTGTTGGGTGGCCTAGCCGTCGTCCTGGTTTAA

Protein sequence

MAAARAMLVLALSFLALSSSAIAQAPSPAPVPSAAAPPVSVPPTTPPPATPPPAASPPPSISTPPMAAPPPSIPPMMSSPPPMAMSPGPIMADSPPSPPGPSPGPEPMTPPGPAPSPPPPPPRGGGFSIQHGGYKAGLAAVLGGLAVVLV
Homology
BLAST of Tan0001630 vs. NCBI nr
Match: XP_022923372.1 (classical arabinogalactan protein 9-like [Cucurbita moschata])

HSP 1 Score: 141.0 bits (354), Expect = 8.5e-30
Identity = 113/161 (70.19%), Postives = 125/161 (77.64%), Query Frame = 0

Query: 1   MAAARAMLVLALSFLALSSSAIAQAPSPAPVPSAAA------PPVSVPPTTPPP-ATPPP 60
           MA ARAMLVLALS L+L++SAIAQ+PSPAP P+ AA      PPV+ PPTTPPP A PP 
Sbjct: 1   MATARAMLVLALSLLSLAASAIAQSPSPAPGPAVAAPPPLTPPPVAAPPTTPPPTAMPPM 60

Query: 61  AASPPPSISTPPMAAPPPSIPPMMSSPPPMAMSPGP---IMADSPPSPPGPSPGPEPMTP 120
           AASPPPS+STPPMAA PP++PP M+SPP M   P P    M DSPPSPPGPSPGP P   
Sbjct: 61  AASPPPSMSTPPMAA-PPTVPPPMASPPSMESGPSPGPSTMPDSPPSPPGPSPGPAPGPA 120

Query: 121 PGPAPSPPPPPPRGGGFSIQHGGYK-AGLAAVLGGLAVVLV 151
           P  APSPPP PP G GFSIQHGGYK AG+AAVLGGLA+VLV
Sbjct: 121 PASAPSPPPSPPPGAGFSIQHGGYKAAGVAAVLGGLAIVLV 160

BLAST of Tan0001630 vs. NCBI nr
Match: XP_038903480.1 (classical arabinogalactan protein 9-like [Benincasa hispida])

HSP 1 Score: 138.3 bits (347), Expect = 5.5e-29
Identity = 112/155 (72.26%), Postives = 121/155 (78.06%), Query Frame = 0

Query: 1   MAAARAMLVLALSFLALSSSAIAQAPSPAPVPSAAA--PPVSVPPTTPPPATPPPAASPP 60
           MA+ARAMLVLALS LAL SS+IAQ+PSPAP P++AA  PP S PP   PP TPPPAASPP
Sbjct: 1   MASARAMLVLALSLLALFSSSIAQSPSPAPGPASAAAPPPSSPPPVAAPPTTPPPAASPP 60

Query: 61  PSISTPPMAAPPPSIPPMMSSPPPMAMSPGP-IMADSPPSPPGPSPGPEPMTP--PGPAP 120
           P+ISTPP AA PP+IPPMM+SPP M  SPGP  M DSPPSPPGPSP P  M+P  P PAP
Sbjct: 61  PTISTPP-AAAPPAIPPMMASPPSMESSPGPGPMPDSPPSPPGPSPSPMSMSPPTPAPAP 120

Query: 121 SPPPPPPRGGGFSIQHGGYKAGLAAVLGGLAVVLV 151
           SPP PPP   GFSI  GGYKA  AAVLGGLAV LV
Sbjct: 121 SPPSPPPPSAGFSIHQGGYKAAAAAVLGGLAVFLV 154

BLAST of Tan0001630 vs. NCBI nr
Match: XP_023553563.1 (classical arabinogalactan protein 9-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 134.4 bits (337), Expect = 8.0e-28
Identity = 113/162 (69.75%), Postives = 124/162 (76.54%), Query Frame = 0

Query: 1   MAAARAMLVLALSFLALSSSAIAQAPSPAPVPSAAA------PPVSVPPTTPPP-ATPPP 60
           MA ARAMLVLALS L+L++SAIAQ+PSPAP P+ AA      PPV+ PPTTPPP A PP 
Sbjct: 1   MATARAMLVLALSLLSLAASAIAQSPSPAPGPAVAAPPPLTPPPVAAPPTTPPPTAMPPM 60

Query: 61  AASPPPSISTPPMAAPPPSIPPMMSSPPPMAMSPGP---IMADSPPSPPGPSPGPEPMTP 120
           AASPPPS+STPPMAAPP  +PP M+SPP M   P P    M DSPPSPPGPSPGP     
Sbjct: 61  AASPPPSMSTPPMAAPPTVLPP-MASPPSMESGPSPGPTTMPDSPPSPPGPSPGPSSEPS 120

Query: 121 PGPAPSPPP-PPPRGGGFSIQHGGYK-AGLAAVLGGLAVVLV 151
           P  APSPPP PPP G GFSIQHGGYK AG+AAVLGGLA+VLV
Sbjct: 121 PASAPSPPPSPPPPGAGFSIQHGGYKAAGVAAVLGGLAIVLV 161

BLAST of Tan0001630 vs. NCBI nr
Match: KAG6596882.1 (hypothetical protein SDJN03_10062, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 81.6 bits (200), Expect = 6.2e-12
Identity = 85/150 (56.67%), Postives = 89/150 (59.33%), Query Frame = 0

Query: 1   MAAARAMLVLALSFLALSSSAIAQAPSPAPVPSAAAPPVSVPPTTPPPATPPPAASPPPS 60
           MA ARAMLVLAL F ALSSSAIAQ PSPA   +AAAP  S P     PA  PPAASPP  
Sbjct: 1   MATARAMLVLALYFFALSSSAIAQGPSPA---AAAAPRPSTPSPVASPAMSPPAASPPLI 60

Query: 61  ISTPPMAAPPPSIPPMMSSPPPMAMSPGPI-MADSPPSPPGPSPGPEPMTPPGPAPSPPP 120
           IS+PPMAA          SPP M MSP  + M DSPPS P P PG               
Sbjct: 61  ISSPPMAA----------SPPSMGMSPVSVPMPDSPPSLPSPGPG--------------- 119

Query: 121 PPPRGGGFSIQHGGYKAGLAAVLGGLAVVL 150
                 GFSI HGGYKAGLAAVLGG+A VL
Sbjct: 121 ---SSAGFSIHHGGYKAGLAAVLGGMAEVL 119

BLAST of Tan0001630 vs. ExPASy TrEMBL
Match: A0A6J1E5X6 (classical arabinogalactan protein 9-like OS=Cucurbita moschata OX=3662 GN=LOC111431090 PE=4 SV=1)

HSP 1 Score: 141.0 bits (354), Expect = 4.1e-30
Identity = 113/161 (70.19%), Postives = 125/161 (77.64%), Query Frame = 0

Query: 1   MAAARAMLVLALSFLALSSSAIAQAPSPAPVPSAAA------PPVSVPPTTPPP-ATPPP 60
           MA ARAMLVLALS L+L++SAIAQ+PSPAP P+ AA      PPV+ PPTTPPP A PP 
Sbjct: 1   MATARAMLVLALSLLSLAASAIAQSPSPAPGPAVAAPPPLTPPPVAAPPTTPPPTAMPPM 60

Query: 61  AASPPPSISTPPMAAPPPSIPPMMSSPPPMAMSPGP---IMADSPPSPPGPSPGPEPMTP 120
           AASPPPS+STPPMAA PP++PP M+SPP M   P P    M DSPPSPPGPSPGP P   
Sbjct: 61  AASPPPSMSTPPMAA-PPTVPPPMASPPSMESGPSPGPSTMPDSPPSPPGPSPGPAPGPA 120

Query: 121 PGPAPSPPPPPPRGGGFSIQHGGYK-AGLAAVLGGLAVVLV 151
           P  APSPPP PP G GFSIQHGGYK AG+AAVLGGLA+VLV
Sbjct: 121 PASAPSPPPSPPPGAGFSIQHGGYKAAGVAAVLGGLAIVLV 160

BLAST of Tan0001630 vs. ExPASy TrEMBL
Match: A0A0A0L458 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G017230 PE=4 SV=1)

HSP 1 Score: 123.6 bits (309), Expect = 6.8e-25
Identity = 105/160 (65.62%), Postives = 119/160 (74.38%), Query Frame = 0

Query: 1   MAAARAMLVLALSFLALSSSAIAQAPSPAPVPS--AAAPPVSVPPTTPPPATPPPAASPP 60
           MA+ARAMLVL LS L+LSSS+IAQ+PSP+P P+  AA PP + PP  PPP TPPPAASPP
Sbjct: 1   MASARAMLVLLLSLLSLSSSSIAQSPSPSPGPASPAAPPPSTPPPVAPPPTTPPPAASPP 60

Query: 61  PSISTPPMAAPPPSIPPMMSSPPPM----AMSPGPIMADSPPSPPGPSPGPEPMTPPGPA 120
            SISTPP AA P + PPMM+SPPPM    A +PGP M + PPSPP  S    PM+PPGPA
Sbjct: 61  SSISTPP-AAAPTATPPMMASPPPMEPSTAPAPGPAMPEGPPSPPSQS----PMSPPGPA 120

Query: 121 PSPPPPP----PRGGGFSIQHGGYKAGLAAVLGGLAVVLV 151
           P+P P P    PR  GFSI  GGY AG+AAVLGGLAVVLV
Sbjct: 121 PAPAPTPPSARPRSAGFSIHQGGYMAGVAAVLGGLAVVLV 155

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022923372.18.5e-3070.19classical arabinogalactan protein 9-like [Cucurbita moschata][more]
XP_038903480.15.5e-2972.26classical arabinogalactan protein 9-like [Benincasa hispida][more]
XP_023553563.18.0e-2869.75classical arabinogalactan protein 9-like [Cucurbita pepo subsp. pepo][more]
KAG6596882.16.2e-1256.67hypothetical protein SDJN03_10062, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1E5X64.1e-3070.19classical arabinogalactan protein 9-like OS=Cucurbita moschata OX=3662 GN=LOC111... [more]
A0A0A0L4586.8e-2565.63Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G017230 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 30..125
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 23..131

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001630.1Tan0001630.1mRNA