Tan0010552 (gene) Snake gourd v1

Overview
NameTan0010552
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
LocationLG05: 3746090 .. 3746668 (+)
RNA-Seq ExpressionTan0010552
SyntenyTan0010552
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGCTCCCGATCGGAAACGACGAAACATTTGCATAGCCGTATTGCTTTCTGTAATCTTAATCGTAATTTTAATCCTCATTTTAGCATTTACTGTTTTCAAGCCCAAGAAGCCCACCATCGCCGTCGATTCAGTTTCTTTGCTCGATCTGAGCGTTTCTCTGGACGCCGCGAGGTTCGGCGTCGATCTGAATTTGACTTTGATTGTGAGTCTCTCCGTCGAGAATCCGAATAAGGTGGCTTTCAAATACTCCGATAGCACCGCCGTCGTGAGTTACAGAGGCGAAGTAGTCGGAGAGGCGCCGATTCCGGCAGGTCGGTTGTCGGCCGACGGGACCGAGAAAATGAACCTAACACTGACGATGATGGCGGACCGGCTGCTCGCCAAGTCGGAGCTGTACTCCGACGTGATCGCCGGTGAACTGCCGATCAGCACTTTCGCTCGGCTGGCTGGGAAAGTGACGGTGATGGGTGTTTTCAAGATTCATGTTGTGACCTCCTCGTCTTGTGATCTCACCATCGACATTAAAAACAGAAGCGTTGAGGATCAACGATGCGAATATCGGACTAAGCTTTGA

mRNA sequence

ATGGCCGCTCCCGATCGGAAACGACGAAACATTTGCATAGCCGTATTGCTTTCTGTAATCTTAATCGTAATTTTAATCCTCATTTTAGCATTTACTGTTTTCAAGCCCAAGAAGCCCACCATCGCCGTCGATTCAGTTTCTTTGCTCGATCTGAGCGTTTCTCTGGACGCCGCGAGGTTCGGCGTCGATCTGAATTTGACTTTGATTGTGAGTCTCTCCGTCGAGAATCCGAATAAGGTGGCTTTCAAATACTCCGATAGCACCGCCGTCGTGAGTTACAGAGGCGAAGTAGTCGGAGAGGCGCCGATTCCGGCAGGTCGGTTGTCGGCCGACGGGACCGAGAAAATGAACCTAACACTGACGATGATGGCGGACCGGCTGCTCGCCAAGTCGGAGCTGTACTCCGACGTGATCGCCGGTGAACTGCCGATCAGCACTTTCGCTCGGCTGGCTGGGAAAGTGACGGTGATGGGTGTTTTCAAGATTCATGTTGTGACCTCCTCGTCTTGTGATCTCACCATCGACATTAAAAACAGAAGCGTTGAGGATCAACGATGCGAATATCGGACTAAGCTTTGA

Coding sequence (CDS)

ATGGCCGCTCCCGATCGGAAACGACGAAACATTTGCATAGCCGTATTGCTTTCTGTAATCTTAATCGTAATTTTAATCCTCATTTTAGCATTTACTGTTTTCAAGCCCAAGAAGCCCACCATCGCCGTCGATTCAGTTTCTTTGCTCGATCTGAGCGTTTCTCTGGACGCCGCGAGGTTCGGCGTCGATCTGAATTTGACTTTGATTGTGAGTCTCTCCGTCGAGAATCCGAATAAGGTGGCTTTCAAATACTCCGATAGCACCGCCGTCGTGAGTTACAGAGGCGAAGTAGTCGGAGAGGCGCCGATTCCGGCAGGTCGGTTGTCGGCCGACGGGACCGAGAAAATGAACCTAACACTGACGATGATGGCGGACCGGCTGCTCGCCAAGTCGGAGCTGTACTCCGACGTGATCGCCGGTGAACTGCCGATCAGCACTTTCGCTCGGCTGGCTGGGAAAGTGACGGTGATGGGTGTTTTCAAGATTCATGTTGTGACCTCCTCGTCTTGTGATCTCACCATCGACATTAAAAACAGAAGCGTTGAGGATCAACGATGCGAATATCGGACTAAGCTTTGA

Protein sequence

MAAPDRKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARFGVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTLTMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRSVEDQRCEYRTKL
Homology
BLAST of Tan0010552 vs. ExPASy Swiss-Prot
Match: Q6DST1 (Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana OX=3702 GN=At1g64065 PE=2 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 9.4e-08
Identity = 54/179 (30.17%), Postives = 92/179 (51.40%), Query Frame = 0

Query: 12  CIAVLLSVILIVI-LILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARFGVDLNLTLIV 71
           C+   L++I+I+  L LIL+    +  KP I   S+S  DL    ++       N TL+ 
Sbjct: 39  CLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRSGGNST--NPYFNATLVS 98

Query: 72  SLSVENPNKVAFKYSDSTAVVSYRGE-VVGEAPIPAGRLSADGTEKM-NLTLTMMADRLL 131
            +S+ N N  AF++ DST  V Y    VVGE  I   R+ A  T ++  + + + + RLL
Sbjct: 99  DISIRNSNFGAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLL 158

Query: 132 AKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRSVEDQRCE 188
              +L  D+  G L + + A + G++ V+G  K   V+  SC + +++  R +++  CE
Sbjct: 159 DTKDLDKDLRLGFLELRSVAEVRGRIKVLG-RKRWKVSVMSCTMRLNLTGRFIQNLLCE 214

BLAST of Tan0010552 vs. NCBI nr
Match: XP_023007254.1 (uncharacterized protein LOC111499794 [Cucurbita maxima])

HSP 1 Score: 310.5 bits (794), Expect = 1.0e-80
Identity = 166/192 (86.46%), Postives = 182/192 (94.79%), Query Frame = 0

Query: 1   MAAPDRKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARF 60
           MAA +RKRRNICIAVLLS+I++VILILILAFTVFKPK+PTI VDSVSLLDL++SL+AARF
Sbjct: 1   MAALNRKRRNICIAVLLSLIVLVILILILAFTVFKPKQPTITVDSVSLLDLNISLNAARF 60

Query: 61  GVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTL 120
           GVDLNLTLIV L+VENPNKVAF++SD TAVVSYRGE V EAPIP+GRLS DGTEKMNLTL
Sbjct: 61  GVDLNLTLIVQLTVENPNKVAFQHSDGTAVVSYRGEEVAEAPIPSGRLSPDGTEKMNLTL 120

Query: 121 TMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRS 180
           TMMADRLLAKSEL+SDVIAGELPISTFARLAGK+TV+GVFKI VV  SSCDLTIDI+NRS
Sbjct: 121 TMMADRLLAKSELFSDVIAGELPISTFARLAGKMTVIGVFKIRVVALSSCDLTIDIRNRS 180

Query: 181 VEDQRCEYRTKL 193
           VEDQRCEYRTKL
Sbjct: 181 VEDQRCEYRTKL 192

BLAST of Tan0010552 vs. NCBI nr
Match: XP_023534551.1 (uncharacterized protein LOC111796093 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 304.7 bits (779), Expect = 5.7e-79
Identity = 161/192 (83.85%), Postives = 181/192 (94.27%), Query Frame = 0

Query: 1   MAAPDRKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARF 60
           MAA +RKRRNICIAVLLS+IL+VI ILILAFTVFKPK+PTI VDS+SLLDL++SL+AARF
Sbjct: 1   MAALNRKRRNICIAVLLSLILLVIFILILAFTVFKPKQPTITVDSLSLLDLNISLNAARF 60

Query: 61  GVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTL 120
           GVDLNLTLIV L++ENPNKVAF++SD TAVVSYRGE V EAPIP+GRLSADGTEKMNLTL
Sbjct: 61  GVDLNLTLIVQLTLENPNKVAFQHSDGTAVVSYRGEEVAEAPIPSGRLSADGTEKMNLTL 120

Query: 121 TMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRS 180
           TMMADR+LAKSEL+SDV+ GELPISTFARLAGKVTV+GVFKI VV  SSCDLTI+I+NR+
Sbjct: 121 TMMADRMLAKSELFSDVLTGELPISTFARLAGKVTVIGVFKIRVVALSSCDLTINIRNRN 180

Query: 181 VEDQRCEYRTKL 193
           VEDQRCEYRTKL
Sbjct: 181 VEDQRCEYRTKL 192

BLAST of Tan0010552 vs. NCBI nr
Match: XP_022948127.1 (uncharacterized protein LOC111451800 [Cucurbita moschata] >KAG6605388.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG7023930.1 hypothetical protein SDJN02_14958, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 298.5 bits (763), Expect = 4.1e-77
Identity = 161/192 (83.85%), Postives = 177/192 (92.19%), Query Frame = 0

Query: 1   MAAPDRKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARF 60
           MAA +RKRRNICIAVLLS+IL+VI ILILAFTVFKPK+PTI VDS+SLLDL++SLDAARF
Sbjct: 1   MAALNRKRRNICIAVLLSLILLVIFILILAFTVFKPKQPTITVDSLSLLDLNISLDAARF 60

Query: 61  GVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTL 120
            VDLNLTLIV L+VENPNKVAF++SD TAVVSYRGE V EAPIP+GRLSADGTEKMNLTL
Sbjct: 61  RVDLNLTLIVLLTVENPNKVAFQHSDGTAVVSYRGEEVAEAPIPSGRLSADGTEKMNLTL 120

Query: 121 TMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRS 180
           TMMADRLLAKSEL SDV+AGELPISTFARL GKV V+GVFKI VV  SSCDLTIDI+ R+
Sbjct: 121 TMMADRLLAKSELLSDVLAGELPISTFARLPGKVMVIGVFKIRVVALSSCDLTIDIRKRN 180

Query: 181 VEDQRCEYRTKL 193
           VEDQRC+YRTKL
Sbjct: 181 VEDQRCKYRTKL 192

BLAST of Tan0010552 vs. NCBI nr
Match: XP_023512272.1 (uncharacterized protein LOC111777064 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 273.1 bits (697), Expect = 1.8e-69
Identity = 146/188 (77.66%), Postives = 165/188 (87.77%), Query Frame = 0

Query: 1   MAAPDRKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARF 60
           MAAP RK R+ICI VLLSV L+VI ILILAFT FKPK+PTIAVDSVSLLDL++SLDAAR 
Sbjct: 1   MAAPSRKLRSICIPVLLSVTLLVISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARL 60

Query: 61  GVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTL 120
            VDLNL+L++ LSVENPNKVAF+YS STAVVSYRGE +GEAPIPAGRL AD TEKMNLTL
Sbjct: 61  SVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEAPIPAGRLPADRTEKMNLTL 120

Query: 121 TMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRS 180
           TMMADRLLAKSEL+SD I+GE+PI+ F RL+G V V+GVFKIHVV SSSCD TI I NRS
Sbjct: 121 TMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDFTIGIGNRS 180

Query: 181 VEDQRCEY 189
           ++DQ+C Y
Sbjct: 181 IKDQKCHY 188

BLAST of Tan0010552 vs. NCBI nr
Match: KAG7010478.1 (hypothetical protein SDJN02_27272, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 272.7 bits (696), Expect = 2.4e-69
Identity = 146/188 (77.66%), Postives = 165/188 (87.77%), Query Frame = 0

Query: 1   MAAPDRKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARF 60
           MAAP RK R+ICI VLLSV L++I ILILAFT FKPK+PTIAVDSVSLLDL++SLDAAR 
Sbjct: 1   MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARL 60

Query: 61  GVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTL 120
            VDLNL+L++ LSVENPNKVAF+YS STAVVSYRGE +GEAPIPAG L AD TEKMNLTL
Sbjct: 61  SVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEAPIPAGWLPADRTEKMNLTL 120

Query: 121 TMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRS 180
           TMMADRLLAKSEL+SD I+GE+PI+ F RL+G V V+GVFKIHVV SSSCDLTI I NRS
Sbjct: 121 TMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRS 180

Query: 181 VEDQRCEY 189
           +EDQ+C Y
Sbjct: 181 IEDQKCHY 188

BLAST of Tan0010552 vs. ExPASy TrEMBL
Match: A0A6J1L4F9 (uncharacterized protein LOC111499794 OS=Cucurbita maxima OX=3661 GN=LOC111499794 PE=4 SV=1)

HSP 1 Score: 310.5 bits (794), Expect = 5.0e-81
Identity = 166/192 (86.46%), Postives = 182/192 (94.79%), Query Frame = 0

Query: 1   MAAPDRKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARF 60
           MAA +RKRRNICIAVLLS+I++VILILILAFTVFKPK+PTI VDSVSLLDL++SL+AARF
Sbjct: 1   MAALNRKRRNICIAVLLSLIVLVILILILAFTVFKPKQPTITVDSVSLLDLNISLNAARF 60

Query: 61  GVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTL 120
           GVDLNLTLIV L+VENPNKVAF++SD TAVVSYRGE V EAPIP+GRLS DGTEKMNLTL
Sbjct: 61  GVDLNLTLIVQLTVENPNKVAFQHSDGTAVVSYRGEEVAEAPIPSGRLSPDGTEKMNLTL 120

Query: 121 TMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRS 180
           TMMADRLLAKSEL+SDVIAGELPISTFARLAGK+TV+GVFKI VV  SSCDLTIDI+NRS
Sbjct: 121 TMMADRLLAKSELFSDVIAGELPISTFARLAGKMTVIGVFKIRVVALSSCDLTIDIRNRS 180

Query: 181 VEDQRCEYRTKL 193
           VEDQRCEYRTKL
Sbjct: 181 VEDQRCEYRTKL 192

BLAST of Tan0010552 vs. ExPASy TrEMBL
Match: A0A6J1G8C1 (uncharacterized protein LOC111451800 OS=Cucurbita moschata OX=3662 GN=LOC111451800 PE=4 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 2.0e-77
Identity = 161/192 (83.85%), Postives = 177/192 (92.19%), Query Frame = 0

Query: 1   MAAPDRKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARF 60
           MAA +RKRRNICIAVLLS+IL+VI ILILAFTVFKPK+PTI VDS+SLLDL++SLDAARF
Sbjct: 1   MAALNRKRRNICIAVLLSLILLVIFILILAFTVFKPKQPTITVDSLSLLDLNISLDAARF 60

Query: 61  GVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTL 120
            VDLNLTLIV L+VENPNKVAF++SD TAVVSYRGE V EAPIP+GRLSADGTEKMNLTL
Sbjct: 61  RVDLNLTLIVLLTVENPNKVAFQHSDGTAVVSYRGEEVAEAPIPSGRLSADGTEKMNLTL 120

Query: 121 TMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRS 180
           TMMADRLLAKSEL SDV+AGELPISTFARL GKV V+GVFKI VV  SSCDLTIDI+ R+
Sbjct: 121 TMMADRLLAKSELLSDVLAGELPISTFARLPGKVMVIGVFKIRVVALSSCDLTIDIRKRN 180

Query: 181 VEDQRCEYRTKL 193
           VEDQRC+YRTKL
Sbjct: 181 VEDQRCKYRTKL 192

BLAST of Tan0010552 vs. ExPASy TrEMBL
Match: A0A6J1FYG9 (uncharacterized protein LOC111448649 OS=Cucurbita moschata OX=3662 GN=LOC111448649 PE=4 SV=1)

HSP 1 Score: 271.2 bits (692), Expect = 3.4e-69
Identity = 144/188 (76.60%), Postives = 165/188 (87.77%), Query Frame = 0

Query: 1   MAAPDRKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARF 60
           MAAP RK R+ICI VLLSV L++I ILILAFT FKPK+PTIAVDSVSLLDL++SLDAAR 
Sbjct: 1   MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARL 60

Query: 61  GVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTL 120
            VDLNL+L++ LS+ENPNKVAF+YS +TAVVSYRGE +GEAPIPAG L AD TEKMNLTL
Sbjct: 61  SVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGEAPIPAGWLPADRTEKMNLTL 120

Query: 121 TMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRS 180
           TMMADRLLAKSEL+SD I+GE+PI+ F RL+G V V+GVFKIHVV SSSCDLTI I NRS
Sbjct: 121 TMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRS 180

Query: 181 VEDQRCEY 189
           +EDQ+C Y
Sbjct: 181 IEDQKCHY 188

BLAST of Tan0010552 vs. ExPASy TrEMBL
Match: A0A6J1JFV2 (uncharacterized protein LOC111484029 OS=Cucurbita maxima OX=3661 GN=LOC111484029 PE=4 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 2.9e-68
Identity = 144/188 (76.60%), Postives = 162/188 (86.17%), Query Frame = 0

Query: 1   MAAPDRKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARF 60
           M AP RK R+ICI VLLSV L+VI ILILAFT FKPK+PTIAVDSVSLLDL++SLDAAR 
Sbjct: 1   MVAPSRKLRSICIPVLLSVTLLVISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARL 60

Query: 61  GVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTL 120
            VDLNL L++ LSVENPNKVAF+YS STAVVSYRGE +GE PIPAGRL AD TEKMNLTL
Sbjct: 61  SVDLNLFLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEVPIPAGRLLADRTEKMNLTL 120

Query: 121 TMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRS 180
            MMADRLLAKSEL+SD ++GE+PI+ F RL+G V V+GVFKIHVV SSSCDLTI I NRS
Sbjct: 121 KMMADRLLAKSELFSDAMSGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRS 180

Query: 181 VEDQRCEY 189
           +EDQ+C Y
Sbjct: 181 IEDQKCHY 188

BLAST of Tan0010552 vs. ExPASy TrEMBL
Match: A0A5A7SSE6 (Putative Harpin-induced 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold260G00330 PE=4 SV=1)

HSP 1 Score: 236.9 bits (603), Expect = 7.1e-59
Identity = 128/193 (66.32%), Postives = 159/193 (82.38%), Query Frame = 0

Query: 1   MAAPDRK-RRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAAR 60
           MAAP  K  RN CI ++LS+IL+V+L+L+LAFTVFKP++P I VDSVSLLDL+V+L    
Sbjct: 8   MAAPASKLLRNFCITLVLSLILLVVLVLVLAFTVFKPQRPIIVVDSVSLLDLNVALTD-- 67

Query: 61  FGVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLT 120
            GVDLNL++ V L+VENPNKVAF+YS STAVV YRGE VGEAPIP GRL   GT+KMNLT
Sbjct: 68  -GVDLNLSINVDLTVENPNKVAFEYSKSTAVVIYRGEKVGEAPIPGGRLPGKGTKKMNLT 127

Query: 121 LTMMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNR 180
           LT+M +R+L +SE++SDV++G+L IST ARLAGKV VMGV KIHVV S+SCDL ID+KN 
Sbjct: 128 LTIMGERMLGRSEVFSDVVSGQLSISTLARLAGKVKVMGVVKIHVVASTSCDLIIDVKNG 187

Query: 181 SVEDQRCEYRTKL 193
           S  DQ C++RT++
Sbjct: 188 SFGDQLCQFRTRV 197

BLAST of Tan0010552 vs. TAIR 10
Match: AT3G54200.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 190.7 bits (483), Expect = 1.1e-48
Identity = 99/191 (51.83%), Postives = 147/191 (76.96%), Query Frame = 0

Query: 6   RKRRN----ICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARFG 65
           R++RN    IC  +LL ++LI I+I+ILAFT+FKPK+PT  +DSV++  L  S++     
Sbjct: 46  RRKRNCKICICFTILL-ILLIAIVIVILAFTLFKPKRPTTTIDSVTVDRLQASVNPLLLK 105

Query: 66  VDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTLT 125
           V LNLTL V LS++NPN++ F Y  S+A+++YRG+V+GEAP+PA R++A  T  +N+TLT
Sbjct: 106 VLLNLTLNVDLSLKNPNRIGFSYDSSSALLNYRGQVIGEAPLPANRIAARKTVPLNITLT 165

Query: 126 MMADRLLAKSELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRSV 185
           +MADRLL++++L SDV+AG +P++TF ++ GKVTV+ +FKI V +SSSCDL+I + +R+V
Sbjct: 166 LMADRLLSETQLLSDVMAGVIPLNTFVKVTGKVTVLKIFKIKVQSSSSCDLSISVSDRNV 225

Query: 186 EDQRCEYRTKL 193
             Q C+Y TKL
Sbjct: 226 TSQHCKYSTKL 235

BLAST of Tan0010552 vs. TAIR 10
Match: AT2G46150.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 100.5 bits (249), Expect = 1.5e-21
Identity = 60/190 (31.58%), Postives = 111/190 (58.42%), Query Frame = 0

Query: 6   RKRRNICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSL--LDLSVSLDAARFGVD 65
           R R    I V  + +++  ++L L FTVF+ K P I ++ V +  LD     +  +  + 
Sbjct: 33  RNRIKCSICVTATSLILTTIVLTLVFTVFRVKDPIIKMNGVMVNGLDSVTGTNQVQL-LG 92

Query: 66  LNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTLTMM 125
            N+++IV +SV+NPN  +FKYS++T  + Y+G +VGEA    G+     T +MN+T+ +M
Sbjct: 93  TNISMIVDVSVKNPNTASFKYSNTTTDIYYKGTLVGEAHGLPGKARPHRTSRMNVTVDIM 152

Query: 126 ADRLLAKSELYSDVI-AGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRSVE 185
            DR+L+   L  ++  +G + + ++ R+ GKV +MG+ K HV    +C + ++I  ++++
Sbjct: 153 LDRILSDPGLGREISRSGLVNVWSYTRVGGKVKIMGIVKKHVTVKMNCTMAVNITGQAIQ 212

Query: 186 DQRCEYRTKL 193
           D  C+ +  L
Sbjct: 213 DVDCKKKIDL 221

BLAST of Tan0010552 vs. TAIR 10
Match: AT3G05975.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 97.1 bits (240), Expect = 1.7e-20
Identity = 59/188 (31.38%), Postives = 109/188 (57.98%), Query Frame = 0

Query: 7   KRRNICI-AVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARFGVDLN 66
           KRR  CI + ++ V+ ++ +  ++   VFKPK P +   S ++  +S ++ +  + V LN
Sbjct: 3   KRRICCIVSGIIFVLFVIFMTALILAQVFKPKHPILQTVSSTVDGISTNI-SLPYEVQLN 62

Query: 67  LTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTLTMMAD 126
            TL + + ++NPN   F+Y     +V YR  +VG   +P+  L A G+  +   L +  D
Sbjct: 63  FTLTLEMLLKNPNVADFEYKTVENLVYYRDTLVGNLTLPSSTLPAKGSVLLPCPLFLQLD 122

Query: 127 RLLAK-SELYSDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRSVEDQ 186
           + +A   ++  DV+ G++ + T A++ GK+T++G+FKI + + S C+L +   +  VEDQ
Sbjct: 123 KFVANLGDIVQDVLHGKIVMETRAKMPGKITLLGIFKIPLDSISHCNLVLGFPSMVVEDQ 182

Query: 187 RCEYRTKL 193
            C+ +TKL
Sbjct: 183 VCDLKTKL 189

BLAST of Tan0010552 vs. TAIR 10
Match: AT4G23930.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 67.4 bits (163), Expect = 1.4e-11
Identity = 49/183 (26.78%), Postives = 86/183 (46.99%), Query Frame = 0

Query: 12  CIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSVSLDAARFGVDLNLTLIVS 71
           C    L ++ ++I  L +  TVF+P+ P I+V SV +   SV+  +  F      T    
Sbjct: 11  CAVATLFIVFLIIAALTVYLTVFRPRDPEISVTSVKVPSFSVANSSVSF------TFSQF 70

Query: 72  LSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGTEKMNLTLTMMADRLLAKS 131
            +V NPN+ AF + ++   + Y G  +G   +PAG + +  T++M  T ++ +  L A S
Sbjct: 71  SAVRNPNRAAFSHYNNVIQLFYYGNRIGYTFVPAGEIESGRTKRMLATFSVQSFPLAAAS 130

Query: 132 ELY--------SDVIAGELPISTFARLAGKVTVMGVFKIHVVTSSSCDLTIDIKNRSVED 187
                      SD     + I +   +AG+V V+G+F   +    +C + I   + S+  
Sbjct: 131 SSQISAAQFQNSDRSGSTVEIESKLEMAGRVRVLGLFTHRIAAKCNCRIAISSSDGSIVA 187

BLAST of Tan0010552 vs. TAIR 10
Match: AT1G64450.1 (Glycine-rich protein family )

HSP 1 Score: 62.4 bits (150), Expect = 4.6e-10
Identity = 41/129 (31.78%), Postives = 71/129 (55.04%), Query Frame = 0

Query: 1   MAAPDRKRRN-------ICIAVLLSVILIVILILILAFTVFKPKKPTIAVDSVSLLDLSV 60
           MA P  +RR+        C    + ++++++++L++ FTVFKPK P I+V++V L   +V
Sbjct: 1   MAKPHDRRRSSGRTNLASCAVATVFLLILLVVLLVVYFTVFKPKDPKISVNAVQLPSFAV 60

Query: 61  SLDAARFGVDLNLTLIVSLSVENPNKVAFKYSDSTAVVSYRGEVVGEAPIPAGRLSADGT 120
           S + A      N +    ++V NPN+  F + DS+  + Y G  VG   IPAG++ +   
Sbjct: 61  SNNTA------NFSFSQYVAVRNPNRAVFSHYDSSIQLLYSGNQVGFMFIPAGKIDSGRI 120

Query: 121 EKMNLTLTM 123
           + M  T T+
Sbjct: 121 QYMAATFTV 123

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6DST19.4e-0830.17Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
XP_023007254.11.0e-8086.46uncharacterized protein LOC111499794 [Cucurbita maxima][more]
XP_023534551.15.7e-7983.85uncharacterized protein LOC111796093 [Cucurbita pepo subsp. pepo][more]
XP_022948127.14.1e-7783.85uncharacterized protein LOC111451800 [Cucurbita moschata] >KAG6605388.1 Late emb... [more]
XP_023512272.11.8e-6977.66uncharacterized protein LOC111777064 [Cucurbita pepo subsp. pepo][more]
KAG7010478.12.4e-6977.66hypothetical protein SDJN02_27272, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
A0A6J1L4F95.0e-8186.46uncharacterized protein LOC111499794 OS=Cucurbita maxima OX=3661 GN=LOC111499794... [more]
A0A6J1G8C12.0e-7783.85uncharacterized protein LOC111451800 OS=Cucurbita moschata OX=3662 GN=LOC1114518... [more]
A0A6J1FYG93.4e-6976.60uncharacterized protein LOC111448649 OS=Cucurbita moschata OX=3662 GN=LOC1114486... [more]
A0A6J1JFV22.9e-6876.60uncharacterized protein LOC111484029 OS=Cucurbita maxima OX=3661 GN=LOC111484029... [more]
A0A5A7SSE67.1e-5966.32Putative Harpin-induced 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffo... [more]
Match NameE-valueIdentityDescription
AT3G54200.11.1e-4851.83Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT2G46150.11.5e-2131.58Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT3G05975.11.7e-2031.38Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT4G23930.11.4e-1126.78Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT1G64450.14.6e-1031.78Glycine-rich protein family [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013990Water stress and hypersensitive response domainSMARTSM00769whycoord: 53..169
e-value: 0.0011
score: 28.2
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 72..164
e-value: 9.3E-12
score: 45.4
NoneNo IPR availableGENE3D2.60.40.1820coord: 27..175
e-value: 2.1E-13
score: 52.3
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 5..191
NoneNo IPR availablePANTHERPTHR31852:SF122LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 5..191
NoneNo IPR availableSUPERFAMILY117070LEA14-likecoord: 21..167

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0010552.1Tan0010552.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009269 response to desiccation
cellular_component GO:0016021 integral component of membrane