Tan0015388 (gene) Snake gourd v1

Overview
NameTan0015388
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionhomeobox-leucine zipper protein HAT3-like
LocationLG09: 69123874 .. 69125221 (+)
RNA-Seq ExpressionTan0015388
SyntenyTan0015388
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAAACACAAACTGATCATCATAACTGATATTCAACTTGAAACCTCTTTCAAATAATCCCCTCTTCTTCATTTCTCTCTTAATCCCCACTTACCCCCTTCTCCAAATCCAAATCCAAATTCCCATCTCTCTCCTCTTTCTCTTTCTCTTTCTAAAACCGCCATGGGAGATCAAGATGAGCTCTGCAACATCAGGCTTGGGCTTGGCCTTGGCCTTGGCTTGGGATTTGGTGAATATGTCCCAAAAAAGATGCAAAAAATCAACAACAAAAACAACCCCAAATTCTTCCCTGACCTCTCTTTCACTCTCATTCCAAAGGAAGAATTAGCCATTAACATGGAGGTTGAAGCTAATCATTCAATTAGTAGAAGCAATGAAACTAATTCCCAAGATTCTTCCTTTGGCAACAACAACAACAACAACGCCATTAATGGATCAGAAAGAGAGAGAAAAAAGCTTAGGCTTTCAAAAGAACAGTCCACTTTGCTCGAAGAAAGCTTCAAACTTCACACCACTTTGAATCCGGTACGTAATTTAGCTTTTATTTTACAAACCCATTTCACCCTTTTTTATTGATCTTTTCATTTTTCTGAAATACCCATTTCGTTTTTTTTGTAAAAAATTTGATCTTTTTTTTTTTTTTTTCTTTTTTCTTTGCATGGATTTGTAGGCTCAGAAGCAGGCACTTGCCCAACAATTAAACCTCAAACCTCGACAAGTGGAAGTTTGGTTTCAAAACAGACGTGCAAGGTAATATAAAACAATTTCAACATTATTATTTAGTTTCTTCCAAAATCCAAAAAAAAAAAAAAAGAACCATATCATATAGCTAGCAATTGCCAACCTTAATTTTGCATTAGATTAAAGTTTTATCTCTGAATATTAAAAATTGTGAATATAAAAAGTTAGAAGGTAAAATTTTGTTGGTGAACAGAACGAAATTGAAACAAACGGAAGTAGATTGCGAGTTTTTGAAGAAATGTTGTGAAAGGTTGAATGAAGAGAATCGAAGGTTGAAGAAAGAGTTGCATGAATTAAGATCCATAAAACTTGGAGCTTCACAGTTGTATATTCAGCTGCCAAAGGCGGCGACGCTCACAATTTGTCCCTCATGCGACAAAATTACCAGGACGGCCGCCGCCAACGCCGCCGTGGAGCCCAATTCTCCGCCACAATAATTTAATCACCTTTTTTTTTTTTTTTTTTAAAGGCTGTGTATGTTTAGTACATCAAAAAGATCAATAATTTAATTATTCAACAACAATACAGCCTTTTTAGATTACATGAATCATCTAAATTTTATGTTATGGCAAAATGGTGCTGATGTATAATCCAAATCTATGAAAAG

mRNA sequence

CAAAAACACAAACTGATCATCATAACTGATATTCAACTTGAAACCTCTTTCAAATAATCCCCTCTTCTTCATTTCTCTCTTAATCCCCACTTACCCCCTTCTCCAAATCCAAATCCAAATTCCCATCTCTCTCCTCTTTCTCTTTCTCTTTCTAAAACCGCCATGGGAGATCAAGATGAGCTCTGCAACATCAGGCTTGGGCTTGGCCTTGGCCTTGGCTTGGGATTTGGTGAATATGTCCCAAAAAAGATGCAAAAAATCAACAACAAAAACAACCCCAAATTCTTCCCTGACCTCTCTTTCACTCTCATTCCAAAGGAAGAATTAGCCATTAACATGGAGGTTGAAGCTAATCATTCAATTAGTAGAAGCAATGAAACTAATTCCCAAGATTCTTCCTTTGGCAACAACAACAACAACAACGCCATTAATGGATCAGAAAGAGAGAGAAAAAAGCTTAGGCTTTCAAAAGAACAGTCCACTTTGCTCGAAGAAAGCTTCAAACTTCACACCACTTTGAATCCGGCTCAGAAGCAGGCACTTGCCCAACAATTAAACCTCAAACCTCGACAAGTGGAAGTTTGGTTTCAAAACAGACGTGCAAGAACGAAATTGAAACAAACGGAAGTAGATTGCGAGTTTTTGAAGAAATGTTGTGAAAGGTTGAATGAAGAGAATCGAAGGTTGAAGAAAGAGTTGCATGAATTAAGATCCATAAAACTTGGAGCTTCACAGTTGTATATTCAGCTGCCAAAGGCGGCGACGCTCACAATTTGTCCCTCATGCGACAAAATTACCAGGACGGCCGCCGCCAACGCCGCCGTGGAGCCCAATTCTCCGCCACAATAATTTAATCACCTTTTTTTTTTTTTTTTTTAAAGGCTGTGTATGTTTAGTACATCAAAAAGATCAATAATTTAATTATTCAACAACAATACAGCCTTTTTAGATTACATGAATCATCTAAATTTTATGTTATGGCAAAATGGTGCTGATGTATAATCCAAATCTATGAAAAG

Coding sequence (CDS)

ATGGGAGATCAAGATGAGCTCTGCAACATCAGGCTTGGGCTTGGCCTTGGCCTTGGCTTGGGATTTGGTGAATATGTCCCAAAAAAGATGCAAAAAATCAACAACAAAAACAACCCCAAATTCTTCCCTGACCTCTCTTTCACTCTCATTCCAAAGGAAGAATTAGCCATTAACATGGAGGTTGAAGCTAATCATTCAATTAGTAGAAGCAATGAAACTAATTCCCAAGATTCTTCCTTTGGCAACAACAACAACAACAACGCCATTAATGGATCAGAAAGAGAGAGAAAAAAGCTTAGGCTTTCAAAAGAACAGTCCACTTTGCTCGAAGAAAGCTTCAAACTTCACACCACTTTGAATCCGGCTCAGAAGCAGGCACTTGCCCAACAATTAAACCTCAAACCTCGACAAGTGGAAGTTTGGTTTCAAAACAGACGTGCAAGAACGAAATTGAAACAAACGGAAGTAGATTGCGAGTTTTTGAAGAAATGTTGTGAAAGGTTGAATGAAGAGAATCGAAGGTTGAAGAAAGAGTTGCATGAATTAAGATCCATAAAACTTGGAGCTTCACAGTTGTATATTCAGCTGCCAAAGGCGGCGACGCTCACAATTTGTCCCTCATGCGACAAAATTACCAGGACGGCCGCCGCCAACGCCGCCGTGGAGCCCAATTCTCCGCCACAATAA

Protein sequence

MGDQDELCNIRLGLGLGLGLGFGEYVPKKMQKINNKNNPKFFPDLSFTLIPKEELAINMEVEANHSISRSNETNSQDSSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGASQLYIQLPKAATLTICPSCDKITRTAAANAAVEPNSPPQ
Homology
BLAST of Tan0015388 vs. ExPASy Swiss-Prot
Match: P46604 (Homeobox-leucine zipper protein HAT22 OS=Arabidopsis thaliana OX=3702 GN=HAT22 PE=1 SV=1)

HSP 1 Score: 166.8 bits (421), Expect = 2.9e-40
Identity = 113/238 (47.48%), Postives = 142/238 (59.66%), Query Frame = 0

Query: 5   DELCNIRLGLGLGLGLGFGEY--VPKKMQKINNKNNPKFFPDLSFTLIPKEELAINMEVE 64
           D+ CN  L LGLGL      Y    KK     +    +  P L+ +L   E   I     
Sbjct: 4   DDSCNTGLVLGLGLSPTPNNYNHAIKKSSSTVDHRFIRLDPSLTLSL-SGESYKIKTGAG 63

Query: 65  ANHSISRSNETNSQDSSF--GNNNNNNAINGSERE------------------------- 124
           A   I R   ++S  SSF  G       I+G + E                         
Sbjct: 64  AGDQICRQTSSHSGISSFSSGRVKREREISGGDGEEEAEETTERVVCSRVSDDHDDEEGV 123

Query: 125 --RKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQ 184
             RKKLRL+K+QS LLE++FKLH+TLNP QKQALA+QLNL+PRQVEVWFQNRRARTKLKQ
Sbjct: 124 SARKKLRLTKQQSALLEDNFKLHSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQ 183

Query: 185 TEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGASQLYIQLPKAATLTICPSCDKI 212
           TEVDCEFLKKCCE L +ENRRL+KEL +L+++KL +   Y+ +P AATLT+CPSC+++
Sbjct: 184 TEVDCEFLKKCCETLTDENRRLQKELQDLKALKL-SQPFYMHMP-AATLTMCPSCERL 238

BLAST of Tan0015388 vs. ExPASy Swiss-Prot
Match: P46602 (Homeobox-leucine zipper protein HAT3 OS=Arabidopsis thaliana OX=3702 GN=HAT3 PE=1 SV=2)

HSP 1 Score: 164.5 bits (415), Expect = 1.4e-39
Identity = 85/143 (59.44%), Postives = 116/143 (81.12%), Query Frame = 0

Query: 81  GNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKPRQVEV 140
           G+++ + + NG +  RKKLRLSKEQ+ +LEE+FK H+TLNP QK ALA+QLNL+ RQVEV
Sbjct: 146 GSDDEDGSGNGDDSSRKKLRLSKEQALVLEETFKEHSTLNPKQKMALAKQLNLRTRQVEV 205

Query: 141 WFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGASQLYIQLPKAA 200
           WFQNRRARTKLKQTEVDCE+LK+CCE L +ENRRL+KE+ ELR++KL +  LY+ +    
Sbjct: 206 WFQNRRARTKLKQTEVDCEYLKRCCENLTDENRRLQKEVSELRALKL-SPHLYMHMKPPT 265

Query: 201 TLTICPSCDKITRTAAANAAVEP 224
           TLT+CPSC+++  T+++++   P
Sbjct: 266 TLTMCPSCERVAVTSSSSSVAPP 287

BLAST of Tan0015388 vs. ExPASy Swiss-Prot
Match: P46665 (Homeobox-leucine zipper protein HAT14 OS=Arabidopsis thaliana OX=3702 GN=HAT14 PE=2 SV=3)

HSP 1 Score: 163.3 bits (412), Expect = 3.2e-39
Identity = 96/153 (62.75%), Postives = 121/153 (79.08%), Query Frame = 0

Query: 69  RSNETNSQD-----SSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTLNPAQ 128
           RSN+ +  D     +S  +N +N+  NGS   RKKLRLSK+QS  LE+SFK H+TLNP Q
Sbjct: 159 RSNKRDIDDEVERSASRASNEDNDDENGS--TRKKLRLSKDQSAFLEDSFKEHSTLNPKQ 218

Query: 129 KQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELR 188
           K ALA+QLNL+PRQVEVWFQNRRARTKLKQTEVDCE+LK+CCE L EENRRL+KE+ ELR
Sbjct: 219 KIALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCESLTEENRRLQKEVKELR 278

Query: 189 SIKLGASQLYIQLPKAATLTICPSCDKITRTAA 217
           ++K  ++  Y+QLP A TLT+CPSC+++  +AA
Sbjct: 279 TLKT-STPFYMQLP-ATTLTMCPSCERVATSAA 307

BLAST of Tan0015388 vs. ExPASy Swiss-Prot
Match: Q01I23 (Homeobox-leucine zipper protein HOX17 OS=Oryza sativa subsp. indica OX=39946 GN=HOX17 PE=2 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 5.5e-39
Identity = 84/134 (62.69%), Postives = 106/134 (79.10%), Query Frame = 0

Query: 81  GNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKPRQVEV 140
           G ++  +   G +  RKKLRLSK+QS +LE+SF+ H TLNP QK  LAQQL L+PRQVEV
Sbjct: 66  GGSDEEDGGCGIDGSRKKLRLSKDQSAVLEDSFREHPTLNPRQKATLAQQLGLRPRQVEV 125

Query: 141 WFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGASQLYIQLPKAA 200
           WFQNRRARTKLKQTEVDCEFLK+CCE L EENRRL+KE+ ELR++KL +  LY+ +    
Sbjct: 126 WFQNRRARTKLKQTEVDCEFLKRCCETLTEENRRLQKEVQELRALKLVSPHLYMNMSPPT 185

Query: 201 TLTICPSCDKITRT 215
           TLT+CPSC++++ T
Sbjct: 186 TLTMCPSCERVSNT 199

BLAST of Tan0015388 vs. ExPASy Swiss-Prot
Match: Q0JB92 (Homeobox-leucine zipper protein HOX17 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX17 PE=2 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 5.5e-39
Identity = 84/134 (62.69%), Postives = 106/134 (79.10%), Query Frame = 0

Query: 81  GNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKPRQVEV 140
           G ++  +   G +  RKKLRLSK+QS +LE+SF+ H TLNP QK  LAQQL L+PRQVEV
Sbjct: 66  GGSDEEDGGCGIDGSRKKLRLSKDQSAVLEDSFREHPTLNPRQKATLAQQLGLRPRQVEV 125

Query: 141 WFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGASQLYIQLPKAA 200
           WFQNRRARTKLKQTEVDCEFLK+CCE L EENRRL+KE+ ELR++KL +  LY+ +    
Sbjct: 126 WFQNRRARTKLKQTEVDCEFLKRCCETLTEENRRLQKEVQELRALKLVSPHLYMNMSPPT 185

Query: 201 TLTICPSCDKITRT 215
           TLT+CPSC++++ T
Sbjct: 186 TLTMCPSCERVSNT 199

BLAST of Tan0015388 vs. NCBI nr
Match: KAG6587716.1 (Homeobox-leucine zipper protein HAT4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 323.6 bits (828), Expect = 1.4e-84
Identity = 182/228 (79.82%), Postives = 187/228 (82.02%), Query Frame = 0

Query: 1   MGDQDELCNIRLGLGLGLGLGFGEYVPKKMQKIN-NKNNPKFFPDLSFTLIPKEELAINM 60
           MGDQD+LCNIR  LGL LG GFGEYVPKKMQKIN N NNPK F DLSFTL+PK+ELAIN 
Sbjct: 1   MGDQDDLCNIR--LGLSLGSGFGEYVPKKMQKINSNHNNPKSFTDLSFTLVPKQELAIN- 60

Query: 61  EVEANHSISRSNETNSQDSSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTL 120
                     S  TNS  S              ERERKKLRLSKEQ+TLLEESFKLHTTL
Sbjct: 61  ---------SSTTTNSLGSE------------RERERKKLRLSKEQATLLEESFKLHTTL 120

Query: 121 NPAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL 180
           NPAQKQALA QLNLK RQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL
Sbjct: 121 NPAQKQALAHQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL 180

Query: 181 HELRSIKLGASQLYIQLPKAATLTICPSCDKITRTAAANAAVEPNSPP 228
           HELRSIKLGASQLYIQLPKAATLTICPSCDKITRT AANAA +PNSPP
Sbjct: 181 HELRSIKLGASQLYIQLPKAATLTICPSCDKITRTPAANAAADPNSPP 204

BLAST of Tan0015388 vs. NCBI nr
Match: XP_023531587.1 (homeobox-leucine zipper protein HAT3-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 322.8 bits (826), Expect = 2.4e-84
Identity = 182/228 (79.82%), Postives = 186/228 (81.58%), Query Frame = 0

Query: 1   MGDQDELCNIRLGLGLGLGLGFGEYVPKKMQKIN-NKNNPKFFPDLSFTLIPKEELAINM 60
           MGDQD+LCNIR  LGL LG GFGEYVPKKMQKIN N NNPK F DLSFTL+PK+EL IN 
Sbjct: 1   MGDQDDLCNIR--LGLSLGSGFGEYVPKKMQKINSNHNNPKSFTDLSFTLVPKQELPIN- 60

Query: 61  EVEANHSISRSNETNSQDSSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTL 120
                     S  TNS  S              ERERKKLRLSKEQ+TLLEESFKLHTTL
Sbjct: 61  ---------SSTTTNSLGSE------------RERERKKLRLSKEQATLLEESFKLHTTL 120

Query: 121 NPAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL 180
           NPAQKQALA QLNLK RQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL
Sbjct: 121 NPAQKQALAHQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL 180

Query: 181 HELRSIKLGASQLYIQLPKAATLTICPSCDKITRTAAANAAVEPNSPP 228
           HELRSIKLGASQLYIQLPKAATLTICPSCDKITRT AANAA EPNSPP
Sbjct: 181 HELRSIKLGASQLYIQLPKAATLTICPSCDKITRTPAANAAAEPNSPP 204

BLAST of Tan0015388 vs. NCBI nr
Match: XP_022931479.1 (homeobox-leucine zipper protein HAT3-like [Cucurbita moschata])

HSP 1 Score: 321.6 bits (823), Expect = 5.4e-84
Identity = 182/228 (79.82%), Postives = 186/228 (81.58%), Query Frame = 0

Query: 1   MGDQDELCNIRLGLGLGLGLGFGEYVPKKMQKIN-NKNNPKFFPDLSFTLIPKEELAINM 60
           MGDQD+LCNIR  LGL LG GFGEYVPKKMQKIN N NNPK   DLSFTL+PK+ELAIN 
Sbjct: 1   MGDQDDLCNIR--LGLSLGSGFGEYVPKKMQKINSNHNNPKSCTDLSFTLVPKQELAIN- 60

Query: 61  EVEANHSISRSNETNSQDSSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTL 120
                     S  TNS  S              ERERKKLRLSKEQ+TLLEESFKLHTTL
Sbjct: 61  ---------SSTTTNSLGSE------------RERERKKLRLSKEQATLLEESFKLHTTL 120

Query: 121 NPAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL 180
           NPAQKQALA QLNLK RQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL
Sbjct: 121 NPAQKQALAHQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL 180

Query: 181 HELRSIKLGASQLYIQLPKAATLTICPSCDKITRTAAANAAVEPNSPP 228
           HELRSIKLGASQLYIQLPKAATLTICPSCDKITRT AANAA EPNSPP
Sbjct: 181 HELRSIKLGASQLYIQLPKAATLTICPSCDKITRTPAANAAAEPNSPP 204

BLAST of Tan0015388 vs. NCBI nr
Match: XP_022134791.1 (homeobox-leucine zipper protein HAT22-like [Momordica charantia])

HSP 1 Score: 293.5 bits (750), Expect = 1.6e-75
Identity = 172/224 (76.79%), Postives = 188/224 (83.93%), Query Frame = 0

Query: 1   MGDQDELCNIRLGLGLGLGLGFGEYVPKKMQKINNKNNPKFFPDLSFTLIPKEELAINME 60
           MG  DE+CNIRLGLGLG G    EYVPKK  KINN +NPKFF DLSFTLIPKEE AIN+E
Sbjct: 1   MGGDDEICNIRLGLGLGFG---EEYVPKK--KINN-HNPKFFSDLSFTLIPKEE-AINVE 60

Query: 61  VEANHS----ISRSNETNSQDSSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLH 120
           +EA+ S    + R    N+QD     ++++  INGS  ERKKLRLSKEQS LLEESFKLH
Sbjct: 61  IEASSSDHDHLKRIRSNNNQDQI--RDSSSIVINGSS-ERKKLRLSKEQSNLLEESFKLH 120

Query: 121 TTLNPAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLK 180
           TTLNPAQKQALAQQLNLK RQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLK
Sbjct: 121 TTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLK 180

Query: 181 KELHELRSIKLGASQLYIQLPKAATLTICPSCDKITRTAAANAA 221
           KE+ ELRS+K+GASQLYIQLPKAATLTICPSC+K+TR AAA AA
Sbjct: 181 KEVQELRSLKIGASQLYIQLPKAATLTICPSCNKLTRNAAATAA 214

BLAST of Tan0015388 vs. NCBI nr
Match: XP_038879995.1 (homeobox-leucine zipper protein HOX17-like [Benincasa hispida])

HSP 1 Score: 288.9 bits (738), Expect = 3.9e-74
Identity = 186/280 (66.43%), Postives = 199/280 (71.07%), Query Frame = 0

Query: 1   MGD-QDELCNIRLGLGLGLGLGFGEYVPKKMQKINNKNNPKFFPDLSFTLIPKEELA--- 60
           MGD QDE+CNI   L LGLG G  +YVPKKMQKINN         LSFTLIPKEEL    
Sbjct: 1   MGDHQDEVCNIS-WLSLGLGFGDDQYVPKKMQKINNN-------QLSFTLIPKEELGINN 60

Query: 61  -----INMEV---EAN--------HSISRSNETNS------QDSSFG------------N 120
                INME+   EAN        H + RS   N+      QDSSFG             
Sbjct: 61  NNNSNINMEIDDEEANSSEEDHHHHLMKRSRSNNNIVNYDHQDSSFGIRSRSSSDHHHHQ 120

Query: 121 NNNNNAINGSE-------------RERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQ 180
           ++NNN I  +              RERKKLRLSKEQSTLLEESFKL+TTLNPAQKQALAQ
Sbjct: 121 SSNNNIITTNHNHKGISSSGASELRERKKLRLSKEQSTLLEESFKLNTTLNPAQKQALAQ 180

Query: 181 QLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGA 229
           QLNLK RQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHEL+S+KLGA
Sbjct: 181 QLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELKSLKLGA 240

BLAST of Tan0015388 vs. ExPASy TrEMBL
Match: A0A6J1EUD1 (homeobox-leucine zipper protein HAT3-like OS=Cucurbita moschata OX=3662 GN=LOC111437643 PE=4 SV=1)

HSP 1 Score: 321.6 bits (823), Expect = 2.6e-84
Identity = 182/228 (79.82%), Postives = 186/228 (81.58%), Query Frame = 0

Query: 1   MGDQDELCNIRLGLGLGLGLGFGEYVPKKMQKIN-NKNNPKFFPDLSFTLIPKEELAINM 60
           MGDQD+LCNIR  LGL LG GFGEYVPKKMQKIN N NNPK   DLSFTL+PK+ELAIN 
Sbjct: 1   MGDQDDLCNIR--LGLSLGSGFGEYVPKKMQKINSNHNNPKSCTDLSFTLVPKQELAIN- 60

Query: 61  EVEANHSISRSNETNSQDSSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTL 120
                     S  TNS  S              ERERKKLRLSKEQ+TLLEESFKLHTTL
Sbjct: 61  ---------SSTTTNSLGSE------------RERERKKLRLSKEQATLLEESFKLHTTL 120

Query: 121 NPAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL 180
           NPAQKQALA QLNLK RQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL
Sbjct: 121 NPAQKQALAHQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL 180

Query: 181 HELRSIKLGASQLYIQLPKAATLTICPSCDKITRTAAANAAVEPNSPP 228
           HELRSIKLGASQLYIQLPKAATLTICPSCDKITRT AANAA EPNSPP
Sbjct: 181 HELRSIKLGASQLYIQLPKAATLTICPSCDKITRTPAANAAAEPNSPP 204

BLAST of Tan0015388 vs. ExPASy TrEMBL
Match: A0A6J1BYS3 (homeobox-leucine zipper protein HAT22-like OS=Momordica charantia OX=3673 GN=LOC111006976 PE=4 SV=1)

HSP 1 Score: 293.5 bits (750), Expect = 7.6e-76
Identity = 172/224 (76.79%), Postives = 188/224 (83.93%), Query Frame = 0

Query: 1   MGDQDELCNIRLGLGLGLGLGFGEYVPKKMQKINNKNNPKFFPDLSFTLIPKEELAINME 60
           MG  DE+CNIRLGLGLG G    EYVPKK  KINN +NPKFF DLSFTLIPKEE AIN+E
Sbjct: 1   MGGDDEICNIRLGLGLGFG---EEYVPKK--KINN-HNPKFFSDLSFTLIPKEE-AINVE 60

Query: 61  VEANHS----ISRSNETNSQDSSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLH 120
           +EA+ S    + R    N+QD     ++++  INGS  ERKKLRLSKEQS LLEESFKLH
Sbjct: 61  IEASSSDHDHLKRIRSNNNQDQI--RDSSSIVINGSS-ERKKLRLSKEQSNLLEESFKLH 120

Query: 121 TTLNPAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLK 180
           TTLNPAQKQALAQQLNLK RQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLK
Sbjct: 121 TTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLK 180

Query: 181 KELHELRSIKLGASQLYIQLPKAATLTICPSCDKITRTAAANAA 221
           KE+ ELRS+K+GASQLYIQLPKAATLTICPSC+K+TR AAA AA
Sbjct: 181 KEVQELRSLKIGASQLYIQLPKAATLTICPSCNKLTRNAAATAA 214

BLAST of Tan0015388 vs. ExPASy TrEMBL
Match: A0A6J1E158 (homeobox-leucine zipper protein ATHB-4-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111429651 PE=4 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 3.2e-74
Identity = 168/228 (73.68%), Postives = 182/228 (79.82%), Query Frame = 0

Query: 1   MGDQDELCNIRLGLGLGLGLGFGEYVPKKMQKINNKNNPKFFPDLSFTLIPKEELAINME 60
           MGDQDELCN R    LGL LGFG+YVPKKMQK N +  PKF  DLSF+LIP++E AINM+
Sbjct: 1   MGDQDELCNTR----LGLALGFGDYVPKKMQKANKQ--PKFLSDLSFSLIPRQESAINMQ 60

Query: 61  VEANHSISRSNETNSQDSSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTLN 120
           ++AN     S+   S+D S  N N N    G ERERKKLRLS+EQ TLLEE+FKLHTTLN
Sbjct: 61  LQANEPSKDSSFGISRDRSSTNYNCNAISGGLERERKKLRLSQEQLTLLEETFKLHTTLN 120

Query: 121 PAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELH 180
            AQK ALAQQLNLK RQVEVWFQNRRAR+KLKQTEVDCEFLKK CERL EEN RLKKEL 
Sbjct: 121 LAQKLALAQQLNLKARQVEVWFQNRRARSKLKQTEVDCEFLKKYCERLKEENGRLKKELQ 180

Query: 181 ELRSIKLGASQLYIQLPKAATLTICPSCDKITRTAAANAAVEPNSPPQ 229
           ELRS KLGASQLYIQLPKAATLTICPSCDK TR A   AAVE +SPPQ
Sbjct: 181 ELRSTKLGASQLYIQLPKAATLTICPSCDKTTRPA---AAVEAHSPPQ 219

BLAST of Tan0015388 vs. ExPASy TrEMBL
Match: A0A6J1JA40 (homeobox-leucine zipper protein ATHB-4-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484923 PE=4 SV=1)

HSP 1 Score: 278.5 bits (711), Expect = 2.5e-71
Identity = 162/228 (71.05%), Postives = 179/228 (78.51%), Query Frame = 0

Query: 1   MGDQDELCNIRLGLGLGLGLGFGEYVPKKMQKINNKNNPKFFPDLSFTLIPKEELAINME 60
           MGDQDELCN R    LGL LGFG+YVPK MQK N +  PKF  DLSF+LIP++E AINM+
Sbjct: 1   MGDQDELCNTR----LGLALGFGDYVPKTMQKANKQ--PKFLSDLSFSLIPRQESAINMQ 60

Query: 61  VEANHSISRSNETNSQDSSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTLN 120
           ++AN     S+   ++D S  N N N    G ER+RKKLRLS+EQ TLLEE+FKLHTTLN
Sbjct: 61  LQANEPSKDSSFGITRDRSSTNYNCNAISGGLERDRKKLRLSQEQLTLLEETFKLHTTLN 120

Query: 121 PAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELH 180
            AQK ALA QLNLK RQVEVWFQNRRAR+KLKQTEVDCEFLKK CERL EEN RLKKEL 
Sbjct: 121 LAQKLALADQLNLKSRQVEVWFQNRRARSKLKQTEVDCEFLKKYCERLKEENGRLKKELQ 180

Query: 181 ELRSIKLGASQLYIQLPKAATLTICPSCDKITRTAAANAAVEPNSPPQ 229
           ELRS K+GASQLYIQLPKAATLTICPSCDK TR     AAVE +SPPQ
Sbjct: 181 ELRSRKIGASQLYIQLPKAATLTICPSCDKTTRPV---AAVEAHSPPQ 219

BLAST of Tan0015388 vs. ExPASy TrEMBL
Match: A0A5A7USQ4 (Homeobox-leucine zipper protein HOX18-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G005100 PE=4 SV=1)

HSP 1 Score: 273.1 bits (697), Expect = 1.1e-69
Identity = 175/274 (63.87%), Postives = 196/274 (71.53%), Query Frame = 0

Query: 4   QDELCNIRLGLGLGLGLGFG-EYVPKKMQKINNKNNPKFFPDLSFTLIPKEELAI----N 63
           +DE+CNI     L LGLGFG +YVPKK+QK  ++        +SFTLIPKEEL I    N
Sbjct: 6   EDEICNIS---WLSLGLGFGDQYVPKKIQKNQHQQQ-----QVSFTLIPKEELEITNNNN 65

Query: 64  MEVE--------------------ANHSISRSNETNSQDSSFGN---------NNNNNAI 123
           ME++                    +N++I   +  + QDSSFG+          NNN+ +
Sbjct: 66  MEIDDDEVNSSEEDDDHHLMKRIRSNNNIVNYDHHHRQDSSFGSIRRLSSDQYINNNDIV 125

Query: 124 N------------GSE-RERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKPR 183
           N            GSE RERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLK R
Sbjct: 126 NSTNHNYKGISSSGSELRERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKTR 185

Query: 184 QVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGASQLYIQL 229
           QVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKEL+ELRS+KLGASQLYIQL
Sbjct: 186 QVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELNELRSLKLGASQLYIQL 245

BLAST of Tan0015388 vs. TAIR 10
Match: AT4G37790.1 (Homeobox-leucine zipper protein family )

HSP 1 Score: 166.8 bits (421), Expect = 2.1e-41
Identity = 113/238 (47.48%), Postives = 142/238 (59.66%), Query Frame = 0

Query: 5   DELCNIRLGLGLGLGLGFGEY--VPKKMQKINNKNNPKFFPDLSFTLIPKEELAINMEVE 64
           D+ CN  L LGLGL      Y    KK     +    +  P L+ +L   E   I     
Sbjct: 4   DDSCNTGLVLGLGLSPTPNNYNHAIKKSSSTVDHRFIRLDPSLTLSL-SGESYKIKTGAG 63

Query: 65  ANHSISRSNETNSQDSSF--GNNNNNNAINGSERE------------------------- 124
           A   I R   ++S  SSF  G       I+G + E                         
Sbjct: 64  AGDQICRQTSSHSGISSFSSGRVKREREISGGDGEEEAEETTERVVCSRVSDDHDDEEGV 123

Query: 125 --RKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQ 184
             RKKLRL+K+QS LLE++FKLH+TLNP QKQALA+QLNL+PRQVEVWFQNRRARTKLKQ
Sbjct: 124 SARKKLRLTKQQSALLEDNFKLHSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQ 183

Query: 185 TEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGASQLYIQLPKAATLTICPSCDKI 212
           TEVDCEFLKKCCE L +ENRRL+KEL +L+++KL +   Y+ +P AATLT+CPSC+++
Sbjct: 184 TEVDCEFLKKCCETLTDENRRLQKELQDLKALKL-SQPFYMHMP-AATLTMCPSCERL 238

BLAST of Tan0015388 vs. TAIR 10
Match: AT3G60390.1 (homeobox-leucine zipper protein 3 )

HSP 1 Score: 164.5 bits (415), Expect = 1.0e-40
Identity = 85/143 (59.44%), Postives = 116/143 (81.12%), Query Frame = 0

Query: 81  GNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTLNPAQKQALAQQLNLKPRQVEV 140
           G+++ + + NG +  RKKLRLSKEQ+ +LEE+FK H+TLNP QK ALA+QLNL+ RQVEV
Sbjct: 146 GSDDEDGSGNGDDSSRKKLRLSKEQALVLEETFKEHSTLNPKQKMALAKQLNLRTRQVEV 205

Query: 141 WFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELRSIKLGASQLYIQLPKAA 200
           WFQNRRARTKLKQTEVDCE+LK+CCE L +ENRRL+KE+ ELR++KL +  LY+ +    
Sbjct: 206 WFQNRRARTKLKQTEVDCEYLKRCCENLTDENRRLQKEVSELRALKL-SPHLYMHMKPPT 265

Query: 201 TLTICPSCDKITRTAAANAAVEP 224
           TLT+CPSC+++  T+++++   P
Sbjct: 266 TLTMCPSCERVAVTSSSSSVAPP 287

BLAST of Tan0015388 vs. TAIR 10
Match: AT5G06710.1 (homeobox from Arabidopsis thaliana )

HSP 1 Score: 163.3 bits (412), Expect = 2.3e-40
Identity = 96/153 (62.75%), Postives = 121/153 (79.08%), Query Frame = 0

Query: 69  RSNETNSQD-----SSFGNNNNNNAINGSERERKKLRLSKEQSTLLEESFKLHTTLNPAQ 128
           RSN+ +  D     +S  +N +N+  NGS   RKKLRLSK+QS  LE+SFK H+TLNP Q
Sbjct: 159 RSNKRDIDDEVERSASRASNEDNDDENGS--TRKKLRLSKDQSAFLEDSFKEHSTLNPKQ 218

Query: 129 KQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELHELR 188
           K ALA+QLNL+PRQVEVWFQNRRARTKLKQTEVDCE+LK+CCE L EENRRL+KE+ ELR
Sbjct: 219 KIALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCESLTEENRRLQKEVKELR 278

Query: 189 SIKLGASQLYIQLPKAATLTICPSCDKITRTAA 217
           ++K  ++  Y+QLP A TLT+CPSC+++  +AA
Sbjct: 279 TLKT-STPFYMQLP-ATTLTMCPSCERVATSAA 307

BLAST of Tan0015388 vs. TAIR 10
Match: AT2G44910.1 (homeobox-leucine zipper protein 4 )

HSP 1 Score: 155.2 bits (391), Expect = 6.2e-38
Identity = 92/173 (53.18%), Postives = 121/173 (69.94%), Query Frame = 0

Query: 52  KEELAINMEVEANH----SISRSNETNSQDSSFGNNNNNNAINGSERERKKLRLSKEQST 111
           K +LA+    + N     S SR   +   D   G N + +        RKKLRLSK+Q+ 
Sbjct: 122 KRDLAVARGGDENEAERASCSRGGGSGGSDDEDGGNGDGS--------RKKLRLSKDQAL 181

Query: 112 LLEESFKLHTTLNPAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCER 171
           +LEE+FK H+TLNP QK ALA+QLNL+ RQVEVWFQNRRARTKLKQTEVDCE+LK+CC+ 
Sbjct: 182 VLEETFKEHSTLNPKQKLALAKQLNLRARQVEVWFQNRRARTKLKQTEVDCEYLKRCCDN 241

Query: 172 LNEENRRLKKELHELRSIKLGASQLYIQLPKAATLTICPSCDKITRTAAANAA 221
           L EENRRL+KE+ ELR++KL +  LY+ +    TLT+CPSC++++ +AA   A
Sbjct: 242 LTEENRRLQKEVSELRALKL-SPHLYMHMTPPTTLTMCPSCERVSSSAATVTA 285

BLAST of Tan0015388 vs. TAIR 10
Match: AT4G16780.1 (homeobox protein 2 )

HSP 1 Score: 152.1 bits (383), Expect = 5.3e-37
Identity = 87/152 (57.24%), Postives = 112/152 (73.68%), Query Frame = 0

Query: 64  NHSISRSNETNSQDSSFGNNNNNNAINGSE---RERKKLRLSKEQSTLLEESFKLHTTLN 123
           N ++S S    S+     +   +  I+  E     RKKLRLSK+QS +LEE+FK H+TLN
Sbjct: 93  NSTVSSSTGKRSEREEDTDPQGSRGISDDEDGDNSRKKLRLSKDQSAILEETFKDHSTLN 152

Query: 124 PAQKQALAQQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRRLKKELH 183
           P QKQALA+QL L+ RQVEVWFQNRRARTKLKQTEVDCEFL++CCE L EENRRL+KE+ 
Sbjct: 153 PKQKQALAKQLGLRARQVEVWFQNRRARTKLKQTEVDCEFLRRCCENLTEENRRLQKEVT 212

Query: 184 ELRSIKLGASQLYIQLPKAATLTICPSCDKIT 213
           ELR++KL + Q Y+ +    TLT+CPSC+ ++
Sbjct: 213 ELRALKL-SPQFYMHMSPPTTLTMCPSCEHVS 243

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P466042.9e-4047.48Homeobox-leucine zipper protein HAT22 OS=Arabidopsis thaliana OX=3702 GN=HAT22 P... [more]
P466021.4e-3959.44Homeobox-leucine zipper protein HAT3 OS=Arabidopsis thaliana OX=3702 GN=HAT3 PE=... [more]
P466653.2e-3962.75Homeobox-leucine zipper protein HAT14 OS=Arabidopsis thaliana OX=3702 GN=HAT14 P... [more]
Q01I235.5e-3962.69Homeobox-leucine zipper protein HOX17 OS=Oryza sativa subsp. indica OX=39946 GN=... [more]
Q0JB925.5e-3962.69Homeobox-leucine zipper protein HOX17 OS=Oryza sativa subsp. japonica OX=39947 G... [more]
Match NameE-valueIdentityDescription
KAG6587716.11.4e-8479.82Homeobox-leucine zipper protein HAT4, partial [Cucurbita argyrosperma subsp. sor... [more]
XP_023531587.12.4e-8479.82homeobox-leucine zipper protein HAT3-like [Cucurbita pepo subsp. pepo][more]
XP_022931479.15.4e-8479.82homeobox-leucine zipper protein HAT3-like [Cucurbita moschata][more]
XP_022134791.11.6e-7576.79homeobox-leucine zipper protein HAT22-like [Momordica charantia][more]
XP_038879995.13.9e-7466.43homeobox-leucine zipper protein HOX17-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1EUD12.6e-8479.82homeobox-leucine zipper protein HAT3-like OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
A0A6J1BYS37.6e-7676.79homeobox-leucine zipper protein HAT22-like OS=Momordica charantia OX=3673 GN=LOC... [more]
A0A6J1E1583.2e-7473.68homeobox-leucine zipper protein ATHB-4-like isoform X1 OS=Cucurbita moschata OX=... [more]
A0A6J1JA402.5e-7171.05homeobox-leucine zipper protein ATHB-4-like isoform X1 OS=Cucurbita maxima OX=36... [more]
A0A5A7USQ41.1e-6963.87Homeobox-leucine zipper protein HOX18-like OS=Cucumis melo var. makuwa OX=119469... [more]
Match NameE-valueIdentityDescription
AT4G37790.12.1e-4147.48Homeobox-leucine zipper protein family [more]
AT3G60390.11.0e-4059.44homeobox-leucine zipper protein 3 [more]
AT5G06710.12.3e-4062.75homeobox from Arabidopsis thaliana [more]
AT2G44910.16.2e-3853.18homeobox-leucine zipper protein 4 [more]
AT4G16780.15.3e-3757.24homeobox protein 2 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 151..185
NoneNo IPR availableGENE3D1.10.10.60coord: 100..150
e-value: 4.1E-19
score: 69.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 67..90
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 67..97
NoneNo IPR availablePANTHERPTHR45714:SF26HOMEOBOX ASSOCIATED LEUCINE ZIPPER PROTEINcoord: 21..215
NoneNo IPR availablePANTHERPTHR45714FAMILY NOT NAMEDcoord: 21..215
IPR003106Leucine zipper, homeobox-associatedSMARTSM00340halzcoord: 152..195
e-value: 1.6E-18
score: 77.5
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 152..184
e-value: 8.1E-9
score: 35.5
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 94..156
e-value: 1.1E-15
score: 68.2
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 96..150
e-value: 5.0E-16
score: 58.3
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 92..152
score: 17.750473
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 96..153
e-value: 1.79871E-14
score: 63.8016
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 127..150
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 76..153

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0015388.1Tan0015388.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003677 DNA binding