CmaCh01G017640 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh01G017640
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionHomeobox-leucine zipper protein family
LocationCma_Chr01: 11857318 .. 11858247 (+)
RNA-Seq ExpressionCmaCh01G017640
SyntenyCmaCh01G017640
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGGTGTAAAGAAGAAGAAGCAGCAGGTTTTGAAGTTTGATGACATTATTATGCCTTCTCTATCGTTGGGGTTGTCGATTGTTGTCGAATCCCCCAACGAGTTCATCCAGCTGAGTTCTTCGGGAAGTCCAGTGTCGTCGTTTTCAAATTCATCGGGGTATAAGAGGGAGAGAGACGGCTGCGGCGGTGAGGAGGCGGAGGAAGACGAAGATGGAAGTCCGAGGAAGAAACTTAGATTCACTAAAGAACAATCCGCCAATTTAGAAGAAAGCTTCAAAGAACACTCAACTTTCAGTCCTGTAAGTCTGTCTTTTACTTTTTATTTTTGGATTTAAAAAAAATATATATTAATTTTTGTTTAAACAGAAGCAAAAGCAGGAATTGGCAAGAAATTTAAAGCTAAGGGCAAGACAAGTGGAAGTATGGTTTCAAAATAGAAGAGCCAGGTACCTAATCACCCAATTAATTTCTCTAATTTTTAATTATTTAGCATAATTCACCGAGACCATGCAGAACCAAGCTGAAGCAAACAGAAATGGACTGTGAATTAATGAAGAAATGCTGTGAAAAGCTGAAAGAAGAGACCACAAGGCTTCAAAAGGAGCTTCAAGAGCTCAAATCACTCAAATTAACAGCTCCGCCGTTCGCCACCCTCACCGTTTGCCCTTCCTGTTAGAGGTCCATTTGCGGCGGCGGCGGCGGCGGTTGCGATGCATCTCAGGCCACCACCTCCTCGATTAGCCCAAAGCTTGACTTTCTTAAATTCCCGTTTAACCACCCGTCGGCGGCTTGTTAGGCAACCTAATTTTAATTATATCATTAATTAAAAAGCCCATGTACTTGTATGAGGGGGTGAGGAGGTGGGCCGAGGATGGGCATAATTATCCATTTATTTTCGTGGAGATAGAAAATAAAATTTTATACAG

mRNA sequence

ATGGCCGGTGTAAAGAAGAAGAAGCAGCAGGTTTTGAAGTTTGATGACATTATTATGCCTTCTCTATCGTTGGGGTTGTCGATTGTTGTCGAATCCCCCAACGAGTTCATCCAGCTGAGTTCTTCGGGAAGTCCAGTGTCGTCGTTTTCAAATTCATCGGGGTATAAGAGGGAGAGAGACGGCTGCGGCGGTGAGGAGGCGGAGGAAGACGAAGATGGAAGTCCGAGGAAGAAACTTAGATTCACTAAAGAACAATCCGCCAATTTAGAAGAAAGCTTCAAAGAACACTCAACTTTCAGTCCTAAGCAAAAGCAGGAATTGGCAAGAAATTTAAAGCTAAGGGCAAGACAAGTGGAAGTATGGTTTCAAAATAGAAGAGCCAGAACCAAGCTGAAGCAAACAGAAATGGACTGTGAATTAATGAAGAAATGCTGTGAAAAGCTGAAAGAAGAGACCACAAGGCTTCAAAAGGAGCTTCAAGAGCTCAAATCACTCAAATTAACAGCTCCGCCGTTCGCCACCCTCACCGTTTGCCCTTCCTGTTAGAGGTCCATTTGCGGCGGCGGCGGCGGCGGTTGCGATGCATCTCAGGCCACCACCTCCTCGATTAGCCCAAAGCTTGACTTTCTTAAATTCCCGTTTAACCACCCGTCGGCGGCTTGTTAGGCAACCTAATTTTAATTATATCATTAATTAAAAAGCCCATGTACTTGTATGAGGGGGTGAGGAGGTGGGCCGAGGATGGGCATAATTATCCATTTATTTTCGTGGAGATAGAAAATAAAATTTTATACAG

Coding sequence (CDS)

ATGGCCGGTGTAAAGAAGAAGAAGCAGCAGGTTTTGAAGTTTGATGACATTATTATGCCTTCTCTATCGTTGGGGTTGTCGATTGTTGTCGAATCCCCCAACGAGTTCATCCAGCTGAGTTCTTCGGGAAGTCCAGTGTCGTCGTTTTCAAATTCATCGGGGTATAAGAGGGAGAGAGACGGCTGCGGCGGTGAGGAGGCGGAGGAAGACGAAGATGGAAGTCCGAGGAAGAAACTTAGATTCACTAAAGAACAATCCGCCAATTTAGAAGAAAGCTTCAAAGAACACTCAACTTTCAGTCCTAAGCAAAAGCAGGAATTGGCAAGAAATTTAAAGCTAAGGGCAAGACAAGTGGAAGTATGGTTTCAAAATAGAAGAGCCAGAACCAAGCTGAAGCAAACAGAAATGGACTGTGAATTAATGAAGAAATGCTGTGAAAAGCTGAAAGAAGAGACCACAAGGCTTCAAAAGGAGCTTCAAGAGCTCAAATCACTCAAATTAACAGCTCCGCCGTTCGCCACCCTCACCGTTTGCCCTTCCTGTTAG

Protein sequence

MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERDGCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPSC
Homology
BLAST of CmaCh01G017640 vs. ExPASy Swiss-Prot
Match: P46603 (Homeobox-leucine zipper protein HAT9 OS=Arabidopsis thaliana OX=3702 GN=HAT9 PE=1 SV=2)

HSP 1 Score: 169.9 bits (429), Expect = 2.7e-41
Identity = 112/187 (59.89%), Postives = 129/187 (68.98%), Query Frame = 0

Query: 20  PSLSLGLS-----IVVESPNEFIQLSSSGSPVSSFSNSSGYKRERDGCGGEEA------- 79
           PSL+L LS      VV   ++  + +SS S VSSFS+    KRERD  GGEE+       
Sbjct: 38  PSLTLCLSGDPSVTVVTGADQLCRQTSSHSGVSSFSSGRVVKRERD--GGEESPEEEEMT 97

Query: 80  -------EEDEDG-SPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVE 139
                   EDE+G S RKKLR TK+QSA LEESFK+HST +PKQKQ LAR L LR RQVE
Sbjct: 98  ERVISDYHEDEEGISARKKLRLTKQQSALLEESFKDHSTLNPKQKQVLARQLNLRPRQVE 157

Query: 140 VWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAP-----PFAT 182
           VWFQNRRARTKLKQTE+DCE +KKCCE L +E  RLQKE+QELK+LKLT P     P +T
Sbjct: 158 VWFQNRRARTKLKQTEVDCEFLKKCCETLADENIRLQKEIQELKTLKLTQPFYMHMPAST 217

BLAST of CmaCh01G017640 vs. ExPASy Swiss-Prot
Match: P46604 (Homeobox-leucine zipper protein HAT22 OS=Arabidopsis thaliana OX=3702 GN=HAT22 PE=1 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 1.1e-37
Identity = 109/193 (56.48%), Postives = 129/193 (66.84%), Query Frame = 0

Query: 20  PSLSLGLS-------IVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD---GCGGEEAEE 79
           PSL+L LS           + ++  + +SS S +SSFS S   KRER+   G G EEAEE
Sbjct: 44  PSLTLSLSGESYKIKTGAGAGDQICRQTSSHSGISSFS-SGRVKREREISGGDGEEEAEE 103

Query: 80  ---------------DEDG-SPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKL 139
                          DE+G S RKKLR TK+QSA LE++FK HST +PKQKQ LAR L L
Sbjct: 104 TTERVVCSRVSDDHDDEEGVSARKKLRLTKQQSALLEDNFKLHSTLNPKQKQALARQLNL 163

Query: 140 RARQVEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAP--- 182
           R RQVEVWFQNRRARTKLKQTE+DCE +KKCCE L +E  RLQKELQ+LK+LKL+ P   
Sbjct: 164 RPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQDLKALKLSQPFYM 223

BLAST of CmaCh01G017640 vs. ExPASy Swiss-Prot
Match: Q05466 (Homeobox-leucine zipper protein HAT4 OS=Arabidopsis thaliana OX=3702 GN=HAT4 PE=1 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 8.5e-35
Identity = 90/150 (60.00%), Postives = 112/150 (74.67%), Query Frame = 0

Query: 44  SPVSSFSNSSGYKRER----DGCGGEEAEEDEDG-SPRKKLRFTKEQSANLEESFKEHST 103
           SP S+ S+S+G + ER    D  G     +DEDG + RKKLR +K+QSA LEE+FK+HST
Sbjct: 91  SPNSTVSSSTGKRSEREEDTDPQGSRGISDDEDGDNSRKKLRLSKDQSAILEETFKDHST 150

Query: 104 FSPKQKQELARNLKLRARQVEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKE 163
            +PKQKQ LA+ L LRARQVEVWFQNRRARTKLKQTE+DCE +++CCE L EE  RLQKE
Sbjct: 151 LNPKQKQALAKQLGLRARQVEVWFQNRRARTKLKQTEVDCEFLRRCCENLTEENRRLQKE 210

Query: 164 LQELKSLKLT-------APPFATLTVCPSC 182
           + EL++LKL+       +PP  TLT+CPSC
Sbjct: 211 VTELRALKLSPQFYMHMSPP-TTLTMCPSC 239

BLAST of CmaCh01G017640 vs. ExPASy Swiss-Prot
Match: P46665 (Homeobox-leucine zipper protein HAT14 OS=Arabidopsis thaliana OX=3702 GN=HAT14 PE=2 SV=3)

HSP 1 Score: 145.2 bits (365), Expect = 7.2e-34
Identity = 97/188 (51.60%), Postives = 119/188 (63.30%), Query Frame = 0

Query: 22  LSLGLSIVVESPNE------FIQLSSSGSPVSSFS-----NSSGYKR---ERD------- 81
           + LG + VVE   E       + +S   S  SSF       S GY+R   +RD       
Sbjct: 112 MPLGAATVVEEEEEEEEAVPSMSVSPPDSVTSSFQLDFGIKSYGYERRSNKRDIDDEVER 171

Query: 82  --GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQV 141
                  E  +DE+GS RKKLR +K+QSA LE+SFKEHST +PKQK  LA+ L LR RQV
Sbjct: 172 SASRASNEDNDDENGSTRKKLRLSKDQSAFLEDSFKEHSTLNPKQKIALAKQLNLRPRQV 231

Query: 142 EVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAP-----PFA 182
           EVWFQNRRARTKLKQTE+DCE +K+CCE L EE  RLQKE++EL++LK + P     P  
Sbjct: 232 EVWFQNRRARTKLKQTEVDCEYLKRCCESLTEENRRLQKEVKELRTLKTSTPFYMQLPAT 291

BLAST of CmaCh01G017640 vs. ExPASy Swiss-Prot
Match: A2Z1U1 (Homeobox-leucine zipper protein HOX11 OS=Oryza sativa subsp. indica OX=39946 GN=HOX11 PE=2 SV=1)

HSP 1 Score: 145.2 bits (365), Expect = 7.2e-34
Identity = 96/179 (53.63%), Postives = 114/179 (63.69%), Query Frame = 0

Query: 15  DDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNS--SGYKRERDGCGGEE-----A 74
           DD+   +LS        SPN     S+   P+  FS     G      G GG+      +
Sbjct: 32  DDVAGAALS-------SSPNN----SAGSFPMDDFSGHGLGGNDAAPGGGGGDRSCSRAS 91

Query: 75  EEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEVWFQNRRA 134
           +ED+ GS RKKLR +KEQSA LEESFKEHST +PKQK  LA+ L LR RQVEVWFQNRRA
Sbjct: 92  DEDDGGSARKKLRLSKEQSAFLEESFKEHSTLNPKQKLALAKQLNLRPRQVEVWFQNRRA 151

Query: 135 RTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAP-----PFATLTVCPSC 182
           RTKLKQTE+DCE +K+CCE L EE  RLQKEL EL++LK   P     P  TL++CPSC
Sbjct: 152 RTKLKQTEVDCEYLKRCCETLTEENRRLQKELAELRALKTVHPFYMHLPATTLSMCPSC 199

BLAST of CmaCh01G017640 vs. ExPASy TrEMBL
Match: A0A6J1IWK3 (homeobox-leucine zipper protein HAT9-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111481247 PE=4 SV=1)

HSP 1 Score: 339.0 bits (868), Expect = 1.2e-89
Identity = 181/181 (100.00%), Postives = 181/181 (100.00%), Query Frame = 0

Query: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD 60
           MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD
Sbjct: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD 60

Query: 61  GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEV 120
           GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEV
Sbjct: 61  GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEV 120

Query: 121 WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS 180
           WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS
Sbjct: 121 WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS 180

Query: 181 C 182
           C
Sbjct: 181 C 181

BLAST of CmaCh01G017640 vs. ExPASy TrEMBL
Match: A0A6J1J4S4 (homeobox-leucine zipper protein HAT9-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111481247 PE=4 SV=1)

HSP 1 Score: 327.0 bits (837), Expect = 4.9e-86
Identity = 177/181 (97.79%), Postives = 177/181 (97.79%), Query Frame = 0

Query: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD 60
           MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD
Sbjct: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD 60

Query: 61  GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEV 120
           GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSP    ELARNLKLRARQVEV
Sbjct: 61  GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSP----ELARNLKLRARQVEV 120

Query: 121 WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS 180
           WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS
Sbjct: 121 WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS 177

Query: 181 C 182
           C
Sbjct: 181 C 177

BLAST of CmaCh01G017640 vs. ExPASy TrEMBL
Match: A0A6J1F6L5 (homeobox-leucine zipper protein HAT9-like OS=Cucurbita moschata OX=3662 GN=LOC111442602 PE=4 SV=1)

HSP 1 Score: 246.5 bits (628), Expect = 8.4e-62
Identity = 149/211 (70.62%), Postives = 165/211 (78.20%), Query Frame = 0

Query: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVE---------SPNEFIQLSSSGSPVSSFSN 60
           +AGV KKK QVLKFDD I+PSL+LGLS+VV+         + +E IQ  SSGSPVSSFS+
Sbjct: 30  VAGV-KKKLQVLKFDD-ILPSLTLGLSVVVDKSGGESAATAADELIQQGSSGSPVSSFSH 89

Query: 61  SSGYKRERDGCGGEE---------------AEEDEDGSPRKKLRFTKEQSANLEESFKEH 120
           SSG+KRERDG  GEE               AEE+EDGSPRKKLR TKEQSA LE++FKEH
Sbjct: 90  SSGFKRERDGGAGEEPAEAEVFMERISMKVAEEEEDGSPRKKLRLTKEQSAVLEDNFKEH 149

Query: 121 STFSPKQKQELARNLKLRARQVEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQ 180
           S+ SPKQKQ+LAR L LR RQVEVWFQNRRARTKLKQTEMDCEL+KKCCEKLKEE T+LQ
Sbjct: 150 SSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTKLQ 209

Query: 181 KELQELKSLKLTAPPF------ATLTVCPSC 182
           KELQELKSLKLTAPPF      ATLTVCPSC
Sbjct: 210 KELQELKSLKLTAPPFCMQLQAATLTVCPSC 238

BLAST of CmaCh01G017640 vs. ExPASy TrEMBL
Match: A0A6J1IHG6 (homeobox-leucine zipper protein HAT9-like OS=Cucurbita maxima OX=3661 GN=LOC111477260 PE=4 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 1.1e-61
Identity = 148/208 (71.15%), Postives = 163/208 (78.37%), Query Frame = 0

Query: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVE------SPNEFIQLSSSGSPVSSFSNSSG 60
           +AGV KKK QVLKFDD I+PSL+LGLS+VVE      + ++ I   SSGSP SSFSNSSG
Sbjct: 30  VAGV-KKKLQVLKFDD-ILPSLTLGLSVVVEKSGGESAADDLIHQGSSGSPASSFSNSSG 89

Query: 61  YKRERDGCGGEE---------------AEEDEDGSPRKKLRFTKEQSANLEESFKEHSTF 120
           +KRERDG  GEE               AEE+EDGSPRKKLR TKEQSA LE++FKEHS+ 
Sbjct: 90  FKRERDGGAGEEPAEAEVFMERISMKVAEEEEDGSPRKKLRLTKEQSAVLEDNFKEHSSL 149

Query: 121 SPKQKQELARNLKLRARQVEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKEL 180
           SPKQKQ+LAR L LR RQVEVWFQNRRARTKLKQTEMDCEL+KKCCEKLKEE T+LQKEL
Sbjct: 150 SPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTKLQKEL 209

Query: 181 QELKSLKLTAPPF------ATLTVCPSC 182
           QELKSLKLTAPPF      ATLTVCPSC
Sbjct: 210 QELKSLKLTAPPFCMQLQAATLTVCPSC 235

BLAST of CmaCh01G017640 vs. ExPASy TrEMBL
Match: A0A0A0L3X3 (Homeobox domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G645820 PE=4 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 1.5e-58
Identity = 141/190 (74.21%), Postives = 152/190 (80.00%), Query Frame = 0

Query: 5   KKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERDG--- 64
           KKK QQVLKFDD I+PSL+LGLS VV++  E      SGSPVSSFSNSSG+KRER G   
Sbjct: 34  KKKLQQVLKFDDDILPSLTLGLSFVVDTATED---GCSGSPVSSFSNSSGFKRERAGEEV 93

Query: 65  CGGEE----AEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQ 124
              EE     EEDE+GSPRKKLR TK QSA LE++FKEHS+ SPKQKQ+LAR L LR RQ
Sbjct: 94  AETEECMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQ 153

Query: 125 VEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPF----- 182
           VEVWFQNRRARTKLKQTEMDCEL+KKCCEKLKEE TRLQKELQELKSLKLT PPF     
Sbjct: 154 VEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPPPFCMQLQ 213

BLAST of CmaCh01G017640 vs. NCBI nr
Match: XP_022982417.1 (homeobox-leucine zipper protein HAT9-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 339.0 bits (868), Expect = 2.6e-89
Identity = 181/181 (100.00%), Postives = 181/181 (100.00%), Query Frame = 0

Query: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD 60
           MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD
Sbjct: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD 60

Query: 61  GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEV 120
           GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEV
Sbjct: 61  GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEV 120

Query: 121 WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS 180
           WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS
Sbjct: 121 WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS 180

Query: 181 C 182
           C
Sbjct: 181 C 181

BLAST of CmaCh01G017640 vs. NCBI nr
Match: XP_022982418.1 (homeobox-leucine zipper protein HAT9-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 327.0 bits (837), Expect = 1.0e-85
Identity = 177/181 (97.79%), Postives = 177/181 (97.79%), Query Frame = 0

Query: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD 60
           MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD
Sbjct: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD 60

Query: 61  GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEV 120
           GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSP    ELARNLKLRARQVEV
Sbjct: 61  GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSP----ELARNLKLRARQVEV 120

Query: 121 WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS 180
           WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS
Sbjct: 121 WFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAPPFATLTVCPS 177

Query: 181 C 182
           C
Sbjct: 181 C 177

BLAST of CmaCh01G017640 vs. NCBI nr
Match: XP_023536403.1 (homeobox-leucine zipper protein HAT9-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 248.4 bits (633), Expect = 4.6e-62
Identity = 150/211 (71.09%), Postives = 165/211 (78.20%), Query Frame = 0

Query: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVE---------SPNEFIQLSSSGSPVSSFSN 60
           +AGV KKK QVLKFDD I+PSL+LGLS+VV+         + +E IQ  SSGSPVSSFSN
Sbjct: 30  VAGV-KKKLQVLKFDD-ILPSLTLGLSVVVDKSGGESAATAADELIQQGSSGSPVSSFSN 89

Query: 61  SSGYKRERDGCGGEE---------------AEEDEDGSPRKKLRFTKEQSANLEESFKEH 120
           SSG+KRERDG  GEE               AEE+EDGSPRKKLR TKEQSA LE++FKEH
Sbjct: 90  SSGFKRERDGGAGEEPAETEVFMERISMKVAEEEEDGSPRKKLRLTKEQSAVLEDNFKEH 149

Query: 121 STFSPKQKQELARNLKLRARQVEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQ 180
           S+ SPKQKQ+LAR L LR RQVEVWFQNRRARTKLKQTEMDCEL+KKCCEKLKEE T+LQ
Sbjct: 150 SSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTKLQ 209

Query: 181 KELQELKSLKLTAPPF------ATLTVCPSC 182
           KELQELKSLKLTAPPF      ATLTVCPSC
Sbjct: 210 KELQELKSLKLTAPPFCMQLQAATLTVCPSC 238

BLAST of CmaCh01G017640 vs. NCBI nr
Match: KAG6591449.1 (Homeobox-leucine zipper protein HAT9, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 247.3 bits (630), Expect = 1.0e-61
Identity = 150/211 (71.09%), Postives = 163/211 (77.25%), Query Frame = 0

Query: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVE---------SPNEFIQLSSSGSPVSSFSN 60
           +AGV KKK QVLKFDD I+PSL+LGLS+VVE         +  E I   SSGSPVSSFSN
Sbjct: 30  VAGV-KKKLQVLKFDD-ILPSLTLGLSVVVEKSGGESSATAAEELIHQGSSGSPVSSFSN 89

Query: 61  SSGYKRERDGCGGEE---------------AEEDEDGSPRKKLRFTKEQSANLEESFKEH 120
           SSG+KRERDG  GEE               AEE+EDGSPRKKLR TKEQSA LE++FKEH
Sbjct: 90  SSGFKRERDGGAGEEPVEAEVFMERISMKVAEEEEDGSPRKKLRLTKEQSAVLEDNFKEH 149

Query: 121 STFSPKQKQELARNLKLRARQVEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQ 180
           S+ SPKQKQ+LAR L LR RQVEVWFQNRRARTKLKQTEMDCEL+KKCCEKLKEE T+LQ
Sbjct: 150 SSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTKLQ 209

Query: 181 KELQELKSLKLTAPPF------ATLTVCPSC 182
           KELQELKSLKLTAPPF      ATLTVCPSC
Sbjct: 210 KELQELKSLKLTAPPFCMQLQAATLTVCPSC 238

BLAST of CmaCh01G017640 vs. NCBI nr
Match: XP_022935802.1 (homeobox-leucine zipper protein HAT9-like [Cucurbita moschata])

HSP 1 Score: 246.5 bits (628), Expect = 1.7e-61
Identity = 149/211 (70.62%), Postives = 165/211 (78.20%), Query Frame = 0

Query: 1   MAGVKKKKQQVLKFDDIIMPSLSLGLSIVVE---------SPNEFIQLSSSGSPVSSFSN 60
           +AGV KKK QVLKFDD I+PSL+LGLS+VV+         + +E IQ  SSGSPVSSFS+
Sbjct: 30  VAGV-KKKLQVLKFDD-ILPSLTLGLSVVVDKSGGESAATAADELIQQGSSGSPVSSFSH 89

Query: 61  SSGYKRERDGCGGEE---------------AEEDEDGSPRKKLRFTKEQSANLEESFKEH 120
           SSG+KRERDG  GEE               AEE+EDGSPRKKLR TKEQSA LE++FKEH
Sbjct: 90  SSGFKRERDGGAGEEPAEAEVFMERISMKVAEEEEDGSPRKKLRLTKEQSAVLEDNFKEH 149

Query: 121 STFSPKQKQELARNLKLRARQVEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQ 180
           S+ SPKQKQ+LAR L LR RQVEVWFQNRRARTKLKQTEMDCEL+KKCCEKLKEE T+LQ
Sbjct: 150 SSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTKLQ 209

Query: 181 KELQELKSLKLTAPPF------ATLTVCPSC 182
           KELQELKSLKLTAPPF      ATLTVCPSC
Sbjct: 210 KELQELKSLKLTAPPFCMQLQAATLTVCPSC 238

BLAST of CmaCh01G017640 vs. TAIR 10
Match: AT2G22800.1 (Homeobox-leucine zipper protein family )

HSP 1 Score: 169.9 bits (429), Expect = 1.9e-42
Identity = 112/187 (59.89%), Postives = 129/187 (68.98%), Query Frame = 0

Query: 20  PSLSLGLS-----IVVESPNEFIQLSSSGSPVSSFSNSSGYKRERDGCGGEEA------- 79
           PSL+L LS      VV   ++  + +SS S VSSFS+    KRERD  GGEE+       
Sbjct: 38  PSLTLCLSGDPSVTVVTGADQLCRQTSSHSGVSSFSSGRVVKRERD--GGEESPEEEEMT 97

Query: 80  -------EEDEDG-SPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVE 139
                   EDE+G S RKKLR TK+QSA LEESFK+HST +PKQKQ LAR L LR RQVE
Sbjct: 98  ERVISDYHEDEEGISARKKLRLTKQQSALLEESFKDHSTLNPKQKQVLARQLNLRPRQVE 157

Query: 140 VWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAP-----PFAT 182
           VWFQNRRARTKLKQTE+DCE +KKCCE L +E  RLQKE+QELK+LKLT P     P +T
Sbjct: 158 VWFQNRRARTKLKQTEVDCEFLKKCCETLADENIRLQKEIQELKTLKLTQPFYMHMPAST 217

BLAST of CmaCh01G017640 vs. TAIR 10
Match: AT4G37790.1 (Homeobox-leucine zipper protein family )

HSP 1 Score: 157.9 bits (398), Expect = 7.6e-39
Identity = 109/193 (56.48%), Postives = 129/193 (66.84%), Query Frame = 0

Query: 20  PSLSLGLS-------IVVESPNEFIQLSSSGSPVSSFSNSSGYKRERD---GCGGEEAEE 79
           PSL+L LS           + ++  + +SS S +SSFS S   KRER+   G G EEAEE
Sbjct: 44  PSLTLSLSGESYKIKTGAGAGDQICRQTSSHSGISSFS-SGRVKREREISGGDGEEEAEE 103

Query: 80  ---------------DEDG-SPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKL 139
                          DE+G S RKKLR TK+QSA LE++FK HST +PKQKQ LAR L L
Sbjct: 104 TTERVVCSRVSDDHDDEEGVSARKKLRLTKQQSALLEDNFKLHSTLNPKQKQALARQLNL 163

Query: 140 RARQVEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAP--- 182
           R RQVEVWFQNRRARTKLKQTE+DCE +KKCCE L +E  RLQKELQ+LK+LKL+ P   
Sbjct: 164 RPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQDLKALKLSQPFYM 223

BLAST of CmaCh01G017640 vs. TAIR 10
Match: AT4G16780.1 (homeobox protein 2 )

HSP 1 Score: 148.3 bits (373), Expect = 6.0e-36
Identity = 90/150 (60.00%), Postives = 112/150 (74.67%), Query Frame = 0

Query: 44  SPVSSFSNSSGYKRER----DGCGGEEAEEDEDG-SPRKKLRFTKEQSANLEESFKEHST 103
           SP S+ S+S+G + ER    D  G     +DEDG + RKKLR +K+QSA LEE+FK+HST
Sbjct: 91  SPNSTVSSSTGKRSEREEDTDPQGSRGISDDEDGDNSRKKLRLSKDQSAILEETFKDHST 150

Query: 104 FSPKQKQELARNLKLRARQVEVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKE 163
            +PKQKQ LA+ L LRARQVEVWFQNRRARTKLKQTE+DCE +++CCE L EE  RLQKE
Sbjct: 151 LNPKQKQALAKQLGLRARQVEVWFQNRRARTKLKQTEVDCEFLRRCCENLTEENRRLQKE 210

Query: 164 LQELKSLKLT-------APPFATLTVCPSC 182
           + EL++LKL+       +PP  TLT+CPSC
Sbjct: 211 VTELRALKLSPQFYMHMSPP-TTLTMCPSC 239

BLAST of CmaCh01G017640 vs. TAIR 10
Match: AT5G06710.1 (homeobox from Arabidopsis thaliana )

HSP 1 Score: 145.2 bits (365), Expect = 5.1e-35
Identity = 97/188 (51.60%), Postives = 119/188 (63.30%), Query Frame = 0

Query: 22  LSLGLSIVVESPNE------FIQLSSSGSPVSSFS-----NSSGYKR---ERD------- 81
           + LG + VVE   E       + +S   S  SSF       S GY+R   +RD       
Sbjct: 112 MPLGAATVVEEEEEEEEAVPSMSVSPPDSVTSSFQLDFGIKSYGYERRSNKRDIDDEVER 171

Query: 82  --GCGGEEAEEDEDGSPRKKLRFTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQV 141
                  E  +DE+GS RKKLR +K+QSA LE+SFKEHST +PKQK  LA+ L LR RQV
Sbjct: 172 SASRASNEDNDDENGSTRKKLRLSKDQSAFLEDSFKEHSTLNPKQKIALAKQLNLRPRQV 231

Query: 142 EVWFQNRRARTKLKQTEMDCELMKKCCEKLKEETTRLQKELQELKSLKLTAP-----PFA 182
           EVWFQNRRARTKLKQTE+DCE +K+CCE L EE  RLQKE++EL++LK + P     P  
Sbjct: 232 EVWFQNRRARTKLKQTEVDCEYLKRCCESLTEENRRLQKEVKELRTLKTSTPFYMQLPAT 291

BLAST of CmaCh01G017640 vs. TAIR 10
Match: AT2G44910.1 (homeobox-leucine zipper protein 4 )

HSP 1 Score: 141.0 bits (354), Expect = 9.6e-34
Identity = 89/167 (53.29%), Postives = 110/167 (65.87%), Query Frame = 0

Query: 29  VVESPNEFI-QLSSSGSPVSSFSNSSGYKRERDGC---GGEEAEEDEDG----SPRKKLR 88
           VV SPN  +  LS +   ++        + ER  C   GG    +DEDG      RKKLR
Sbjct: 107 VVSSPNSAVSSLSGNKRDLAVARGGDENEAERASCSRGGGSGGSDDEDGGNGDGSRKKLR 166

Query: 89  FTKEQSANLEESFKEHSTFSPKQKQELARNLKLRARQVEVWFQNRRARTKLKQTEMDCEL 148
            +K+Q+  LEE+FKEHST +PKQK  LA+ L LRARQVEVWFQNRRARTKLKQTE+DCE 
Sbjct: 167 LSKDQALVLEETFKEHSTLNPKQKLALAKQLNLRARQVEVWFQNRRARTKLKQTEVDCEY 226

Query: 149 MKKCCEKLKEETTRLQKELQELKSLKLT------APPFATLTVCPSC 182
           +K+CC+ L EE  RLQKE+ EL++LKL+        P  TLT+CPSC
Sbjct: 227 LKRCCDNLTEENRRLQKEVSELRALKLSPHLYMHMTPPTTLTMCPSC 273

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P466032.7e-4159.89Homeobox-leucine zipper protein HAT9 OS=Arabidopsis thaliana OX=3702 GN=HAT9 PE=... [more]
P466041.1e-3756.48Homeobox-leucine zipper protein HAT22 OS=Arabidopsis thaliana OX=3702 GN=HAT22 P... [more]
Q054668.5e-3560.00Homeobox-leucine zipper protein HAT4 OS=Arabidopsis thaliana OX=3702 GN=HAT4 PE=... [more]
P466657.2e-3451.60Homeobox-leucine zipper protein HAT14 OS=Arabidopsis thaliana OX=3702 GN=HAT14 P... [more]
A2Z1U17.2e-3453.63Homeobox-leucine zipper protein HOX11 OS=Oryza sativa subsp. indica OX=39946 GN=... [more]
Match NameE-valueIdentityDescription
A0A6J1IWK31.2e-89100.00homeobox-leucine zipper protein HAT9-like isoform X1 OS=Cucurbita maxima OX=3661... [more]
A0A6J1J4S44.9e-8697.79homeobox-leucine zipper protein HAT9-like isoform X2 OS=Cucurbita maxima OX=3661... [more]
A0A6J1F6L58.4e-6270.62homeobox-leucine zipper protein HAT9-like OS=Cucurbita moschata OX=3662 GN=LOC11... [more]
A0A6J1IHG61.1e-6171.15homeobox-leucine zipper protein HAT9-like OS=Cucurbita maxima OX=3661 GN=LOC1114... [more]
A0A0A0L3X31.5e-5874.21Homeobox domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G645820 PE... [more]
Match NameE-valueIdentityDescription
XP_022982417.12.6e-89100.00homeobox-leucine zipper protein HAT9-like isoform X1 [Cucurbita maxima][more]
XP_022982418.11.0e-8597.79homeobox-leucine zipper protein HAT9-like isoform X2 [Cucurbita maxima][more]
XP_023536403.14.6e-6271.09homeobox-leucine zipper protein HAT9-like [Cucurbita pepo subsp. pepo][more]
KAG6591449.11.0e-6171.09Homeobox-leucine zipper protein HAT9, partial [Cucurbita argyrosperma subsp. sor... [more]
XP_022935802.11.7e-6170.62homeobox-leucine zipper protein HAT9-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT2G22800.11.9e-4259.89Homeobox-leucine zipper protein family [more]
AT4G37790.17.6e-3956.48Homeobox-leucine zipper protein family [more]
AT4G16780.16.0e-3660.00homeobox protein 2 [more]
AT5G06710.15.1e-3551.60homeobox from Arabidopsis thaliana [more]
AT2G44910.19.6e-3453.29homeobox-leucine zipper protein 4 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 131..168
NoneNo IPR availableGENE3D1.10.10.60coord: 68..138
e-value: 2.0E-19
score: 70.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..55
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..104
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 70..104
NoneNo IPR availablePANTHERPTHR45714:SF24HOMEOBOX ASSOCIATED LEUCINE ZIPPER PROTEINcoord: 38..181
NoneNo IPR availablePANTHERPTHR45714FAMILY NOT NAMEDcoord: 38..181
IPR003106Leucine zipper, homeobox-associatedSMARTSM00340halzcoord: 132..175
e-value: 3.4E-16
score: 69.8
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 132..165
e-value: 8.7E-7
score: 29.0
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 74..136
e-value: 1.8E-15
score: 67.4
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 76..130
e-value: 1.2E-16
score: 60.3
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 72..132
score: 17.620911
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 75..133
e-value: 1.57794E-13
score: 60.3349
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 107..130
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 73..140

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh01G017640.1CmaCh01G017640.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003677 DNA binding