ClCG09G018360 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG09G018360
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionhomeobox-leucine zipper protein ATHB-20-like
LocationCG_Chr09: 35458682 .. 35461730 (+)
RNA-Seq ExpressionClCG09G018360
SyntenyClCG09G018360
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCTCTTTACCTAAAATTCGCTGTCTTTCAGACTTCTCTCTCTCTTTGCAAAAATCCATCTCTTCTCACCATGAGAGTGAGAGCCACACCACTCTTTCAACCATTTCTCTCTTCTTCTTCTTCTTCTTCTTTAATCAAATTCTTCTTCTATTTCTTCTCATCTTTCTCTCTTTGATCTCAATCCAAATCCAACACACAATATTGAGATCTAGCATGCATTGCCATTCCTATGGCTTCCCCTCACCATTCCCATAGCTTCATGTTCCAATCCCGCCCCGCCGATCACCACGAATACGTCCCCTCCGCTTCCTTCAACGCCATTCCCTCCTGCCCTCCTCACCTCTACTTTCACGGTCTCTCTCTTCTTCTTTCTTCCTGCAGTGGCAGAACCAGAACTTTCTTTCCCTTCTAATAACATTCTATCTTTTTCACTGCTAAAAACCTTTTTTTTTTTATTAAGATCAATTATGTATATACGATTATATTCTTCTTCTGGATTTAGGTATCGTTACGGATGTATTAACCTAGTTGAAATATTCATATGCACCTATTTATTTGTTCTGGTTTATTTTTTAAAGAAAGCAATAGCATTATCGCTCTCATCCTTGATTTTCCTCTCCTTCTTTCTTTCTCGCTTTGTTTTCACATGTGCATACAAATTAGATGATGGAGTTTTGATTTTCAAGTGATGTGTTTGTGGTTTTGTTTTGTAGATGGAGTGGTTCCAGTGATGATGAAGAGATCGATGTCGTTTTCGGGAGTCGAAAACGGGTGCGAGGAAGTGAATGGCGACGAGGGGTTATCAGACGATGGATTGGCATTGGGAGAGAAGAAGAAGCGATTAAATTTAGAGCAAGTGAAGGCTTTGGAGAAGAGCTTTGAGTTAGGGAATAAGCTTGAGCCAGAGAGGAAAATGCAGCTAGCCAAAGCTTTGGGGTTGCAGCCAAGACAGATTGCTATTTGGTTTCAGAATAGAAGGGCCAGATGGAAAACCAAGCAGTTGGAGAGAGATTATGAAGTGTTGAAGAAACAGTTTGAAGCTCTTAAAGCTGACAATGATGTACTTCAAGCTCAAAATACCAAACTCCATGCAGAGGTAATAAAAACAAAAACAATATATTAAAATCTCTTCCTTCATAATCATTTGATTTTTTAAAATAGTGCTATTTTGTTCTTCTCACCGTCTCTTTACAATTACATCAAATTTCAAACAAAAGAAAAAAAATCTTTTCAATTTCAAAAGATGATCAAAAAACAAATAATTTTTAATAAATTAAATTTTCAAAAACTAAAAACAACATTTCGAAACAAAACTAAATTTTGGTTTTGATTTGTTGTAATTGAATTTATGGGTTATTACATTTGGAACAAGGACAAAATAATAATCCTTGCTATTTACCCTAGCTAGTAGGATAAGAGTTTCCTATTAACCTAATTTTTAATTGTATTTGTAAATTTTGAGTGTGTCCCGGCCGTAATATATGTATATGTATTTTTCTGTCCACTATGAATTAAAAATTAAGTACAAAAGTATGTCTGTTCAAAACAAACAAAGTACACAAAGTATGTAGGCAAGTAAATAATCAAATTTATAACATATTAATTTAATGAGACTTTTTTAACCATTAAACTAATTTAAAATTTTGAGTTGCTTTTAAATATAGTAAAACAAGTCAATTTGTTTACAAATACCTCTTATAGATACAAATAGACTACTATTATTTATTATTATATATTTAAAAATATTTTTAATAATTTTATTACTTAAAATAATTACTAGAAAACTTTTAATTTTAATTCTCTTATTTCATTGTTTTTAAGATTAAGTAAGTAGTTTTTAATACGCAAGTTTTCTAATCTCTTAACTTTAAATAACAAGGTTGAATTTATAAATAATTCCAACAGATCAACTGTTTTAGATTATTGTTAGCATTATTCTCTTAATTAATTTAGTGAGCATGTCTAATCTAATAATTAACATTTTTTTTTTGTAAGAGTTGCTTTCAAACTTCCATTTTGATCAAAACACAACCTTTGCACAACTTGAAAAACTCAAACTATCTGTAAATTTTGATAATAACAAAGGTTTTAAATGTCATCTTCGCAAAATTATCGAAATTTTAATTTGATAAAAATGTCGATGAAAATATTGATAAAACTAACATGAATATGAAAAGATATTTTTTAAAATATTATAAAAACAAAAATTAAAAAATAATTTTAAATTAATAAATAAAAGTTTTATGGTTTTTTAAATGAGTGAATAAATGTTTATATTACTTATATTTTATTTACATATCTAGGTAATATTTGAGCTATTTTCAAATATAGCAAAAGGAACCAAAATATTTACAAATATAGTAAAATTTTATTGTCTATTTGTCATAGATCGCGATAGACTATTATGTGTCACTTGACATTTAATAGTCTATCGCGATCTATCACAATCTATTGTAGATATACGGTGTATTTTGTTATTAGTTGTAAATATTTTCAACAATTTTAACATTTAAGATAATTTTTTGTAATATTTTGTTGCTTATTTTATTATTTATTTTTTAGACTTTTCTACGATATAATGGAAATACCAAATATGTTGACCGAAATTTTAATAATCCATGAATTTTTTTTTTTTTCTAGCTATTAGCATTAAAAACCAAAGACTCCGGCGAGGCAGCGGGGGGCGGCGCCACCATGAACCTAAAGAAAGAAAACGAACGCTGTTGGAGCAGCGACAACAGTTGCGACATTAATCTCGACATCTCAAAGACACAAGCACCAATAAACGGCAGCGGCGGTGGTGGCGGAGGTAGAGCATGTTCTCAACCAGGAATAATCAAAGATCTTTTCCCATCGGCGGCGTTCCGATCCGCCGCCATAACGCAGCTGCTTCAACACGGGTCGTCCAGATCAACGGTGGACCATCCTCAAGTGATTCAAGAAGAAAGCTTCTCTCAAATGTTCAATGGCATTGAAGAACAACAACAAACTGCAGCAGCAGCTGGGTTTTGGCCATGGAGTTCAGATCAAAATTCCCATTTTCATTAA

mRNA sequence

TCTCTCTTTACCTAAAATTCGCTGTCTTTCAGACTTCTCTCTCTCTTTGCAAAAATCCATCTCTTCTCACCATGAGAGTGAGAGCCACACCACTCTTTCAACCATTTCTCTCTTCTTCTTCTTCTTCTTCTTTAATCAAATTCTTCTTCTATTTCTTCTCATCTTTCTCTCTTTGATCTCAATCCAAATCCAACACACAATATTGAGATCTAGCATGCATTGCCATTCCTATGGCTTCCCCTCACCATTCCCATAGCTTCATGTTCCAATCCCGCCCCGCCGATCACCACGAATACGTCCCCTCCGCTTCCTTCAACGCCATTCCCTCCTGCCCTCCTCACCTCTACTTTCACGATGGAGTGGTTCCAGTGATGATGAAGAGATCGATGTCGTTTTCGGGAGTCGAAAACGGGTGCGAGGAAGTGAATGGCGACGAGGGGTTATCAGACGATGGATTGGCATTGGGAGAGAAGAAGAAGCGATTAAATTTAGAGCAAGTGAAGGCTTTGGAGAAGAGCTTTGAGTTAGGGAATAAGCTTGAGCCAGAGAGGAAAATGCAGCTAGCCAAAGCTTTGGGGTTGCAGCCAAGACAGATTGCTATTTGGTTTCAGAATAGAAGGGCCAGATGGAAAACCAAGCAGTTGGAGAGAGATTATGAAGTGTTGAAGAAACAGTTTGAAGCTCTTAAAGCTGACAATGATGTACTTCAAGCTCAAAATACCAAACTCCATGCAGAGGTTGAATTTATAAATAATTCCAACAGATCAACTGTTTTAGATTATTCATTAAAAACCAAAGACTCCGGCGAGGCAGCGGGGGGCGGCGCCACCATGAACCTAAAGAAAGAAAACGAACGCTGTTGGAGCAGCGACAACAGTTGCGACATTAATCTCGACATCTCAAAGACACAAGCACCAATAAACGGCAGCGGCGGTGGTGGCGGAGGTAGAGCATGTTCTCAACCAGGAATAATCAAAGATCTTTTCCCATCGGCGGCGTTCCGATCCGCCGCCATAACGCAGCTGCTTCAACACGGGTCGTCCAGATCAACGGTGGACCATCCTCAAGTGATTCAAGAAGAAAGCTTCTCTCAAATGTTCAATGGCATTGAAGAACAACAACAAACTGCAGCAGCAGCTGGGTTTTGGCCATGGAGTTCAGATCAAAATTCCCATTTTCATTAA

Coding sequence (CDS)

ATGGCTTCCCCTCACCATTCCCATAGCTTCATGTTCCAATCCCGCCCCGCCGATCACCACGAATACGTCCCCTCCGCTTCCTTCAACGCCATTCCCTCCTGCCCTCCTCACCTCTACTTTCACGATGGAGTGGTTCCAGTGATGATGAAGAGATCGATGTCGTTTTCGGGAGTCGAAAACGGGTGCGAGGAAGTGAATGGCGACGAGGGGTTATCAGACGATGGATTGGCATTGGGAGAGAAGAAGAAGCGATTAAATTTAGAGCAAGTGAAGGCTTTGGAGAAGAGCTTTGAGTTAGGGAATAAGCTTGAGCCAGAGAGGAAAATGCAGCTAGCCAAAGCTTTGGGGTTGCAGCCAAGACAGATTGCTATTTGGTTTCAGAATAGAAGGGCCAGATGGAAAACCAAGCAGTTGGAGAGAGATTATGAAGTGTTGAAGAAACAGTTTGAAGCTCTTAAAGCTGACAATGATGTACTTCAAGCTCAAAATACCAAACTCCATGCAGAGGTTGAATTTATAAATAATTCCAACAGATCAACTGTTTTAGATTATTCATTAAAAACCAAAGACTCCGGCGAGGCAGCGGGGGGCGGCGCCACCATGAACCTAAAGAAAGAAAACGAACGCTGTTGGAGCAGCGACAACAGTTGCGACATTAATCTCGACATCTCAAAGACACAAGCACCAATAAACGGCAGCGGCGGTGGTGGCGGAGGTAGAGCATGTTCTCAACCAGGAATAATCAAAGATCTTTTCCCATCGGCGGCGTTCCGATCCGCCGCCATAACGCAGCTGCTTCAACACGGGTCGTCCAGATCAACGGTGGACCATCCTCAAGTGATTCAAGAAGAAAGCTTCTCTCAAATGTTCAATGGCATTGAAGAACAACAACAAACTGCAGCAGCAGCTGGGTTTTGGCCATGGAGTTCAGATCAAAATTCCCATTTTCATTAA

Protein sequence

MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSHFH
Homology
BLAST of ClCG09G018360 vs. NCBI nr
Match: XP_038890842.1 (LOW QUALITY PROTEIN: homeobox-leucine zipper protein ATHB-20-like [Benincasa hispida])

HSP 1 Score: 558.1 bits (1437), Expect = 4.7e-155
Identity = 291/322 (90.37%), Postives = 294/322 (91.30%), Query Frame = 0

Query: 1   MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVEN 60
           MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVEN
Sbjct: 1   MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVEN 60

Query: 61  GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPR 120
           GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPR
Sbjct: 61  GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPR 120

Query: 121 QIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRST 180
           QIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+          
Sbjct: 121 QIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEL---------- 180

Query: 181 VLDYSLKTKDSGEAA-GGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGG 240
               +LKTKDSGE   GG ATMNLKKENE CWSSDNSCDINLDISKTQA I G G GGGG
Sbjct: 181 ---LALKTKDSGEVVXGGAATMNLKKENEECWSSDNSCDINLDISKTQASI-GGGSGGGG 240

Query: 241 RACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT 300
           R CSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT
Sbjct: 241 RGCSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT 300

Query: 301 ----AAAAGFWPWSSDQNSHFH 318
               AAAAGFWPW+SDQNSHFH
Sbjct: 301 AAAAAAAAGFWPWNSDQNSHFH 308

BLAST of ClCG09G018360 vs. NCBI nr
Match: XP_008459304.1 (PREDICTED: homeobox-leucine zipper protein ATHB-20-like [Cucumis melo])

HSP 1 Score: 548.9 bits (1413), Expect = 2.9e-152
Identity = 289/321 (90.03%), Postives = 294/321 (91.59%), Query Frame = 0

Query: 1   MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGV 60
           MASP HHSHSFMFQSRPA DHHEYVPSASFN IPSCPPHLYFHDGVVPVMMKRSMSFSGV
Sbjct: 1   MASPHHHSHSFMFQSRPAPDHHEYVPSASFNTIPSCPPHLYFHDGVVPVMMKRSMSFSGV 60

Query: 61  ENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQ 120
           ENGCE+VNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE+GNKLEPERKMQLAKALGLQ
Sbjct: 61  ENGCEDVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFEVGNKLEPERKMQLAKALGLQ 120

Query: 121 PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNR 180
           PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+        
Sbjct: 121 PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEL-------- 180

Query: 181 STVLDYSLKTKDSGE-AAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGG 240
                 +LKTKDSGE   GGGATMNLKKENERCWSSDNSCDINLDIS TQ PI    GGG
Sbjct: 181 -----LALKTKDSGETVGGGGATMNLKKENERCWSSDNSCDINLDISNTQTPI----GGG 240

Query: 241 GGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ 300
           GGRACSQPG+IKDLFPSAAFRSAAITQLLQHGSSRSTVD HPQVIQEESFSQMFNGIEEQ
Sbjct: 241 GGRACSQPGMIKDLFPSAAFRSAAITQLLQHGSSRSTVDQHPQVIQEESFSQMFNGIEEQ 300

Query: 301 QQTAAAAGFWPWSSDQNSHFH 318
           QQTAAAAGFWPWSSDQNSHFH
Sbjct: 301 QQTAAAAGFWPWSSDQNSHFH 304

BLAST of ClCG09G018360 vs. NCBI nr
Match: KAG6578593.1 (Homeobox-leucine zipper protein HAT7, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 539.7 bits (1389), Expect = 1.7e-149
Identity = 279/321 (86.92%), Postives = 291/321 (90.65%), Query Frame = 0

Query: 1   MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVEN 60
           MASPHHSHSF+FQSRPADHHEY+PSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSG+EN
Sbjct: 1   MASPHHSHSFLFQSRPADHHEYLPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGIEN 60

Query: 61  GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPR 120
           GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERK+QLAKALGLQPR
Sbjct: 61  GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKIQLAKALGLQPR 120

Query: 121 QIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRST 180
           Q+AIWFQNRRARWKTKQLERDYEVLKK FE+LKADNDVLQAQNTKLHAE+          
Sbjct: 121 QVAIWFQNRRARWKTKQLERDYEVLKKHFESLKADNDVLQAQNTKLHAEL---------- 180

Query: 181 VLDYSLKTKDSGEAAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGR 240
               +LKTKDSGE AGGGATMNLKKENERCWSSDNSCDINLDISKTQA IN   GG GGR
Sbjct: 181 ---LALKTKDSGEVAGGGATMNLKKENERCWSSDNSCDINLDISKTQAAIN---GGEGGR 240

Query: 241 ACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT- 300
           AC +PG IKDLFPSAAFRS AITQL+Q GSSRSTVDHPQVIQEESFSQMFNGIEEQQQ+ 
Sbjct: 241 ACCEPG-IKDLFPSAAFRSGAITQLIQRGSSRSTVDHPQVIQEESFSQMFNGIEEQQQSA 300

Query: 301 ---AAAAGFWPWSSDQNSHFH 318
              AAAAGFWPW SDQNSHF+
Sbjct: 301 AAAAAAAGFWPWGSDQNSHFN 304

BLAST of ClCG09G018360 vs. NCBI nr
Match: XP_004148689.1 (homeobox-leucine zipper protein ATHB-20 [Cucumis sativus] >KGN52448.1 hypothetical protein Csa_008273 [Cucumis sativus])

HSP 1 Score: 538.5 bits (1386), Expect = 3.9e-149
Identity = 286/322 (88.82%), Postives = 292/322 (90.68%), Query Frame = 0

Query: 1   MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGV 60
           MASP HHSHSFMFQSRPA DHHEY+PSASFN IPSCPPHLYFHDGVVPVMMKRSMSFS V
Sbjct: 1   MASPHHHSHSFMFQSRPAPDHHEYIPSASFNTIPSCPPHLYFHDGVVPVMMKRSMSFSEV 60

Query: 61  ENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQ 120
           ENGCE+VNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE+GNKLEPERKMQLAKALGLQ
Sbjct: 61  ENGCEDVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFEVGNKLEPERKMQLAKALGLQ 120

Query: 121 PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNR 180
           PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+        
Sbjct: 121 PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEL-------- 180

Query: 181 STVLDYSLKTKDSGE-AAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGG 240
                 +LKTKDSGE A GGGATMNLKKENERCWSSDNSCDINLDIS TQ PI    GG 
Sbjct: 181 -----LALKTKDSGETAGGGGATMNLKKENERCWSSDNSCDINLDISNTQTPI----GGS 240

Query: 241 GGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ 300
           GGR CSQPG+IKDLFPSAAFRSAAITQLLQHGSSRSTVD HPQVIQEESFSQMFNGIEEQ
Sbjct: 241 GGRGCSQPGMIKDLFPSAAFRSAAITQLLQHGSSRSTVDQHPQVIQEESFSQMFNGIEEQ 300

Query: 301 QQTAAAAGFWPWS-SDQNSHFH 318
           QQTAAAAGFWPWS SDQNSHFH
Sbjct: 301 QQTAAAAGFWPWSTSDQNSHFH 305

BLAST of ClCG09G018360 vs. NCBI nr
Match: XP_022993652.1 (homeobox-leucine zipper protein ATHB-20-like [Cucurbita maxima])

HSP 1 Score: 536.6 bits (1381), Expect = 1.5e-148
Identity = 276/318 (86.79%), Postives = 291/318 (91.51%), Query Frame = 0

Query: 1   MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVEN 60
           MASPHHSHSF+FQSRPADHHEY+PSASFNAIPSCPPHLYFHDGV+PVMMKRSMSFSGVEN
Sbjct: 1   MASPHHSHSFLFQSRPADHHEYLPSASFNAIPSCPPHLYFHDGVIPVMMKRSMSFSGVEN 60

Query: 61  GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPR 120
           GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERK+QLAKALGLQPR
Sbjct: 61  GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKIQLAKALGLQPR 120

Query: 121 QIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRST 180
           Q+AIWFQNRRARWKTKQLERDYEVLKK FE+LKADNDVLQAQN+KLHA++          
Sbjct: 121 QVAIWFQNRRARWKTKQLERDYEVLKKHFESLKADNDVLQAQNSKLHAQL---------- 180

Query: 181 VLDYSLKTKDSGEAAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGR 240
               +LKTKD+GE AGGGATMNLKKENERCWSSDNSCDINLDISKTQA IN   GG GGR
Sbjct: 181 ---LALKTKDTGEVAGGGATMNLKKENERCWSSDNSCDINLDISKTQAAIN---GGEGGR 240

Query: 241 ACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT- 300
           AC +PG IKDLFPSAAFRS AITQL+Q GSSRSTVDHPQVIQEESFSQMFNGIEEQQQT 
Sbjct: 241 ACCEPG-IKDLFPSAAFRSGAITQLIQRGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTA 300

Query: 301 AAAAGFWPWSSDQNSHFH 318
           AAAAGFWPW SDQ+SHF+
Sbjct: 301 AAAAGFWPWGSDQSSHFN 301

BLAST of ClCG09G018360 vs. ExPASy Swiss-Prot
Match: Q00466 (Homeobox-leucine zipper protein HAT7 OS=Arabidopsis thaliana OX=3702 GN=HAT7 PE=2 SV=4)

HSP 1 Score: 234.6 bits (597), Expect = 1.6e-60
Identity = 159/336 (47.32%), Postives = 198/336 (58.93%), Query Frame = 0

Query: 1   MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVE- 60
           MA P   H FMFQ    D+  ++PS +  ++PSCPPHL F+ G    MM RSMSF+GV  
Sbjct: 22  MAFP--QHGFMFQQLHEDNAHHLPSPT--SLPSCPPHL-FYGGGGNYMMNRSMSFTGVSD 81

Query: 61  ---------------NGCEEVNGDEGLSDDG--LALGEKKKRLNLEQVKALEKSFELGNK 120
                          N  ++V  ++ LSDDG  + LGEKKKRLNLEQV+ALEKSFELGNK
Sbjct: 82  HHHLTQKSPTTTNNMNDQDQVGEEDNLSDDGSHMMLGEKKKRLNLEQVRALEKSFELGNK 141

Query: 121 LEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQ 180
           LEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+ LKKQF+ LK+DND L A 
Sbjct: 142 LEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYDSLKKQFDVLKSDNDSLLAH 201

Query: 181 NTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKE-NERCWSSDNSCDINL 240
           N KLHAE+              +LK  D  E+A       +K+E  E  WS++ S + N 
Sbjct: 202 NKKLHAEL-------------VALKKHDRKESA------KIKREFAEASWSNNGSTENNH 261

Query: 241 DISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVI 300
           + + + A              +   +IKDLFPS + RSA  T    H      +DH Q++
Sbjct: 262 NNNSSDA--------------NHVSMIKDLFPS-SIRSATATTTSTH------IDH-QIV 307

Query: 301 --QEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSH 316
             Q++ F  MFNGI+E      +A +W W   Q  H
Sbjct: 322 QDQDQGFCNMFNGIDE----TTSASYWAWPDQQQQH 307

BLAST of ClCG09G018360 vs. ExPASy Swiss-Prot
Match: Q8LAT0 (Homeobox-leucine zipper protein ATHB-20 OS=Arabidopsis thaliana OX=3702 GN=ATHB-20 PE=1 SV=2)

HSP 1 Score: 224.2 bits (570), Expect = 2.1e-57
Identity = 153/323 (47.37%), Postives = 187/323 (57.89%), Query Frame = 0

Query: 1   MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVEN 60
           MA P   H FMFQ    D+       S + +PSCPPHL+  +G    MM RSMS   V+ 
Sbjct: 16  MAFP--QHGFMFQQLHEDN-------SQDQLPSCPPHLF--NGGGNYMMNRSMSLMNVQE 75

Query: 61  GCEEVNGDEGLSDDG--LALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQ 120
              +   +E LSDDG    LGEKKKRL LEQVKALEKSFELGNKLEPERK+QLAKALG+Q
Sbjct: 76  DHNQTLDEENLSDDGAHTMLGEKKKRLQLEQVKALEKSFELGNKLEPERKIQLAKALGMQ 135

Query: 121 PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNR 180
           PRQIAIWFQNRRARWKT+QLERDY+ LKKQFE+LK+DN  L A N KL AEV        
Sbjct: 136 PRQIAIWFQNRRARWKTRQLERDYDSLKKQFESLKSDNASLLAYNKKLLAEV-------- 195

Query: 181 STVLDYSLKTKDSGEAAGGGATMNLKKENERCW----SSDNSCDINLDISKTQAPINGSG 240
                 +LK K+  E         +K+E E  W    S++NS DINL++ +         
Sbjct: 196 -----MALKNKECNEG------NIVKREAEASWSNNGSTENSSDINLEMPRE-------- 255

Query: 241 GGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIE 300
                   +    IKDLFPS + RS+A      H        + +++QEES   MFNGI+
Sbjct: 256 -----TITTHVNTIKDLFPS-SIRSSA------HDDDHH--QNHEIVQEESLCNMFNGID 282

Query: 301 EQQQTAAAAGFWPWSSDQNSHFH 318
           E       AG+W WS   ++H H
Sbjct: 316 E----TTPAGYWAWSDPNHNHHH 282

BLAST of ClCG09G018360 vs. ExPASy Swiss-Prot
Match: A2XD08 (Homeobox-leucine zipper protein HOX21 OS=Oryza sativa subsp. indica OX=39946 GN=HOX21 PE=2 SV=2)

HSP 1 Score: 200.7 bits (509), Expect = 2.5e-50
Identity = 137/332 (41.27%), Postives = 180/332 (54.22%), Query Frame = 0

Query: 16  PADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVN--GDEGLSD 75
           P  H+ ++PS++      CP    F  G+ P++ KR MS+     G +EVN  G++ LSD
Sbjct: 63  PHPHNPFLPSSA-----QCPSLQEFR-GMAPMLGKRPMSYGDGGGGGDEVNGGGEDELSD 122

Query: 76  DGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNRRARW 135
           DG   GEKK+RLN+EQV+ LEK+FELGNKLEPERKMQLA+ALGLQPRQ+AIWFQNRRARW
Sbjct: 123 DGSQAGEKKRRLNVEQVRTLEKNFELGNKLEPERKMQLARALGLQPRQVAIWFQNRRARW 182

Query: 136 KTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTKDSGE 195
           KTKQLE+DY+ LK+Q +A+KA+ND L   N KL AE+  +     ++ L           
Sbjct: 183 KTKQLEKDYDALKRQLDAVKAENDALLNHNKKLQAEIVALKGREAASEL----------- 242

Query: 196 AAGGGATMNLKKENERCWS--SDNSCDINLDISKTQAP--------------INGSGGGG 255
                  +NL KE E   S  S+NS +INLDIS+T  P               +G GGGG
Sbjct: 243 -------INLNKETEASCSNRSENSSEINLDISRTPPPDAAALDAAPTAHHHHHGGGGGG 302

Query: 256 GG------------RACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEES 315
           GG            R  S  G+  D    ++   A   ++  HG   +       +   S
Sbjct: 303 GGGGGMIPFYTSIARPASGGGVDIDQLLHSSSGGAGGPKMEHHGGGGNV--QAASVDTAS 360

Query: 316 FSQMFNGIEEQQQTAAAAGFWPWSSDQNSHFH 318
           F  +  G++E         FWPW   Q  HFH
Sbjct: 363 FGNLLCGVDEPPP------FWPWPDHQ--HFH 360

BLAST of ClCG09G018360 vs. ExPASy Swiss-Prot
Match: Q8S7W9 (Homeobox-leucine zipper protein HOX21 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX21 PE=2 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 2.5e-50
Identity = 141/351 (40.17%), Postives = 180/351 (51.28%), Query Frame = 0

Query: 5   HHSHSFMFQSRPADHHEYVPSASFNAIPSCP--------PHLYFHDGVVPVMMKRSMSFS 64
           HH H    Q +   HH   P       P  P        P L    G+ P++ KR MS+ 
Sbjct: 44  HHGHHHEQQQQQQHHHHLGPPPPPPPHPHNPFLPSSAQCPSLQEFRGMAPMLGKRPMSYG 103

Query: 65  GVENGCEEVN--GDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKA 124
               G +EVN  G++ LSDDG   GEKK+RLN+EQV+ LEK+FELGNKLEPERKMQLA+A
Sbjct: 104 DGGGGGDEVNGGGEDELSDDGSQAGEKKRRLNVEQVRTLEKNFELGNKLEPERKMQLARA 163

Query: 125 LGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFIN 184
           LGLQPRQ+AIWFQNRRARWKTKQLE+DY+ LK+Q +A+KA+ND L   N KL AE+  + 
Sbjct: 164 LGLQPRQVAIWFQNRRARWKTKQLEKDYDALKRQLDAVKAENDALLNHNKKLQAEIVALK 223

Query: 185 NSNRSTVLDYSLKTKDSGEAAGGGATMNLKKENERCWS--SDNSCDINLDISKTQAP--- 244
               ++ L                  +NL KE E   S  S+NS +INLDIS+T  P   
Sbjct: 224 GREAASEL------------------INLNKETEASCSNRSENSSEINLDISRTPPPDAA 283

Query: 245 -----------INGSGGGGGG------------RACSQPGIIKDLFPSAAFRSAAITQLL 304
                       +G GGGGGG            R  S  G+  D    ++   A   ++ 
Sbjct: 284 ALDTAPTAHHHHHGGGGGGGGGGGMIPFYTSIARPASGGGVDIDQLLHSSSGGAGGPKME 343

Query: 305 QHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSHFH 318
            HG   +       +   SF  +  G++E         FWPW   Q  HFH
Sbjct: 344 HHGGGGNV--QAASVDTASFGNLLCGVDEPPP------FWPWPDHQ--HFH 366

BLAST of ClCG09G018360 vs. ExPASy Swiss-Prot
Match: Q8LC03 (Homeobox-leucine zipper protein ATHB-13 OS=Arabidopsis thaliana OX=3702 GN=ATHB-13 PE=2 SV=2)

HSP 1 Score: 196.8 bits (499), Expect = 3.6e-49
Identity = 138/309 (44.66%), Postives = 176/309 (56.96%), Query Frame = 0

Query: 9   SFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRS--MSFSGVENGCEEVN 68
           +FM Q+   D H +   +    +PSC      H G    + KRS       +E G   +N
Sbjct: 13  NFMIQTSYEDDHPHQSPSLAPLLPSCSLPQDLH-GFASFLGKRSPMEGCCDLETG-NNMN 72

Query: 69  GDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWF 128
           G+E  SDDG  +GEKK+RLN+EQVK LEK+FELGNKLEPERKMQLA+ALGLQPRQIAIWF
Sbjct: 73  GEEDYSDDGSQMGEKKRRLNMEQVKTLEKNFELGNKLEPERKMQLARALGLQPRQIAIWF 132

Query: 129 QNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSL 188
           QNRRARWKTKQLE+DY+ LK+QF+ LKA+ND+LQ  N KL AE+               L
Sbjct: 133 QNRRARWKTKQLEKDYDTLKRQFDTLKAENDLLQTHNQKLQAEI-------------MGL 192

Query: 189 KTKDSGEAAGGGATMNLKKENERCWS--SDNSCD-INLDISKTQAPINGSGGGGGGRACS 248
           K ++  E      ++NL KE E   S  SDNS D + LDIS T  P N S   GG     
Sbjct: 193 KNREQTE------SINLNKETEGSCSNRSDNSSDNLRLDIS-TAPPSNDSTLTGGHPPPP 252

Query: 249 QPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAA 308
           Q  + +  FP +   +   T  +Q   + S+     V +E S S MF  +++       +
Sbjct: 253 QT-VGRHFFPPSPATATTTTTTMQFFQNSSS-GQSMVKEENSISNMFCAMDDH------S 291

Query: 309 GFWPWSSDQ 313
           GFWPW   Q
Sbjct: 313 GFWPWLDQQ 291

BLAST of ClCG09G018360 vs. ExPASy TrEMBL
Match: A0A1S3C9V4 (homeobox-leucine zipper protein ATHB-20-like OS=Cucumis melo OX=3656 GN=LOC103498474 PE=4 SV=1)

HSP 1 Score: 548.9 bits (1413), Expect = 1.4e-152
Identity = 289/321 (90.03%), Postives = 294/321 (91.59%), Query Frame = 0

Query: 1   MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGV 60
           MASP HHSHSFMFQSRPA DHHEYVPSASFN IPSCPPHLYFHDGVVPVMMKRSMSFSGV
Sbjct: 1   MASPHHHSHSFMFQSRPAPDHHEYVPSASFNTIPSCPPHLYFHDGVVPVMMKRSMSFSGV 60

Query: 61  ENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQ 120
           ENGCE+VNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE+GNKLEPERKMQLAKALGLQ
Sbjct: 61  ENGCEDVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFEVGNKLEPERKMQLAKALGLQ 120

Query: 121 PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNR 180
           PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+        
Sbjct: 121 PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEL-------- 180

Query: 181 STVLDYSLKTKDSGE-AAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGG 240
                 +LKTKDSGE   GGGATMNLKKENERCWSSDNSCDINLDIS TQ PI    GGG
Sbjct: 181 -----LALKTKDSGETVGGGGATMNLKKENERCWSSDNSCDINLDISNTQTPI----GGG 240

Query: 241 GGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ 300
           GGRACSQPG+IKDLFPSAAFRSAAITQLLQHGSSRSTVD HPQVIQEESFSQMFNGIEEQ
Sbjct: 241 GGRACSQPGMIKDLFPSAAFRSAAITQLLQHGSSRSTVDQHPQVIQEESFSQMFNGIEEQ 300

Query: 301 QQTAAAAGFWPWSSDQNSHFH 318
           QQTAAAAGFWPWSSDQNSHFH
Sbjct: 301 QQTAAAAGFWPWSSDQNSHFH 304

BLAST of ClCG09G018360 vs. ExPASy TrEMBL
Match: A0A0A0KS90 (Homeobox domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G635430 PE=4 SV=1)

HSP 1 Score: 538.5 bits (1386), Expect = 1.9e-149
Identity = 286/322 (88.82%), Postives = 292/322 (90.68%), Query Frame = 0

Query: 1   MASP-HHSHSFMFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGV 60
           MASP HHSHSFMFQSRPA DHHEY+PSASFN IPSCPPHLYFHDGVVPVMMKRSMSFS V
Sbjct: 1   MASPHHHSHSFMFQSRPAPDHHEYIPSASFNTIPSCPPHLYFHDGVVPVMMKRSMSFSEV 60

Query: 61  ENGCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQ 120
           ENGCE+VNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFE+GNKLEPERKMQLAKALGLQ
Sbjct: 61  ENGCEDVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFEVGNKLEPERKMQLAKALGLQ 120

Query: 121 PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNR 180
           PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+        
Sbjct: 121 PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEL-------- 180

Query: 181 STVLDYSLKTKDSGE-AAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGG 240
                 +LKTKDSGE A GGGATMNLKKENERCWSSDNSCDINLDIS TQ PI    GG 
Sbjct: 181 -----LALKTKDSGETAGGGGATMNLKKENERCWSSDNSCDINLDISNTQTPI----GGS 240

Query: 241 GGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQ 300
           GGR CSQPG+IKDLFPSAAFRSAAITQLLQHGSSRSTVD HPQVIQEESFSQMFNGIEEQ
Sbjct: 241 GGRGCSQPGMIKDLFPSAAFRSAAITQLLQHGSSRSTVDQHPQVIQEESFSQMFNGIEEQ 300

Query: 301 QQTAAAAGFWPWS-SDQNSHFH 318
           QQTAAAAGFWPWS SDQNSHFH
Sbjct: 301 QQTAAAAGFWPWSTSDQNSHFH 305

BLAST of ClCG09G018360 vs. ExPASy TrEMBL
Match: A0A6J1JTG1 (homeobox-leucine zipper protein ATHB-20-like OS=Cucurbita maxima OX=3661 GN=LOC111489580 PE=4 SV=1)

HSP 1 Score: 536.6 bits (1381), Expect = 7.2e-149
Identity = 276/318 (86.79%), Postives = 291/318 (91.51%), Query Frame = 0

Query: 1   MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVEN 60
           MASPHHSHSF+FQSRPADHHEY+PSASFNAIPSCPPHLYFHDGV+PVMMKRSMSFSGVEN
Sbjct: 1   MASPHHSHSFLFQSRPADHHEYLPSASFNAIPSCPPHLYFHDGVIPVMMKRSMSFSGVEN 60

Query: 61  GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPR 120
           GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERK+QLAKALGLQPR
Sbjct: 61  GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKIQLAKALGLQPR 120

Query: 121 QIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRST 180
           Q+AIWFQNRRARWKTKQLERDYEVLKK FE+LKADNDVLQAQN+KLHA++          
Sbjct: 121 QVAIWFQNRRARWKTKQLERDYEVLKKHFESLKADNDVLQAQNSKLHAQL---------- 180

Query: 181 VLDYSLKTKDSGEAAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGR 240
               +LKTKD+GE AGGGATMNLKKENERCWSSDNSCDINLDISKTQA IN   GG GGR
Sbjct: 181 ---LALKTKDTGEVAGGGATMNLKKENERCWSSDNSCDINLDISKTQAAIN---GGEGGR 240

Query: 241 ACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT- 300
           AC +PG IKDLFPSAAFRS AITQL+Q GSSRSTVDHPQVIQEESFSQMFNGIEEQQQT 
Sbjct: 241 ACCEPG-IKDLFPSAAFRSGAITQLIQRGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTA 300

Query: 301 AAAAGFWPWSSDQNSHFH 318
           AAAAGFWPW SDQ+SHF+
Sbjct: 301 AAAAGFWPWGSDQSSHFN 301

BLAST of ClCG09G018360 vs. ExPASy TrEMBL
Match: A0A6J1FFJ1 (homeobox-leucine zipper protein ATHB-20-like OS=Cucurbita moschata OX=3662 GN=LOC111445023 PE=4 SV=1)

HSP 1 Score: 535.8 bits (1379), Expect = 1.2e-148
Identity = 277/321 (86.29%), Postives = 290/321 (90.34%), Query Frame = 0

Query: 1   MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVEN 60
           MASPHHSHSF+FQSRPADHHEY+PSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSG+EN
Sbjct: 1   MASPHHSHSFLFQSRPADHHEYLPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGIEN 60

Query: 61  GCEEVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPR 120
           GCEE+NGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERK+QLAKALGLQPR
Sbjct: 61  GCEEMNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKIQLAKALGLQPR 120

Query: 121 QIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRST 180
           Q+AIWFQNRRARWKTKQLERDYEVLKK FE+LKADNDVLQAQNTKLHAE+          
Sbjct: 121 QVAIWFQNRRARWKTKQLERDYEVLKKHFESLKADNDVLQAQNTKLHAEL---------- 180

Query: 181 VLDYSLKTKDSGEAAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGR 240
               +LKTKDSGE AGGGATMNLKKENERCWSSDNSCDINLDISKTQA IN   GG GGR
Sbjct: 181 ---LALKTKDSGEVAGGGATMNLKKENERCWSSDNSCDINLDISKTQAAIN---GGEGGR 240

Query: 241 ACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQT- 300
           AC +PG IKDLFPSAAF S AITQL+Q GSSRSTVDHPQVIQEESFSQMFNGIEEQQQ+ 
Sbjct: 241 ACCEPG-IKDLFPSAAFGSGAITQLIQRGSSRSTVDHPQVIQEESFSQMFNGIEEQQQSA 300

Query: 301 ---AAAAGFWPWSSDQNSHFH 318
              AAAAGFWPW SDQNSHF+
Sbjct: 301 AAAAAAAGFWPWGSDQNSHFN 304

BLAST of ClCG09G018360 vs. ExPASy TrEMBL
Match: A0A5D3CVK8 (Homeobox-leucine zipper protein ATHB-20-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold892G00450 PE=4 SV=1)

HSP 1 Score: 531.2 bits (1367), Expect = 3.0e-147
Identity = 278/310 (89.68%), Postives = 283/310 (91.29%), Query Frame = 0

Query: 11  MFQSRPA-DHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEEVNGDE 70
           MFQSRPA DHHEYVPSASFN IPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCE+VNGDE
Sbjct: 1   MFQSRPAPDHHEYVPSASFNTIPSCPPHLYFHDGVVPVMMKRSMSFSGVENGCEDVNGDE 60

Query: 71  GLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNR 130
           GLSDDGLALGEKKKRLNLEQVKALEKSFE+GNKLEPERKMQLAKALGLQPRQIAIWFQNR
Sbjct: 61  GLSDDGLALGEKKKRLNLEQVKALEKSFEVGNKLEPERKMQLAKALGLQPRQIAIWFQNR 120

Query: 131 RARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSLKTK 190
           RARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAE+              +LKTK
Sbjct: 121 RARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEL-------------LALKTK 180

Query: 191 DSGE-AAGGGATMNLKKENERCWSSDNSCDINLDISKTQAPINGSGGGGGGRACSQPGII 250
           DSGE   GGGATMNLKKENERCWSSDNSCDINLDIS TQ PI    GG GGRACSQPG+I
Sbjct: 181 DSGETVGGGGATMNLKKENERCWSSDNSCDINLDISNTQTPI----GGSGGRACSQPGMI 240

Query: 251 KDLFPSAAFRSAAITQLLQHGSSRSTVD-HPQVIQEESFSQMFNGIEEQQQTAAAAGFWP 310
           KDLFPSAAFRSAAITQLLQHGSSRSTVD HPQVIQEESFSQMFNGIEEQQQTAAAAGFWP
Sbjct: 241 KDLFPSAAFRSAAITQLLQHGSSRSTVDQHPQVIQEESFSQMFNGIEEQQQTAAAAGFWP 293

Query: 311 WSSDQNSHFH 318
           WSSDQNSHFH
Sbjct: 301 WSSDQNSHFH 293

BLAST of ClCG09G018360 vs. TAIR 10
Match: AT5G15150.1 (homeobox 3 )

HSP 1 Score: 234.6 bits (597), Expect = 1.1e-61
Identity = 159/336 (47.32%), Postives = 198/336 (58.93%), Query Frame = 0

Query: 1   MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVE- 60
           MA P   H FMFQ    D+  ++PS +  ++PSCPPHL F+ G    MM RSMSF+GV  
Sbjct: 22  MAFP--QHGFMFQQLHEDNAHHLPSPT--SLPSCPPHL-FYGGGGNYMMNRSMSFTGVSD 81

Query: 61  ---------------NGCEEVNGDEGLSDDG--LALGEKKKRLNLEQVKALEKSFELGNK 120
                          N  ++V  ++ LSDDG  + LGEKKKRLNLEQV+ALEKSFELGNK
Sbjct: 82  HHHLTQKSPTTTNNMNDQDQVGEEDNLSDDGSHMMLGEKKKRLNLEQVRALEKSFELGNK 141

Query: 121 LEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQ 180
           LEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDY+ LKKQF+ LK+DND L A 
Sbjct: 142 LEPERKMQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYDSLKKQFDVLKSDNDSLLAH 201

Query: 181 NTKLHAEVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKE-NERCWSSDNSCDINL 240
           N KLHAE+              +LK  D  E+A       +K+E  E  WS++ S + N 
Sbjct: 202 NKKLHAEL-------------VALKKHDRKESA------KIKREFAEASWSNNGSTENNH 261

Query: 241 DISKTQAPINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVI 300
           + + + A              +   +IKDLFPS + RSA  T    H      +DH Q++
Sbjct: 262 NNNSSDA--------------NHVSMIKDLFPS-SIRSATATTTSTH------IDH-QIV 307

Query: 301 --QEESFSQMFNGIEEQQQTAAAAGFWPWSSDQNSH 316
             Q++ F  MFNGI+E      +A +W W   Q  H
Sbjct: 322 QDQDQGFCNMFNGIDE----TTSASYWAWPDQQQQH 307

BLAST of ClCG09G018360 vs. TAIR 10
Match: AT3G01220.1 (homeobox protein 20 )

HSP 1 Score: 224.2 bits (570), Expect = 1.5e-58
Identity = 153/323 (47.37%), Postives = 187/323 (57.89%), Query Frame = 0

Query: 1   MASPHHSHSFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRSMSFSGVEN 60
           MA P   H FMFQ    D+       S + +PSCPPHL+  +G    MM RSMS   V+ 
Sbjct: 16  MAFP--QHGFMFQQLHEDN-------SQDQLPSCPPHLF--NGGGNYMMNRSMSLMNVQE 75

Query: 61  GCEEVNGDEGLSDDG--LALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQ 120
              +   +E LSDDG    LGEKKKRL LEQVKALEKSFELGNKLEPERK+QLAKALG+Q
Sbjct: 76  DHNQTLDEENLSDDGAHTMLGEKKKRLQLEQVKALEKSFELGNKLEPERKIQLAKALGMQ 135

Query: 121 PRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNR 180
           PRQIAIWFQNRRARWKT+QLERDY+ LKKQFE+LK+DN  L A N KL AEV        
Sbjct: 136 PRQIAIWFQNRRARWKTRQLERDYDSLKKQFESLKSDNASLLAYNKKLLAEV-------- 195

Query: 181 STVLDYSLKTKDSGEAAGGGATMNLKKENERCW----SSDNSCDINLDISKTQAPINGSG 240
                 +LK K+  E         +K+E E  W    S++NS DINL++ +         
Sbjct: 196 -----MALKNKECNEG------NIVKREAEASWSNNGSTENSSDINLEMPRE-------- 255

Query: 241 GGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIE 300
                   +    IKDLFPS + RS+A      H        + +++QEES   MFNGI+
Sbjct: 256 -----TITTHVNTIKDLFPS-SIRSSA------HDDDHH--QNHEIVQEESLCNMFNGID 282

Query: 301 EQQQTAAAAGFWPWSSDQNSHFH 318
           E       AG+W WS   ++H H
Sbjct: 316 E----TTPAGYWAWSDPNHNHHH 282

BLAST of ClCG09G018360 vs. TAIR 10
Match: AT1G69780.1 (Homeobox-leucine zipper protein family )

HSP 1 Score: 196.8 bits (499), Expect = 2.6e-50
Identity = 138/309 (44.66%), Postives = 176/309 (56.96%), Query Frame = 0

Query: 9   SFMFQSRPADHHEYVPSASFNAIPSCPPHLYFHDGVVPVMMKRS--MSFSGVENGCEEVN 68
           +FM Q+   D H +   +    +PSC      H G    + KRS       +E G   +N
Sbjct: 13  NFMIQTSYEDDHPHQSPSLAPLLPSCSLPQDLH-GFASFLGKRSPMEGCCDLETG-NNMN 72

Query: 69  GDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWF 128
           G+E  SDDG  +GEKK+RLN+EQVK LEK+FELGNKLEPERKMQLA+ALGLQPRQIAIWF
Sbjct: 73  GEEDYSDDGSQMGEKKRRLNMEQVKTLEKNFELGNKLEPERKMQLARALGLQPRQIAIWF 132

Query: 129 QNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVEFINNSNRSTVLDYSL 188
           QNRRARWKTKQLE+DY+ LK+QF+ LKA+ND+LQ  N KL AE+               L
Sbjct: 133 QNRRARWKTKQLEKDYDTLKRQFDTLKAENDLLQTHNQKLQAEI-------------MGL 192

Query: 189 KTKDSGEAAGGGATMNLKKENERCWS--SDNSCD-INLDISKTQAPINGSGGGGGGRACS 248
           K ++  E      ++NL KE E   S  SDNS D + LDIS T  P N S   GG     
Sbjct: 193 KNREQTE------SINLNKETEGSCSNRSDNSSDNLRLDIS-TAPPSNDSTLTGGHPPPP 252

Query: 249 QPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQMFNGIEEQQQTAAAA 308
           Q  + +  FP +   +   T  +Q   + S+     V +E S S MF  +++       +
Sbjct: 253 QT-VGRHFFPPSPATATTTTTTMQFFQNSSS-GQSMVKEENSISNMFCAMDDH------S 291

Query: 309 GFWPWSSDQ 313
           GFWPW   Q
Sbjct: 313 GFWPWLDQQ 291

BLAST of ClCG09G018360 vs. TAIR 10
Match: AT1G26960.1 (homeobox protein 23 )

HSP 1 Score: 172.2 bits (435), Expect = 6.8e-43
Identity = 118/264 (44.70%), Postives = 155/264 (58.71%), Query Frame = 0

Query: 50  KRSMSFSGVENGCE-EVNGDEGLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERK 109
           KRS   + V+  C  ++NGDE  SDDG  +GEKK+RLN+EQ+KALEK FELGNKLE +RK
Sbjct: 40  KRS-PMNNVQGFCNLDMNGDEEYSDDGSKMGEKKRRLNMEQLKALEKDFELGNKLESDRK 99

Query: 110 MQLAKALGLQPRQIAIWFQNRRARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHA 169
           ++LA+ALGLQPRQIAIWFQNRRAR KTKQLE+DY++LK+QFE+L+ +N+VLQ QN KL A
Sbjct: 100 LELARALGLQPRQIAIWFQNRRARSKTKQLEKDYDMLKRQFESLRDENEVLQTQNQKLQA 159

Query: 170 EVEFINNSNRSTVLDYSLKTKDSGEAAGGGATMNLKKENERCWSSDNSCDINLDISKTQA 229
           +V              +LK+++  E      ++NL KE E    SD S +I+ DI     
Sbjct: 160 QV-------------MALKSREPIE------SINLNKETEGS-CSDRSENISGDI----- 219

Query: 230 PINGSGGGGGGRACSQPGIIKDLFPSAAFRSAAITQLLQHGSSRSTVDHPQVIQEESFSQ 289
                          +P  I   F      +    Q  Q+ SS    +   V +E S S 
Sbjct: 220 ---------------RPPEIDSQFALGHPPTTTTMQFFQNSSS----EQRMVKEENSISN 252

Query: 290 MFNGIEEQQQTAAAAGFWPWSSDQ 313
           MF GI++Q      +GFWPW   Q
Sbjct: 280 MFCGIDDQ------SGFWPWLDQQ 252

BLAST of ClCG09G018360 vs. TAIR 10
Match: AT5G65310.1 (homeobox protein 5 )

HSP 1 Score: 128.6 bits (322), Expect = 8.6e-30
Identity = 65/102 (63.73%), Postives = 79/102 (77.45%), Query Frame = 0

Query: 70  GLSDDGLALGEKKKRLNLEQVKALEKSFELGNKLEPERKMQLAKALGLQPRQIAIWFQNR 129
           G+        EKK+RL +EQVKALEK+FE+ NKLEPERK++LA+ LGLQPRQ+AIWFQNR
Sbjct: 61  GVGHASSTAAEKKRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNR 120

Query: 130 RARWKTKQLERDYEVLKKQFEALKADNDVLQAQNTKLHAEVE 172
           RARWKTKQLERDY VLK  F+ALK + D LQ  N  L  +++
Sbjct: 121 RARWKTKQLERDYGVLKSNFDALKRNRDSLQRDNDSLLGQIK 162

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890842.14.7e-15590.37LOW QUALITY PROTEIN: homeobox-leucine zipper protein ATHB-20-like [Benincasa his... [more]
XP_008459304.12.9e-15290.03PREDICTED: homeobox-leucine zipper protein ATHB-20-like [Cucumis melo][more]
KAG6578593.11.7e-14986.92Homeobox-leucine zipper protein HAT7, partial [Cucurbita argyrosperma subsp. sor... [more]
XP_004148689.13.9e-14988.82homeobox-leucine zipper protein ATHB-20 [Cucumis sativus] >KGN52448.1 hypothetic... [more]
XP_022993652.11.5e-14886.79homeobox-leucine zipper protein ATHB-20-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q004661.6e-6047.32Homeobox-leucine zipper protein HAT7 OS=Arabidopsis thaliana OX=3702 GN=HAT7 PE=... [more]
Q8LAT02.1e-5747.37Homeobox-leucine zipper protein ATHB-20 OS=Arabidopsis thaliana OX=3702 GN=ATHB-... [more]
A2XD082.5e-5041.27Homeobox-leucine zipper protein HOX21 OS=Oryza sativa subsp. indica OX=39946 GN=... [more]
Q8S7W92.5e-5040.17Homeobox-leucine zipper protein HOX21 OS=Oryza sativa subsp. japonica OX=39947 G... [more]
Q8LC033.6e-4944.66Homeobox-leucine zipper protein ATHB-13 OS=Arabidopsis thaliana OX=3702 GN=ATHB-... [more]
Match NameE-valueIdentityDescription
A0A1S3C9V41.4e-15290.03homeobox-leucine zipper protein ATHB-20-like OS=Cucumis melo OX=3656 GN=LOC10349... [more]
A0A0A0KS901.9e-14988.82Homeobox domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G635430 PE... [more]
A0A6J1JTG17.2e-14986.79homeobox-leucine zipper protein ATHB-20-like OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A6J1FFJ11.2e-14886.29homeobox-leucine zipper protein ATHB-20-like OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A5D3CVK83.0e-14789.68Homeobox-leucine zipper protein ATHB-20-like OS=Cucumis melo var. makuwa OX=1194... [more]
Match NameE-valueIdentityDescription
AT5G15150.11.1e-6147.32homeobox 3 [more]
AT3G01220.11.5e-5847.37homeobox protein 20 [more]
AT1G69780.12.6e-5044.66Homeobox-leucine zipper protein family [more]
AT1G26960.16.8e-4344.70homeobox protein 23 [more]
AT5G65310.18.6e-3063.73homeobox protein 5 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 128..169
NoneNo IPR availableGENE3D1.10.10.60coord: 77..143
e-value: 3.3E-19
score: 70.1
NoneNo IPR availablePANTHERPTHR24326:SF538HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT7coord: 1..317
NoneNo IPR availablePANTHERPTHR24326HOMEOBOX-LEUCINE ZIPPER PROTEINcoord: 1..317
IPR000047Helix-turn-helix motifPRINTSPR00031HTHREPRESSRcoord: 116..132
score: 58.02
coord: 107..116
score: 47.28
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 79..140
e-value: 3.3E-17
score: 73.1
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 82..134
e-value: 3.0E-15
score: 55.8
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 80..136
score: 16.535826
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 81..137
e-value: 6.97E-17
score: 71.5056
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 136..171
e-value: 9.7E-15
score: 54.5
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 111..134
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 79..138

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG09G018360.2ClCG09G018360.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006357 regulation of transcription by RNA polymerase II
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003677 DNA binding