ClCG01G006660 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G006660
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionB box-type domain-containing protein
LocationCG_Chr01: 7536696 .. 7539334 (+)
RNA-Seq ExpressionClCG01G006660
SyntenyClCG01G006660
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACTCTAATGGGCGCAAATGTAAGCAGCAGCTTCCAAAAACATTTCACCTTCCATTTTCTCACTTTCCCCTTTTCTTTTTATTATAATAATTTTCATTCTTCTTCTTCTTATTATTATTATTCATTTTTTCACTTTGCAGCCGCACATACAGTCTCTATTGCCTATTCATGTTTCTCTCTCTCTGATTTCTCTTCTCCCCTTCTCTTTTTCCCAATATACTTGAAGAAATTTCCAACAATTTCAGTCTTTCTGATCGATTTTTCATCGACCCAGAACTTGGGGTTCCTTCAAATTACATTTGGATATGGTGAGCATTGAAATTCCCTTTTTTTTATTTATTTTTTTTTTATTTTTTTTTTTTTGCTGTTTTGAGGTGTTTTTTCTACGTTTCTTCTTTTTAATTTCTTGTTTGTTTTGTCCAATCATGTAGAGTTTTTCCACCCTGATTGTTTCTCTCCTTTGTTAATCATGGAATGTTGTTTTTTGGGTGTTCTTGGATCATTTTGTTTGAGTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAATTTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGAATTATAATTTGCTGATCGTTGAGAGCATGGATTGTTTATCATCCAAACATGTTTGGTTGCAATCGTTTGATTTTGAATTTATTGTCTTGTGTTTTAGATTCATGAATTAGTATCTTATAATTTCCCTCTTACTGAGATTAGTTAGTTGATTGTGACTGATTTGAGAATGTATTCTATTTTGATTTTACAAACAATTGCCATAACTTGCATCATGAATTGGTAATTTTGACAAATTATACCTAATTTCTTTTTCACTTTTTTGATGATCCCAATTTGGTATTGTTTGGTTTATGTTAGGATTTTCTAATCATTTGGCAAACTAGTCTTATTCACAAGCAATCATCAATGAATTGAATTGTTTGGATATGGATTTTGCAGGCAGGATGCAAGATTTATCTGAAGAAAAGGAAAGCAGATTGGCTTAGTTCCCTTTTGCAAAGCAAATTTTTTGGTTCTTGTGTTCATCATCAAAACAATAGGAAAAATGAAAAAAATGTGTTTTGTATAGACTGTGGTATTGCCATCTGCAGGCATTGTTTGATATCTCATTGCGTTCATCGGCGGCTACAGATCTGTAAGTATGTTTATCAGTACGTCGTTCGTGTACCCGACTTGCAAAACCACCTTGATTGTTGCAATATTCAGGTAATCCCTATGTTTCCTTCTTAAATTATCTAACCTTCTAATACTTAAGTTGTCTTTGTTACCTGTTGCTGGTGGTTGTAACTTGTAAGTTGTGATCAAATGAAACAAGAAGTTGTATGACCAATTTCATTCAAGAGGAACCATACTTCTAGATGTTTAATCACCATAAACGAAACAACTCTCGGAGGGAATGACGTCAAGAAAGCATTGAATTCAATCATAATAGAGAACTTTTCTTTTGTTGTTATTGATCAATTATGGTAAAGTTTGGGACTAAATTAGTGAATGTTTGATTCGCAAGACTTATAAGATAAATGGGGAGAAGGCTGTTCATTTGAGTCCTCGTCCACAATCAAAAGATTCGAAACCATCAACAAAATTGAAGTTTGGAGGTACTTGTGAAGCCTGTGGGAGATACATACAAGACTTGCCTAATCGCTTCTGCTCGATCGCTTGCAAAGTAAATGAAATTCCTTCTTGTTTCTCTCTTAAGTTGTATTTCCTCTCATTCATTTCCTCATACATATGTCTTGACCATGTTCCATTACTCATTTTTTCAGGTCTCTATGGTTCCAATGGAACTTAACAATCAAAGCTTTAGATGCATTGATTCGGAGTCGAAGCTTAAAGACATTCCATGGAAGGAGAACCATAATCTAGAAACCAATACAAGCGAAATGGAATCGTCGTCAATCTCGATGGCGGAATCAACTGAAGAGATTCAAGCATGGCGAGTGAAGACGGTCTTGAATCCAAAGAAACTTTTGCATAAAAGGAAGGGCATTCCTCATCGATCACCTCTAAAGTAATGTCTAATAAAAACTAATCATGAAGATCTCCAATACCATTTTTCTGCCAAACCCACATTTTTTTTCATATATCAAAGTCCTATAAGCATATAAGTATAGATATATTTAATAACATATATTTCACCTCTTGGTTTTTGGCTC

mRNA sequence

ACTCTAATGGGCGCAAATGCAGGATGCAAGATTTATCTGAAGAAAAGGAAAGCAGATTGGCTTAGTTCCCTTTTGCAAAGCAAATTTTTTGGTTCTTGTGTTCATCATCAAAACAATAGGAAAAATGAAAAAAATGTGTTTTGTATAGACTGTGGTATTGCCATCTGCAGGCATTGTTTGATATCTCATTGCGTTCATCGGCGGCTACAGATCTGTAAGTATGTTTATCAGTACGTCGTTCGTGTACCCGACTTGCAAAACCACCTTGATTGTTGCAATATTCAGACTTATAAGATAAATGGGGAGAAGGCTGTTCATTTGAGTCCTCGTCCACAATCAAAAGATTCGAAACCATCAACAAAATTGAAGTTTGGAGGTACTTGTGAAGCCTGTGGGAGATACATACAAGACTTGCCTAATCGCTTCTGCTCGATCGCTTGCAAAGTCTCTATGGTTCCAATGGAACTTAACAATCAAAGCTTTAGATGCATTGATTCGGAGTCGAAGCTTAAAGACATTCCATGGAAGGAGAACCATAATCTAGAAACCAATACAAGCGAAATGGAATCGTCGTCAATCTCGATGGCGGAATCAACTGAAGAGATTCAAGCATGGCGAGTGAAGACGGTCTTGAATCCAAAGAAACTTTTGCATAAAAGGAAGGGCATTCCTCATCGATCACCTCTAAAGTAATGTCTAATAAAAACTAATCATGAAGATCTCCAATACCATTTTTCTGCCAAACCCACATTTTTTTTCATATATCAAAGTCCTATAAGCATATAAGTATAGATATATTTAATAACATATATTTCACCTCTTGGTTTTTGGCTC

Coding sequence (CDS)

ATGGGCGCAAATGCAGGATGCAAGATTTATCTGAAGAAAAGGAAAGCAGATTGGCTTAGTTCCCTTTTGCAAAGCAAATTTTTTGGTTCTTGTGTTCATCATCAAAACAATAGGAAAAATGAAAAAAATGTGTTTTGTATAGACTGTGGTATTGCCATCTGCAGGCATTGTTTGATATCTCATTGCGTTCATCGGCGGCTACAGATCTGTAAGTATGTTTATCAGTACGTCGTTCGTGTACCCGACTTGCAAAACCACCTTGATTGTTGCAATATTCAGACTTATAAGATAAATGGGGAGAAGGCTGTTCATTTGAGTCCTCGTCCACAATCAAAAGATTCGAAACCATCAACAAAATTGAAGTTTGGAGGTACTTGTGAAGCCTGTGGGAGATACATACAAGACTTGCCTAATCGCTTCTGCTCGATCGCTTGCAAAGTCTCTATGGTTCCAATGGAACTTAACAATCAAAGCTTTAGATGCATTGATTCGGAGTCGAAGCTTAAAGACATTCCATGGAAGGAGAACCATAATCTAGAAACCAATACAAGCGAAATGGAATCGTCGTCAATCTCGATGGCGGAATCAACTGAAGAGATTCAAGCATGGCGAGTGAAGACGGTCTTGAATCCAAAGAAACTTTTGCATAAAAGGAAGGGCATTCCTCATCGATCACCTCTAAAGTAA

Protein sequence

MGANAGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVHRRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGGTCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTSEMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK
Homology
BLAST of ClCG01G006660 vs. NCBI nr
Match: XP_008467074.1 (PREDICTED: uncharacterized protein LOC103504510 [Cucumis melo])

HSP 1 Score: 446.0 bits (1146), Expect = 1.9e-121
Identity = 211/224 (94.20%), Postives = 217/224 (96.88%), Query Frame = 0

Query: 5   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 64
           AGCKI+LKKRK DWL+SLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH
Sbjct: 2   AGCKIFLKKRKTDWLNSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 61

Query: 65  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 124
           RRLQICKYVYQYVVRVPDLQ+HLDCCNIQTYKINGEKAVHL PRPQSKDSKPSTKLKFGG
Sbjct: 62  RRLQICKYVYQYVVRVPDLQDHLDCCNIQTYKINGEKAVHLCPRPQSKDSKPSTKLKFGG 121

Query: 125 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTS 184
           TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQ  R IDSES LKDIPWKENHNLE NTS
Sbjct: 122 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQCCRFIDSESNLKDIPWKENHNLEINTS 181

Query: 185 EMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK 229
           EMESSSIS+AESTEEI+AWRVKT+LNPKKLLHKRKGIPHRSPLK
Sbjct: 182 EMESSSISVAESTEEIKAWRVKTILNPKKLLHKRKGIPHRSPLK 225

BLAST of ClCG01G006660 vs. NCBI nr
Match: XP_004142823.1 (uncharacterized protein LOC101210438 [Cucumis sativus] >KGN51458.1 hypothetical protein Csa_009445 [Cucumis sativus])

HSP 1 Score: 444.5 bits (1142), Expect = 5.5e-121
Identity = 211/224 (94.20%), Postives = 216/224 (96.43%), Query Frame = 0

Query: 5   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 64
           AGCKIYLKKRK DWL+SLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH
Sbjct: 2   AGCKIYLKKRKTDWLNSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 61

Query: 65  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 124
           RRLQICKYVYQYVVRVPDLQ+HLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG
Sbjct: 62  RRLQICKYVYQYVVRVPDLQDHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 121

Query: 125 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTS 184
           TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQ  R +DSE  LKDIPWKENHNLE NTS
Sbjct: 122 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQCCRFMDSEPNLKDIPWKENHNLEINTS 181

Query: 185 EMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK 229
           EMESSSIS+AESTEEI+AWRVK VLNPKKLLHKRKGIPHRSPLK
Sbjct: 182 EMESSSISVAESTEEIKAWRVKMVLNPKKLLHKRKGIPHRSPLK 225

BLAST of ClCG01G006660 vs. NCBI nr
Match: XP_038875225.1 (uncharacterized protein LOC120067735 [Benincasa hispida])

HSP 1 Score: 443.4 bits (1139), Expect = 1.2e-120
Identity = 211/224 (94.20%), Postives = 217/224 (96.88%), Query Frame = 0

Query: 5   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 64
           AGCKIYLKKRK DWL+SLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH
Sbjct: 2   AGCKIYLKKRKTDWLNSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 61

Query: 65  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 124
           +RLQICKYVYQYVVRVPDLQN LDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG
Sbjct: 62  QRLQICKYVYQYVVRVPDLQNLLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 121

Query: 125 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTS 184
           TCEACGRYIQDLPNRFCSIACKVSMVP+ELNNQ+ RCIDSES LK IPW ENHNLETNTS
Sbjct: 122 TCEACGRYIQDLPNRFCSIACKVSMVPVELNNQNCRCIDSESNLKGIPWTENHNLETNTS 181

Query: 185 EMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK 229
           EMESSSIS+AESTEEI+AWRVKTVLNPKKLLHKRKGIPHRS LK
Sbjct: 182 EMESSSISVAESTEEIKAWRVKTVLNPKKLLHKRKGIPHRSHLK 225

BLAST of ClCG01G006660 vs. NCBI nr
Match: XP_022963423.1 (uncharacterized protein LOC111463631 [Cucurbita moschata] >XP_023545157.1 uncharacterized protein LOC111804538 [Cucurbita pepo subsp. pepo] >KAG6601888.1 hypothetical protein SDJN03_07121, partial [Cucurbita argyrosperma subsp. sororia] >KAG7032586.1 hypothetical protein SDJN02_06635 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 430.6 bits (1106), Expect = 8.2e-117
Identity = 205/224 (91.52%), Postives = 215/224 (95.98%), Query Frame = 0

Query: 5   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 64
           AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHC+H
Sbjct: 2   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCIH 61

Query: 65  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 124
           RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG
Sbjct: 62  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 121

Query: 125 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTS 184
           TCEACGRYIQDLPNRFCSIACKVSMVP+ELNNQS RCI S+S   +I WKENHN+E NTS
Sbjct: 122 TCEACGRYIQDLPNRFCSIACKVSMVPVELNNQSCRCITSKSDPNEISWKENHNVEPNTS 181

Query: 185 EMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK 229
           EM+ SSISMAESTEEI+AW+VK+ LNP+KLLHKRKGIPHRSPLK
Sbjct: 182 EMK-SSISMAESTEEIKAWQVKSGLNPRKLLHKRKGIPHRSPLK 224

BLAST of ClCG01G006660 vs. NCBI nr
Match: XP_022990994.1 (uncharacterized protein LOC111487719 [Cucurbita maxima])

HSP 1 Score: 427.6 bits (1098), Expect = 6.9e-116
Identity = 204/224 (91.07%), Postives = 214/224 (95.54%), Query Frame = 0

Query: 5   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 64
           AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHC+H
Sbjct: 2   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCIH 61

Query: 65  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 124
           RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG
Sbjct: 62  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 121

Query: 125 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTS 184
           TCEACGRYIQDLPNRFCSIACKVSMV +ELNNQS RCI S+S   +I WKENHN+E NTS
Sbjct: 122 TCEACGRYIQDLPNRFCSIACKVSMVSVELNNQSCRCITSKSDPNEISWKENHNVEPNTS 181

Query: 185 EMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK 229
           EM+ SSISMAESTEEI+AW+VK+ LNP+KLLHKRKGIPHRSPLK
Sbjct: 182 EMK-SSISMAESTEEIKAWQVKSGLNPRKLLHKRKGIPHRSPLK 224

BLAST of ClCG01G006660 vs. ExPASy Swiss-Prot
Match: Q1G3Q4 (Protein RGF1 INDUCIBLE TRANSCRIPTION FACTOR 1 OS=Arabidopsis thaliana OX=3702 GN=RITF1 PE=1 SV=1)

HSP 1 Score: 126.3 bits (316), Expect = 4.3e-28
Identity = 60/137 (43.80%), Postives = 83/137 (60.58%), Query Frame = 0

Query: 14  RKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVHRRLQICKYV 73
           +K  WL +L   KFF  C +H+  +KNE+NV C+DC  ++C HC+ SH  HR LQ+ +YV
Sbjct: 4   QKPAWLDALYAEKFFVGCPYHETAKKNERNVCCLDCCTSLCPHCVPSHRFHRLLQVRRYV 63

Query: 74  YQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGGTCEACGRYI 133
           Y  VVR+ DLQ  +DC N+Q Y IN  K V +  RPQ++  K +     G  C +C R +
Sbjct: 64  YHDVVRLEDLQKLIDCSNVQAYTINSAKVVFIKKRPQNRQFKGA-----GNYCTSCDRSL 123

Query: 134 QDLPNRFCSIACKVSMV 151
           Q+ P   CS+ CKV  V
Sbjct: 124 QE-PYIHCSLGCKVDFV 134

BLAST of ClCG01G006660 vs. ExPASy TrEMBL
Match: A0A1S3CSV3 (uncharacterized protein LOC103504510 OS=Cucumis melo OX=3656 GN=LOC103504510 PE=4 SV=1)

HSP 1 Score: 446.0 bits (1146), Expect = 9.1e-122
Identity = 211/224 (94.20%), Postives = 217/224 (96.88%), Query Frame = 0

Query: 5   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 64
           AGCKI+LKKRK DWL+SLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH
Sbjct: 2   AGCKIFLKKRKTDWLNSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 61

Query: 65  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 124
           RRLQICKYVYQYVVRVPDLQ+HLDCCNIQTYKINGEKAVHL PRPQSKDSKPSTKLKFGG
Sbjct: 62  RRLQICKYVYQYVVRVPDLQDHLDCCNIQTYKINGEKAVHLCPRPQSKDSKPSTKLKFGG 121

Query: 125 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTS 184
           TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQ  R IDSES LKDIPWKENHNLE NTS
Sbjct: 122 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQCCRFIDSESNLKDIPWKENHNLEINTS 181

Query: 185 EMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK 229
           EMESSSIS+AESTEEI+AWRVKT+LNPKKLLHKRKGIPHRSPLK
Sbjct: 182 EMESSSISVAESTEEIKAWRVKTILNPKKLLHKRKGIPHRSPLK 225

BLAST of ClCG01G006660 vs. ExPASy TrEMBL
Match: A0A0A0KPD0 (B box-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G550220 PE=4 SV=1)

HSP 1 Score: 444.5 bits (1142), Expect = 2.7e-121
Identity = 211/224 (94.20%), Postives = 216/224 (96.43%), Query Frame = 0

Query: 5   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 64
           AGCKIYLKKRK DWL+SLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH
Sbjct: 2   AGCKIYLKKRKTDWLNSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 61

Query: 65  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 124
           RRLQICKYVYQYVVRVPDLQ+HLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG
Sbjct: 62  RRLQICKYVYQYVVRVPDLQDHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 121

Query: 125 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTS 184
           TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQ  R +DSE  LKDIPWKENHNLE NTS
Sbjct: 122 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQCCRFMDSEPNLKDIPWKENHNLEINTS 181

Query: 185 EMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK 229
           EMESSSIS+AESTEEI+AWRVK VLNPKKLLHKRKGIPHRSPLK
Sbjct: 182 EMESSSISVAESTEEIKAWRVKMVLNPKKLLHKRKGIPHRSPLK 225

BLAST of ClCG01G006660 vs. ExPASy TrEMBL
Match: A0A6J1HHR7 (uncharacterized protein LOC111463631 OS=Cucurbita moschata OX=3662 GN=LOC111463631 PE=4 SV=1)

HSP 1 Score: 430.6 bits (1106), Expect = 4.0e-117
Identity = 205/224 (91.52%), Postives = 215/224 (95.98%), Query Frame = 0

Query: 5   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 64
           AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHC+H
Sbjct: 2   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCIH 61

Query: 65  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 124
           RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG
Sbjct: 62  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 121

Query: 125 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTS 184
           TCEACGRYIQDLPNRFCSIACKVSMVP+ELNNQS RCI S+S   +I WKENHN+E NTS
Sbjct: 122 TCEACGRYIQDLPNRFCSIACKVSMVPVELNNQSCRCITSKSDPNEISWKENHNVEPNTS 181

Query: 185 EMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK 229
           EM+ SSISMAESTEEI+AW+VK+ LNP+KLLHKRKGIPHRSPLK
Sbjct: 182 EMK-SSISMAESTEEIKAWQVKSGLNPRKLLHKRKGIPHRSPLK 224

BLAST of ClCG01G006660 vs. ExPASy TrEMBL
Match: A0A6J1JTJ5 (uncharacterized protein LOC111487719 OS=Cucurbita maxima OX=3661 GN=LOC111487719 PE=4 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 3.4e-116
Identity = 204/224 (91.07%), Postives = 214/224 (95.54%), Query Frame = 0

Query: 5   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVH 64
           AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHC+H
Sbjct: 2   AGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCIH 61

Query: 65  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 124
           RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG
Sbjct: 62  RRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGG 121

Query: 125 TCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTS 184
           TCEACGRYIQDLPNRFCSIACKVSMV +ELNNQS RCI S+S   +I WKENHN+E NTS
Sbjct: 122 TCEACGRYIQDLPNRFCSIACKVSMVSVELNNQSCRCITSKSDPNEISWKENHNVEPNTS 181

Query: 185 EMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK 229
           EM+ SSISMAESTEEI+AW+VK+ LNP+KLLHKRKGIPHRSPLK
Sbjct: 182 EMK-SSISMAESTEEIKAWQVKSGLNPRKLLHKRKGIPHRSPLK 224

BLAST of ClCG01G006660 vs. ExPASy TrEMBL
Match: A0A6J1CY39 (uncharacterized protein LOC111015643 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111015643 PE=4 SV=1)

HSP 1 Score: 414.5 bits (1064), Expect = 2.9e-112
Identity = 199/228 (87.28%), Postives = 210/228 (92.11%), Query Frame = 0

Query: 1   MGANAGCKIYLKKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLIS 60
           +   AGCKIYLKKRKADWLSSLLQSKFFGSC HHQNNRKNEKNVFC+DCG+AICRHCLIS
Sbjct: 5   LSKQAGCKIYLKKRKADWLSSLLQSKFFGSCGHHQNNRKNEKNVFCLDCGVAICRHCLIS 64

Query: 61  HCVHRRLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKL 120
           HC+HRRLQICKYVYQYVVRV DLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKL
Sbjct: 65  HCIHRRLQICKYVYQYVVRVHDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKL 124

Query: 121 KFGGTCEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLE 180
           KFGGTCEACGRYIQDLPNRFCSIACKVSM P+ELN+QS R + SE  LKD+  KENHN E
Sbjct: 125 KFGGTCEACGRYIQDLPNRFCSIACKVSMAPVELNDQSSRFMASEPNLKDLTCKENHNPE 184

Query: 181 TNTSEMESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSPLK 229
           TNTSE+E SSISMAESTEEI+AWR K+ LN KKLLHKRKGIPHRSPLK
Sbjct: 185 TNTSEIE-SSISMAESTEEIKAWRTKSSLNAKKLLHKRKGIPHRSPLK 231

BLAST of ClCG01G006660 vs. TAIR 10
Match: AT2G01818.1 (PLATZ transcription factor family protein )

HSP 1 Score: 204.9 bits (520), Expect = 6.8e-53
Identity = 108/221 (48.87%), Postives = 147/221 (66.52%), Query Frame = 0

Query: 12  KKRKAD--WLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHC----LISHCVHR 71
           +KR+++  W+ +LL S+FFG C++H+  RKNEKNVFCIDC + ICRHC      SH +HR
Sbjct: 5   EKRRSEEVWIETLLNSEFFGICMNHKYLRKNEKNVFCIDCNVEICRHCCNTVTDSHFLHR 64

Query: 72  RLQICKYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGGT 131
           RLQICKYVYQ V+R+ ++QN+ DC  IQTYKINGEKA+HL+ RPQ+KD++PSTK K G +
Sbjct: 65  RLQICKYVYQDVIRLLEIQNYFDCSEIQTYKINGEKAIHLNSRPQAKDARPSTKAKNGAS 124

Query: 132 CEACGRYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTSE 191
           C  C RYIQD PN FCSI+CK+S  P + +   F     +S L+    KE+   E +  E
Sbjct: 125 CVTCKRYIQDHPNLFCSISCKIS-TPSKKHKFCFSPKLEQSVLE----KEHSTQEGSLEE 184

Query: 192 MESSSISMAESTEEIQAWRVKTVLNPKKLLHKRKGIPHRSP 227
            +S + S+ + +E+ +         P   + KRKGI  RSP
Sbjct: 185 KKSCTSSLTDVSEDSEVLLSDFSFRPLLRILKRKGISRRSP 220

BLAST of ClCG01G006660 vs. TAIR 10
Match: AT3G60670.1 (PLATZ transcription factor family protein )

HSP 1 Score: 128.6 bits (322), Expect = 6.2e-30
Identity = 63/135 (46.67%), Postives = 84/135 (62.22%), Query Frame = 0

Query: 18  WLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVHRRLQICKYVYQYV 77
           WL  LL+ KFF +C+ H++++KNEKN+ CIDC + IC HCL SH  HR LQI +YVY+ V
Sbjct: 9   WLEVLLKDKFFNACLDHEDDKKNEKNILCIDCCLTICPHCLSSHTSHRLLQIRRYVYRDV 68

Query: 78  VRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGGTCEACGRYIQDLP 137
           +RV D    +DC  IQ Y  N  K V ++ RPQS+  + S     G  C  C R +Q  P
Sbjct: 69  LRVEDGSKLMDCSLIQPYTTNSSKVVFINERPQSRQFRGS-----GNICITCDRSLQS-P 128

Query: 138 NRFCSIACKVSMVPM 153
             FC ++CK+S V M
Sbjct: 129 YLFCCLSCKISDVIM 137

BLAST of ClCG01G006660 vs. TAIR 10
Match: AT2G12646.1 (PLATZ transcription factor family protein )

HSP 1 Score: 126.3 bits (316), Expect = 3.1e-29
Identity = 60/137 (43.80%), Postives = 83/137 (60.58%), Query Frame = 0

Query: 14  RKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVHRRLQICKYV 73
           +K  WL +L   KFF  C +H+  +KNE+NV C+DC  ++C HC+ SH  HR LQ+ +YV
Sbjct: 4   QKPAWLDALYAEKFFVGCPYHETAKKNERNVCCLDCCTSLCPHCVPSHRFHRLLQVRRYV 63

Query: 74  YQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGGTCEACGRYI 133
           Y  VVR+ DLQ  +DC N+Q Y IN  K V +  RPQ++  K +     G  C +C R +
Sbjct: 64  YHDVVRLEDLQKLIDCSNVQAYTINSAKVVFIKKRPQNRQFKGA-----GNYCTSCDRSL 123

Query: 134 QDLPNRFCSIACKVSMV 151
           Q+ P   CS+ CKV  V
Sbjct: 124 QE-PYIHCSLGCKVDFV 134

BLAST of ClCG01G006660 vs. TAIR 10
Match: AT1G31040.1 (PLATZ transcription factor family protein )

HSP 1 Score: 123.2 bits (308), Expect = 2.6e-28
Identity = 78/231 (33.77%), Postives = 118/231 (51.08%), Query Frame = 0

Query: 15  KADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDCGIAICRHCLISHCVHRRLQICKYVY 74
           K  WL  L+   FF SC  H+  RK+EKNVFC+ C +++C HCL SH  H  LQ+ +YVY
Sbjct: 18  KPAWLEGLMAETFFSSCGIHETRRKSEKNVFCLLCCLSVCPHCLPSHRSHPLLQVRRYVY 77

Query: 75  QYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGGTCEACGRYIQ 134
             VVR+ DL+  +DC  +Q Y ING K + L+ R QS+    S        C  C R +Q
Sbjct: 78  HDVVRLSDLEKLIDCSYVQPYTINGAKVIFLNQRQQSRAKVSS------NVCFTCDRILQ 137

Query: 135 DLPNRFCSIACKVSMVPM---ELNNQSFRCIDSESKLKDIPWKENHNL-ETNTSEMESSS 194
           + P  FCS++CKV  +     +L++  +R  +S+   + +    +  L E +T E     
Sbjct: 138 E-PFHFCSLSCKVDYLSYQGDDLSSILYRIDESDFTFEGLRMDGHDQLGEISTMEDGEDI 197

Query: 195 ISMAESTEEIQAWRVKTVLNPKK---------------LLHKRKGIPHRSP 227
           + +++ +E+      K     KK               L ++RKG PHR+P
Sbjct: 198 LVISDESEQGNNSHKKEKKKSKKKKPESNYLPGMVLSSLGNRRKGAPHRAP 241

BLAST of ClCG01G006660 vs. TAIR 10
Match: AT4G17900.1 (PLATZ transcription factor family protein )

HSP 1 Score: 115.9 bits (289), Expect = 4.2e-26
Identity = 76/221 (34.39%), Postives = 116/221 (52.49%), Query Frame = 0

Query: 12  KKRKADWLSSLLQSKFFGSCVHHQNNRKNEKNVFCIDC-GIAICRHCLISHCVHRRLQIC 71
           + R   WL  LL+ +FF  C  H ++ K+E N++C+DC    +C  CL  H  HR +QI 
Sbjct: 30  ENRWPPWLKPLLKEQFFVHCKFHGDSHKSECNMYCLDCTNGPLCSLCLAHHKDHRTIQIR 89

Query: 72  KYVYQYVVRVPDLQNHLDCCNIQTYKINGEKAVHLSPRPQSKDSKPSTKLKFGGTCEACG 131
           +  Y  V+RV ++Q +LD   IQTY IN  K V L+ RPQ +  K  T      TC+ C 
Sbjct: 90  RSSYHDVIRVNEIQKYLDIGGIQTYVINSAKVVFLNERPQPRPGKGVT-----NTCKVCY 149

Query: 132 RYIQDLPNRFCSIACKVSMVPMELNNQSFRCIDSESKLKDIPWKENHNLETNTSEMESSS 191
           R + D   RFCS+ CK++       ++ F              +EN  +ET  S   SSS
Sbjct: 150 RSLVDDSFRFCSLGCKIAG-----TSRGFE-----------KGRENLLMETEDS---SSS 209

Query: 192 ISMAESTEEIQAWRVK----TVLNPKKLLHKRKGIPHRSPL 228
           I++ ++   +Q++       T  +  +++ +RKGIPHRSP+
Sbjct: 210 IAIGKNITNLQSFSPSTPPLTTSSNCRIVKRRKGIPHRSPM 226

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008467074.11.9e-12194.20PREDICTED: uncharacterized protein LOC103504510 [Cucumis melo][more]
XP_004142823.15.5e-12194.20uncharacterized protein LOC101210438 [Cucumis sativus] >KGN51458.1 hypothetical ... [more]
XP_038875225.11.2e-12094.20uncharacterized protein LOC120067735 [Benincasa hispida][more]
XP_022963423.18.2e-11791.52uncharacterized protein LOC111463631 [Cucurbita moschata] >XP_023545157.1 unchar... [more]
XP_022990994.16.9e-11691.07uncharacterized protein LOC111487719 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q1G3Q44.3e-2843.80Protein RGF1 INDUCIBLE TRANSCRIPTION FACTOR 1 OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
A0A1S3CSV39.1e-12294.20uncharacterized protein LOC103504510 OS=Cucumis melo OX=3656 GN=LOC103504510 PE=... [more]
A0A0A0KPD02.7e-12194.20B box-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G550220 ... [more]
A0A6J1HHR74.0e-11791.52uncharacterized protein LOC111463631 OS=Cucurbita moschata OX=3662 GN=LOC1114636... [more]
A0A6J1JTJ53.4e-11691.07uncharacterized protein LOC111487719 OS=Cucurbita maxima OX=3661 GN=LOC111487719... [more]
A0A6J1CY392.9e-11287.28uncharacterized protein LOC111015643 isoform X2 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT2G01818.16.8e-5348.87PLATZ transcription factor family protein [more]
AT3G60670.16.2e-3046.67PLATZ transcription factor family protein [more]
AT2G12646.13.1e-2943.80PLATZ transcription factor family protein [more]
AT1G31040.12.6e-2833.77PLATZ transcription factor family protein [more]
AT4G17900.14.2e-2634.39PLATZ transcription factor family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006734PLATZ transcription factorPFAMPF04640PLATZcoord: 71..147
e-value: 5.3E-24
score: 84.7
NoneNo IPR availablePANTHERPTHR31065:SF41PLATZ TRANSCRIPTION FACTOR FAMILY PROTEINcoord: 6..227
NoneNo IPR availablePANTHERPTHR31065PLATZ TRANSCRIPTION FACTOR FAMILY PROTEINcoord: 6..227
IPR000315B-box-type zinc fingerPROSITEPS50119ZF_BBOXcoord: 31..69
score: 9.015373

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G006660.1ClCG01G006660.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008270 zinc ion binding