Clc01G04070 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G04070
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionB box-type domain-containing protein
LocationClcChr01: 3894735 .. 3896742 (+)
RNA-Seq ExpressionClc01G04070
SyntenyClc01G04070
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACATTGATGCCCTTCAACCTCATCATCAATCTCTGCTTCCAATTAGTACTTTTTTACAGCATCATCTTTGTTTCACATTCTCTGTTCCTTCTACAACGACCGTACATACACAAATTACAACAAATGGTATATTTCTTTTTTAAATACCATTTTCTTTTGTGGGTTTAATTTAAAAGTTGTTTATATAAGTGGGTTTGTTTTTGTGTTAATTTTACACAGGTGAGTATGGTGAGAGAAGGTGAGAATAATTCAGTTCCCCAATGGCTTGAAATTCTTTTGGGAGAGAAGTTCTTTACTCCATGTTCAATTCACAAGTCTTGTAAGAAGAATGACAAGACTTTCTTTTGCTTGTTCTGCCGCTCCGCCATTTGCTTCTCTTGCTTCTCTTCTCATCGCACTCATGCTCTTCTTCAGGTATTGCCGTAACTTATCAGTTTAAGTTTTTTTTCGTACGAGACTAATATTTAGAATGTATGCATCTTGTTCTAATCACTCGCAAACACAAAAATTTTAAAAATAAGTAAAGAGAGTTTCTCACGTGACAAACTAAACTGAACAATTGAAAAGGATGCACAAATATGAATGTAGCCTAGAGTTTAATTATTTACGTATAAATGATACCCTAACGAGAAGTTTTATATTAATTTTGTCTCTCTGTATATGTCATTTTGAGTCTAATTGACATCCGTGTAATTACTTGTGAGATGTTCTTTCTTTTTTTTTTCCCTAATTTTCAAGTCAACATGATCTCAATTAGTTAACTAAGCTATGTGTCATTGTCAAAAAGTCATAAATTTGAATTCTCCACCTTTCATTTATATTGTAAAACTCAAAATTAATACTTAGGGGCAATTGTAGTATGACAGTCAAACTCAAAATATTAGCGGATATAGCACAATGAGAAAGATGGTAGGAGTTTATCAATAATAGAGACTACATTGCTTGTATGAGTTTATACCAATAAATTTTGCTATATTTGCAAAATTCTTTAATGATTGTGTTTTACACTTAATTATTATCCTAAAAGTACTACCATTGCAATTATATCCTAATAATAATAAAATTAGAATGCTTAAATTTAGGCTAAGTTTTGAATTTTTTTTTCCTTTCATTCTGGTGGTTGACAGATAAGAAGATATGTGTACCATGAGGTTATAAGGCTAGGAGATGCTGAGAAGCTGCTGAATTGTTCTCTTGTACAAGTAATTAATAATTATGGTTTGGCATGAAAATTAATAATTAGCGTTACATAAAATGTATTTACATCATATTTAATTGAACACAGCCATACACAACAAACAGAGCAAAAGTGGTGTTTCTAAAACATAGAAGGGGAAAGAGAAGAGGAGGATCGGGTAATCCGTGCATCACTTGTTTTCGCAACCTTCAATATCCATATCTCTTTTGCTCTCTTTCATGCAAGGTATCTTTCTTTCTTATTCCCTTTTAACATTACTCTCCACTTTTCCAAACTACAACTAATTAAATAATTATTACTATACTTATACTTATAATTTTTTTAGTCAACCAATAATTTAATATAATATGAAAGCAGATACTCCAAGCATTCACGCATTCAAAGATTTAAGTTGTAATTTCATTATAGTTATTCCCAAAATATTAATATCATAGATACAAGGGTGAAGTTTCCACTTTCTTACCAATAATTTAGGGTTAATGTTTAATTATTGTTTTTTTTTTTTTTTTTTTAATCAATCAAAAGTTGAATGGCAAAATTGAAATAATCAACAAACAAAAAAGGAATTATGATCATCTTCCGCCGCGAAGCACAACACAAACCCAAACCCCAACCTCCGTTCTGGACCTCGATTACCCGTCCGCCGTCTCCAAAACCGCCGCCGCCGTCAATTTAGTCAAAAAGAACCGTAGCGGTGTGAAGTCGTTAGCGGCGGTTCTATGCCGGCCGAGATGCTTTCCCATCTCCGATATCGCCACCGCCGTGAACCGCCGAAAGGGCGTTCCTCAAAGATCGCCGTTAACTTGA

mRNA sequence

ATGACATTGATGCCCTTCAACCTCATCATCAATCTCTGCTTCCAATTAGTACTTTTTTACAGCATCATCTTTGTTTCACATTCTCTGTTCCTTCTACAACGACCGTACATACACAAATTACAACAAATGGTGAGTATGGTGAGAGAAGGTGAGAATAATTCAGTTCCCCAATGGCTTGAAATTCTTTTGGGAGAGAAGTTCTTTACTCCATGTTCAATTCACAAGTCTTGTAAGAAGAATGACAAGACTTTCTTTTGCTTGTTCTGCCGCTCCGCCATTTGCTTCTCTTGCTTCTCTTCTCATCGCACTCATGCTCTTCTTCAGATAAGAAGATATGTGTACCATGAGGTTATAAGGCTAGGAGATGCTGAGAAGCTGCTGAATTGTTCTCTTGTACAACCATACACAACAAACAGAGCAAAAGTGGTGTTTCTAAAACATAGAAGGGGAAAGAGAAGAGGAGGATCGGGTAATCCGTGCATCACTTGTTTTCGCAACCTTCAATATCCATATCTCTTTTGCTCTCTTTCATGCAAGTTGAATGGCAAAATTGAAATAATCAACAAACAAAAAAGGAATTATGATCATCTTCCGCCGCGAAGCACAACACAAACCCAAACCCCAACCTCCGTTCTGGACCTCGATTACCCGTCCGCCGTCTCCAAAACCGCCGCCGCCGTCAATTTAGTCAAAAAGAACCGTAGCGGTGTGAAGTCGTTAGCGGCGGTTCTATGCCGGCCGAGATGCTTTCCCATCTCCGATATCGCCACCGCCGTGAACCGCCGAAAGGGCGTTCCTCAAAGATCGCCGTTAACTTGA

Coding sequence (CDS)

ATGACATTGATGCCCTTCAACCTCATCATCAATCTCTGCTTCCAATTAGTACTTTTTTACAGCATCATCTTTGTTTCACATTCTCTGTTCCTTCTACAACGACCGTACATACACAAATTACAACAAATGGTGAGTATGGTGAGAGAAGGTGAGAATAATTCAGTTCCCCAATGGCTTGAAATTCTTTTGGGAGAGAAGTTCTTTACTCCATGTTCAATTCACAAGTCTTGTAAGAAGAATGACAAGACTTTCTTTTGCTTGTTCTGCCGCTCCGCCATTTGCTTCTCTTGCTTCTCTTCTCATCGCACTCATGCTCTTCTTCAGATAAGAAGATATGTGTACCATGAGGTTATAAGGCTAGGAGATGCTGAGAAGCTGCTGAATTGTTCTCTTGTACAACCATACACAACAAACAGAGCAAAAGTGGTGTTTCTAAAACATAGAAGGGGAAAGAGAAGAGGAGGATCGGGTAATCCGTGCATCACTTGTTTTCGCAACCTTCAATATCCATATCTCTTTTGCTCTCTTTCATGCAAGTTGAATGGCAAAATTGAAATAATCAACAAACAAAAAAGGAATTATGATCATCTTCCGCCGCGAAGCACAACACAAACCCAAACCCCAACCTCCGTTCTGGACCTCGATTACCCGTCCGCCGTCTCCAAAACCGCCGCCGCCGTCAATTTAGTCAAAAAGAACCGTAGCGGTGTGAAGTCGTTAGCGGCGGTTCTATGCCGGCCGAGATGCTTTCCCATCTCCGATATCGCCACCGCCGTGAACCGCCGAAAGGGCGTTCCTCAAAGATCGCCGTTAACTTGA

Protein sequence

MTLMPFNLIINLCFQLVLFYSIIFVSHSLFLLQRPYIHKLQQMVSMVREGENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHALLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRRGGSGNPCITCFRNLQYPYLFCSLSCKLNGKIEIINKQKRNYDHLPPRSTTQTQTPTSVLDLDYPSAVSKTAAAVNLVKKNRSGVKSLAAVLCRPRCFPISDIATAVNRRKGVPQRSPLT
Homology
BLAST of Clc01G04070 vs. NCBI nr
Match: XP_038876089.1 (uncharacterized protein LOC120068405 [Benincasa hispida])

HSP 1 Score: 389.8 bits (1000), Expect = 1.9e-104
Identity = 196/235 (83.40%), Postives = 209/235 (88.94%), Query Frame = 0

Query: 46  MVREG-ENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTH 105
           MVR+  ENNSVP+WLEILL EKFFTPCS+HKSCKKNDKTFFCL C SAICFSCFSSH TH
Sbjct: 1   MVRKADENNSVPKWLEILLVEKFFTPCSLHKSCKKNDKTFFCLSCCSAICFSCFSSHPTH 60

Query: 106 ALLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKR---RGGSGNPCI 165
             LQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLK RR KR   RGGSGN CI
Sbjct: 61  TFLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKQRRAKRRGLRGGSGNLCI 120

Query: 166 TCFRNLQYPYLFCSLSCKL----NGKIEIINKQKRNYDHLPPRSTTQTQTPTSVLDLDYP 225
           TC R+LQYPYLFCSLSCK+    N KIEI+ KQ+RNYD+LP RSTT+ QTPTSVL+LD+P
Sbjct: 121 TCLRSLQYPYLFCSLSCKMNQKVNDKIEIMKKQQRNYDYLPTRSTTENQTPTSVLELDFP 180

Query: 226 SAVSKTAAAVNLVKKNRSGVKSLAAVLCRPRCFPISDIATAVNRRKGVPQRSPLT 273
           SAVS+TAAA+ LVKKNRS VKSLAAVLCRPRCFPISDI TAVNRRKGVPQRSPLT
Sbjct: 181 SAVSETAAAIKLVKKNRSCVKSLAAVLCRPRCFPISDIGTAVNRRKGVPQRSPLT 235

BLAST of Clc01G04070 vs. NCBI nr
Match: XP_008437715.1 (PREDICTED: uncharacterized protein LOC103483060 [Cucumis melo])

HSP 1 Score: 372.5 bits (955), Expect = 3.2e-99
Identity = 194/245 (79.18%), Postives = 207/245 (84.49%), Query Frame = 0

Query: 46  MVREGENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHA 105
           MVRE ENNSVP+W+EILLGEKFFTPCS+H SCKKNDKTFFCLFCRSAICFSCFSSHRTHA
Sbjct: 1   MVREVENNSVPEWVEILLGEKFFTPCSLHISCKKNDKTFFCLFCRSAICFSCFSSHRTHA 60

Query: 106 LLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLK-HRRGKRRG---------- 165
           LLQIRRYVYHEVIRL DAEKL+NCSLVQPYTTNRAKVVFLK  RRGKRRG          
Sbjct: 61  LLQIRRYVYHEVIRLRDAEKLMNCSLVQPYTTNRAKVVFLKERRRGKRRGLRSSSGGSRS 120

Query: 166 --GSGNPCITCFRNLQYPYLFCSLSCKL----NGKIEIINKQKRNYDHLPPRSTTQTQTP 225
              + N CITCFRNLQYPYLFCSLSCK+    N KIEII+KQKRNY++LPPR+TT+TQTP
Sbjct: 121 NSNNTNLCITCFRNLQYPYLFCSLSCKINQKVNEKIEIISKQKRNYENLPPRTTTETQTP 180

Query: 226 TSVLDLDYPSAVSKTAAAVNLVKK--NRSGVKSLAAVLCRPRCFPISDIATAVNRRKGVP 272
           +SVLD D+ SA    AAAV  VKK  NRS VKSLAAVLCRPRCFPISD ATAVNRRKGVP
Sbjct: 181 SSVLDRDFSSA----AAAVKSVKKKNNRSCVKSLAAVLCRPRCFPISDFATAVNRRKGVP 240

BLAST of Clc01G04070 vs. NCBI nr
Match: XP_011654791.1 (uncharacterized protein LOC101221644 [Cucumis sativus] >KGN50223.1 hypothetical protein Csa_000124 [Cucumis sativus])

HSP 1 Score: 372.5 bits (955), Expect = 3.2e-99
Identity = 194/252 (76.98%), Postives = 206/252 (81.75%), Query Frame = 0

Query: 46  MVREGENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHA 105
           MVRE ENNSVP+W+EILLGEKFFTPCS+H SCKKNDKTFFCLFCRSAICFSCFSSHRTHA
Sbjct: 1   MVREVENNSVPEWVEILLGEKFFTPCSLHISCKKNDKTFFCLFCRSAICFSCFSSHRTHA 60

Query: 106 LLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLK-HRRGKRRG---------- 165
           LLQIRRYVYHEV+ LGDAEKL+NCSLVQPYTTNRAKVVFLK  RRGKRRG          
Sbjct: 61  LLQIRRYVYHEVVLLGDAEKLMNCSLVQPYTTNRAKVVFLKERRRGKRRGLRSSSSSSSS 120

Query: 166 ---------GSGNPCITCFRNLQYPYLFCSLSCKL----NGKIEIINKQKRNYDHLPPRS 225
                     +GN CITCFRNLQYPYLFCSLSCK+    N KIEIINKQKR Y++LPPR+
Sbjct: 121 GGGGWRSNNNNGNLCITCFRNLQYPYLFCSLSCKINQKVNEKIEIINKQKRKYENLPPRT 180

Query: 226 TTQTQTPTSVLDLDYPSAVSKTAAAVNLVKK-NRSGVKSLAAVLCRPRCFPISDIATAVN 273
           TT+ QTPTSVLD D+ S     AAAV L KK NRS VKSLAAVLCRPRCFPIS  ATAVN
Sbjct: 181 TTENQTPTSVLDRDFSS-----AAAVKLAKKNNRSCVKSLAAVLCRPRCFPISGFATAVN 240

BLAST of Clc01G04070 vs. NCBI nr
Match: KAG6579424.1 (hypothetical protein SDJN03_23872, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 323.9 bits (829), Expect = 1.3e-84
Identity = 166/235 (70.64%), Postives = 183/235 (77.87%), Query Frame = 0

Query: 43  MVSMVREGENNSVPQWLEILLGEKFFTPCSIHKS-CKKNDKTFFCLFCRSAICFSCFSSH 102
           MV +V E  ++S+PQWL ILLGEKFFTPC +H S  K+N+KTFFCL C SAICFSCFSSH
Sbjct: 4   MVGVVGEDNSSSLPQWLPILLGEKFFTPCLLHSSNSKRNEKTFFCLRCCSAICFSCFSSH 63

Query: 103 RTHALLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRR----GGSG 162
           R+HALLQIRRYVYHEVIRLGDA+KLLNCSLVQPYTTN AKV+FL+HRRGKRR    GGSG
Sbjct: 64  RSHALLQIRRYVYHEVIRLGDAQKLLNCSLVQPYTTNSAKVIFLQHRRGKRRALRGGGSG 123

Query: 163 NPCITCFRNLQYPYLFCSLSCKLNGKIEIINKQKRNYDHLPPRSTTQTQTPTSVLDLDYP 222
           N CITCFR LQ PYLFCSLSCK+N K EI NK+  N DHLPP ST              P
Sbjct: 124 NLCITCFRTLQCPYLFCSLSCKINHKTEITNKKHNNSDHLPPPST--------------P 183

Query: 223 SAVSKTAAAVNLVKKNRSGVKSLAAVLCRPRCFP-ISDIATAVNRRKGVPQRSPL 272
              + T AA+ LVKK RS VKSLA VLC+PRCFP +SDI T VNRRKGVPQRSPL
Sbjct: 184 EETAATPAAIKLVKKKRSCVKSLATVLCQPRCFPVVSDITTVVNRRKGVPQRSPL 224

BLAST of Clc01G04070 vs. NCBI nr
Match: KAA0039689.1 (putative PLATZ transcription factor family protein [Cucumis melo var. makuwa])

HSP 1 Score: 241.5 bits (615), Expect = 8.4e-60
Identity = 139/232 (59.91%), Postives = 158/232 (68.10%), Query Frame = 0

Query: 46  MVREGENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHA 105
           MVRE ENNSVP+W+EILLGEKFFTPCS+H SCKKNDKTFFCLFCRSAICFSCFSSHRTHA
Sbjct: 1   MVREVENNSVPEWVEILLGEKFFTPCSLHISCKKNDKTFFCLFCRSAICFSCFSSHRTHA 60

Query: 106 LLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRRGGSGNPCITCFR 165
           LLQI   VY   I      K+    +   Y  +        H             +  + 
Sbjct: 61  LLQILIKVYKCFI-----SKINFFIIFYKYFQHVHVCFHFDHLYPNN--------MIFYE 120

Query: 166 NLQYPYLFCS----LSCKLNGKIEIINKQKRNYDHLPPRSTTQTQTPTSVLDLDYPSAVS 225
                +  C     ++ K+N KIEII+KQKRNY++LPPR+TT+TQTP+SVLD D+ SA  
Sbjct: 121 RKILKFKLCMETIVINQKVNEKIEIISKQKRNYENLPPRTTTETQTPSSVLDRDFSSA-- 180

Query: 226 KTAAAVNLVKK--NRSGVKSLAAVLCRPRCFPISDIATAVNRRKGVPQRSPL 272
             AAAV  VKK  NRS VKSLAAVLCRPRCFPISD ATAVNRRKGVP RSPL
Sbjct: 181 --AAAVKSVKKKNNRSCVKSLAAVLCRPRCFPISDFATAVNRRKGVPHRSPL 215

BLAST of Clc01G04070 vs. ExPASy Swiss-Prot
Match: Q1G3Q4 (Protein RGF1 INDUCIBLE TRANSCRIPTION FACTOR 1 OS=Arabidopsis thaliana OX=3702 GN=RITF1 PE=1 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 9.4e-38
Identity = 98/254 (38.58%), Postives = 140/254 (55.12%), Query Frame = 0

Query: 56  PQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHALLQIRRYVYH 115
           P WL+ L  EKFF  C  H++ KKN++   CL C +++C  C  SHR H LLQ+RRYVYH
Sbjct: 6   PAWLDALYAEKFFVGCPYHETAKKNERNVCCLDCCTSLCPHCVPSHRFHRLLQVRRYVYH 65

Query: 116 EVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRR-GGSGNPCITCFRNLQYPYLFC 175
           +V+RL D +KL++CS VQ YT N AKVVF+K R   R+  G+GN C +C R+LQ PY+ C
Sbjct: 66  DVVRLEDLQKLIDCSNVQAYTINSAKVVFIKKRPQNRQFKGAGNYCTSCDRSLQEPYIHC 125

Query: 176 SLSCKLNGKIEIINKQKRNY--------------DHLPPR------STTQTQTPTS-VLD 235
           SL C    K++ + K+ R+               D++ P+           +TP S V+D
Sbjct: 126 SLGC----KVDFVMKRYRDITPFLKPCHTLTLGPDYIIPQDLLTDDEVAAYETPRSTVVD 185

Query: 236 LDYP--------------SAVSKTAAAVNLVKKNRSG--VKSLAAVLCRPRCFPISDIAT 272
            D                +A +  A   ++V+K R+G  + + +A   +       DI+ 
Sbjct: 186 GDESMSWSSASSDNNNAGAAAAYAATTTHVVRKKRTGFCLCAKSANSYKEVSEDPDDISA 245

BLAST of Clc01G04070 vs. ExPASy TrEMBL
Match: A0A0A0KKT3 (B box-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G160750 PE=4 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 1.5e-99
Identity = 194/252 (76.98%), Postives = 206/252 (81.75%), Query Frame = 0

Query: 46  MVREGENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHA 105
           MVRE ENNSVP+W+EILLGEKFFTPCS+H SCKKNDKTFFCLFCRSAICFSCFSSHRTHA
Sbjct: 1   MVREVENNSVPEWVEILLGEKFFTPCSLHISCKKNDKTFFCLFCRSAICFSCFSSHRTHA 60

Query: 106 LLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLK-HRRGKRRG---------- 165
           LLQIRRYVYHEV+ LGDAEKL+NCSLVQPYTTNRAKVVFLK  RRGKRRG          
Sbjct: 61  LLQIRRYVYHEVVLLGDAEKLMNCSLVQPYTTNRAKVVFLKERRRGKRRGLRSSSSSSSS 120

Query: 166 ---------GSGNPCITCFRNLQYPYLFCSLSCKL----NGKIEIINKQKRNYDHLPPRS 225
                     +GN CITCFRNLQYPYLFCSLSCK+    N KIEIINKQKR Y++LPPR+
Sbjct: 121 GGGGWRSNNNNGNLCITCFRNLQYPYLFCSLSCKINQKVNEKIEIINKQKRKYENLPPRT 180

Query: 226 TTQTQTPTSVLDLDYPSAVSKTAAAVNLVKK-NRSGVKSLAAVLCRPRCFPISDIATAVN 273
           TT+ QTPTSVLD D+ S     AAAV L KK NRS VKSLAAVLCRPRCFPIS  ATAVN
Sbjct: 181 TTENQTPTSVLDRDFSS-----AAAVKLAKKNNRSCVKSLAAVLCRPRCFPISGFATAVN 240

BLAST of Clc01G04070 vs. ExPASy TrEMBL
Match: A0A1S3AUS9 (uncharacterized protein LOC103483060 OS=Cucumis melo OX=3656 GN=LOC103483060 PE=4 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 1.5e-99
Identity = 194/245 (79.18%), Postives = 207/245 (84.49%), Query Frame = 0

Query: 46  MVREGENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHA 105
           MVRE ENNSVP+W+EILLGEKFFTPCS+H SCKKNDKTFFCLFCRSAICFSCFSSHRTHA
Sbjct: 1   MVREVENNSVPEWVEILLGEKFFTPCSLHISCKKNDKTFFCLFCRSAICFSCFSSHRTHA 60

Query: 106 LLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLK-HRRGKRRG---------- 165
           LLQIRRYVYHEVIRL DAEKL+NCSLVQPYTTNRAKVVFLK  RRGKRRG          
Sbjct: 61  LLQIRRYVYHEVIRLRDAEKLMNCSLVQPYTTNRAKVVFLKERRRGKRRGLRSSSGGSRS 120

Query: 166 --GSGNPCITCFRNLQYPYLFCSLSCKL----NGKIEIINKQKRNYDHLPPRSTTQTQTP 225
              + N CITCFRNLQYPYLFCSLSCK+    N KIEII+KQKRNY++LPPR+TT+TQTP
Sbjct: 121 NSNNTNLCITCFRNLQYPYLFCSLSCKINQKVNEKIEIISKQKRNYENLPPRTTTETQTP 180

Query: 226 TSVLDLDYPSAVSKTAAAVNLVKK--NRSGVKSLAAVLCRPRCFPISDIATAVNRRKGVP 272
           +SVLD D+ SA    AAAV  VKK  NRS VKSLAAVLCRPRCFPISD ATAVNRRKGVP
Sbjct: 181 SSVLDRDFSSA----AAAVKSVKKKNNRSCVKSLAAVLCRPRCFPISDFATAVNRRKGVP 240

BLAST of Clc01G04070 vs. ExPASy TrEMBL
Match: A0A5A7TEM7 (Putative PLATZ transcription factor family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold558G00200 PE=4 SV=1)

HSP 1 Score: 241.5 bits (615), Expect = 4.1e-60
Identity = 139/232 (59.91%), Postives = 158/232 (68.10%), Query Frame = 0

Query: 46  MVREGENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHA 105
           MVRE ENNSVP+W+EILLGEKFFTPCS+H SCKKNDKTFFCLFCRSAICFSCFSSHRTHA
Sbjct: 1   MVREVENNSVPEWVEILLGEKFFTPCSLHISCKKNDKTFFCLFCRSAICFSCFSSHRTHA 60

Query: 106 LLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRRGGSGNPCITCFR 165
           LLQI   VY   I      K+    +   Y  +        H             +  + 
Sbjct: 61  LLQILIKVYKCFI-----SKINFFIIFYKYFQHVHVCFHFDHLYPNN--------MIFYE 120

Query: 166 NLQYPYLFCS----LSCKLNGKIEIINKQKRNYDHLPPRSTTQTQTPTSVLDLDYPSAVS 225
                +  C     ++ K+N KIEII+KQKRNY++LPPR+TT+TQTP+SVLD D+ SA  
Sbjct: 121 RKILKFKLCMETIVINQKVNEKIEIISKQKRNYENLPPRTTTETQTPSSVLDRDFSSA-- 180

Query: 226 KTAAAVNLVKK--NRSGVKSLAAVLCRPRCFPISDIATAVNRRKGVPQRSPL 272
             AAAV  VKK  NRS VKSLAAVLCRPRCFPISD ATAVNRRKGVP RSPL
Sbjct: 181 --AAAVKSVKKKNNRSCVKSLAAVLCRPRCFPISDFATAVNRRKGVPHRSPL 215

BLAST of Clc01G04070 vs. ExPASy TrEMBL
Match: A0A2N9EQ66 (B box-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS4621 PE=4 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 4.5e-51
Identity = 120/253 (47.43%), Postives = 150/253 (59.29%), Query Frame = 0

Query: 55  VPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHALLQIRRYVY 114
           +P WLEILL +KFF PC IH+  KKN+K  FCL C  +IC  C  SHR+H LLQIRRYVY
Sbjct: 7   IPHWLEILLRDKFFNPCIIHEFAKKNEKNIFCLDCSISICPHCVPSHRSHRLLQIRRYVY 66

Query: 115 HEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRR-GGSGNPCITCFRNLQYPYLF 174
           H+VIRL DA+KL+NCSLVQPYTTN AKVVFL  R   R   GSGN CI C R+LQ PY F
Sbjct: 67  HDVIRLSDAQKLMNCSLVQPYTTNSAKVVFLNQRPMSRPFRGSGNFCIACDRSLQDPYFF 126

Query: 175 CSLSCKLNGKIEIINK--QKRNYDHLPPRSTTQT-----------QTPTSVLDLDYPSAV 234
           CSLSCK++ ++   N    +     LP ++  ++            TP SVLD     + 
Sbjct: 127 CSLSCKVHQQLMTKNSSGNREFLPILPDKARIESLSVSENEEDGQMTPDSVLDSHVSLSG 186

Query: 235 SKTAAA----------------------VNLVKKNRSGVKSLAAVLCRPRCFPISDIATA 272
           S++ A+                      +  VKK RS + S+  V C+PR  P ++IA A
Sbjct: 187 SRSTASASGCVGGSVSVVSCKTLACTANLEFVKKKRSSLNSVPRVSCQPRYSPAAEIAVA 246

BLAST of Clc01G04070 vs. ExPASy TrEMBL
Match: A0A7N2N1T9 (B box-type domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 4.5e-51
Identity = 122/252 (48.41%), Postives = 150/252 (59.52%), Query Frame = 0

Query: 55  VPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHALLQIRRYVY 114
           +PQWLEILL +KFF PC IH+  KKN+K  FCL C  +IC  C  SHR+H LLQIRRYVY
Sbjct: 16  IPQWLEILLRDKFFNPCIIHEFAKKNEKNIFCLDCSISICPHCVPSHRSHRLLQIRRYVY 75

Query: 115 HEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRR-GGSGNPCITCFRNLQYPYLF 174
           H+VIRL DA+KL+NCSLVQPYTTN AKVVFL  R   R   GSGN CITC R+LQ PY F
Sbjct: 76  HDVIRLSDAQKLMNCSLVQPYTTNSAKVVFLNQRPMSRPFRGSGNFCITCDRSLQDPYYF 135

Query: 175 CSLSCKLNGKIEIINKQKRNYDHLP--------PRSTTQTQ-------TPTSVLDLDYPS 234
           CSL CK++ ++   N    N + LP         RS T ++       TP SVLD     
Sbjct: 136 CSLYCKVHHQLVTAN-SSGNCEFLPILPAKARRDRSLTVSENEEDVQITPDSVLDSGVSL 195

Query: 235 AVSK-------------------TAAAVNLVKKNRSGVKSLAAVLCRPRCFPISDIATAV 272
           + S+                   TA     VKK RS +  +  V  + +C P +++A A+
Sbjct: 196 SGSRSTATASGCGGAVSSKTLACTATTTEFVKKKRSSLNLVPRVSFQMKCSPTAEVAVAL 255

BLAST of Clc01G04070 vs. TAIR 10
Match: AT3G60670.1 (PLATZ transcription factor family protein )

HSP 1 Score: 169.1 bits (427), Expect = 5.0e-42
Identity = 103/253 (40.71%), Postives = 140/253 (55.34%), Query Frame = 0

Query: 51  ENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHALLQIR 110
           E+   P WLE+LL +KFF  C  H+  KKN+K   C+ C   IC  C SSH +H LLQIR
Sbjct: 2   ESGEFPAWLEVLLKDKFFNACLDHEDDKKNEKNILCIDCCLTICPHCLSSHTSHRLLQIR 61

Query: 111 RYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRR-GGSGNPCITCFRNLQY 170
           RYVY +V+R+ D  KL++CSL+QPYTTN +KVVF+  R   R+  GSGN CITC R+LQ 
Sbjct: 62  RYVYRDVLRVEDGSKLMDCSLIQPYTTNSSKVVFINERPQSRQFRGSGNICITCDRSLQS 121

Query: 171 PYLFCSLSCKLNGKIEIINKQK------RNYDHLPPRSTTQTQTPTSVLDLDYPSAVSKT 230
           PYLFC LSCK++   ++I +Q+      R  + L       T TP+S L+   P+  ++T
Sbjct: 122 PYLFCCLSCKIS---DVIMRQRGLSGFLRVCNVLDLTDEVTTTTPSSTLE---PTGSNRT 181

Query: 231 A--------------------AAVNLVKKNRSGVKSLAAVLCRPRCFPISDIATA----- 272
           +                    A   +V+K RS +    +  CR     +S   T      
Sbjct: 182 SSESSGNEGEDMFWCQALACTATTEIVRKKRSSL----STTCRRVTEVVSTTNTEAPVNF 241

BLAST of Clc01G04070 vs. TAIR 10
Match: AT1G31040.1 (PLATZ transcription factor family protein )

HSP 1 Score: 159.1 bits (401), Expect = 5.1e-39
Identity = 95/246 (38.62%), Postives = 133/246 (54.07%), Query Frame = 0

Query: 46  MVREGENN--------SVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSC 105
           MVREGE          + P WLE L+ E FF+ C IH++ +K++K  FCL C  ++C  C
Sbjct: 1   MVREGEEEEEMMMMMATKPAWLEGLMAETFFSSCGIHETRRKSEKNVFCLLCCLSVCPHC 60

Query: 106 FSSHRTHALLQIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRRGGSG 165
             SHR+H LLQ+RRYVYH+V+RL D EKL++CS VQPYT N AKV+FL  R+  R   S 
Sbjct: 61  LPSHRSHPLLQVRRYVYHDVVRLSDLEKLIDCSYVQPYTINGAKVIFLNQRQQSRAKVSS 120

Query: 166 NPCITCFRNLQYPYLFCSLSCK----------LNGKIEIINKQKRNYDHLPPRSTTQTQT 225
           N C TC R LQ P+ FCSLSCK          L+  +  I++    ++ L      Q   
Sbjct: 121 NVCFTCDRILQEPFHFCSLSCKVDYLSYQGDDLSSILYRIDESDFTFEGLRMDGHDQLGE 180

Query: 226 PTSVLDLDYPSAVS-KTAAAVNLVKKNRSGVKSLAAVLCRPRCFPISDIATAVNRRKGVP 273
            +++ D +    +S ++    N  KK +   K            P   +++  NRRKG P
Sbjct: 181 ISTMEDGEDILVISDESEQGNNSHKKEKKKSKKKKP---ESNYLPGMVLSSLGNRRKGAP 240

BLAST of Clc01G04070 vs. TAIR 10
Match: AT2G12646.1 (PLATZ transcription factor family protein )

HSP 1 Score: 158.7 bits (400), Expect = 6.7e-39
Identity = 98/254 (38.58%), Postives = 140/254 (55.12%), Query Frame = 0

Query: 56  PQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSAICFSCFSSHRTHALLQIRRYVYH 115
           P WL+ L  EKFF  C  H++ KKN++   CL C +++C  C  SHR H LLQ+RRYVYH
Sbjct: 6   PAWLDALYAEKFFVGCPYHETAKKNERNVCCLDCCTSLCPHCVPSHRFHRLLQVRRYVYH 65

Query: 116 EVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRR-GGSGNPCITCFRNLQYPYLFC 175
           +V+RL D +KL++CS VQ YT N AKVVF+K R   R+  G+GN C +C R+LQ PY+ C
Sbjct: 66  DVVRLEDLQKLIDCSNVQAYTINSAKVVFIKKRPQNRQFKGAGNYCTSCDRSLQEPYIHC 125

Query: 176 SLSCKLNGKIEIINKQKRNY--------------DHLPPR------STTQTQTPTS-VLD 235
           SL C    K++ + K+ R+               D++ P+           +TP S V+D
Sbjct: 126 SLGC----KVDFVMKRYRDITPFLKPCHTLTLGPDYIIPQDLLTDDEVAAYETPRSTVVD 185

Query: 236 LDYP--------------SAVSKTAAAVNLVKKNRSG--VKSLAAVLCRPRCFPISDIAT 272
            D                +A +  A   ++V+K R+G  + + +A   +       DI+ 
Sbjct: 186 GDESMSWSSASSDNNNAGAAAAYAATTTHVVRKKRTGFCLCAKSANSYKEVSEDPDDISA 245

BLAST of Clc01G04070 vs. TAIR 10
Match: AT1G32700.1 (PLATZ transcription factor family protein )

HSP 1 Score: 124.0 bits (310), Expect = 1.8e-28
Identity = 79/225 (35.11%), Postives = 117/225 (52.00%), Query Frame = 0

Query: 49  EGENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCRSA-ICFSCFSSHRTHALL 108
           E  N + P WL+ LL EKFF  C +H    K++   +CL C +  +C  C S H+ H  +
Sbjct: 5   EETNKTYPHWLKPLLREKFFVQCKLHADSHKSECNMYCLDCTNGPLCSLCLSFHKDHHAI 64

Query: 109 QIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRRG-GSGNPCITCFRN 168
           QIRR  YH+VIR+ + +K L+ + VQ Y  N AKVVFL  R   R G G  N C  C+R+
Sbjct: 65  QIRRSSYHDVIRVSEIQKFLDITGVQTYVINSAKVVFLNERPQPRPGKGVINTCEVCYRS 124

Query: 169 LQYPYLFCSLSCKLNGKIEIINKQKRNYDHLPPRSTTQTQTPTSVLDLDYPSAVSKTAAA 228
           L   + FCSL CK++G    I+K+KR             +   ++ D D     S ++ +
Sbjct: 125 LVDSFRFCSLGCKISG----ISKKKRK------------EWTNNLSDSD----DSYSSTS 184

Query: 229 VNLVKKNRSGVKSLAAVLCRPRCFPISDIATAV-NRRKGVPQRSP 271
           +  +KKN   + +       P   P+S +   +  RRKG+P R+P
Sbjct: 185 IGRLKKNDDIMNNSFT----PSTPPLSAVNRRIAKRRKGIPHRAP 205

BLAST of Clc01G04070 vs. TAIR 10
Match: AT1G43000.1 (PLATZ transcription factor family protein )

HSP 1 Score: 117.9 bits (294), Expect = 1.3e-26
Identity = 78/226 (34.51%), Postives = 117/226 (51.77%), Query Frame = 0

Query: 49  EGENNSVPQWLEILLGEKFFTPCSIHKSCKKNDKTFFCLFCR-SAICFSCFSSHRTHALL 108
           E ++   P WL  +L   +F  CSIH    K++   FCL C  +A C SC + HRTH ++
Sbjct: 2   ENDDVMTPPWLTPMLRADYFVTCSIHSQSSKSECNLFCLDCSGNAFCSSCLAHHRTHRVI 61

Query: 109 QIRRYVYHEVIRLGDAEKLLNCSLVQPYTTNRAKVVFLKHRRGKRRGGSGN-PCITCFRN 168
           QIRR  YH V+R+ + +K ++ S +Q Y  N AK+ FL  R   R G S N  C  C RN
Sbjct: 62  QIRRSSYHNVVRVSEIQKHIDISCIQTYVINSAKIFFLNARPQCRTGKSLNKTCQICSRN 121

Query: 169 LQYPYLFCSLSCKLNGKIEIINKQKRNYDHLPPRSTTQTQTPTSVLDLDYPS-AVSKTAA 228
           L   +LFCSL+CKL G   + N +  N   L    + ++   + +++    S  +   + 
Sbjct: 122 LLDSFLFCSLACKLEG---VKNGEDPN---LTLFHSGKSDDSSKIINTGICSRLIDGISI 181

Query: 229 AVNLVKKNRSGVKSLAAVLCRP-RCFPISDIATAVNRRKGVPQRSP 271
           AV+  +   +GV S         R +P+       +RRKG+PQR+P
Sbjct: 182 AVDDQRSETAGVLSPETPSIESHRNYPMK------SRRKGIPQRAP 215

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876089.11.9e-10483.40uncharacterized protein LOC120068405 [Benincasa hispida][more]
XP_008437715.13.2e-9979.18PREDICTED: uncharacterized protein LOC103483060 [Cucumis melo][more]
XP_011654791.13.2e-9976.98uncharacterized protein LOC101221644 [Cucumis sativus] >KGN50223.1 hypothetical ... [more]
KAG6579424.11.3e-8470.64hypothetical protein SDJN03_23872, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAA0039689.18.4e-6059.91putative PLATZ transcription factor family protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q1G3Q49.4e-3838.58Protein RGF1 INDUCIBLE TRANSCRIPTION FACTOR 1 OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KKT31.5e-9976.98B box-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G160750 ... [more]
A0A1S3AUS91.5e-9979.18uncharacterized protein LOC103483060 OS=Cucumis melo OX=3656 GN=LOC103483060 PE=... [more]
A0A5A7TEM74.1e-6059.91Putative PLATZ transcription factor family protein OS=Cucumis melo var. makuwa O... [more]
A0A2N9EQ664.5e-5147.43B box-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS462... [more]
A0A7N2N1T94.5e-5148.41B box-type domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G60670.15.0e-4240.71PLATZ transcription factor family protein [more]
AT1G31040.15.1e-3938.62PLATZ transcription factor family protein [more]
AT2G12646.16.7e-3938.58PLATZ transcription factor family protein [more]
AT1G32700.11.8e-2835.11PLATZ transcription factor family protein [more]
AT1G43000.11.3e-2634.51PLATZ transcription factor family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006734PLATZ transcription factorPFAMPF04640PLATZcoord: 110..180
e-value: 4.2E-21
score: 75.4
NoneNo IPR availablePANTHERPTHR31065PLATZ TRANSCRIPTION FACTOR FAMILY PROTEINcoord: 53..272
NoneNo IPR availablePANTHERPTHR31065:SF52BNAA09G38660D PROTEINcoord: 53..272
IPR000315B-box-type zinc fingerPROSITEPS50119ZF_BBOXcoord: 71..120
score: 9.157618

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G04070.2Clc01G04070.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008270 zinc ion binding