ClCG04G006910 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG04G006910
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionlate embryogenesis abundant protein B19.4
LocationCG_Chr04: 21818940 .. 21822954 (+)
RNA-Seq ExpressionClCG04G006910
SyntenyClCG04G006910
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGTTTCTTTAGAAGAATTAGAAGGTCGGGCAACACAAATGGCATCGCAACAGGAAAGATCAGAGCTGGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGAACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGCTGAAGGTTCTTAACTGTCTCTTAATTGTTAATGTACTATGTATTTCTGCTTTAATAACACAATACGATAGAAAACATTTTTAAGTATTTCTTGAGAAAAACTTAGATTTTATCTTTGTCCAACAAAGAAAAAAACTCTAATGTGATAAATCTTTTTTCAAAAAAATGATTTTTTTATAGTTTTGTTAACTCGTTTGTTTCTTGGGTCGTGGCGACATGGAGGTCGGAGTCGGGGTGGGCAGACGAGGAAGGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCACCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATGAAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCTGGTGGTGAGCGGGCAGCGGAGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACCAAGGACCGTTGATCAGAGCTTATGGCTCTGTGTGCATACAAGATGATGATCCAGCTGACGTTGTTTTGAAAGTAGCCGTGTGGCGTGTTCTTGTCGTGTTTTTTTTTTTTTTTTTTTCCTTCATTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGTGGCGTGTTCTTGTCGTTTTTTTTTTTTTTTTTTTTTTCCTTCATGTTTATGTATTTCGGTGAGGTTGCTTTGTAATAGTTAATGTCTTTTTAAGTCTGTTTTTTTTTTCCGTTTCTGGCCACATGTACGTCGACACGTCAGGGTTTTGGTTATTGGACTTGAACAAGGACGAACTTTGCTTTATTATGTGAATTTTAGATGTCTTATTTTATTTGCATGGAAAAAGGAAAGGATCTTGGCCAACTTTAATTCGCATTATCATGAACACTCACTCCTATTCTCAATGGGTTAAGATGGAAATTTCATGTTTCTCTTTCCAGATCGAATTTGAATTTCTGCTTAGCTAAGTTCACTACGTGAAGATAAAAGAAACTGGAACAAACTAACTCATAATTGTTTCAATTAAATGTTAGAACTAGTGTGAGTTTGATTTTGAAATTGAAATTTCGGGCGGAGTGTAAATTGATATATATCTAAATATAAATTGATTAGACTATTGGAGATTATGGAAGATTTTTTTAGAGTTTGATCCACGCAAATTAAAAGTTTTATCAACGAAATTAAGGAAATACACAAAAAAAAATTGAAGGAATTTAGAAAATAATTAGAAGTTCATATTTAGGTTAAAGTTAATATTTTTTCTAAAAATATGAAGAATAAAGTACCACGTCAATTTTCATCTTTCATTTAGTGAAAAATAAAGTTCAAATAAGAATTTTGTACGATTTAATTTTAAAAAAAGAAAACTAAATTACAGATGCGCTTCCAACTTATATGTGGTTTTCTTTTTAAAATATTTCAATATTTTGTAAGATATTAAGGAAGAGCTAATGCTTTGTTTGACATTTTACCATGATACCTTACTGGATAAAAACTTGAAGTTGGACTACTATGATGACATCAAAATCCACTCCATTTGATCCCAACTTCTAATTAGATTTAGGTAAACACTTTTGGTCCAATAAAAGTAAAGATTAAATTAAACCATTATCAGCCAAAGGTTTTAAGCCTAAATTGTTATTTCAAATAGAAAGATAAAAACTACTTTTTAACTGATTTAAAAATAGCTATGAGCCAATAATCTAATGACAAAATTTTAGCATTTTATTCCCACTATAAATTTGTGTTAGTAAATTGAGCACTTAAATATTTATCGTGCTGAATATTTAAGGGTTCTTGATTTTTGTTTTTCATGGTTTAAAAAACTAAATCTAAAGCTGGGGTTTAAATTTCTAGTTAAAATATTATTTTGGTATCTAACTGGTCAAAATTTTAATCCCAACGTCGGATTTTAAATCTATATCCTTAAACTTATTTTAAAACAACTTTGTCATACATATTTATTTATTTTATTTTCTGAAAATTATTGTTATCATTATATTATTTAATCAATTTTGAAACTGGACTAAAATGGAATAAACTGGGACTGAAATGGTATTTGACCTAAATCTTTCTACCTTATCGCAGTACTGACCTAAATTTTTCTACCTTATCGTAGTACTGTTGATAGAAGGGAATCAGATCTTAGAACCGCTATCTCGTCTCAACACGTGTCATCCGTGGGACACTAGAAGAAACTTTTTGACTGAGTAAGCACCTACGTGTCAACCACGCAATGTTCTGCAATTCGTTATCAGCCACGTAGGCACTCCAGAGTTGGGAATAAGAGTTCTCAAATTCTATATAAACCCAAAGGGACAAGAAGAGACAGGCATAAACTCTTATCTTCTCTTGTCGAAGCTAAGTATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGATGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTCGTCCCCGGTGGAACTGGGGGCAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGTGTGACGACTTGTCGACTTTTCTAGTTTCCAACCCTATTGCTCCATTTTGAACTTACCTTTTCTTAGTTAACAAGCACTTTTTATCTAAATTTAACCATTTGAATCCTTTAGCTCTGGTACTTAATTATTTAAGTCAATATACTTTTTAACTTGATATCAAAATGAAGTGTCGCTACTATAAAACAAGGTTTATTATTCTACAAATTTAATAAATTTCATTTGGTATCTAAAATTTTTAAAATGGTTAAATTAAAATTCTAGTCTCTAATTATTTTTATCTCGTTGATTCGAGACCTGACCGAACTTCAAATCTTGTGGCTAAAGCCTCTATACTTCTAATTTTCTATATTTAATAAGGCTATGATATATGCAACTTTTTTCCTTTATAAAAAAAAATTATAGGAAAGGGATAGTTTTCAAAAAAAAACCTTTTCAAATATAACAAAATCCGTGTAGCAAAATGTTGAATAAATATAGTTAAATTATACTATCATTCATGACCGGACTACTACGGATAATCGACTACTATTTATTTAGAATTATCGCCATCACTAAAAAAAGAATTGCTGTATTTAAAAATATTTTAAGCAATTTTGCTATTTAAAATAATTATTTAAAAAAATATATTATATTTGATAATTTTTTAAGTAAATTTTAGTTAATAATTGATCTATTAAACATGCTCTAAAAATTTTATGTCTTATTAGATGAAAATTTTATAAGACTGAAAATTGAAACTAATTTATAGGCTGAACTTTTAAGAAAATTATTATAAATTGAAAAAATATTAGAACTTCTATCGCTAACATCCCAAAACTTTTAATTTAATCTTTAAAAATATCTCTAAACAGCATAGAGACCACGCAAAACACTCTTGAAAGTACTAATGTTGAGGCACATTTTAAACCAATGCCAACCGCCAAAGTGAAAAGTGGCGACTCATTCTGATGGGTGATGCAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGAGCAACTAGGACACGAAGGGTACCAAGAGATGGGCCGTAAAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAATCCAAGTTCAGGACTAAGTAGAAGAAAGCCTTTCACAATGTCGTTGAGTTTCAAGTTCTAAGTTCCATTTTCAGCTTT

mRNA sequence

AAGTTTCTTTAGAAGAATTAGAAGGTCGGGCAACACAAATGGCATCGCAACAGGAAAGATCAGAGCTGGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGAACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGCTGAAGGTCGGAGTCGGGGTGGGCAGACGAGGAAGGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCACCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATGAAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCTGGTGGTGAGCGGGCAGCGGAGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACCAAGGACCCTAAGTATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGATGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTCGTCCCCGGTGGAACTGGGGGCAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGAGCAACTAGGACACGAAGGGTACCAAGAGATGGGCCGTAAAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAATCCAAGTTCAGGACTAAGTAGAAGAAAGCCTTTCACAATGTCGTTGAGTTTCAAGTTCTAAGTTCCATTTTCAGCTTT

Coding sequence (CDS)

ATGGCATCGCAACAGGAAAGATCAGAGCTGGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGAACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGCTGAAGGTCGGAGTCGGGGTGGGCAGACGAGGAAGGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCACCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATGAAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCTGGTGGTGAGCGGGCAGCGGAGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACCAAGGACCCTAAGTATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGATGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTCGTCCCCGGTGGAACTGGGGGCAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGAGCAACTAGGACACGAAGGGTACCAAGAGATGGGCCGTAAAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAATCCAAGTTCAGGACTAAGTAG

Protein sequence

MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK
Homology
BLAST of ClCG04G006910 vs. NCBI nr
Match: KAG7033764.1 (Em-like protein GEA1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 326.6 bits (836), Expect = 1.6e-85
Identity = 181/213 (84.98%), Postives = 189/213 (88.73%), Query Frame = 0

Query: 1   MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           MA+QQERSEL+AKAKQGETVVPGGTGGKS EAQER    RSRGGQTRKEQLGHEGYQE+G
Sbjct: 1   MAAQQERSELEAKAKQGETVVPGGTGGKSLEAQER----RSRGGQTRKEQLGHEGYQEMG 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELK 120
           H+GGE RREQMG EGYQEMG+KGGLSTMDKS  ER  EEGIEIDESK            +
Sbjct: 61  HRGGETRREQMGQEGYQEMGKKGGLSTMDKSAAERVEEEGIEIDESK------------E 120

Query: 121 KEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE 180
           +EMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE
Sbjct: 121 REMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE 180

Query: 181 MGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           MGRKGGLSN+GMPGGERAAEEGVEIDESKFR K
Sbjct: 181 MGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK 197

BLAST of ClCG04G006910 vs. NCBI nr
Match: KAF4401425.1 (hypothetical protein G4B88_001619 [Cannabis sativa])

HSP 1 Score: 266.5 bits (680), Expect = 1.9e-67
Identity = 160/271 (59.04%), Postives = 178/271 (65.68%), Query Frame = 0

Query: 3   SQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH- 62
           SQ++R ELDAKAKQGETVVPGGTGG+S EAQE LAEGRSRGGQTR EQLGHEGYQE+G  
Sbjct: 89  SQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMGRK 148

Query: 63  -----------------------------------------------------------Q 122
                                                                      +
Sbjct: 149 GGLSTTDKSGGERAEEEGIQIDESKQELDAKARQGETVIPGGTGGKSLEAQEHLAEGRSR 208

Query: 123 GGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKE 182
           GG+ R EQ+GHEGYQEMGRKGGLST DKSGG+RA EEGI+IDES                
Sbjct: 209 GGQTRSEQLGHEGYQEMGRKGGLSTTDKSGGDRAEEEGIQIDES---------------- 268

Query: 183 MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMG 214
            +S+++R ELDA+A+QGETVVPGGTGG+SLEAQEHLAEGRSRGGQTR EQLGHEGYQEMG
Sbjct: 269 -NSQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMG 328

BLAST of ClCG04G006910 vs. NCBI nr
Match: XP_016183975.2 (LOW QUALITY PROTEIN: late embryogenesis abundant protein B19.4 [Arachis ipaensis])

HSP 1 Score: 250.4 bits (638), Expect = 1.4e-62
Identity = 153/233 (65.67%), Postives = 167/233 (71.67%), Query Frame = 0

Query: 1   MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           MAS+Q++ ELD +AKQGETVVPGGTGGKS EAQE LAEGRS+GGQT              
Sbjct: 1   MASKQQKQELDERAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQT-------------- 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKD----PKYL 120
                 RREQ+G EGYQEMGRKGG STM+KSGGERA EEG+EIDESKF TK+    P+Y 
Sbjct: 61  ------RREQLGTEGYQEMGRKGGFSTMEKSGGERAEEEGVEIDESKFVTKNLNKYPEYQ 120

Query: 121 E-ELKKEM---------------SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAE 180
             E K  M               S +Q R ELD RA+QGETVVPGGTGGKSLEAQEHLAE
Sbjct: 121 HIESKYNMLSLLSSNSIQVISMASKQQNRQELDERAKQGETVVPGGTGGKSLEAQEHLAE 180

Query: 181 GRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           GRS+GGQTR+EQLG EGYQEMGRKGG S     GGERA EEGVEIDESKF TK
Sbjct: 181 GRSKGGQTRREQLGTEGYQEMGRKGGFSTMEKSGGERAEEEGVEIDESKFTTK 213

BLAST of ClCG04G006910 vs. NCBI nr
Match: XP_007206180.2 (late embryogenesis abundant protein B19.3 [Prunus persica])

HSP 1 Score: 235.7 bits (600), Expect = 3.6e-58
Identity = 143/235 (60.85%), Postives = 161/235 (68.51%), Query Frame = 0

Query: 5   QERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGG 64
           Q+R ELD KA++GE V+PGGTGGKS EAQE LAEGRSRGGQTRK ++             
Sbjct: 9   QKRRELDEKARKGEVVIPGGTGGKSLEAQEHLAEGRSRGGQTRKNEI------------- 68

Query: 65  EARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEMS 124
                  GHEGY EMG+KGGLST DKSGGERAAEEGI +DESK++T          KEM+
Sbjct: 69  -------GHEGYHEMGKKGGLSTTDKSGGERAAEEGIPLDESKYKTNG---RSNDSKEMA 128

Query: 125 SEQERC------ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQ----------- 184
           SEQER       ELD +ARQGE VVPGGTGGKSLEAQEHLAEGRSRGGQ           
Sbjct: 129 SEQERSDPSRRKELDEKARQGEVVVPGGTGGKSLEAQEHLAEGRSRGGQTRREQVGHEGY 188

Query: 185 ---------TRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
                    TRKEQ+GHEGY+EMG+KGGLS     GGERAAEEG+ IDESK++TK
Sbjct: 189 RELGHRGGETRKEQIGHEGYREMGKKGGLSTKDKSGGERAAEEGIPIDESKYKTK 220

BLAST of ClCG04G006910 vs. NCBI nr
Match: XP_021296037.1 (late embryogenesis abundant protein B19.4 [Herrania umbratica])

HSP 1 Score: 216.9 bits (551), Expect = 1.7e-52
Identity = 135/225 (60.00%), Postives = 157/225 (69.78%), Query Frame = 0

Query: 1   MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           M+ QQ+R ELD +A++ E V+PGGTGGKS EA+E LAEGRSRGGQTRKEQ+  EGYQE+G
Sbjct: 1   MSYQQQREELDHRAREVEIVIPGGTGGKSLEAEEHLAEGRSRGGQTRKEQIRTEGYQEMG 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFR-TKDPKYLEEL 120
           HQ                    GGLST DKSGGERA EEG++I++SK+R ++  K    +
Sbjct: 61  HQ--------------------GGLSTGDKSGGERAEEEGVQIEKSKYRASQRQKRRSSV 120

Query: 121 KK------EMSSEQ-------ERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGG 180
           KK       M+SEQ       ER ELDARARQGE VVPGGT GKSLEAQE LAEGR  GG
Sbjct: 121 KKPRQKGTRMASEQVKNASDEERAELDARARQGEVVVPGGTSGKSLEAQERLAEGRHPGG 180

Query: 181 QTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFR 212
           +  K+Q+G EGYQEMGRKGGLS T    GERAAEEG+ IDESK R
Sbjct: 181 EAGKQQIGREGYQEMGRKGGLSTTDKSDGERAAEEGMPIDESKHR 205

BLAST of ClCG04G006910 vs. ExPASy Swiss-Prot
Match: Q07187 (Em-like protein GEA1 OS=Arabidopsis thaliana OX=3702 GN=EM1 PE=2 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 4.5e-43
Identity = 112/214 (52.34%), Postives = 125/214 (58.41%), Query Frame = 0

Query: 1   MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           MAS+Q  R ELD KAKQGETVVPGGTGG S EAQE LAEGRS+GGQTRKEQLGHEGYQE+
Sbjct: 1   MASKQLSREELDEKAKQGETVVPGGTGGHSLEAQEHLAEGRSKGGQTRKEQLGHEGYQEI 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEEL 120
           GH+GGEAR+EQ+GHEGYQEMG KGG +  ++ G E   E G                   
Sbjct: 61  GHKGGEARKEQLGHEGYQEMGHKGGEARKEQLGHEGYQEMG------------------- 120

Query: 121 KKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ 180
                                                       +GG+ RKEQLGHEGY+
Sbjct: 121 -------------------------------------------HKGGEARKEQLGHEGYK 152

Query: 181 EMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           EMGRKGGLS     GGERA EEG+EIDESKF  K
Sbjct: 181 EMGRKGGLSTMEKSGGERAEEEGIEIDESKFTNK 152

BLAST of ClCG04G006910 vs. ExPASy Swiss-Prot
Match: Q05191 (Late embryogenesis abundant protein B19.4 OS=Hordeum vulgare OX=4513 GN=B19.4 PE=2 SV=1)

HSP 1 Score: 168.3 bits (425), Expect = 9.3e-41
Identity = 104/212 (49.06%), Postives = 124/212 (58.49%), Query Frame = 0

Query: 2   ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH 61
           + QQERSELD  A++GETVVPGGTGGK+ EAQE LAEGRSRGGQTRKEQLG EGY+E+GH
Sbjct: 3   SGQQERSELDRMAREGETVVPGGTGGKTLEAQEHLAEGRSRGGQTRKEQLGEEGYREMGH 62

Query: 62  QGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKK 121
           +GGE R+EQ+G EGY+EMG KGG +  ++ G E   E G                     
Sbjct: 63  KGGETRKEQLGEEGYREMGHKGGETRKEQLGEEGYREMG--------------------- 122

Query: 122 EMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEM 181
                                                     +GG+TRKEQ+G EGY+EM
Sbjct: 123 -----------------------------------------HKGGETRKEQMGEEGYREM 152

Query: 182 GRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           GRKGGLS     GGERAA EG++IDESKF+TK
Sbjct: 183 GRKGGLSTMNESGGERAAREGIDIDESKFKTK 152

BLAST of ClCG04G006910 vs. ExPASy Swiss-Prot
Match: I1N2Z5 (Protein SLE1 OS=Glycine max OX=3847 GN=SLE1 PE=2 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 2.3e-39
Identity = 90/110 (81.82%), Postives = 97/110 (88.18%), Query Frame = 0

Query: 1   MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           M SQQ  R ELD KA+QGETVVPGGTGGKS EAQE LAEGRSRGGQTRK+QLG EGY E+
Sbjct: 1   MESQQANREELDEKARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKQQLGSEGYHEM 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFR 110
           G +GG+ R+EQMG EGYQEMGRKGGLSTMDKSGGERA EEGIEIDESKF+
Sbjct: 61  GTKGGQTRKEQMGREGYQEMGRKGGLSTMDKSGGERAEEEGIEIDESKFK 110

BLAST of ClCG04G006910 vs. ExPASy Swiss-Prot
Match: Q5KTS7 (Carrot ABA-induced in somatic embryos 3 OS=Daucus carota OX=4039 GN=CAISE3 PE=2 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 8.7e-39
Identity = 86/110 (78.18%), Postives = 98/110 (89.09%), Query Frame = 0

Query: 2   ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH 61
           + Q++RSELDA+AKQGETVVPGGTGGKS EAQE LAEGRS+GG TRKEQLG EGYQE+G 
Sbjct: 3   SGQEKRSELDARAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGHTRKEQLGTEGYQEIGT 62

Query: 62  QGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTK 112
           +GGE RREQMG EGY++MGR GGL+T DKSG ERA EEGI+ID+SKFRTK
Sbjct: 63  KGGETRREQMGKEGYEQMGRMGGLATKDKSGAERAEEEGIDIDQSKFRTK 112

BLAST of ClCG04G006910 vs. ExPASy Swiss-Prot
Match: Q02400 (Late embryogenesis abundant protein B19.3 OS=Hordeum vulgare OX=4513 GN=B19.3 PE=2 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 2.8e-37
Identity = 88/130 (67.69%), Postives = 100/130 (76.92%), Query Frame = 0

Query: 2   ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQ---------------- 61
           + QQERSELD  A++GETVVPGGTGGK+ EAQE LAEGRSRGGQ                
Sbjct: 3   SGQQERSELDRMAREGETVVPGGTGGKTLEAQEHLAEGRSRGGQTRKDQLGEEGYREMGH 62

Query: 62  ----TRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGI 112
               TRKEQLG EGY+E+GH+GGE R+EQMG EGY EMGRKGGLSTM++SGGERAA EGI
Sbjct: 63  KGGETRKEQLGEEGYREMGHKGGETRKEQMGEEGYHEMGRKGGLSTMEESGGERAAREGI 122

BLAST of ClCG04G006910 vs. ExPASy TrEMBL
Match: A0A7J6I2G2 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_001619 PE=3 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 9.3e-68
Identity = 160/271 (59.04%), Postives = 178/271 (65.68%), Query Frame = 0

Query: 3   SQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH- 62
           SQ++R ELDAKAKQGETVVPGGTGG+S EAQE LAEGRSRGGQTR EQLGHEGYQE+G  
Sbjct: 89  SQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMGRK 148

Query: 63  -----------------------------------------------------------Q 122
                                                                      +
Sbjct: 149 GGLSTTDKSGGERAEEEGIQIDESKQELDAKARQGETVIPGGTGGKSLEAQEHLAEGRSR 208

Query: 123 GGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKE 182
           GG+ R EQ+GHEGYQEMGRKGGLST DKSGG+RA EEGI+IDES                
Sbjct: 209 GGQTRSEQLGHEGYQEMGRKGGLSTTDKSGGDRAEEEGIQIDES---------------- 268

Query: 183 MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMG 214
            +S+++R ELDA+A+QGETVVPGGTGG+SLEAQEHLAEGRSRGGQTR EQLGHEGYQEMG
Sbjct: 269 -NSQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMG 328

BLAST of ClCG04G006910 vs. ExPASy TrEMBL
Match: A0A6J1B9A4 (late embryogenesis abundant protein B19.4 OS=Herrania umbratica OX=108875 GN=LOC110425442 PE=4 SV=1)

HSP 1 Score: 216.9 bits (551), Expect = 8.4e-53
Identity = 135/225 (60.00%), Postives = 157/225 (69.78%), Query Frame = 0

Query: 1   MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           M+ QQ+R ELD +A++ E V+PGGTGGKS EA+E LAEGRSRGGQTRKEQ+  EGYQE+G
Sbjct: 1   MSYQQQREELDHRAREVEIVIPGGTGGKSLEAEEHLAEGRSRGGQTRKEQIRTEGYQEMG 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFR-TKDPKYLEEL 120
           HQ                    GGLST DKSGGERA EEG++I++SK+R ++  K    +
Sbjct: 61  HQ--------------------GGLSTGDKSGGERAEEEGVQIEKSKYRASQRQKRRSSV 120

Query: 121 KK------EMSSEQ-------ERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGG 180
           KK       M+SEQ       ER ELDARARQGE VVPGGT GKSLEAQE LAEGR  GG
Sbjct: 121 KKPRQKGTRMASEQVKNASDEERAELDARARQGEVVVPGGTSGKSLEAQERLAEGRHPGG 180

Query: 181 QTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFR 212
           +  K+Q+G EGYQEMGRKGGLS T    GERAAEEG+ IDESK R
Sbjct: 181 EAGKQQIGREGYQEMGRKGGLSTTDKSDGERAAEEGMPIDESKHR 205

BLAST of ClCG04G006910 vs. ExPASy TrEMBL
Match: A0A498K5A7 (Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_014793 PE=4 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 5.3e-47
Identity = 136/279 (48.75%), Postives = 154/279 (55.20%), Query Frame = 0

Query: 1   MASQQE------RSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHE 60
           MAS+QE      R+ELD KA++GET+VPGGTGG S EAQE LAEGRSRGGQTRK Q+   
Sbjct: 1   MASEQEKQDPQKRNELDEKARRGETIVPGGTGGHSLEAQEHLAEGRSRGGQTRKGQI--- 60

Query: 61  GYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPK 120
                            G EGY EMG+KGGLST DK GGERAAEEGI+IDESK   +DP+
Sbjct: 61  -----------------GEEGYHEMGKKGGLSTTDKPGGERAAEEGIKIDESK---RDPR 120

Query: 121 YLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQL- 180
                         R ELD +ARQGE VVPGGTG K+L AQEHLAEGR RGG+ RKEQL 
Sbjct: 121 -------------RRQELDQKARQGENVVPGGTGSKTLNAQEHLAEGRHRGGEARKEQLG 180

Query: 181 -----------------------------------------------------------G 214
                                                                      G
Sbjct: 181 SEGYGEIGHRGGEARKEQLGHEGYRDMGHRRCEASKKQLGHEGYQEMGRHGGEMRKEQIG 240

BLAST of ClCG04G006910 vs. ExPASy TrEMBL
Match: A0A446J0E8 (Uncharacterized protein OS=Triticum turgidum subsp. durum OX=4567 GN=TRITD_1Av1G143760 PE=3 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 2.6e-46
Identity = 116/212 (54.72%), Postives = 134/212 (63.21%), Query Frame = 0

Query: 2   ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH 61
           + QQERSELD  A++GETVVPGGTGGKS EAQE LA+GRSRGG+TRKEQLG EGY+E+GH
Sbjct: 3   SGQQERSELDRMAREGETVVPGGTGGKSLEAQEHLADGRSRGGETRKEQLGEEGYREMGH 62

Query: 62  QGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKK 121
           +GGE R+EQ+G EGY+EMGRKGGLSTM++SGGERAA                        
Sbjct: 63  KGGETRKEQLGEEGYREMGRKGGLSTMEESGGERAAR----------------------- 122

Query: 122 EMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEM 181
                                                 EGRSRGGQTR+EQ+G EGY EM
Sbjct: 123 --------------------------------------EGRSRGGQTRREQMGEEGYSEM 153

Query: 182 GRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           GRKGGLS     GGERAA EG++IDESKF+TK
Sbjct: 183 GRKGGLSTNDESGGERAAREGIDIDESKFKTK 153

BLAST of ClCG04G006910 vs. ExPASy TrEMBL
Match: R0HE60 (Uncharacterized protein OS=Capsella rubella OX=81985 GN=CARUB_v10019332mg PE=4 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 1.7e-45
Identity = 124/215 (57.67%), Postives = 145/215 (67.44%), Query Frame = 0

Query: 1   MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           MAS+Q  R ELD KAKQGETVV GGTGGKS EAQE LAEGRS+GGQTRKEQLGHEGYQE+
Sbjct: 1   MASKQLSREELDEKAKQGETVVQGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGHEGYQEI 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDES-KFRTKDPKYLEE 120
           G +GGE R+EQ+GHEGYQEMGRKGG +  ++ G E   E G +  E+ K +     Y E 
Sbjct: 61  GSKGGETRKEQLGHEGYQEMGRKGGETRREQLGHEGYQEMGRKGGETRKEQLGHEGYQEM 120

Query: 121 LKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGY 180
            +K   + +E+   +     G+    GG   K     E   E   +GG+ RKEQLGHEGY
Sbjct: 121 GRKGGETRKEQLGHEGYQEMGQ---KGGEARKEQLGHEGYQEMGRKGGEARKEQLGHEGY 180

Query: 181 QEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           QEMGRKGGLS     GGERA EEG+EIDESKF  K
Sbjct: 181 QEMGRKGGLSTMDKSGGERAEEEGIEIDESKFTNK 212

BLAST of ClCG04G006910 vs. TAIR 10
Match: AT3G51810.1 (Stress induced protein )

HSP 1 Score: 176.0 bits (445), Expect = 3.2e-44
Identity = 112/214 (52.34%), Postives = 125/214 (58.41%), Query Frame = 0

Query: 1   MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           MAS+Q  R ELD KAKQGETVVPGGTGG S EAQE LAEGRS+GGQTRKEQLGHEGYQE+
Sbjct: 1   MASKQLSREELDEKAKQGETVVPGGTGGHSLEAQEHLAEGRSKGGQTRKEQLGHEGYQEI 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEEL 120
           GH+GGEAR+EQ+GHEGYQEMG KGG +  ++ G E   E G                   
Sbjct: 61  GHKGGEARKEQLGHEGYQEMGHKGGEARKEQLGHEGYQEMG------------------- 120

Query: 121 KKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ 180
                                                       +GG+ RKEQLGHEGY+
Sbjct: 121 -------------------------------------------HKGGEARKEQLGHEGYK 152

Query: 181 EMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           EMGRKGGLS     GGERA EEG+EIDESKF  K
Sbjct: 181 EMGRKGGLSTMEKSGGERAEEEGIEIDESKFTNK 152

BLAST of ClCG04G006910 vs. TAIR 10
Match: AT2G40170.1 (Stress induced protein )

HSP 1 Score: 141.0 bits (354), Expect = 1.1e-33
Identity = 73/91 (80.22%), Postives = 81/91 (89.01%), Query Frame = 0

Query: 123 MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMG 182
           M+S+QE+ +LD RA++GETVVPGGTGGKS EAQ+HLAEGRSRGGQTRKEQLG EGYQ+MG
Sbjct: 1   MASQQEKKQLDERAKKGETVVPGGTGGKSFEAQQHLAEGRSRGGQTRKEQLGTEGYQQMG 60

Query: 183 RKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           RKGGLS    PGGE A EEGVEIDESKFRTK
Sbjct: 61  RKGGLSTGDKPGGEHAEEEGVEIDESKFRTK 91

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7033764.11.6e-8584.98Em-like protein GEA1, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
KAF4401425.11.9e-6759.04hypothetical protein G4B88_001619 [Cannabis sativa][more]
XP_016183975.21.4e-6265.67LOW QUALITY PROTEIN: late embryogenesis abundant protein B19.4 [Arachis ipaensis... [more]
XP_007206180.23.6e-5860.85late embryogenesis abundant protein B19.3 [Prunus persica][more]
XP_021296037.11.7e-5260.00late embryogenesis abundant protein B19.4 [Herrania umbratica][more]
Match NameE-valueIdentityDescription
Q071874.5e-4352.34Em-like protein GEA1 OS=Arabidopsis thaliana OX=3702 GN=EM1 PE=2 SV=1[more]
Q051919.3e-4149.06Late embryogenesis abundant protein B19.4 OS=Hordeum vulgare OX=4513 GN=B19.4 PE... [more]
I1N2Z52.3e-3981.82Protein SLE1 OS=Glycine max OX=3847 GN=SLE1 PE=2 SV=1[more]
Q5KTS78.7e-3978.18Carrot ABA-induced in somatic embryos 3 OS=Daucus carota OX=4039 GN=CAISE3 PE=2 ... [more]
Q024002.8e-3767.69Late embryogenesis abundant protein B19.3 OS=Hordeum vulgare OX=4513 GN=B19.3 PE... [more]
Match NameE-valueIdentityDescription
A0A7J6I2G29.3e-6859.04Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_001619 PE=3 SV=1[more]
A0A6J1B9A48.4e-5360.00late embryogenesis abundant protein B19.4 OS=Herrania umbratica OX=108875 GN=LOC... [more]
A0A498K5A75.3e-4748.75Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_014793 PE=4 SV=1[more]
A0A446J0E82.6e-4654.72Uncharacterized protein OS=Triticum turgidum subsp. durum OX=4567 GN=TRITD_1Av1G... [more]
R0HE601.7e-4557.67Uncharacterized protein OS=Capsella rubella OX=81985 GN=CARUB_v10019332mg PE=4 S... [more]
Match NameE-valueIdentityDescription
AT3G51810.13.2e-4452.34Stress induced protein [more]
AT2G40170.11.1e-3380.22Stress induced protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 116..136
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 199..213
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 155..178
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..213
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..136
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 33..76
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..105
NoneNo IPR availablePANTHERPTHR34671:SF13SUBFAMILY NOT NAMEDcoord: 123..213
coord: 62..112
NoneNo IPR availablePANTHERPTHR34671:SF13SUBFAMILY NOT NAMEDcoord: 1..64
IPR038956Late embryogenesis abundant protein, LEA_5 subgroupPFAMPF00477LEA_5coord: 124..178
e-value: 8.8E-25
score: 87.3
coord: 2..108
e-value: 4.4E-56
score: 188.0
IPR000389Small hydrophilic plant seed proteinPANTHERPTHR34671EM-LIKE PROTEIN GEA1coord: 1..64
coord: 123..213
IPR000389Small hydrophilic plant seed proteinPANTHERPTHR34671EM-LIKE PROTEIN GEA1coord: 62..112
IPR022377Small hydrophilic plant seed protein, conserved sitePROSITEPS00431SMALL_HYDR_PLANT_SEEDcoord: 139..147
IPR022377Small hydrophilic plant seed protein, conserved sitePROSITEPS00431SMALL_HYDR_PLANT_SEEDcoord: 17..25

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG04G006910.1ClCG04G006910.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009737 response to abscisic acid