ClCG01G013835 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G013835
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotransposon protein
LocationCG_Chr01: 27756884 .. 27758361 (-)
RNA-Seq ExpressionClCG01G013835
SyntenyClCG01G013835
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGAAAGGTCGATGGGCAATCATTCGGGGAAAATCATACTACCTAGTTGATGTGCAGTGCAAGACCATACATGTCTGTTGTCTCTTGCACAATCTAATAATTCAGGAAATGGGCACAAATTCATTACTTGACGAAGGAAAGGGAAGTCAAGCTGGACCAATTCAAAAAGACACTGAGAATATTGAATTTGTAGAGACATCTAATGTCTTTACTGCATGGAGGGATGATTTCGCAAATCAAATGTAAGCTGAATGGAATGCAAATTAAAGACGCCCTAGGGAAACTTTGTATAGTCTTTTTTGTTATGTGTACGGTATTTCGTTGATGCTTGACGTGTACGTTTTGTCATTTTTTGGATAACATATGTGAATATTTCAATGTATAGTGTGAATGGATTGATTCTTATTATATACAGTCTCAATGGGTTAATGCATGTTCACCGCACTACACCTTCATGTTCAAGCCTGATTGTGTTAACATGTTTAATTTTAGATGGAAGCATTTGAATCACGCGTAAGAGCCTCAAAATATATTTGGATGGATGAAGAGGACAGAATCCTAGTGGAGTGTTTAGTTTAGTGTGTGCAGTCTGGACACTGGCGAGCTGATAACGGGACTTTCCGACTTGGATTCCTAGCAAACGTACTACGAATGATGCAGCAAAGGATTCCAGGGTGTTCCATACAGGTAAGCCCGAATTTGGAGTCCAGGGTCAGGATGTTGAAGAGACAGTACAGCGTGATCGTTGAAATGTTGGGCCTAGGATATAGTGGGTTTGGTTGGAATGCGGAGCGCAAATGTAATGACTGTGAGCCGAAGATATTTGACGCATGGGTCAAGGTAATTTTATTATTTTTCTATTTATTCTAGGTTGTTTTAAATATGTTTTAGCCATTACATAACACATTTAATCTTTAACAGAGTCATCCGAGTGCAAAAGGACTGCGCCATAATTCATTTCCGTTTTATGACGACTTGGCCATTATATTTGGCAAAGACAAAGCAACAGGGAGTCGTGCCACTACCATTGCAGAGGTCGGATCTAAACCTGTTGTGGAAGAGGAGAACGAGGACATCTTGAATAACCAACCCCCGGACTTTGAGAACTTCTATATTCCCGATCCACCGTTCACCAGCTCGACCACATTAGAGGACCTTCCAACTACCCTCGGCGATAGAGGGTCTGGGAGTAGCATGTCAACAAGAAGTAGGAGGTCCCGAAGTTCCTCAATTGGAGAGTATAGCGAGGTGGTTCGAGATGGCTTCCAACTTCTGATGAAGTCCATTGACGGCATTGCATAGTGGCCTATTGTGAACAATGACCTGGCAAGGCGTCATCGTCGAGAACTGTACGCCGAGCTGTAATCAATTCCTGGTCTGTTGATGCAAGATGACTTGACTGTTGCACGGTCATTGCTTGCAGATCCAATGCTGTTAAGCCACTTCGTGGACTTCCCACCGTAGTGGAAGTATGA

mRNA sequence

ATGTTGAAAGGTCGATGGGCAATCATTCGGGGAAAATCATACTACCTAGTTGATGTGCAGTGCAAGACCATACATGTCTGTTGTCTCTTGCACAATCTAATAATTCAGGAAATGGGCACAAATTCATTACTTGACGAAGGAAAGGGAAGTCAAGCTGGACCAATTCAAAAAGACACTGAGAATATTGAATTTGTAGAGACATCTAATGTCTTTACTGCATGGAGGGATGATTTCGCAAATCAAATTCTCAATGGGTTAATGCATGTTCACCGCACTACACCTTCATGTTCAAGCCTGATTTCTGGACACTGGCGAGCTGATAACGGGACTTTCCGACTTGGATTCCTAGCAAACGTACTACGAATGATGCAGCAAAGGATTCCAGGGTGTTCCATACAGGTAAGCCCGAATTTGGAGTCCAGGGTCAGGATGTTGAAGAGACAGTACAGCGTGATCGTTGAAATGTTGGGCCTAGGATATAGTGGGTTTGGTTGGAATGCGGAGCGCAAATGTAATGACTGTGAGCCGAAGATATTTGACGCATGGGTCAAGAGTCATCCGAGTGCAAAAGGACTGCGCCATAATTCATTTCCGTTTTATGACGACTTGGCCATTATATTTGGCAAAGACAAAGCAACAGGGAGTCGTGCCACTACCATTGCAGAGGTCGGATCTAAACCTGTTGTGGAAGAGGAGAACGAGGACATCTTGAATAACCAACCCCCGGACTTTGAGAACTTCTATATTCCCGATCCACCGTTCACCAGCTCGACCACATTAGAGGACCTTCCAACTACCCTCGGCGATAGAGGGTCTGGGAGTAGCATGTCAACAAGAAGTAGGAGGTCCCGAAGTTCCTCAATTGGAGAGTATAGCGAGGTGGTTCGAGATGGCTTCCAACTTCTGATGAAATCCAATGCTGTTAAGCCACTTCGTGGACTTCCCACCGTAGTGGAAGTATGA

Coding sequence (CDS)

ATGTTGAAAGGTCGATGGGCAATCATTCGGGGAAAATCATACTACCTAGTTGATGTGCAGTGCAAGACCATACATGTCTGTTGTCTCTTGCACAATCTAATAATTCAGGAAATGGGCACAAATTCATTACTTGACGAAGGAAAGGGAAGTCAAGCTGGACCAATTCAAAAAGACACTGAGAATATTGAATTTGTAGAGACATCTAATGTCTTTACTGCATGGAGGGATGATTTCGCAAATCAAATTCTCAATGGGTTAATGCATGTTCACCGCACTACACCTTCATGTTCAAGCCTGATTTCTGGACACTGGCGAGCTGATAACGGGACTTTCCGACTTGGATTCCTAGCAAACGTACTACGAATGATGCAGCAAAGGATTCCAGGGTGTTCCATACAGGTAAGCCCGAATTTGGAGTCCAGGGTCAGGATGTTGAAGAGACAGTACAGCGTGATCGTTGAAATGTTGGGCCTAGGATATAGTGGGTTTGGTTGGAATGCGGAGCGCAAATGTAATGACTGTGAGCCGAAGATATTTGACGCATGGGTCAAGAGTCATCCGAGTGCAAAAGGACTGCGCCATAATTCATTTCCGTTTTATGACGACTTGGCCATTATATTTGGCAAAGACAAAGCAACAGGGAGTCGTGCCACTACCATTGCAGAGGTCGGATCTAAACCTGTTGTGGAAGAGGAGAACGAGGACATCTTGAATAACCAACCCCCGGACTTTGAGAACTTCTATATTCCCGATCCACCGTTCACCAGCTCGACCACATTAGAGGACCTTCCAACTACCCTCGGCGATAGAGGGTCTGGGAGTAGCATGTCAACAAGAAGTAGGAGGTCCCGAAGTTCCTCAATTGGAGAGTATAGCGAGGTGGTTCGAGATGGCTTCCAACTTCTGATGAAATCCAATGCTGTTAAGCCACTTCGTGGACTTCCCACCGTAGTGGAAGTATGA

Protein sequence

MLKGRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEMGTNSLLDEGKGSQAGPIQKDTENIEFVETSNVFTAWRDDFANQILNGLMHVHRTTPSCSSLISGHWRADNGTFRLGFLANVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEENEDILNNQPPDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRSRSSSIGEYSEVVRDGFQLLMKSNAVKPLRGLPTVVEV
Homology
BLAST of ClCG01G013835 vs. NCBI nr
Match: ADN33754.1 (retrotransposon protein [Cucumis melo subsp. melo])

HSP 1 Score: 225.3 bits (573), Expect = 7.4e-55
Identity = 125/317 (39.43%), Postives = 179/317 (56.47%), Query Frame = 0

Query: 1   MLKGRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEMG-TNSLLDEGKGSQAGPIQKDT 60
           +LKGRW I+RGKSYY + VQC+TI  C LLHNLI +EM   N + DE +G         +
Sbjct: 246 VLKGRWTILRGKSYYPLQVQCRTILACTLLHNLINREMTYCNDVEDEDEGDSTYATTTAS 305

Query: 61  ENIEFVETSNVFTAWRDDFANQILNGLMHVHRTTPSCS-SLIS-GHWRADNGTFRLGFLA 120
           E+I+++ET+N ++ WRDD A  +        R   SC   L+S G W++DNGTFR G+LA
Sbjct: 306 EDIQYIETTNEWSQWRDDLATSMFTDWQ--FRGGDSCGMELVSMGGWKSDNGTFRPGYLA 365

Query: 121 NVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPK 180
            ++RMM +++ GC ++ +  ++ R++ LKR +  I EMLG   SGFGWN E KC   E +
Sbjct: 366 QLVRMMAEKLSGCQVRATTVIDCRIKTLKRTFQAIAEMLGPACSGFGWNDEEKCIVAEKE 425

Query: 181 IFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEENEDIL 240
           +FD WV+S P+AKGL +N FP+YD+L  +FG+D+ATG  A T A+VGS       +   +
Sbjct: 426 LFDNWVRSPPAAKGLLNNPFPYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDM 485

Query: 241 NNQPPDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRSRSSSIGEYSEVVRD 300
            +   DF   Y               P+   +  +GSS S R R S+     E   +  D
Sbjct: 486 GDGNEDFPPVYSRGVDILQDDVRASRPSRASEGKTGSSGSKRKRGSQRDFDVEAIHLALD 545

Query: 301 GFQLLMKSNAVKPLRGL 315
                ++  A  P R L
Sbjct: 546 QTNEQLRQIAEWPARNL 560

BLAST of ClCG01G013835 vs. NCBI nr
Match: ADN34114.1 (retrotransposon protein [Cucumis melo subsp. melo])

HSP 1 Score: 213.4 bits (542), Expect = 2.9e-51
Identity = 119/303 (39.27%), Postives = 176/303 (58.09%), Query Frame = 0

Query: 1   MLKGRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEMGTNSLLDEGKGSQAGPIQKDTE 60
           +LKGRWAI+RGKSYY V+VQC+TI  CCLLHNLI +EM    + D      +       +
Sbjct: 274 VLKGRWAILRGKSYYPVEVQCRTILACCLLHNLINREMTNFDIEDNIDEVDSTHATTAAD 333

Query: 61  NIEFVETSNVFTAWRDDFANQILNGLMHVHR---TTPSCSSLI--------SGHWRADNG 120
           +I ++ETSN ++ WRD+ A +I+     + +   T    + L+        +G WR+DNG
Sbjct: 334 DIHYIETSNEWSQWRDNLAEEIMTSSSRLPKHTWTKEEEAGLVECLVELVNAGGWRSDNG 393

Query: 121 TFRLGFLANVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAER 180
           TFR G+L  + RMM  +IPG +I  S  ++SR++++KR +  + EM G   SGFGWN E+
Sbjct: 394 TFRPGYLNQLARMMAFKIPGSNIHAS-TIDSRIKLMKRMFHALAEMRGPNCSGFGWNDEK 453

Query: 181 KCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSK--P 240
           KC   E ++FD W  SHP+AKGL + SF  YD+L+ +FGKD+ATG RA + A++GS   P
Sbjct: 454 KCIVAEKEVFDDW--SHPAAKGLLNKSFVHYDELSYVFGKDRATGGRAESFADIGSNDPP 513

Query: 241 VVEEENEDILNNQPPDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRSRSSS 291
             +    D + +   DF   Y P    +    +E     + +R + SS S R R   ++ 
Sbjct: 514 GYDAGAADAMPD--TDFPPMYSPGLNMSPDDLMETRTARVSERRNVSSGSKRKRPGHATD 571

BLAST of ClCG01G013835 vs. NCBI nr
Match: XP_039026053.1 (uncharacterized protein LOC120159549 [Hibiscus syriacus])

HSP 1 Score: 199.9 bits (507), Expect = 3.3e-47
Identity = 98/222 (44.14%), Postives = 136/222 (61.26%), Query Frame = 0

Query: 1   MLKGRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEMGTNSL-LDEGKGSQAGPIQKDT 60
           +LK RWAI+R KS+Y V  QC+ I  CCLLHN I  EM  + +  D  + S+      + 
Sbjct: 94  ILKARWAILREKSFYPVKTQCRLISACCLLHNFIRSEMPIDHIESDYTEDSRHVEEINEV 153

Query: 61  ENIEFVETSNVFTAWRDDFANQILNGLMHVHRTTPSCSSLISGHWRADNGTFRLGFLANV 120
           E I   E S+ +T WRD  A  +    +  H            HW+ADNGTFR G+L N+
Sbjct: 154 EMIRHCEPSSAWTEWRDKLAEDMFTSWLASH-----------PHWKADNGTFRSGYLYNL 213

Query: 121 LRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPKIF 180
            +M++ ++P   I+  P++ESRV++LKRQY+ + EML +  SGFGWN E KC      +F
Sbjct: 214 EKMLEIKLPTSQIRAHPHIESRVKLLKRQYNALSEMLNI-ESGFGWNEEEKCLTAPKDVF 273

Query: 181 DAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIA 222
           D WV+SHP+A GLR+ SFPF+DD   IFGK++ATG+ A  +A
Sbjct: 274 DDWVRSHPTAAGLRNKSFPFFDDFVHIFGKERATGTTAEIVA 303

BLAST of ClCG01G013835 vs. NCBI nr
Match: TYK03086.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 194.5 bits (493), Expect = 1.4e-45
Identity = 93/228 (40.79%), Postives = 137/228 (60.09%), Query Frame = 0

Query: 1   MLKGRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEM---GTNSLLDEGKGSQAGPIQK 60
           +LK RW I+ GKSYY + VQC TI  CCLLHNLI +EM        +DEG    A     
Sbjct: 119 VLKDRWTILNGKSYYPLQVQCHTILACCLLHNLINREMTYCDDVDYVDEGDSRYA--TTT 178

Query: 61  DTENIEFVETSNVFTAWRDDFANQILNGLMHVHRTTPSCSSLISGHWRADNGTFRLGFLA 120
            +E+I+++ET+N ++ WRD+ A  +                    +W+  NGTFR G+L 
Sbjct: 179 ASEDIQYIETTNEWSQWRDELAESMFT------------------NWQLYNGTFRPGYLD 238

Query: 121 NVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPK 180
            ++ MM +++P C ++ +  ++ R++ LKR +  I EM G  YSGFGWN E KC   E +
Sbjct: 239 QLVHMMAEKLPECQVRATTVIDYRIKTLKRIFQAIAEMRGPAYSGFGWNNEEKCIIAEKE 298

Query: 181 IFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGS 226
           +FD WV+SHP+AKGL +  FP+Y++L  +F +D+ TG  A T A++GS
Sbjct: 299 LFDNWVRSHPAAKGLLNKLFPYYNELTYVFSRDRTTGRFAETFADLGS 326

BLAST of ClCG01G013835 vs. NCBI nr
Match: KAA0043564.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 191.4 bits (485), Expect = 1.2e-44
Identity = 99/242 (40.91%), Postives = 143/242 (59.09%), Query Frame = 0

Query: 4   GRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEMGTNSLLDEGKGSQAGPIQKDTENIE 63
           GRWAI+RGKSYY V+VQC+TI  CCLLHNLI +EM  + ++D+     +       + I 
Sbjct: 84  GRWAILRGKSYYPVNVQCRTIMACCLLHNLINREMTNSEIIDDLDEGDSTYATTGGDEIN 143

Query: 64  FVETSNVFTAWRDDFANQILNGL-MHVHRTTPSCSS-------------------LISGH 123
           ++E SN ++  RD  A  + +   +    T+ S +S                   + +G 
Sbjct: 144 YIEVSNEWSELRDQLAYTMFSDWELRDQMTSSSRASKHTWTKEEEAKLVECLVELVSAGG 203

Query: 124 WRADNGTFRLGFLANVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGF 183
           WR++NGTFR G+LA + RMM +++   +IQ S  ++ RV+ LK+       M G   SGF
Sbjct: 204 WRSNNGTFRHGYLAQLQRMMVKKLSDTNIQGSLPMDCRVKSLKKHLPSNFRMRGPSCSGF 263

Query: 184 GWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEV 226
           GWN + +C   E  +FD WVKSHP+A+GL H SFP+YDDL+ +F KD+AT +R  T A+V
Sbjct: 264 GWNEKFQCIIAERDLFDNWVKSHPTAEGLLHKSFPYYDDLSYVFDKDRATEARLETFADV 323

BLAST of ClCG01G013835 vs. ExPASy Swiss-Prot
Match: O82368 (Uncharacterized protein At2g29880 OS=Arabidopsis thaliana OX=3702 GN=At2g29880 PE=2 SV=1)

HSP 1 Score: 56.6 bits (135), Expect = 5.9e-07
Identity = 50/192 (26.04%), Postives = 83/192 (43.23%), Query Frame = 0

Query: 104 WRADNGTFRLGFLANVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGF 163
           WR  NGT     +   +  +  +   C+ +   N  SR++ +K++YSV   +     SGF
Sbjct: 42  WRDKNGTISKTTVERKILPLLNKKFKCN-KTYTNYLSRMKSMKKEYSVYAALFWFS-SGF 101

Query: 164 GWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEV 223
           GW+   K       ++ A++  HP+   +R ++F  ++DL +IF    A G+ A  +   
Sbjct: 102 GWDPITKQFTAPDDVWAAYLMGHPNHHHMRTSTFEDFEDLQLIFESAIAKGNNAFGLGGD 161

Query: 224 GSKPVVEEENEDILNNQPPDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRS 283
            +    EEE++    +     E   I D     +   E LPT        S  +    RS
Sbjct: 162 SNAETFEEEDDLQAGDNVNHME---INDDEVNETLPKEKLPTR-----KRSKTNRNGDRS 221

Query: 284 RSSSIGEYSEVV 296
            S + GE SE V
Sbjct: 222 DSINHGESSEKV 223

BLAST of ClCG01G013835 vs. ExPASy TrEMBL
Match: E5GBB2 (Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1)

HSP 1 Score: 225.3 bits (573), Expect = 3.6e-55
Identity = 125/317 (39.43%), Postives = 179/317 (56.47%), Query Frame = 0

Query: 1   MLKGRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEMG-TNSLLDEGKGSQAGPIQKDT 60
           +LKGRW I+RGKSYY + VQC+TI  C LLHNLI +EM   N + DE +G         +
Sbjct: 246 VLKGRWTILRGKSYYPLQVQCRTILACTLLHNLINREMTYCNDVEDEDEGDSTYATTTAS 305

Query: 61  ENIEFVETSNVFTAWRDDFANQILNGLMHVHRTTPSCS-SLIS-GHWRADNGTFRLGFLA 120
           E+I+++ET+N ++ WRDD A  +        R   SC   L+S G W++DNGTFR G+LA
Sbjct: 306 EDIQYIETTNEWSQWRDDLATSMFTDWQ--FRGGDSCGMELVSMGGWKSDNGTFRPGYLA 365

Query: 121 NVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPK 180
            ++RMM +++ GC ++ +  ++ R++ LKR +  I EMLG   SGFGWN E KC   E +
Sbjct: 366 QLVRMMAEKLSGCQVRATTVIDCRIKTLKRTFQAIAEMLGPACSGFGWNDEEKCIVAEKE 425

Query: 181 IFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSKPVVEEENEDIL 240
           +FD WV+S P+AKGL +N FP+YD+L  +FG+D+ATG  A T A+VGS       +   +
Sbjct: 426 LFDNWVRSPPAAKGLLNNPFPYYDELTYVFGRDRATGRFAETFADVGSNEPGGGYDRFDM 485

Query: 241 NNQPPDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRSRSSSIGEYSEVVRD 300
            +   DF   Y               P+   +  +GSS S R R S+     E   +  D
Sbjct: 486 GDGNEDFPPVYSRGVDILQDDVRASRPSRASEGKTGSSGSKRKRGSQRDFDVEAIHLALD 545

Query: 301 GFQLLMKSNAVKPLRGL 315
                ++  A  P R L
Sbjct: 546 QTNEQLRQIAEWPARNL 560

BLAST of ClCG01G013835 vs. ExPASy TrEMBL
Match: E5GCB5 (Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 1.4e-51
Identity = 119/303 (39.27%), Postives = 176/303 (58.09%), Query Frame = 0

Query: 1   MLKGRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEMGTNSLLDEGKGSQAGPIQKDTE 60
           +LKGRWAI+RGKSYY V+VQC+TI  CCLLHNLI +EM    + D      +       +
Sbjct: 274 VLKGRWAILRGKSYYPVEVQCRTILACCLLHNLINREMTNFDIEDNIDEVDSTHATTAAD 333

Query: 61  NIEFVETSNVFTAWRDDFANQILNGLMHVHR---TTPSCSSLI--------SGHWRADNG 120
           +I ++ETSN ++ WRD+ A +I+     + +   T    + L+        +G WR+DNG
Sbjct: 334 DIHYIETSNEWSQWRDNLAEEIMTSSSRLPKHTWTKEEEAGLVECLVELVNAGGWRSDNG 393

Query: 121 TFRLGFLANVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAER 180
           TFR G+L  + RMM  +IPG +I  S  ++SR++++KR +  + EM G   SGFGWN E+
Sbjct: 394 TFRPGYLNQLARMMAFKIPGSNIHAS-TIDSRIKLMKRMFHALAEMRGPNCSGFGWNDEK 453

Query: 181 KCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGSK--P 240
           KC   E ++FD W  SHP+AKGL + SF  YD+L+ +FGKD+ATG RA + A++GS   P
Sbjct: 454 KCIVAEKEVFDDW--SHPAAKGLLNKSFVHYDELSYVFGKDRATGGRAESFADIGSNDPP 513

Query: 241 VVEEENEDILNNQPPDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRSRSSS 291
             +    D + +   DF   Y P    +    +E     + +R + SS S R R   ++ 
Sbjct: 514 GYDAGAADAMPD--TDFPPMYSPGLNMSPDDLMETRTARVSERRNVSSGSKRKRPGHATD 571

BLAST of ClCG01G013835 vs. ExPASy TrEMBL
Match: A0A5D3BY22 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold46G001930 PE=3 SV=1)

HSP 1 Score: 194.5 bits (493), Expect = 6.7e-46
Identity = 93/228 (40.79%), Postives = 137/228 (60.09%), Query Frame = 0

Query: 1   MLKGRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEM---GTNSLLDEGKGSQAGPIQK 60
           +LK RW I+ GKSYY + VQC TI  CCLLHNLI +EM        +DEG    A     
Sbjct: 119 VLKDRWTILNGKSYYPLQVQCHTILACCLLHNLINREMTYCDDVDYVDEGDSRYA--TTT 178

Query: 61  DTENIEFVETSNVFTAWRDDFANQILNGLMHVHRTTPSCSSLISGHWRADNGTFRLGFLA 120
            +E+I+++ET+N ++ WRD+ A  +                    +W+  NGTFR G+L 
Sbjct: 179 ASEDIQYIETTNEWSQWRDELAESMFT------------------NWQLYNGTFRPGYLD 238

Query: 121 NVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGFGWNAERKCNDCEPK 180
            ++ MM +++P C ++ +  ++ R++ LKR +  I EM G  YSGFGWN E KC   E +
Sbjct: 239 QLVHMMAEKLPECQVRATTVIDYRIKTLKRIFQAIAEMRGPAYSGFGWNNEEKCIIAEKE 298

Query: 181 IFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEVGS 226
           +FD WV+SHP+AKGL +  FP+Y++L  +F +D+ TG  A T A++GS
Sbjct: 299 LFDNWVRSHPAAKGLLNKLFPYYNELTYVFSRDRTTGRFAETFADLGS 326

BLAST of ClCG01G013835 vs. ExPASy TrEMBL
Match: A0A5A7TJS2 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold335G00710 PE=4 SV=1)

HSP 1 Score: 191.4 bits (485), Expect = 5.7e-45
Identity = 99/242 (40.91%), Postives = 143/242 (59.09%), Query Frame = 0

Query: 4   GRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEMGTNSLLDEGKGSQAGPIQKDTENIE 63
           GRWAI+RGKSYY V+VQC+TI  CCLLHNLI +EM  + ++D+     +       + I 
Sbjct: 84  GRWAILRGKSYYPVNVQCRTIMACCLLHNLINREMTNSEIIDDLDEGDSTYATTGGDEIN 143

Query: 64  FVETSNVFTAWRDDFANQILNGL-MHVHRTTPSCSS-------------------LISGH 123
           ++E SN ++  RD  A  + +   +    T+ S +S                   + +G 
Sbjct: 144 YIEVSNEWSELRDQLAYTMFSDWELRDQMTSSSRASKHTWTKEEEAKLVECLVELVSAGG 203

Query: 124 WRADNGTFRLGFLANVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGF 183
           WR++NGTFR G+LA + RMM +++   +IQ S  ++ RV+ LK+       M G   SGF
Sbjct: 204 WRSNNGTFRHGYLAQLQRMMVKKLSDTNIQGSLPMDCRVKSLKKHLPSNFRMRGPSCSGF 263

Query: 184 GWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEV 226
           GWN + +C   E  +FD WVKSHP+A+GL H SFP+YDDL+ +F KD+AT +R  T A+V
Sbjct: 264 GWNEKFQCIIAERDLFDNWVKSHPTAEGLLHKSFPYYDDLSYVFDKDRATEARLETFADV 323

BLAST of ClCG01G013835 vs. ExPASy TrEMBL
Match: A0A5D3D8J6 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold307G00030 PE=4 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 7.4e-45
Identity = 105/281 (37.37%), Postives = 158/281 (56.23%), Query Frame = 0

Query: 4   GRWAIIRGKSYYLVDVQCKTIHVCCLLHNLIIQEMGTNSLLDEGKGSQAGPIQKDTENIE 63
           GRWAI+RGKSYY V+VQC+TI  CCLLHNLI +EM  + ++D+     +       + I 
Sbjct: 84  GRWAILRGKSYYPVNVQCRTIMACCLLHNLINREMTNSEIIDDLDEGDSTYATTGGDEIN 143

Query: 64  FVETSNVFTAWRDDFANQILNGL-MHVHRTTPSCSS-------------------LISGH 123
           ++E SN ++  RD  A  + +   +    T+ S +S                   + +G 
Sbjct: 144 YIEVSNEWSELRDQLAYTMFSDWELRDQMTSSSRASKHTWTKEEEAKLVECLVELVSAGG 203

Query: 124 WRADNGTFRLGFLANVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGF 183
           WR++NGTFRLG+LA + RMM +++   +IQ S  ++ RV+ LK+       + G   SGF
Sbjct: 204 WRSNNGTFRLGYLAQLQRMMVKKLSDTNIQGSLPMDCRVKSLKKHLPSNFRIRGPSCSGF 263

Query: 184 GWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEV 243
           GWN + +C   E  +FD WVKSHP+A+GL H SFP+YDDL+ +F KD+AT +R  T A+V
Sbjct: 264 GWNEKFQCIIAERDLFDNWVKSHPTAEGLLHKSFPYYDDLSYVFDKDRATEARLETFADV 323

Query: 244 GSKPVVEEENEDILNNQPPDFENFYIPDPPFTSSTTLEDLP 265
           GS  +      ++ N+  P  +++    P   S   + D P
Sbjct: 324 GSNVL------NMFNDGVPLGDSYDQDIPAMYSQGAIADWP 358

BLAST of ClCG01G013835 vs. TAIR 10
Match: AT5G27260.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G29880.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 70.5 bits (171), Expect = 2.8e-12
Identity = 43/149 (28.86%), Postives = 73/149 (48.99%), Query Frame = 0

Query: 100 ISGHWRADNGTFRLGFLANVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLG 159
           I+ +WR  NGT           M +     C  +   +  SR++ LK QY   +++    
Sbjct: 33  INNNWRDSNGTIS-KLTVETKFMPEINKEFCRSKNYNHYLSRMKYLKIQYQSCLDLQRFS 92

Query: 160 YSGFGWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATT 219
            SGFGW+   K      +++  ++K+HP+ K LR+++F F+D+L IIFG+  ATG  A  
Sbjct: 93  -SGFGWDPLTKRFTASDEVWSDYLKAHPNNKQLRYDTFEFFDELQIIFGEGVATGKNAIG 152

Query: 220 IAEVGSKPVVEEENEDILNNQPPDFENFY 249
           + +  +  +     E+       DF+N Y
Sbjct: 153 LCD-STDGLTYRAGENPRKEYVDDFDNVY 178

BLAST of ClCG01G013835 vs. TAIR 10
Match: AT1G30140.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 65.1 bits (157), Expect = 1.2e-10
Identity = 39/125 (31.20%), Postives = 69/125 (55.20%), Query Frame = 0

Query: 99  LISGHWRADNGTF-RLGFLANVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLG 158
           LI  +WR  +G   +L   + +L  + +R+ GC+ +   N  SR++ LK  Y   +++  
Sbjct: 28  LIRQNWRDSSGIIGKLTVESKLLPALNKRL-GCN-KNHKNYMSRLKFLKNLYQSYLDLKR 87

Query: 159 LGYSGFGWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRA 218
              SGFGW+ E K      +++  ++K+HP+ K ++  S   ++DL IIFG   ATGS A
Sbjct: 88  FS-SGFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTESIDHFEDLQIIFGDVVATGSFA 147

Query: 219 TTIAE 223
             +++
Sbjct: 148 VGMSD 149

BLAST of ClCG01G013835 vs. TAIR 10
Match: AT2G29880.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27260.1); Has 260 Blast hits to 212 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 10; Plants - 240; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink). )

HSP 1 Score: 56.6 bits (135), Expect = 4.2e-08
Identity = 50/192 (26.04%), Postives = 83/192 (43.23%), Query Frame = 0

Query: 104 WRADNGTFRLGFLANVLRMMQQRIPGCSIQVSPNLESRVRMLKRQYSVIVEMLGLGYSGF 163
           WR  NGT     +   +  +  +   C+ +   N  SR++ +K++YSV   +     SGF
Sbjct: 42  WRDKNGTISKTTVERKILPLLNKKFKCN-KTYTNYLSRMKSMKKEYSVYAALFWFS-SGF 101

Query: 164 GWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSFPFYDDLAIIFGKDKATGSRATTIAEV 223
           GW+   K       ++ A++  HP+   +R ++F  ++DL +IF    A G+ A  +   
Sbjct: 102 GWDPITKQFTAPDDVWAAYLMGHPNHHHMRTSTFEDFEDLQLIFESAIAKGNNAFGLGGD 161

Query: 224 GSKPVVEEENEDILNNQPPDFENFYIPDPPFTSSTTLEDLPTTLGDRGSGSSMSTRSRRS 283
            +    EEE++    +     E   I D     +   E LPT        S  +    RS
Sbjct: 162 SNAETFEEEDDLQAGDNVNHME---INDDEVNETLPKEKLPTR-----KRSKTNRNGDRS 221

Query: 284 RSSSIGEYSEVV 296
            S + GE SE V
Sbjct: 222 DSINHGESSEKV 223

BLAST of ClCG01G013835 vs. TAIR 10
Match: AT2G24960.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes - 50 (source: NCBI BLink). )

HSP 1 Score: 47.8 bits (112), Expect = 2.0e-05
Identity = 27/101 (26.73%), Postives = 49/101 (48.51%), Query Frame = 0

Query: 143 RMLKRQYSVIVEMLG-----LGYSGFGWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSF 202
           R+L+ +Y+ +++        L   GF W+  R     +  ++D+++K HP A+  R  S 
Sbjct: 221 RVLRHRYNKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSL 280

Query: 203 PFYDDLAIIFG--KDKATGSRAT-TIAEVGSKPVVEEENED 236
           P Y+DL  IF    ++ T  R   + A+       +E+N D
Sbjct: 281 PSYNDLDTIFACQAEQGTDHRDDGSAAQTSETKASQEQNSD 321

BLAST of ClCG01G013835 vs. TAIR 10
Match: AT2G24960.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 47.8 bits (112), Expect = 2.0e-05
Identity = 27/101 (26.73%), Postives = 49/101 (48.51%), Query Frame = 0

Query: 143 RMLKRQYSVIVEMLG-----LGYSGFGWNAERKCNDCEPKIFDAWVKSHPSAKGLRHNSF 202
           R+L+ +Y+ +++        L   GF W+  R     +  ++D+++K HP A+  R  S 
Sbjct: 221 RVLRHRYNKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSL 280

Query: 203 PFYDDLAIIFG--KDKATGSRAT-TIAEVGSKPVVEEENED 236
           P Y+DL  IF    ++ T  R   + A+       +E+N D
Sbjct: 281 PSYNDLDTIFACQAEQGTDHRDDGSAAQTSETKASQEQNSD 321

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ADN33754.17.4e-5539.43retrotransposon protein [Cucumis melo subsp. melo][more]
ADN34114.12.9e-5139.27retrotransposon protein [Cucumis melo subsp. melo][more]
XP_039026053.13.3e-4744.14uncharacterized protein LOC120159549 [Hibiscus syriacus][more]
TYK03086.11.4e-4540.79retrotransposon protein [Cucumis melo var. makuwa][more]
KAA0043564.11.2e-4440.91retrotransposon protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
O823685.9e-0726.04Uncharacterized protein At2g29880 OS=Arabidopsis thaliana OX=3702 GN=At2g29880 P... [more]
Match NameE-valueIdentityDescription
E5GBB23.6e-5539.43Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1[more]
E5GCB51.4e-5139.27Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1[more]
A0A5D3BY226.7e-4640.79Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7TJS25.7e-4540.91Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5D3D8J67.4e-4537.37Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT5G27260.12.8e-1228.86unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G30140.11.2e-1031.20unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G29880.14.2e-0826.04unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24960.12.0e-0526.73unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24960.22.0e-0526.73unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 255..289
NoneNo IPR availablePANTHERPTHR46250MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEIN-RELATEDcoord: 100..287

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G013835.1ClCG01G013835.1mRNA