Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATTAACCCAAAAATTAGAGGGTGGTTAAGCTATAATTCCGGATTACTGAAATTCCCACAACCAAATCCCCAATTTAAAACCCTAAAATTCGTCTCCATCAATCCCCAATTCTCCCCGTAAAATTCCCCAAATCCAAAACCCTCTCTCCGTCGTCTTCCTCCTCTGTAACCATGAGTCTGAATTGCCTCTCATGTCAACTCTTACAGAGATCGGACTCCGACAGAGACCGCGACCACCAAGATTATTTCTCTGATCCCTCTCACTCGCCGGAGAGAAGCTGGTCCGGCAACCTCTCGTTCCGGCCTCCCACTCGCCAAAACAGAGGAGGGTTTCGGGCCATGGCGGAGAAGAAGGTGGCGCCGATGGGCCACCGCCGTCTTCACAGTACCGGCGCCGTCGCTTTCGGCGGCCCCGGTAAGGAGCCCAGGCTGATTAGAAGCTCGGGGATGAGGAGGGATTGGAGCTTTGAGGATCTCAGAGCCATTCGAGAGGAAAAGGGGCCATCTGCCAATTCCTAACAACTTACTAACTTCTTTTTTTTTTCCTTGTTCTTTTCTATTTTTTTATTATATATATATATTTGAAATTTGTTTGGACATGAATATATAAAAAAGTAGAAGGAATGGGGAGA
mRNA sequence
CATTAACCCAAAAATTAGAGGGTGGTTAAGCTATAATTCCGGATTACTGAAATTCCCACAACCAAATCCCCAATTTAAAACCCTAAAATTCGTCTCCATCAATCCCCAATTCTCCCCGTAAAATTCCCCAAATCCAAAACCCTCTCTCCGTCGTCTTCCTCCTCTGTAACCATGAGTCTGAATTGCCTCTCATGTCAACTCTTACAGAGATCGGACTCCGACAGAGACCGCGACCACCAAGATTATTTCTCTGATCCCTCTCACTCGCCGGAGAGAAGCTGGTCCGGCAACCTCTCGTTCCGGCCTCCCACTCGCCAAAACAGAGGAGGGTTTCGGGCCATGGCGGAGAAGAAGGTGGCGCCGATGGGCCACCGCCGTCTTCACAGTACCGGCGCCGTCGCTTTCGGCGGCCCCGGTAAGGAGCCCAGGCTGATTAGAAGCTCGGGGATGAGGAGGGATTGGAGCTTTGAGGATCTCAGAGCCATTCGAGAGGAAAAGGGGCCATCTGCCAATTCCTAACAACTTACTAACTTCTTTTTTTTTTCCTTGTTCTTTTCTATTTTTTTATTATATATATATATTTGAAATTTGTTTGGACATGAATATATAAAAAAGTAGAAGGAATGGGGAGA
Coding sequence (CDS)
ATGAGTCTGAATTGCCTCTCATGTCAACTCTTACAGAGATCGGACTCCGACAGAGACCGCGACCACCAAGATTATTTCTCTGATCCCTCTCACTCGCCGGAGAGAAGCTGGTCCGGCAACCTCTCGTTCCGGCCTCCCACTCGCCAAAACAGAGGAGGGTTTCGGGCCATGGCGGAGAAGAAGGTGGCGCCGATGGGCCACCGCCGTCTTCACAGTACCGGCGCCGTCGCTTTCGGCGGCCCCGGTAAGGAGCCCAGGCTGATTAGAAGCTCGGGGATGAGGAGGGATTGGAGCTTTGAGGATCTCAGAGCCATTCGAGAGGAAAAGGGGCCATCTGCCAATTCCTAA
Protein sequence
MSLNCLSCQLLQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEKGPSANS
Homology
BLAST of Tan0015631 vs. NCBI nr
Match:
XP_022952265.1 (uncharacterized protein LOC111454968 [Cucurbita moschata] >KAG6596735.1 hypothetical protein SDJN03_09915, partial [Cucurbita argyrosperma subsp. sororia] >KAG7028271.1 hypothetical protein SDJN02_09452, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 188.0 bits (476), Expect = 4.7e-44
Identity = 93/116 (80.17%), Postives = 101/116 (87.07%), Query Frame = 0
Query: 1 MSLNCLSCQL-LQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAE 60
MSLNCLSCQL LQRSDSD+D + +YF+D + SP+RSWSGNLSFRPPTRQN GFR E
Sbjct: 1 MSLNCLSCQLFLQRSDSDKDLEQHEYFTDQAKSPDRSWSGNLSFRPPTRQNSDGFRVRTE 60
Query: 61 KKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEKGPSANS 116
KVAPMGHRRLHSTGAVAFGGPGKEPRLIRS+GMRRDWSFEDLRAIRE+K S NS
Sbjct: 61 NKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSAGMRRDWSFEDLRAIREDKESSPNS 116
BLAST of Tan0015631 vs. NCBI nr
Match:
XP_023539506.1 (uncharacterized protein LOC111800149 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 186.8 bits (473), Expect = 1.0e-43
Identity = 93/116 (80.17%), Postives = 100/116 (86.21%), Query Frame = 0
Query: 1 MSLNCLSCQL-LQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAE 60
MSLNCLSCQL LQRSDSD+D + +YF+D + SP+RSWSGNLSFRPPTRQN GFR E
Sbjct: 1 MSLNCLSCQLFLQRSDSDKDLEQHEYFTDQAKSPDRSWSGNLSFRPPTRQNSDGFRVRTE 60
Query: 61 KKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEKGPSANS 116
KVAP GHRRLHSTGAVAFGGPGKEPRLIRS+GMRRDWSFEDLRAIREEK S NS
Sbjct: 61 NKVAPTGHRRLHSTGAVAFGGPGKEPRLIRSAGMRRDWSFEDLRAIREEKESSPNS 116
BLAST of Tan0015631 vs. NCBI nr
Match:
XP_038905810.1 (uncharacterized protein LOC120091761 [Benincasa hispida])
HSP 1 Score: 148.3 bits (373), Expect = 4.1e-32
Identity = 81/116 (69.83%), Postives = 89/116 (76.72%), Query Frame = 0
Query: 1 MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAM 60
MSLNCLSCQ+LQR+DS+R RD Q Y SD S ERSWSGNLSFRP R NRGGFR +
Sbjct: 1 MSLNCLSCQILQRTDSERHRDQQIQTYYASDEFDSSERSWSGNLSFRPHDRHNRGGFRGL 60
Query: 61 AEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEKGPSA 114
E KVAP+ HRR AV+FG GKEPRL+RSSGMRRDWSFEDLR IREE+ PSA
Sbjct: 61 PENKVAPISHRR-----AVSFG--GKEPRLVRSSGMRRDWSFEDLRTIREEREPSA 109
BLAST of Tan0015631 vs. NCBI nr
Match:
XP_031737872.1 (uncharacterized protein LOC116402546 [Cucumis sativus])
HSP 1 Score: 139.0 bits (349), Expect = 2.5e-29
Identity = 84/130 (64.62%), Postives = 91/130 (70.00%), Query Frame = 0
Query: 1 MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRG----- 60
MSLNCLSCQ+LQR+DS+R RD Q Y SD +S ERSWSGNL RP NRG
Sbjct: 1 MSLNCLSCQILQRTDSERHRDRQVQTYYTSDEFNSSERSWSGNLCLRP----NRGGGGGG 60
Query: 61 -------GFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAI 116
GFR MA+ KVAP+GHRR AV+FG GKEPRLIRSSGMRRDWSFEDLRAI
Sbjct: 61 GGGGGGRGFRGMADNKVAPIGHRR-----AVSFG--GKEPRLIRSSGMRRDWSFEDLRAI 119
BLAST of Tan0015631 vs. NCBI nr
Match:
XP_008448684.1 (PREDICTED: uncharacterized protein LOC103490783 [Cucumis melo] >KAA0053043.1 uncharacterized protein E6C27_scaffold344G001590 [Cucumis melo var. makuwa] >TYK11498.1 uncharacterized protein E5676_scaffold139G001620 [Cucumis melo var. makuwa])
HSP 1 Score: 136.0 bits (341), Expect = 2.1e-28
Identity = 81/119 (68.07%), Postives = 88/119 (73.95%), Query Frame = 0
Query: 1 MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGG-FRA 60
MSLNCLSCQLLQR+DS+R RD Q Y SD +RSWSGNLS RQNRGG FR
Sbjct: 1 MSLNCLSCQLLQRTDSERHRDRQIQTYYTSDEFDPSQRSWSGNLSL----RQNRGGVFRG 60
Query: 61 MAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEKGPSANS 116
MA+ KVAP+ HRR AV+FG GKEPRL+RSSGMRRDWSFEDLR IREEK PS NS
Sbjct: 61 MADNKVAPVCHRR-----AVSFG--GKEPRLVRSSGMRRDWSFEDLRTIREEKEPSPNS 108
BLAST of Tan0015631 vs. ExPASy TrEMBL
Match:
A0A6J1GL99 (uncharacterized protein LOC111454968 OS=Cucurbita moschata OX=3662 GN=LOC111454968 PE=4 SV=1)
HSP 1 Score: 188.0 bits (476), Expect = 2.3e-44
Identity = 93/116 (80.17%), Postives = 101/116 (87.07%), Query Frame = 0
Query: 1 MSLNCLSCQL-LQRSDSDRDRDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMAE 60
MSLNCLSCQL LQRSDSD+D + +YF+D + SP+RSWSGNLSFRPPTRQN GFR E
Sbjct: 1 MSLNCLSCQLFLQRSDSDKDLEQHEYFTDQAKSPDRSWSGNLSFRPPTRQNSDGFRVRTE 60
Query: 61 KKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEKGPSANS 116
KVAPMGHRRLHSTGAVAFGGPGKEPRLIRS+GMRRDWSFEDLRAIRE+K S NS
Sbjct: 61 NKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSAGMRRDWSFEDLRAIREDKESSPNS 116
BLAST of Tan0015631 vs. ExPASy TrEMBL
Match:
A0A0A0L198 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G011635 PE=4 SV=1)
HSP 1 Score: 140.6 bits (353), Expect = 4.1e-30
Identity = 84/126 (66.67%), Postives = 91/126 (72.22%), Query Frame = 0
Query: 1 MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRG----- 60
MSLNCLSCQ+LQR+DS+R RD Q Y SD +S ERSWSGNL RP NRG
Sbjct: 1 MSLNCLSCQILQRTDSERHRDRQVQTYYTSDEFNSSERSWSGNLCLRP----NRGGGGGG 60
Query: 61 ---GFRAMAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEK 116
GFR MA+ KVAP+GHRR AV+FG GKEPRLIRSSGMRRDWSFEDLRAIREEK
Sbjct: 61 GGRGFRGMADNKVAPIGHRR-----AVSFG--GKEPRLIRSSGMRRDWSFEDLRAIREEK 115
BLAST of Tan0015631 vs. ExPASy TrEMBL
Match:
A0A5A7UAS3 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G001620 PE=4 SV=1)
HSP 1 Score: 136.0 bits (341), Expect = 1.0e-28
Identity = 81/119 (68.07%), Postives = 88/119 (73.95%), Query Frame = 0
Query: 1 MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGG-FRA 60
MSLNCLSCQLLQR+DS+R RD Q Y SD +RSWSGNLS RQNRGG FR
Sbjct: 1 MSLNCLSCQLLQRTDSERHRDRQIQTYYTSDEFDPSQRSWSGNLSL----RQNRGGVFRG 60
Query: 61 MAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEKGPSANS 116
MA+ KVAP+ HRR AV+FG GKEPRL+RSSGMRRDWSFEDLR IREEK PS NS
Sbjct: 61 MADNKVAPVCHRR-----AVSFG--GKEPRLVRSSGMRRDWSFEDLRTIREEKEPSPNS 108
BLAST of Tan0015631 vs. ExPASy TrEMBL
Match:
A0A1S3BJN6 (uncharacterized protein LOC103490783 OS=Cucumis melo OX=3656 GN=LOC103490783 PE=4 SV=1)
HSP 1 Score: 136.0 bits (341), Expect = 1.0e-28
Identity = 81/119 (68.07%), Postives = 88/119 (73.95%), Query Frame = 0
Query: 1 MSLNCLSCQLLQRSDSDRDRDHQ---DYFSDPSHSPERSWSGNLSFRPPTRQNRGG-FRA 60
MSLNCLSCQLLQR+DS+R RD Q Y SD +RSWSGNLS RQNRGG FR
Sbjct: 1 MSLNCLSCQLLQRTDSERHRDRQIQTYYTSDEFDPSQRSWSGNLSL----RQNRGGVFRG 60
Query: 61 MAEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEKGPSANS 116
MA+ KVAP+ HRR AV+FG GKEPRL+RSSGMRRDWSFEDLR IREEK PS NS
Sbjct: 61 MADNKVAPVCHRR-----AVSFG--GKEPRLVRSSGMRRDWSFEDLRTIREEKEPSPNS 108
BLAST of Tan0015631 vs. ExPASy TrEMBL
Match:
A0A6J1L834 (uncharacterized protein LOC111500186 OS=Cucurbita maxima OX=3661 GN=LOC111500186 PE=4 SV=1)
HSP 1 Score: 132.9 bits (333), Expect = 8.6e-28
Identity = 69/111 (62.16%), Postives = 83/111 (74.77%), Query Frame = 0
Query: 1 MSLNCLSCQLLQRSDSDRDRDH--QDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAMA 60
M+LNCLSCQLLQR+DS+RD D Q+Y+S RSWSGNLSFRPP R + RA+
Sbjct: 1 MTLNCLSCQLLQRTDSERDPDPQLQNYYSGQIEPSGRSWSGNLSFRPPDRPKKEALRALP 60
Query: 61 EKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREEK 110
E + P+ RRLHS+G ++ G KEP+L+RSSGMRRDWSFEDLRAIREEK
Sbjct: 61 EDQAPPVAPRRLHSSGPISLG--SKEPKLVRSSGMRRDWSFEDLRAIREEK 109
BLAST of Tan0015631 vs. TAIR 10
Match:
AT2G35215.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G46770.1); Has 19 Blast hits to 19 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 19; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 67.8 bits (164), Expect = 6.6e-12
Identity = 44/111 (39.64%), Postives = 61/111 (54.95%), Query Frame = 0
Query: 1 MSLNCLSCQLLQRSDSDRD---RDHQDYFSDPSHSPERSWSGNLSFRPPTRQNRGGFRAM 60
MSLNCL+C +LQR+DSDRD R + + + S N S P R+
Sbjct: 1 MSLNCLACHILQRTDSDRDMGSRKDSSFKENFATSAFEKMVRNRSSLPVVRR-------- 60
Query: 61 AEKKVAPMGHRRLHSTGAVAFGGPGKEPRLIRSSGMRRDWSFEDLRAIREE 109
GHRRL+S + + G EP+L+RSSG+RRDWSFEDL+ +++
Sbjct: 61 -----VNKGHRRLYSADIMVY-GELDEPKLVRSSGIRRDWSFEDLKKHKDQ 97
BLAST of Tan0015631 vs. TAIR 10
Match:
AT5G46770.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 60.1 bits (144), Expect = 1.4e-09
Identity = 52/132 (39.39%), Postives = 69/132 (52.27%), Query Frame = 0
Query: 1 MSLNCLSCQLLQRSDSDRDRDHQDYFSDP-----------------SHSPERSWSGNLSF 60
MSLNCLSCQ L R+DS++D D S P + R+WSGNLS
Sbjct: 1 MSLNCLSCQALPRTDSNKDVD----LSGPGPPRVEINNVLGKTCCVNPIGGRNWSGNLSP 60
Query: 61 RPPTRQNR-GGFRAMAEKKVAPMGHRRLHS-TGAVAFGGPGK--EPRLIRSSGMRRDWSF 110
R + R G A KKV + H RL G+ P + +P+L+RS+G+RR+WSF
Sbjct: 61 RIYEKIGRPGSSLAHKMKKVKKIHHVRLSGPVGSSPSNVPTRPEQPKLVRSTGVRRNWSF 120
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022952265.1 | 4.7e-44 | 80.17 | uncharacterized protein LOC111454968 [Cucurbita moschata] >KAG6596735.1 hypothet... | [more] |
XP_023539506.1 | 1.0e-43 | 80.17 | uncharacterized protein LOC111800149 [Cucurbita pepo subsp. pepo] | [more] |
XP_038905810.1 | 4.1e-32 | 69.83 | uncharacterized protein LOC120091761 [Benincasa hispida] | [more] |
XP_031737872.1 | 2.5e-29 | 64.62 | uncharacterized protein LOC116402546 [Cucumis sativus] | [more] |
XP_008448684.1 | 2.1e-28 | 68.07 | PREDICTED: uncharacterized protein LOC103490783 [Cucumis melo] >KAA0053043.1 unc... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GL99 | 2.3e-44 | 80.17 | uncharacterized protein LOC111454968 OS=Cucurbita moschata OX=3662 GN=LOC1114549... | [more] |
A0A0A0L198 | 4.1e-30 | 66.67 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G011635 PE=4 SV=1 | [more] |
A0A5A7UAS3 | 1.0e-28 | 68.07 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3BJN6 | 1.0e-28 | 68.07 | uncharacterized protein LOC103490783 OS=Cucumis melo OX=3656 GN=LOC103490783 PE=... | [more] |
A0A6J1L834 | 8.6e-28 | 62.16 | uncharacterized protein LOC111500186 OS=Cucurbita maxima OX=3661 GN=LOC111500186... | [more] |
Match Name | E-value | Identity | Description | |
AT2G35215.1 | 6.6e-12 | 39.64 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT5G46770.1 | 1.4e-09 | 39.39 | unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... | [more] |