Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGAGCGAATCGCTGGGGATAAATTGAGTGGGAGCAAAAAAACCTCTACAACACACACCTTCCCAATTCGCTTCACACAAGATCACCATTTTTCTCTCTTCTTCTTCCCTCCGCCCCATGGCCAGACCTTTCCCAACCATTTCAAATCTCTCTCATCCTCTCCATCTTCTATTTTTCTCAACTCATTTCTCATTTCTCATCCTTTCAACCTCAGTTATCCTCTCCATCTTCGCCCTCCTCATTTTCCTCTGCACATCTTCAAGAAAATCCAATAAATCGCAGCAGGGGAGGAATAATTTTGTTTCCAAAATGAACAGTAACATCAGTTCTAGAGCAATTTCAATGGCCAAGATGATTTCGTGGAGGAAAGTGGAAGCAGCCGAGGAAGAGGAAGAAGAAGAAGAAGAAAGAGGATCAGGAGGTTGTGATTTTATTGATAAAGATGAAGAAGAAGAGGTTTGGAGGAAAACGATTATTAGAGGTGAACGATGTCGTCCGTTAGAATTTTCTGGTAAAATTGATTATGATTCTGATGGAAATCTGTTGTGTGATTCAAATAGGGATTTCAAATAATCTGATTCTTCTTTTTCTCCTGCGCTCTTTCTAATTCTTCAAATAATCTGACTTTAAATAACCCTGAAAACGTACAAACATCAATAGAAATGTTTACATCAAAATTTAGTACTAT
mRNA sequence
GTGAGCGAATCGCTGGGGATAAATTGAGTGGGAGCAAAAAAACCTCTACAACACACACCTTCCCAATTCGCTTCACACAAGATCACCATTTTTCTCTCTTCTTCTTCCCTCCGCCCCATGGCCAGACCTTTCCCAACCATTTCAAATCTCTCTCATCCTCTCCATCTTCTATTTTTCTCAACTCATTTCTCATTTCTCATCCTTTCAACCTCAGTTATCCTCTCCATCTTCGCCCTCCTCATTTTCCTCTGCACATCTTCAAGAAAATCCAATAAATCGCAGCAGGGGAGGAATAATTTTGTTTCCAAAATGAACAGTAACATCAGTTCTAGAGCAATTTCAATGGCCAAGATGATTTCGTGGAGGAAAGTGGAAGCAGCCGAGGAAGAGGAAGAAGAAGAAGAAGAAAGAGGATCAGGAGGTTGTGATTTTATTGATAAAGATGAAGAAGAAGAGGTTTGGAGGAAAACGATTATTAGAGGTGAACGATGTCGTCCGTTAGAATTTTCTGGTAAAATTGATTATGATTCTGATGGAAATCTGTTGTGTGATTCAAATAGGGATTTCAAATAATCTGATTCTTCTTTTTCTCCTGCGCTCTTTCTAATTCTTCAAATAATCTGACTTTAAATAACCCTGAAAACGTACAAACATCAATAGAAATGTTTACATCAAAATTTAGTACTAT
Coding sequence (CDS)
ATGGCCAGACCTTTCCCAACCATTTCAAATCTCTCTCATCCTCTCCATCTTCTATTTTTCTCAACTCATTTCTCATTTCTCATCCTTTCAACCTCAGTTATCCTCTCCATCTTCGCCCTCCTCATTTTCCTCTGCACATCTTCAAGAAAATCCAATAAATCGCAGCAGGGGAGGAATAATTTTGTTTCCAAAATGAACAGTAACATCAGTTCTAGAGCAATTTCAATGGCCAAGATGATTTCGTGGAGGAAAGTGGAAGCAGCCGAGGAAGAGGAAGAAGAAGAAGAAGAAAGAGGATCAGGAGGTTGTGATTTTATTGATAAAGATGAAGAAGAAGAGGTTTGGAGGAAAACGATTATTAGAGGTGAACGATGTCGTCCGTTAGAATTTTCTGGTAAAATTGATTATGATTCTGATGGAAATCTGTTGTGTGATTCAAATAGGGATTTCAAATAA
Protein sequence
MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQGRNNFVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTIIRGERCRPLEFSGKIDYDSDGNLLCDSNRDFK*
Homology
BLAST of CSPI03G21890 vs. ExPASy TrEMBL
Match:
A0A0A0L7R8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G348940 PE=4 SV=1)
HSP 1 Score: 289.7 bits (740), Expect = 7.3e-75
Identity = 151/151 (100.00%), Postives = 151/151 (100.00%), Query Frame = 0
Query: 1 MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQGRNN 60
MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQGRNN
Sbjct: 1 MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQGRNN 60
Query: 61 FVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTII 120
FVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTII
Sbjct: 61 FVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTII 120
Query: 121 RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK 152
RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK
Sbjct: 121 RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK 151
BLAST of CSPI03G21890 vs. ExPASy TrEMBL
Match:
A0A5D3DFS8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold712G00010 PE=4 SV=1)
HSP 1 Score: 236.9 bits (603), Expect = 5.6e-59
Identity = 130/151 (86.09%), Postives = 136/151 (90.07%), Query Frame = 0
Query: 1 MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQGRNN 60
MARPFPTISNLSH HLLFFSTHFSF ILSTS ILSIFALLIFLCTSS KSNKSQQG+
Sbjct: 1 MARPFPTISNLSHHPHLLFFSTHFSFPILSTSTILSIFALLIFLCTSSTKSNKSQQGKTT 60
Query: 61 FVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTII 120
FVSKMNSNISSRAISMAK+ISWRKVEAA +E EEERGSG CD + ++EEEVWRKTII
Sbjct: 61 FVSKMNSNISSRAISMAKIISWRKVEAA---DELEEERGSGSCD--ELEDEEEVWRKTII 120
Query: 121 RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK 152
RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK
Sbjct: 121 RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK 146
BLAST of CSPI03G21890 vs. ExPASy TrEMBL
Match:
A0A5A7V4T5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold72G00580 PE=4 SV=1)
HSP 1 Score: 140.2 bits (352), Expect = 7.2e-30
Identity = 74/87 (85.06%), Postives = 79/87 (90.80%), Query Frame = 0
Query: 65 MNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTIIRGER 124
MNSNISSRAISMAK+ISWRKVEAA +E EEERGSG CD + ++EEEVWRKTIIRGER
Sbjct: 1 MNSNISSRAISMAKIISWRKVEAA---DELEEERGSGSCD--ELEDEEEVWRKTIIRGER 60
Query: 125 CRPLEFSGKIDYDSDGNLLCDSNRDFK 152
CRPLEFSGKIDYDSDGNLLCDSNRDFK
Sbjct: 61 CRPLEFSGKIDYDSDGNLLCDSNRDFK 82
BLAST of CSPI03G21890 vs. ExPASy TrEMBL
Match:
A0A5E4G755 (PREDICTED: LOC100277003 OS=Prunus dulcis OX=3755 GN=ALMOND_2B003044 PE=4 SV=1)
HSP 1 Score: 115.5 bits (288), Expect = 1.9e-22
Identity = 75/158 (47.47%), Postives = 103/158 (65.19%), Query Frame = 0
Query: 1 MARPF-PTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQGRN 60
MARP P+ S SH HL +HF F ++ V S+F+L+IFLC +SRKS KS + +
Sbjct: 1 MARPLAPSFSMASH--HLFQHQSHFLFALI---VFFSMFSLVIFLC-ASRKSKKSHKKKE 60
Query: 61 -----------NFVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDK 120
F++K+NS ISS+A++MAKM+SWRK+EA EE+++++++ D
Sbjct: 61 EAITNSESKDAKFIAKLNSKISSKALAMAKMVSWRKMEAGEEDQKDDDD---------DD 120
Query: 121 DEEEEVWRKTIIRGERCRPLEFSGKIDYDSDGNLLCDS 147
+E VWRK+II GERC PL FSGKIDYDSDGNL +S
Sbjct: 121 HSDEAVWRKSIIMGERCAPLNFSGKIDYDSDGNLQPES 143
BLAST of CSPI03G21890 vs. ExPASy TrEMBL
Match:
A0A6J5WQZ0 (Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=ORAREDHAP_LOCUS21513 PE=4 SV=1)
HSP 1 Score: 114.0 bits (284), Expect = 5.5e-22
Identity = 79/177 (44.63%), Postives = 107/177 (60.45%), Query Frame = 0
Query: 1 MARPF-PTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTS--SRKSNK---- 60
MARP P+ S SH HL +HF F + VI S+F+LLIFLC S S+KSN+
Sbjct: 1 MARPLAPSFSMASH--HLFQHPSHFLF---APIVIFSMFSLLIFLCASHKSKKSNEKKEE 60
Query: 61 ----SQQGRNNFVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERG---------- 120
S+ F++K+NS ISS+A++MAKM+SWRK+EA EE+++++++
Sbjct: 61 AITNSESKDAKFIAKLNSKISSKALAMAKMVSWRKMEAGEEDQKDDDDDDHSDEAVWRKS 120
Query: 121 ----------SGGCDFIDKDEEEEVWRKTIIRGERCRPLEFSGKIDYDSDGNLLCDS 147
+ D D D +E VWRK+II GERC PL FSGKIDYDS+GNLL +S
Sbjct: 121 IIMGERCTPLNDDDDDDDDDRDEAVWRKSIIMGERCAPLNFSGKIDYDSEGNLLPES 172
BLAST of CSPI03G21890 vs. NCBI nr
Match:
KGN57848.1 (hypothetical protein Csa_010872 [Cucumis sativus])
HSP 1 Score: 289.7 bits (740), Expect = 1.5e-74
Identity = 151/151 (100.00%), Postives = 151/151 (100.00%), Query Frame = 0
Query: 1 MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQGRNN 60
MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQGRNN
Sbjct: 1 MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQGRNN 60
Query: 61 FVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTII 120
FVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTII
Sbjct: 61 FVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTII 120
Query: 121 RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK 152
RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK
Sbjct: 121 RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK 151
BLAST of CSPI03G21890 vs. NCBI nr
Match:
TYK22452.1 (hypothetical protein E5676_scaffold712G00010 [Cucumis melo var. makuwa])
HSP 1 Score: 236.9 bits (603), Expect = 1.2e-58
Identity = 130/151 (86.09%), Postives = 136/151 (90.07%), Query Frame = 0
Query: 1 MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQGRNN 60
MARPFPTISNLSH HLLFFSTHFSF ILSTS ILSIFALLIFLCTSS KSNKSQQG+
Sbjct: 1 MARPFPTISNLSHHPHLLFFSTHFSFPILSTSTILSIFALLIFLCTSSTKSNKSQQGKTT 60
Query: 61 FVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTII 120
FVSKMNSNISSRAISMAK+ISWRKVEAA +E EEERGSG CD + ++EEEVWRKTII
Sbjct: 61 FVSKMNSNISSRAISMAKIISWRKVEAA---DELEEERGSGSCD--ELEDEEEVWRKTII 120
Query: 121 RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK 152
RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK
Sbjct: 121 RGERCRPLEFSGKIDYDSDGNLLCDSNRDFK 146
BLAST of CSPI03G21890 vs. NCBI nr
Match:
KAG7036703.1 (hypothetical protein SDJN02_00323, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 169.1 bits (427), Expect = 3.0e-38
Identity = 99/152 (65.13%), Postives = 116/152 (76.32%), Query Frame = 0
Query: 1 MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKS-QQGRN 60
MA+PFP+ SN H HL F S ++++ +LSIFAL+IFLCTSSRKS K +
Sbjct: 1 MAKPFPSFSN--HSYHLPFSSPS----LVASIAVLSIFALVIFLCTSSRKSKKPILLQQR 60
Query: 61 NFVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTI 120
NFV+K+NSNISSRAIS+AKMISWRKVEAA +E+E G GG D D ++EVWRKTI
Sbjct: 61 NFVAKVNSNISSRAISIAKMISWRKVEAA---DEDEGGGGGGGFDLSGDDYDDEVWRKTI 120
Query: 121 IRGERCRPLEFSGKIDYDSDGNLLCDSNRDFK 152
IRGERCRPLEFSGKIDYDSDGNLLCDS R+FK
Sbjct: 121 IRGERCRPLEFSGKIDYDSDGNLLCDSKREFK 143
BLAST of CSPI03G21890 vs. NCBI nr
Match:
KAG6607004.1 (hypothetical protein SDJN03_00346, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 167.2 bits (422), Expect = 1.1e-37
Identity = 99/152 (65.13%), Postives = 116/152 (76.32%), Query Frame = 0
Query: 1 MARPFPTISNLSHPLHLLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKS-QQGRN 60
MA+PFP+ SN H HL F S ++++ +LSIFAL+IFLCTSSRKS K +
Sbjct: 1 MAKPFPSFSN--HSYHLPFSSPS----LVASIAVLSIFALVIFLCTSSRKSKKPILLQQR 60
Query: 61 NFVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDEEEEVWRKTI 120
NFV+K+NSNISSRAIS+AKMISWRKVEAA +E+E G GG D D ++EVWRKTI
Sbjct: 61 NFVAKVNSNISSRAISIAKMISWRKVEAA-DEDEGGGGGGGGGFDLSGDDYDDEVWRKTI 120
Query: 121 IRGERCRPLEFSGKIDYDSDGNLLCDSNRDFK 152
IRGERCRPLEFSGKIDYDSDGNLLCDS R+FK
Sbjct: 121 IRGERCRPLEFSGKIDYDSDGNLLCDSKREFK 145
BLAST of CSPI03G21890 vs. NCBI nr
Match:
XP_038906595.1 (uncharacterized protein LOC120092546 [Benincasa hispida])
HSP 1 Score: 160.6 bits (405), Expect = 1.1e-35
Identity = 102/138 (73.91%), Postives = 113/138 (81.88%), Query Frame = 0
Query: 1 MARPFPTISNLSHPLH---LLFFSTHFSFLILSTSVILSIFALLIFLCTSSRKSNKSQQG 60
MAR FP+ISN SH H L F ST+FSFLI S+ +LSIFAL++FLCTSSRKSNKSQQ
Sbjct: 1 MARLFPSISNPSHHHHHHLLPFSSTNFSFLI-SSIAVLSIFALVVFLCTSSRKSNKSQQ- 60
Query: 61 RNNFVSKMNSNISSRAISMAKMISWRKVEAAEEEEEEEEERGSGGCDFIDKDE-EEEVWR 120
R NFVSKMNSNISSRAISMAKMISWRKVEAA +EE+EEE RGS D++E EEEVWR
Sbjct: 61 RRNFVSKMNSNISSRAISMAKMISWRKVEAA-DEEDEEERRGSCNLSGDDEEEDEEEVWR 120
Query: 121 KTIIRGERCRPLEFSGKI 135
KTIIRGERCRPLEFS +
Sbjct: 121 KTIIRGERCRPLEFSDSV 135
BLAST of CSPI03G21890 vs. TAIR 10
Match:
AT1G49000.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: stem; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G18560.1); Has 105 Blast hits to 105 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 105; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 44.7 bits (104), Expect = 7.9e-05
Identity = 18/35 (51.43%), Postives = 26/35 (74.29%), Query Frame = 0
Query: 109 DEEEEVWRKTIIRGERCRPLEFSGKIDYDSDGNLL 144
+EE +W++ I+ G +C PL+FSG I YDS+G LL
Sbjct: 102 EEEHGLWQREILMGGKCEPLDFSGVIYYDSNGRLL 136
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L7R8 | 7.3e-75 | 100.00 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G348940 PE=4 SV=1 | [more] |
A0A5D3DFS8 | 5.6e-59 | 86.09 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A5A7V4T5 | 7.2e-30 | 85.06 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A5E4G755 | 1.9e-22 | 47.47 | PREDICTED: LOC100277003 OS=Prunus dulcis OX=3755 GN=ALMOND_2B003044 PE=4 SV=1 | [more] |
A0A6J5WQZ0 | 5.5e-22 | 44.63 | Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=ORAREDHAP_LOCUS21513 PE=... | [more] |
Match Name | E-value | Identity | Description | |
KGN57848.1 | 1.5e-74 | 100.00 | hypothetical protein Csa_010872 [Cucumis sativus] | [more] |
TYK22452.1 | 1.2e-58 | 86.09 | hypothetical protein E5676_scaffold712G00010 [Cucumis melo var. makuwa] | [more] |
KAG7036703.1 | 3.0e-38 | 65.13 | hypothetical protein SDJN02_00323, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6607004.1 | 1.1e-37 | 65.13 | hypothetical protein SDJN03_00346, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_038906595.1 | 1.1e-35 | 73.91 | uncharacterized protein LOC120092546 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
AT1G49000.1 | 7.9e-05 | 51.43 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |