Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTGAGTATGTAGTTTGATTATCATTATTGCATCACATCTTCTTCTTCTTTGTCTCTTAATCTTATCCAAATTAAGCCAAACCAAAAACCTCGATAAATCCTTCCTCTTGTTGCTTATGCCGTTTCTTCCCACTTTCCTCTCTAACAAATTCCCAGTGACCCTCATCATACTCTTCCTACCCTAAAATCAGCAATGCCCTTTCCAGTTCATCACAATTTGCAAAGATTCAACATCTCCGCTCCAACCCCTTCGATTGACATTGGACGAATCTCCTCTAAACCCTCTTCTATTATCATCCCCTGAATCAGTTTCCAATGCCATAATCCCACTCCCCCCAAATGTCTCAGCTAAATCCCTCTGGTGCTGAAGACGATTTTTCTTGAACAATTCGACCCCCACTCGTATTCTTTAGCTTCCGGTAACCGATGACTACTTCTTTGCCATTAACTTCACCGTTGTTCTCTGCATTTTCGCCGAGAAAGACCCTTTTTTCGCTCAAGCTTAATCGGCCCTCCATCACTCGGAGTACACAGTCTCTCCACTTCTCGAGTCCATTTGTAAATGTTCCTCATTTCAACTGTTTTGATCCTGTTTCCAGGACTTCTCGGATCATTCGTACTGTCCCTCGAAGTTCTTCAAATGGGTTTCTCGAAGATGACGAGATTATCCCTTCATTTGAGGAGAAGCCGGTTAAAGTTCTGTTGTTGGTTCTGTTTTGGGCATCTTTATCCCTTGCTTGGTTTGCTGCTTCTGGGGATGCTAAAGCTGCTGTTGATTCTATCAGAGCTTCGAATTTTGGGCTAAAGATCGCCAGCGCATTGCAAAACTCAGGCTGGCCTGCTGAGGCTGTAGTGTTTGCCCTTGCTACGCTTCCTGTAATTGAGCTCCGTGGGGCCATTCCTGTTGGTTACTGGATGCAGCTTAAGCCTGTTGCTCTAACCGTTCTATCAGTACTTGGGTGAGTTCTCACGTTTTAATTTCTGAAAACTCCATTTTTTATTCTACGTGTTCTCCCAATCTGTTACTTATCTCTAGAAATTAACAAATCGTCAGTGCTATGATCAACTAGTTCCTCTTCAATCTGCCCTTTGGAGAGAAAGGCCGTTTGCTTTGGCTTGCTGGTGTGTCCACAGTATTATGGGTTTTGTGAGTGAGTGCAACTGAAGGGTGTTTAGAGGTGTGGAAGGAAACTAGGGAGTTATTGTCCCTTTTGGCTTCCATTTCAAAACCTTTTTGTAATTATTCCGTAGGAATATATTCGTATAATTGACGTCTCTTCTTGCAAAGGGAGTCCCTCTTGTTTTTGTGTGTCCGTGTATTCCTTTATTTTTTTTCAATGAAAAAAAGATATTATATTTCTCCTCAACTGCCGCTGGCTTTTGTGTTTGTGTACTTTTCATTCACATACACATCGATGTAAAGACAAGGTGAAAATTTCAGTTCAAATAAACTAGATATGGAGTAGATTGCTACAAGCTCATGCTTTATTGGTGTTACAAATGTAAGTGGATCTTTATGTTTATTGAGGAATAGATGGTATGTTCCGTCAAACAAGATGTGGAGATTTCAAACCTCAACGTGTTTTCATGGAGACACAATCTTCCTATGAATCATTGAGGGCCTAAATCTTGAAGAATTAGCCAAGTCCTTTAAACATTGATGAATGTTTGAAGTGAACTTTGAAATGGTTCATAAGTTTTTTGGAGAAGTGAGTGATGCATTTTTTTTCCATTTTCTATGTTAACTTAAATAACCAACACCTCTTCTGCACAAGTGATTTTTAAAGTCTGATTAGATGAAATAAATGAACTGCAGGAACATGGTTCCTGTACCCTTTATCATACTCTATTTGAAGAAATTTGCAACTTTTCTAGCGGGAAGGAATGCATCTGCCTCTCAATTCCTCGATATGTTATTCAAGAGGGCCAAAGAGAAAGCTGCACCTGTTGAAGAGTTTCAGTGGCTTGGTCTAATGCTATTTGTGGCCGTGCCTTTCCCCGGAACAGGAGCTTGGACTGGTGCCATAATAGCTTCCATCTTAGACATGCCATTCTGGTCAGGTGTCTCTGCAAATTTCTTTGGTGTTGTAGTGGCAGGGCTTCTAGTCAACTTGTTGGTGAATCTTGGTCTGAAGGAGGCCATTGTCACTGGAGTGATTCTTTTCATTATATCAACATTCATGTGGAGCATTCTTCGAATGATAAAGAAAAGCTTTGAGAAAATGAACTAATCCAAAAGGTTGTATTTCTATGTCTACTGACAATGAACCACTTTGGTCTGATCTTAGTCTGATATATCTTGACTTAGATTTGTTACTTGTTAGTTTAAGTTGTAAGTTTGGAACTCTGATCAATTGATTCCTTACTCTTTTCTTGGTGGCATAGATTTTTTTTTCTCATATCATATTAAGAATAGAGTAATTGACAGTGTGGAGATACTTAACAGTAGATAGAAAGCTTGTCTTGCGAACACTGTTCAGAAGAGAACATCCACAAACAACTTTCATTTGCCATACTAAGCCGAGAAATGTTGTGTAAATTCCGGTATTCCGATTGCTCTTTGCACGTGTAGAATTGTGAATCTTGTTCTTTAGATGTCAGCAGGAGCAAATATAGTTGTGCTTCGAACGTTGAAAATTCGTGCTCGAATGTTGTGGTCTGGTTTGATTTACAGGCAAGCATCCTTAGTTGGGGTGCAAAAATGGCAAGAAACTGAACCACCCAAAAACCATGGATTTGCCTTTAATTGCAATAATATATGAACCTGATGATTACTTCTTTAATGATTGACGTGGGTTCACAAATCTGAACATAATTCAACGGTTGATATGGGATCTCGAGTTGACTCTTTGGTATCTATACTATGCGGCTCTCATCTTGTTCTTAGGTTTGTCTGCCATTTTGTTGCATGTGTTTTTTTGCTGCTGCACATCCTGAGATTGATTGTACATTTGTACTACAAATGAAAGATCCTTCATGAATCTGATTTTTTTCATGGTTTGTTCCCCAAATTAAAATTTTTTAGTGTTGATCCTTTAAGCTCGGTATTGCTTTCTACAGAAATCCTCCTCTTAAATATCTTGTTGCGCTCTTCTAATGTTGCGTTCGTCACTCTTTGATGACAGGTTTATGGCCACCCTTTCAAGTTGGGTATTTTTCTATAGCCATTCTTCATCATTTTCAAGGGGATTTTCCATGACCAAGCCTCAAAAGTTTTTATGACAAAGACTTTAGATTTGCAGTGACATTCTTCAAATTCAAGTTCCAATTTATCTTAAAAGTTCTGAATTCCTCATCTCTTGAATTTAGCTTTATTCTTTATGACACCTAAAAACTTTCTCAAGAATTAATGCGAATCAATATAATTATGAAAATAGCTGTACC
mRNA sequence
CTGAGTATGTAGTTTGATTATCATTATTGCATCACATCTTCTTCTTCTTTGTCTCTTAATCTTATCCAAATTAAGCCAAACCAAAAACCTCGATAAATCCTTCCTCTTGTTGCTTATGCCGTTTCTTCCCACTTTCCTCTCTAACAAATTCCCAGTGACCCTCATCATACTCTTCCTACCCTAAAATCAGCAATGCCCTTTCCAGTTCATCACAATTTGCAAAGATTCAACATCTCCGCTCCAACCCCTTCGATTGACATTGGACGAATCTCCTCTAAACCCTCTTCTATTATCATCCCCTGAATCAGTTTCCAATGCCATAATCCCACTCCCCCCAAATGTCTCAGCTAAATCCCTCTGGTGCTGAAGACGATTTTTCTTGAACAATTCGACCCCCACTCGTATTCTTTAGCTTCCGGTAACCGATGACTACTTCTTTGCCATTAACTTCACCGTTGTTCTCTGCATTTTCGCCGAGAAAGACCCTTTTTTCGCTCAAGCTTAATCGGCCCTCCATCACTCGGAGTACACAGTCTCTCCACTTCTCGAGTCCATTTGTAAATGTTCCTCATTTCAACTGTTTTGATCCTGTTTCCAGGACTTCTCGGATCATTCGTACTGTCCCTCGAAGTTCTTCAAATGGGTTTCTCGAAGATGACGAGATTATCCCTTCATTTGAGGAGAAGCCGGTTAAAGTTCTGTTGTTGGTTCTGTTTTGGGCATCTTTATCCCTTGCTTGGTTTGCTGCTTCTGGGGATGCTAAAGCTGCTGTTGATTCTATCAGAGCTTCGAATTTTGGGCTAAAGATCGCCAGCGCATTGCAAAACTCAGGCTGGCCTGCTGAGGCTGTAGTGTTTGCCCTTGCTACGCTTCCTGTAATTGAGCTCCGTGGGGCCATTCCTGTTGGTTACTGGATGCAGCTTAAGCCTGTTGCTCTAACCGTTCTATCAGTACTTGGGAACATGGTTCCTGTACCCTTTATCATACTCTATTTGAAGAAATTTGCAACTTTTCTAGCGGGAAGGAATGCATCTGCCTCTCAATTCCTCGATATGTTATTCAAGAGGGCCAAAGAGAAAGCTGCACCTGTTGAAGAGTTTCAGTGGCTTGGTCTAATGCTATTTGTGGCCGTGCCTTTCCCCGGAACAGGAGCTTGGACTGGTGCCATAATAGCTTCCATCTTAGACATGCCATTCTGGTCAGGTGTCTCTGCAAATTTCTTTGGTGTTGTAGTGGCAGGGCTTCTAGTCAACTTGTTGGTGAATCTTGGTCTGAAGGAGGCCATTGTCACTGGAGTGATTCTTTTCATTATATCAACATTCATGTGGAGCATTCTTCGAATGATAAAGAAAAGCTTTGAGAAAATGAACTAATCCAAAAGGTTTATGGCCACCCTTTCAAGTTGGGTATTTTTCTATAGCCATTCTTCATCATTTTCAAGGGGATTTTCCATGACCAAGCCTCAAAAGTTTTTATGACAAAGACTTTAGATTTGCAGTGACATTCTTCAAATTCAAGTTCCAATTTATCTTAAAAGTTCTGAATTCCTCATCTCTTGAATTTAGCTTTATTCTTTATGACACCTAAAAACTTTCTCAAGAATTAATGCGAATCAATATAATTATGAAAATAGCTGTACC
Coding sequence (CDS)
ATGACTACTTCTTTGCCATTAACTTCACCGTTGTTCTCTGCATTTTCGCCGAGAAAGACCCTTTTTTCGCTCAAGCTTAATCGGCCCTCCATCACTCGGAGTACACAGTCTCTCCACTTCTCGAGTCCATTTGTAAATGTTCCTCATTTCAACTGTTTTGATCCTGTTTCCAGGACTTCTCGGATCATTCGTACTGTCCCTCGAAGTTCTTCAAATGGGTTTCTCGAAGATGACGAGATTATCCCTTCATTTGAGGAGAAGCCGGTTAAAGTTCTGTTGTTGGTTCTGTTTTGGGCATCTTTATCCCTTGCTTGGTTTGCTGCTTCTGGGGATGCTAAAGCTGCTGTTGATTCTATCAGAGCTTCGAATTTTGGGCTAAAGATCGCCAGCGCATTGCAAAACTCAGGCTGGCCTGCTGAGGCTGTAGTGTTTGCCCTTGCTACGCTTCCTGTAATTGAGCTCCGTGGGGCCATTCCTGTTGGTTACTGGATGCAGCTTAAGCCTGTTGCTCTAACCGTTCTATCAGTACTTGGGAACATGGTTCCTGTACCCTTTATCATACTCTATTTGAAGAAATTTGCAACTTTTCTAGCGGGAAGGAATGCATCTGCCTCTCAATTCCTCGATATGTTATTCAAGAGGGCCAAAGAGAAAGCTGCACCTGTTGAAGAGTTTCAGTGGCTTGGTCTAATGCTATTTGTGGCCGTGCCTTTCCCCGGAACAGGAGCTTGGACTGGTGCCATAATAGCTTCCATCTTAGACATGCCATTCTGGTCAGGTGTCTCTGCAAATTTCTTTGGTGTTGTAGTGGCAGGGCTTCTAGTCAACTTGTTGGTGAATCTTGGTCTGAAGGAGGCCATTGTCACTGGAGTGATTCTTTTCATTATATCAACATTCATGTGGAGCATTCTTCGAATGATAAAGAAAAGCTTTGAGAAAATGAACTAA
Protein sequence
MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTSRIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIRASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFMWSILRMIKKSFEKMN*
Homology
BLAST of CsGy5G022890 vs. NCBI nr
Match:
XP_004135199.1 (uncharacterized protein LOC101204187 [Cucumis sativus] >XP_011655666.1 uncharacterized protein LOC101204187 [Cucumis sativus] >KGN51880.1 hypothetical protein Csa_008845 [Cucumis sativus])
HSP 1 Score: 602 bits (1553), Expect = 8.80e-217
Identity = 315/315 (100.00%), Postives = 315/315 (100.00%), Query Frame = 0
Query: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS
Sbjct: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
Query: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
Query: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
Query: 301 WSILRMIKKSFEKMN 315
WSILRMIKKSFEKMN
Sbjct: 301 WSILRMIKKSFEKMN 315
BLAST of CsGy5G022890 vs. NCBI nr
Match:
XP_008446308.1 (PREDICTED: uncharacterized protein LOC103489082 [Cucumis melo] >KAA0034383.1 Sm_multidrug_ex domain-containing protein [Cucumis melo var. makuwa] >TYK15536.1 Sm_multidrug_ex domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 584 bits (1506), Expect = 1.28e-209
Identity = 305/315 (96.83%), Postives = 309/315 (98.10%), Query Frame = 0
Query: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSIT+ST SLHFSSPFVNVPH NC DPVS TS
Sbjct: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITQSTHSLHFSSPFVNVPHSNCSDPVSSTS 60
Query: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
RIIRTVPRSSSNGFLEDDEIIPSFEEKP+KVL+LVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPIKVLILVLFWASLSLAWFAASGDAKAAVDSIR 120
Query: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
ASNFGLKIASALQ+SGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPV LTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVTLTVLSVLGNM 180
Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTG ILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGAILFIISTFM 300
Query: 301 WSILRMIKKSFEKMN 315
WSILRMIKKSFEKMN
Sbjct: 301 WSILRMIKKSFEKMN 315
BLAST of CsGy5G022890 vs. NCBI nr
Match:
XP_038893409.1 (uncharacterized protein LOC120082205 [Benincasa hispida])
HSP 1 Score: 537 bits (1384), Expect = 4.63e-191
Identity = 284/313 (90.73%), Postives = 293/313 (93.61%), Query Frame = 0
Query: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
MTTSLP TSPL S FSPRKTLFSLKLNRPSIT+S QSLH SS VNV HFN F V TS
Sbjct: 1 MTTSLPFTSPLISVFSPRKTLFSLKLNRPSITQSKQSLHLSSSCVNVRHFNYFKHVFSTS 60
Query: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
RI RTV RSSSNGFLEDDEI+PSFEEKPVKV+LLVLFWASLSLAWFAASGDAKAA DSIR
Sbjct: 61 RIFRTVTRSSSNGFLEDDEILPSFEEKPVKVVLLVLFWASLSLAWFAASGDAKAAADSIR 120
Query: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
ASNFGLKIASALQ+SGW EAVVFALATLPVIELRGAIPVGYW+ LKPVALTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQSSGWSPEAVVFALATLPVIELRGAIPVGYWLHLKPVALTVLSVLGNM 180
Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
VPVPFIILYL+KFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLRKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
Query: 301 WSILRMIKKSFEK 313
WSILR+I+K+F K
Sbjct: 301 WSILRLIRKAFRK 313
BLAST of CsGy5G022890 vs. NCBI nr
Match:
KAG7013172.1 (hypothetical protein SDJN02_25928 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 516 bits (1328), Expect = 1.75e-182
Identity = 271/313 (86.58%), Postives = 289/313 (92.33%), Query Frame = 0
Query: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
M TSL LTSPL SAFS RKTL SL LNRPSI++ QSL SS +N+ HFNCF+PV TS
Sbjct: 1 MATSLVLTSPLMSAFSARKTLISLNLNRPSISQGKQSLPRSSSCINIRHFNCFNPVFSTS 60
Query: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
R+ TV R SSN FLE D+I+PSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61 RMSCTVTRCSSNEFLEVDDILPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
Query: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
ASNFGLKIASALQ+SGWPAEA+VFALATLPV+ELRGAIPVGYWMQLKPVALTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQSSGWPAEAIVFALATLPVVELRGAIPVGYWMQLKPVALTVLSVLGNM 180
Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
VPVPFIILYLKKFATFLAGRNA+ASQFLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKFATFLAGRNATASQFLDMLFKRAKAKAAPVEEFQWLGLMLFVAVPFPG 240
Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
TGAWTGAIIASILDMPFWSG+SANFFGVV+AGLLVNLLVNLGLKEA+ TGVILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGLSANFFGVVLAGLLVNLLVNLGLKEAVFTGVILFIISTFM 300
Query: 301 WSILRMIKKSFEK 313
WSILR+I+K+F K
Sbjct: 301 WSILRLIRKAFNK 313
BLAST of CsGy5G022890 vs. NCBI nr
Match:
XP_022944972.1 (uncharacterized protein LOC111449349 [Cucurbita moschata] >XP_022944973.1 uncharacterized protein LOC111449349 [Cucurbita moschata])
HSP 1 Score: 514 bits (1323), Expect = 1.01e-181
Identity = 270/313 (86.26%), Postives = 288/313 (92.01%), Query Frame = 0
Query: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
M TSL LTSPL SAFS RKTL SL LNRPSI++ QSL SS +N+ HFNCF+PV TS
Sbjct: 1 MATSLVLTSPLMSAFSARKTLISLNLNRPSISQGNQSLPRSSSCINIRHFNCFNPVFSTS 60
Query: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
R+ TV R SSN FLE D+I+PSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61 RLSCTVTRCSSNEFLEVDDILPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
Query: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
ASNFGLKIASALQ+SGWPAEA+VFALATLPV+ELRGAIPVGYWMQLKPVALTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQSSGWPAEAIVFALATLPVVELRGAIPVGYWMQLKPVALTVLSVLGNM 180
Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
VPVPFIILYLKK ATFLAGRNA+ASQFLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKLATFLAGRNATASQFLDMLFKRAKAKAAPVEEFQWLGLMLFVAVPFPG 240
Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
TGAWTGAIIASILDMPFWSG+SANFFGVV+AGLLVNLLVNLGLKEA+ TGVILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGLSANFFGVVLAGLLVNLLVNLGLKEAVFTGVILFIISTFM 300
Query: 301 WSILRMIKKSFEK 313
WSILR+I+K+F K
Sbjct: 301 WSILRLIQKAFNK 313
BLAST of CsGy5G022890 vs. ExPASy TrEMBL
Match:
A0A0A0KQH3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G604240 PE=4 SV=1)
HSP 1 Score: 602 bits (1553), Expect = 4.26e-217
Identity = 315/315 (100.00%), Postives = 315/315 (100.00%), Query Frame = 0
Query: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS
Sbjct: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
Query: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
Query: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
Query: 301 WSILRMIKKSFEKMN 315
WSILRMIKKSFEKMN
Sbjct: 301 WSILRMIKKSFEKMN 315
BLAST of CsGy5G022890 vs. ExPASy TrEMBL
Match:
A0A5A7SV67 (Sm_multidrug_ex domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G00280 PE=4 SV=1)
HSP 1 Score: 584 bits (1506), Expect = 6.21e-210
Identity = 305/315 (96.83%), Postives = 309/315 (98.10%), Query Frame = 0
Query: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSIT+ST SLHFSSPFVNVPH NC DPVS TS
Sbjct: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITQSTHSLHFSSPFVNVPHSNCSDPVSSTS 60
Query: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
RIIRTVPRSSSNGFLEDDEIIPSFEEKP+KVL+LVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPIKVLILVLFWASLSLAWFAASGDAKAAVDSIR 120
Query: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
ASNFGLKIASALQ+SGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPV LTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVTLTVLSVLGNM 180
Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTG ILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGAILFIISTFM 300
Query: 301 WSILRMIKKSFEKMN 315
WSILRMIKKSFEKMN
Sbjct: 301 WSILRMIKKSFEKMN 315
BLAST of CsGy5G022890 vs. ExPASy TrEMBL
Match:
A0A1S3BER4 (uncharacterized protein LOC103489082 OS=Cucumis melo OX=3656 GN=LOC103489082 PE=4 SV=1)
HSP 1 Score: 584 bits (1506), Expect = 6.21e-210
Identity = 305/315 (96.83%), Postives = 309/315 (98.10%), Query Frame = 0
Query: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSIT+ST SLHFSSPFVNVPH NC DPVS TS
Sbjct: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITQSTHSLHFSSPFVNVPHSNCSDPVSSTS 60
Query: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
RIIRTVPRSSSNGFLEDDEIIPSFEEKP+KVL+LVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPIKVLILVLFWASLSLAWFAASGDAKAAVDSIR 120
Query: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
ASNFGLKIASALQ+SGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPV LTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVTLTVLSVLGNM 180
Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTG ILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGAILFIISTFM 300
Query: 301 WSILRMIKKSFEKMN 315
WSILRMIKKSFEKMN
Sbjct: 301 WSILRMIKKSFEKMN 315
BLAST of CsGy5G022890 vs. ExPASy TrEMBL
Match:
A0A6J1FZI4 (uncharacterized protein LOC111449349 OS=Cucurbita moschata OX=3662 GN=LOC111449349 PE=4 SV=1)
HSP 1 Score: 514 bits (1323), Expect = 4.89e-182
Identity = 270/313 (86.26%), Postives = 288/313 (92.01%), Query Frame = 0
Query: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
M TSL LTSPL SAFS RKTL SL LNRPSI++ QSL SS +N+ HFNCF+PV TS
Sbjct: 1 MATSLVLTSPLMSAFSARKTLISLNLNRPSISQGNQSLPRSSSCINIRHFNCFNPVFSTS 60
Query: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
R+ TV R SSN FLE D+I+PSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR
Sbjct: 61 RLSCTVTRCSSNEFLEVDDILPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
Query: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
ASNFGLKIASALQ+SGWPAEA+VFALATLPV+ELRGAIPVGYWMQLKPVALTVLSVLGNM
Sbjct: 121 ASNFGLKIASALQSSGWPAEAIVFALATLPVVELRGAIPVGYWMQLKPVALTVLSVLGNM 180
Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
VPVPFIILYLKK ATFLAGRNA+ASQFLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPFIILYLKKLATFLAGRNATASQFLDMLFKRAKAKAAPVEEFQWLGLMLFVAVPFPG 240
Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
TGAWTGAIIASILDMPFWSG+SANFFGVV+AGLLVNLLVNLGLKEA+ TGVILFIISTFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGLSANFFGVVLAGLLVNLLVNLGLKEAVFTGVILFIISTFM 300
Query: 301 WSILRMIKKSFEK 313
WSILR+I+K+F K
Sbjct: 301 WSILRLIQKAFNK 313
BLAST of CsGy5G022890 vs. ExPASy TrEMBL
Match:
A0A6J1DB35 (uncharacterized protein LOC111019066 OS=Momordica charantia OX=3673 GN=LOC111019066 PE=4 SV=1)
HSP 1 Score: 511 bits (1315), Expect = 7.25e-181
Identity = 265/313 (84.66%), Postives = 287/313 (91.69%), Query Frame = 0
Query: 1 MTTSLPLTSPLFSAFSPRKTLFSLKLNRPSITRSTQSLHFSSPFVNVPHFNCFDPVSRTS 60
M TS+ T P+ SAFSPRKT LKLNRPS+++S QSLH SSP +NV HFN F P+ TS
Sbjct: 1 MGTSVASTPPIMSAFSPRKTHIFLKLNRPSVSQSKQSLHSSSPCINVRHFNHFSPIFATS 60
Query: 61 RIIRTVPRSSSNGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIR 120
RI RTV R+ SNGF+E+D+I+PSFEEKPVK+LLLVLFWASLSL+WFAASGDAKAA DSIR
Sbjct: 61 RIFRTVTRAFSNGFVEEDDIMPSFEEKPVKILLLVLFWASLSLSWFAASGDAKAAGDSIR 120
Query: 121 ASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
ASNFGLKIA+ L++SGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM
Sbjct: 121 ASNFGLKIATTLRSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGNM 180
Query: 181 VPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPG 240
VPVP IILYLKKFATFLAGRNASAS+FLDMLFKR KEKAAPVEEFQWLGLMLFVAVPFPG
Sbjct: 181 VPVPLIILYLKKFATFLAGRNASASRFLDMLFKRGKEKAAPVEEFQWLGLMLFVAVPFPG 240
Query: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTFM 300
TGAWTGAIIASILDMPFWSGVSANFFGVV+AGLLVNLLVNLGLKEAIVTGV LFI+STFM
Sbjct: 241 TGAWTGAIIASILDMPFWSGVSANFFGVVLAGLLVNLLVNLGLKEAIVTGVFLFIVSTFM 300
Query: 301 WSILRMIKKSFEK 313
WSILR+I K+F K
Sbjct: 301 WSILRLISKAFRK 313
BLAST of CsGy5G022890 vs. TAIR 10
Match:
AT2G02590.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Putative small multi-drug export (InterPro:IPR009577); Has 405 Blast hits to 405 proteins in 185 species: Archae - 65; Bacteria - 295; Metazoa - 0; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 22 (source: NCBI BLink). )
HSP 1 Score: 353.6 bits (906), Expect = 1.6e-97
Identity = 186/253 (73.52%), Postives = 212/253 (83.79%), Query Frame = 0
Query: 69 SSSNGFL-------EDDEII--PSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSI 128
SS +GFL E +EII PS PVK + V+ WAS SL WFA SGDAKAA DSI
Sbjct: 68 SSPDGFLRNTKDDEEGNEIIQLPSIGVNPVKFAICVVLWASFSLLWFARSGDAKAATDSI 127
Query: 129 RASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVALTVLSVLGN 188
++S+FGL+IAS L+ GWP EAVVFALATLPVIELRGAIPVGYWMQLKPV LT SVLGN
Sbjct: 128 KSSSFGLRIASTLRRFGWPDEAVVFALATLPVIELRGAIPVGYWMQLKPVVLTSFSVLGN 187
Query: 189 MVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFP 248
MVPVPFI+LYLK FA+F+AG++ +AS+ LD+LFKRAKEKA PVEEF+WLGLMLFVAVPFP
Sbjct: 188 MVPVPFIVLYLKTFASFVAGKSQTASKLLDILFKRAKEKAGPVEEFKWLGLMLFVAVPFP 247
Query: 249 GTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGVILFIISTF 308
GTGAWTGAIIASILDMPFWS VS+NF GVV+AGLLVNLLVNLGLK+AIV G+ LF +STF
Sbjct: 248 GTGAWTGAIIASILDMPFWSAVSSNFCGVVLAGLLVNLLVNLGLKQAIVAGIALFFVSTF 307
Query: 309 MWSILRMIKKSFE 313
MWS+LR I+KS +
Sbjct: 308 MWSVLRNIRKSIK 320
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_004135199.1 | 8.80e-217 | 100.00 | uncharacterized protein LOC101204187 [Cucumis sativus] >XP_011655666.1 uncharact... | [more] |
XP_008446308.1 | 1.28e-209 | 96.83 | PREDICTED: uncharacterized protein LOC103489082 [Cucumis melo] >KAA0034383.1 Sm_... | [more] |
XP_038893409.1 | 4.63e-191 | 90.73 | uncharacterized protein LOC120082205 [Benincasa hispida] | [more] |
KAG7013172.1 | 1.75e-182 | 86.58 | hypothetical protein SDJN02_25928 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022944972.1 | 1.01e-181 | 86.26 | uncharacterized protein LOC111449349 [Cucurbita moschata] >XP_022944973.1 unchar... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KQH3 | 4.26e-217 | 100.00 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G604240 PE=4 SV=1 | [more] |
A0A5A7SV67 | 6.21e-210 | 96.83 | Sm_multidrug_ex domain-containing protein OS=Cucumis melo var. makuwa OX=1194695... | [more] |
A0A1S3BER4 | 6.21e-210 | 96.83 | uncharacterized protein LOC103489082 OS=Cucumis melo OX=3656 GN=LOC103489082 PE=... | [more] |
A0A6J1FZI4 | 4.89e-182 | 86.26 | uncharacterized protein LOC111449349 OS=Cucurbita moschata OX=3662 GN=LOC1114493... | [more] |
A0A6J1DB35 | 7.25e-181 | 84.66 | uncharacterized protein LOC111019066 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
Match Name | E-value | Identity | Description | |
AT2G02590.1 | 1.6e-97 | 73.52 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |