Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAAAAAAAATGCAATCAATCTCATTAGCTTTCTCCCACTCTCTTCAATTTTCCCAATCCCATTTCCACTCCAACTCCAAAATTTCTCTGCTAAAACCTTACTGTCACTGCAGCAGCCATGGAAGCAAGAGGACCAGATCAAGCCTCAGTCTTACAACCACTTGGCCTTCCATCTCCATCGCCCTCTTCGGCTCAGGCTTTCTCTTAGGCCCTCTTCTCGATGGACTCCATTCTCGCGTCAATCTCGTTGTTTATCGAACAGGATCCATCGTCATCGGCCCCCTCCATACTAACATCTGGGTAACATTTTTGAACCTCCATTGTAAAACCCTAATGTTTCGCGAAATGTCTATTTATTTTCTAGGGCAATTCATGATTTTCTTACTAGCGGTTTTGAATCTGAATGCTGTTAGGTTCCTTTCTTGCTGGGATTGTTTTACTGTACAGTTGGTTTGATTCAACTCTACATAGATGAGAAATTTTCACCAAAAAGATCAGAGGGAAGCTTGGGTAGGACAGTAGCTTCTTTAATGTGAGTGTTCTTTTTCATTCGATTCATTTGGTGAAACCATTGGTTGTTTCATTCATATGTATTATTTGTTATTGTTGATTTAGAGCATTGGCTTTGTTTATTGAATTGAGTGCTGAAATGTACAAAGCTGGCGTAGCTGACAATATTGAGGCCTATGCATTATTTGCTGGGGCTGAGTTTATATGGGCATTGCTTGATAGTTCGTTGCTTGGCTTCTCGCTGGCTTGTGTTCTTGGCCTTGGCTGCCCTCTGGCTGAGATTCCCATTATGAAGTATGTGAACTCCCCCACTTTTCAATTCATATACTCGTAGTTTATCATATGTATTGGTTGGTTTGATCATAATTATAGTTCTTGTTCTTTGCTCTTGTATTATTTTGGTTCTTGTAGTTTCTTATGTTATGATTTATATGCAGAAGTTAGTTCCCTAGTGTTAGAAGTTTAGTTGTTAGTGAGATATAGTTGCAGAAGCTAGTTCCCTAATGTTCATATACTAGTAGTTTATGGTATGCATTGGTTGGTTTAAATCATAATTATAGATTTTTGTTTATGTACATTCTTATGTGATGTCTTATATGCAGAAGTTAGTTCCCTAGTGTTAGAAGTTTAGTTGTTAGTGAGATATAAGAATGAGTTTATACTTGGACTACATTTGAGTTTTAGAGATGTTTATCTACCTTATTCACTTGGCCCAACCTCAAATACCTACAAGTTCCTACTTGATGAGTTTTTAAAATAAAACTGGAATGTCAAGAACTTAGGCCCCATTTCATAACTGTAATTATTAGTTTTTGGTTTTTGAAAATTAACCTTATAAACACTACTTCTACATACGAGTTCATATCGACCTTTCATCTCAATTTTACAAAAGAACATTCAGCCTTTTGTGACTAGCACGTAATCTACCTTGGATTTTACCATGGAAACAAATATAGTTCACTCAAGTCAGTTTACTGAAAATCTGTGAGATGTTCAAGTGACATTTGATCCGTAAAAAAAAATTTCAACTTCGATCCTGTTTTCACTTACAAAATGCTACTCAACTCCTTGACTAATGATTGTGCAGGTTCTTCCATCTCTGGGATTATCCGAAAGCAAACATAGAGATTTTTGGTGAGGTAAGTGATGGAAAGACACATTAGAAAGTGATGGGAAAATGCAAGAATTCTTTCCTAGAAATTTTTTAAAGCCTGCATTTGTGTATTACAGGGGATAATCAGCTGGACAATGACTTGCTATTTTGTGTACACTCCATTTTTGATTAATTTATCAAGATGGCTCAAGTCTGTGGTGGATGCTGCTGCTACTACTGTAAATGAAGATGGGCCTGCATAGTCAAATTTTTTATATGCCAAATAGATTGGAAGGATGGAACTGGGAAGGTAATACTTGTTTTATAACCTTTTGATTTTTGAAAAGGAATCTTATAAACAGTAATTCTATTTATAAGTTTCTTTGTTTTGTGCTCCACTTTACCACTTTATATCAATGTTTTAAAAAATCAAGCCAAGTTTTGAAAACTACAAAAAGAAATCAAATAAAGTTTTAAAAACATGTTTTTTTTTTTTGAAATTTCAGTAAGAATTCAAATATATCTTTAAGAAATGTTAAAACCATAATATAACACTGAGAGGAAACAAATACAATTTTAACATAAAACTACAAACAAAAGATTGTGAAATGAAGCTTAAATAATTCATTTTGATTATTATTTATCATTCTGTGATTTTTATTTAATAGATGAATGATCAGGAAAGTGAGTTTGAAAATTATTTGTTTTTTGCTTGTGGTTAGAAAATTCTTACACTTAGTCCTTCCAGCCTTTTCTAGTTGCGCCCTAAGATAGCGTTTACTTACAAGTTGGGATGCTCATTTTTCCTGGTCAG
mRNA sequence
AAAAAAAAAAAAATGCAATCAATCTCATTAGCTTTCTCCCACTCTCTTCAATTTTCCCAATCCCATTTCCACTCCAACTCCAAAATTTCTCTGCTAAAACCTTACTGTCACTGCAGCAGCCATGGAAGCAAGAGGACCAGATCAAGCCTCAGTCTTACAACCACTTGGCCTTCCATCTCCATCGCCCTCTTCGGCTCAGGCTTTCTCTTAGGCCCTCTTCTCGATGGACTCCATTCTCGCGTCAATCTCGTTGTTTATCGAACAGGATCCATCGTCATCGGCCCCCTCCATACTAACATCTGGGTTCCTTTCTTGCTGGGATTGTTTTACTGTACAGTTGGTTTGATTCAACTCTACATAGATGAGAAATTTTCACCAAAAAGATCAGAGGGAAGCTTGGGTAGGACAGTAGCTTCTTTAATAGCATTGGCTTTGTTTATTGAATTGAGTGCTGAAATGTACAAAGCTGGCGTAGCTGACAATATTGAGGCCTATGCATTATTTGCTGGGGCTGAGTTTATATGGGCATTGCTTGATAGTTCGTTGCTTGGCTTCTCGCTGGCTTGTGTTCTTGGCCTTGGCTGCCCTCTGGCTGAGATTCCCATTATGAAGTTCTTCCATCTCTGGGATTATCCGAAAGCAAACATAGAGATTTTTGGTGAGGGGATAATCAGCTGGACAATGACTTGCTATTTTGTGTACACTCCATTTTTGATTAATTTATCAAGATGGCTCAAGTCTGTGGTGGATGCTGCTGCTACTACTGTAAATGAAGATGGGCCTGCATAGTCAAATTTTTTATATGCCAAATAGATTGGAAGGATGGAACTGGGAAGGTAATACTTGTTTTATAACCTTTTGATTTTTGAAAAGGAATCTTATAAACAGTAATTCTATTTATAAGTTTCTTTGTTTTGTGCTCCACTTTACCACTTTATATCAATGTTTTAAAAAATCAAGCCAAGTTTTGAAAACTACAAAAAGAAATCAAATAAAGTTTTAAAAACATGTTTTTTTTTTTTGAAATTTCAGTAAGAATTCAAATATATCTTTAAGAAATGTTAAAACCATAATATAACACTGAGAGGAAACAAATACAATTTTAACATAAAACTACAAACAAAAGATTGTGAAATGAAGCTTAAATAATTCATTTTGATTATTATTTATCATTCTGTGATTTTTATTTAATAGATGAATGATCAGGAAAGTGAGTTTGAAAATTATTTGTTTTTTGCTTGTGGTTAGAAAATTCTTACACTTAGTCCTTCCAGCCTTTTCTAGTTGCGCCCTAAGATAGCGTTTACTTACAAGTTGGGATGCTCATTTTTCCTGGTCAG
Coding sequence (CDS)
ATGCAATCAATCTCATTAGCTTTCTCCCACTCTCTTCAATTTTCCCAATCCCATTTCCACTCCAACTCCAAAATTTCTCTGCTAAAACCTTACTGTCACTGCAGCAGCCATGGAAGCAAGAGGACCAGATCAAGCCTCAGTCTTACAACCACTTGGCCTTCCATCTCCATCGCCCTCTTCGGCTCAGGCTTTCTCTTAGGCCCTCTTCTCGATGGACTCCATTCTCGCGTCAATCTCGTTGTTTATCGAACAGGATCCATCGTCATCGGCCCCCTCCATACTAACATCTGGGTTCCTTTCTTGCTGGGATTGTTTTACTGTACAGTTGGTTTGATTCAACTCTACATAGATGAGAAATTTTCACCAAAAAGATCAGAGGGAAGCTTGGGTAGGACAGTAGCTTCTTTAATAGCATTGGCTTTGTTTATTGAATTGAGTGCTGAAATGTACAAAGCTGGCGTAGCTGACAATATTGAGGCCTATGCATTATTTGCTGGGGCTGAGTTTATATGGGCATTGCTTGATAGTTCGTTGCTTGGCTTCTCGCTGGCTTGTGTTCTTGGCCTTGGCTGCCCTCTGGCTGAGATTCCCATTATGAAGTTCTTCCATCTCTGGGATTATCCGAAAGCAAACATAGAGATTTTTGGTGAGGGGATAATCAGCTGGACAATGACTTGCTATTTTGTGTACACTCCATTTTTGATTAATTTATCAAGATGGCTCAAGTCTGTGGTGGATGCTGCTGCTACTACTGTAAATGAAGATGGGCCTGCATAG
Protein sequence
MQSISLAFSHSLQFSQSHFHSNSKISLLKPYCHCSSHGSKRTRSSLSLTTTWPSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLYIDEKFSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLINLSRWLKSVVDAAATTVNEDGPA
Homology
BLAST of Clc10G10820 vs. NCBI nr
Match:
XP_038904924.1 (uncharacterized protein LOC120091134 [Benincasa hispida] >XP_038904925.1 uncharacterized protein LOC120091134 [Benincasa hispida] >XP_038904926.1 uncharacterized protein LOC120091134 [Benincasa hispida] >XP_038904927.1 uncharacterized protein LOC120091134 [Benincasa hispida] >XP_038904928.1 uncharacterized protein LOC120091134 [Benincasa hispida])
HSP 1 Score: 453.4 bits (1165), Expect = 1.3e-123
Identity = 231/259 (89.19%), Postives = 241/259 (93.05%), Query Frame = 0
Query: 1 MQSI-SLAFSHSLQFSQSHFHSNSKISLLKPYCHCSSHGSKRTRSSLSLTTTWPSISIAL 60
MQSI +L FSHSLQFSQSHFHSNSKISL KP+C SSHGS+R R+SLSL TTWPSISIAL
Sbjct: 8 MQSIYALGFSHSLQFSQSHFHSNSKISLQKPHCTSSSHGSRRARTSLSLRTTWPSISIAL 67
Query: 61 FGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLYIDEK 120
FGSGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWVPFLLGLFYC+VGLIQLY+DE
Sbjct: 68 FGSGFLLGPLLDGLHSRVNLVVYRTGSIHIGPLHTNIWVPFLLGLFYCSVGLIQLYLDEN 127
Query: 121 FSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLL 180
FSP++SEG GRTVASLIAL LFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLL
Sbjct: 128 FSPRKSEGCFGRTVASLIALGLFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLL 187
Query: 181 GFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLINLSR 240
GFSLACVLGL CPLAEIPIMKFFHLW YPKANIEIFGEGI+SWT+TCYFVYTPFLINLSR
Sbjct: 188 GFSLACVLGLVCPLAEIPIMKFFHLWYYPKANIEIFGEGIVSWTITCYFVYTPFLINLSR 247
Query: 241 WLKSVVDAAATTVNEDGPA 259
WLKSVVDAAA NEDG A
Sbjct: 248 WLKSVVDAAA--ANEDGSA 264
BLAST of Clc10G10820 vs. NCBI nr
Match:
XP_008447464.1 (PREDICTED: uncharacterized protein LOC103489906 [Cucumis melo] >XP_008447465.1 PREDICTED: uncharacterized protein LOC103489906 [Cucumis melo])
HSP 1 Score: 441.4 bits (1134), Expect = 5.3e-120
Identity = 228/263 (86.69%), Postives = 241/263 (91.63%), Query Frame = 0
Query: 1 MQSI-SLAFSHSLQFSQSH----FHSNSKISLLKPYCHCSSHGSKRTRSSLSLTTTWPSI 60
MQSI +L FSHSLQF SH FHSNS+ISL KP HC+SHGS ++R +LSL TTWPSI
Sbjct: 1 MQSIFALGFSHSLQFPHSHSHSYFHSNSRISLQKP--HCTSHGSNKSRPTLSLRTTWPSI 60
Query: 61 SIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLY 120
SI+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWVPFLLGLFYCTVGLIQLY
Sbjct: 61 SISLFASGFLLGPLLDGLHSRVNLVVYRTGSIHIGPLHTNIWVPFLLGLFYCTVGLIQLY 120
Query: 121 IDEKFSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLD 180
+DEKFSPK+S+GSL +TVASLIAL LFIELSAEMYKAGVADNIEAYALFAGAEFIWALLD
Sbjct: 121 LDEKFSPKQSQGSLRKTVASLIALGLFIELSAEMYKAGVADNIEAYALFAGAEFIWALLD 180
Query: 181 SSLLGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLI 240
SSLLGFSLACVLGLGCPLAEIPIMKFFHLW+YPKANIEIFGEGIISWT+TCYFVYTPFLI
Sbjct: 181 SSLLGFSLACVLGLGCPLAEIPIMKFFHLWEYPKANIEIFGEGIISWTVTCYFVYTPFLI 240
Query: 241 NLSRWLKSVV--DAAATTVNEDG 257
NLSRWLKSVV DAAA VNEDG
Sbjct: 241 NLSRWLKSVVDADAAAAAVNEDG 261
BLAST of Clc10G10820 vs. NCBI nr
Match:
XP_004150901.1 (uncharacterized protein LOC101205226 [Cucumis sativus] >XP_031739074.1 uncharacterized protein LOC101205226 [Cucumis sativus] >XP_031739075.1 uncharacterized protein LOC101205226 [Cucumis sativus] >KGN58127.1 hypothetical protein Csa_017408 [Cucumis sativus])
HSP 1 Score: 439.5 bits (1129), Expect = 2.0e-119
Identity = 226/259 (87.26%), Postives = 239/259 (92.28%), Query Frame = 0
Query: 1 MQSI-SLAFSHSLQF--SQSHFHSNSKISLLKPYCHCSSHGSKRTRSSLSLTTTWPSISI 60
MQSI +L FS SLQF S SHFHSNS+IS+ KP HCSSHGSK+ R SLSL TTWPSISI
Sbjct: 1 MQSIFALGFSQSLQFPHSHSHFHSNSRISVQKP--HCSSHGSKKPRISLSLRTTWPSISI 60
Query: 61 ALFGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLYID 120
+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWVPFLLGLFYCTVGLIQLY+D
Sbjct: 61 SLFASGFLLGPLLDGLHSRVNLVVYRTGSIHIGPLHTNIWVPFLLGLFYCTVGLIQLYLD 120
Query: 121 EKFSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSS 180
EKFS K+S+GSLG+TVASLIAL LFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSS
Sbjct: 121 EKFSLKQSQGSLGKTVASLIALGLFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSS 180
Query: 181 LLGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLINL 240
LLGFSLACVLGLGCPLAEIPIMKFFHLW+YPKANI+IFGEGIISWT+TCYFVYTPFLINL
Sbjct: 181 LLGFSLACVLGLGCPLAEIPIMKFFHLWEYPKANIDIFGEGIISWTVTCYFVYTPFLINL 240
Query: 241 SRWLKSVVDAAATTVNEDG 257
SRWLKSVVDAAA +E G
Sbjct: 241 SRWLKSVVDAAAVNEDESG 257
BLAST of Clc10G10820 vs. NCBI nr
Match:
KAG7036627.1 (hypothetical protein SDJN02_00246 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 413.3 bits (1061), Expect = 1.5e-111
Identity = 216/260 (83.08%), Postives = 233/260 (89.62%), Query Frame = 0
Query: 1 MQSI-SLAFSHSLQFSQSHFHSNSKISLLKPYCHCSSHGSKR-TRSSLSLTTTWPSISIA 60
MQSI +L FSHSLQFSQSHFH N K SLL+ +SHGS R R+SLSL T+WPSISIA
Sbjct: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRT---LASHGSNRKARTSLSLRTSWPSISIA 60
Query: 61 LFGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLYIDE 120
LFGSGFLLGPLLDGLHSRVNLVVY+ GS+ IGPL TNIWVPFLLG+FYCTVGLIQLYIDE
Sbjct: 61 LFGSGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDE 120
Query: 121 KFSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSL 180
FSP R EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSL
Sbjct: 121 NFSPNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSL 180
Query: 181 LGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLINLS 240
LGFSLACV+GL CPLAEIPIMKFFHLW YP+AN+EIFGEG+ISWT+TCYFVYTPFLINLS
Sbjct: 181 LGFSLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLS 240
Query: 241 RWLKSVVDAAATTVNEDGPA 259
RWLKSV+D+AA V +DG A
Sbjct: 241 RWLKSVLDSAA--VKKDGSA 255
BLAST of Clc10G10820 vs. NCBI nr
Match:
XP_022949544.1 (uncharacterized protein LOC111452860 [Cucurbita moschata] >XP_022949545.1 uncharacterized protein LOC111452860 [Cucurbita moschata] >XP_022949546.1 uncharacterized protein LOC111452860 [Cucurbita moschata])
HSP 1 Score: 412.9 bits (1060), Expect = 2.0e-111
Identity = 216/260 (83.08%), Postives = 232/260 (89.23%), Query Frame = 0
Query: 1 MQSI-SLAFSHSLQFSQSHFHSNSKISLLKPYCHCSSHGSKR-TRSSLSLTTTWPSISIA 60
MQSI +L FSHSLQFSQSHFH N K SLL+ SSHGS R R+SLSL T+WPSISIA
Sbjct: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRT---LSSHGSNRKARTSLSLRTSWPSISIA 60
Query: 61 LFGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLYIDE 120
LFGSGFLLGPLLDGLHSRVNLVVY+ GS+ IGPL TNIWVPFLLG+FYCTVGLIQLYIDE
Sbjct: 61 LFGSGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDE 120
Query: 121 KFSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSL 180
FSP R EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSL
Sbjct: 121 NFSPNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSL 180
Query: 181 LGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLINLS 240
LGFSLACV+GL CPLAEIPIMKFFHLW YP+AN+EIFGEG+ISWT+TCYFVYTPFLINLS
Sbjct: 181 LGFSLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLS 240
Query: 241 RWLKSVVDAAATTVNEDGPA 259
RWLKSV+D+ A V +DG A
Sbjct: 241 RWLKSVLDSVA--VKKDGSA 255
BLAST of Clc10G10820 vs. ExPASy TrEMBL
Match:
A0A1S3BI47 (uncharacterized protein LOC103489906 OS=Cucumis melo OX=3656 GN=LOC103489906 PE=4 SV=1)
HSP 1 Score: 441.4 bits (1134), Expect = 2.5e-120
Identity = 228/263 (86.69%), Postives = 241/263 (91.63%), Query Frame = 0
Query: 1 MQSI-SLAFSHSLQFSQSH----FHSNSKISLLKPYCHCSSHGSKRTRSSLSLTTTWPSI 60
MQSI +L FSHSLQF SH FHSNS+ISL KP HC+SHGS ++R +LSL TTWPSI
Sbjct: 1 MQSIFALGFSHSLQFPHSHSHSYFHSNSRISLQKP--HCTSHGSNKSRPTLSLRTTWPSI 60
Query: 61 SIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLY 120
SI+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWVPFLLGLFYCTVGLIQLY
Sbjct: 61 SISLFASGFLLGPLLDGLHSRVNLVVYRTGSIHIGPLHTNIWVPFLLGLFYCTVGLIQLY 120
Query: 121 IDEKFSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLD 180
+DEKFSPK+S+GSL +TVASLIAL LFIELSAEMYKAGVADNIEAYALFAGAEFIWALLD
Sbjct: 121 LDEKFSPKQSQGSLRKTVASLIALGLFIELSAEMYKAGVADNIEAYALFAGAEFIWALLD 180
Query: 181 SSLLGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLI 240
SSLLGFSLACVLGLGCPLAEIPIMKFFHLW+YPKANIEIFGEGIISWT+TCYFVYTPFLI
Sbjct: 181 SSLLGFSLACVLGLGCPLAEIPIMKFFHLWEYPKANIEIFGEGIISWTVTCYFVYTPFLI 240
Query: 241 NLSRWLKSVV--DAAATTVNEDG 257
NLSRWLKSVV DAAA VNEDG
Sbjct: 241 NLSRWLKSVVDADAAAAAVNEDG 261
BLAST of Clc10G10820 vs. ExPASy TrEMBL
Match:
A0A0A0LBT1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G535110 PE=4 SV=1)
HSP 1 Score: 439.5 bits (1129), Expect = 9.7e-120
Identity = 226/259 (87.26%), Postives = 239/259 (92.28%), Query Frame = 0
Query: 1 MQSI-SLAFSHSLQF--SQSHFHSNSKISLLKPYCHCSSHGSKRTRSSLSLTTTWPSISI 60
MQSI +L FS SLQF S SHFHSNS+IS+ KP HCSSHGSK+ R SLSL TTWPSISI
Sbjct: 1 MQSIFALGFSQSLQFPHSHSHFHSNSRISVQKP--HCSSHGSKKPRISLSLRTTWPSISI 60
Query: 61 ALFGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLYID 120
+LF SGFLLGPLLDGLHSRVNLVVYRTGSI IGPLHTNIWVPFLLGLFYCTVGLIQLY+D
Sbjct: 61 SLFASGFLLGPLLDGLHSRVNLVVYRTGSIHIGPLHTNIWVPFLLGLFYCTVGLIQLYLD 120
Query: 121 EKFSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSS 180
EKFS K+S+GSLG+TVASLIAL LFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSS
Sbjct: 121 EKFSLKQSQGSLGKTVASLIALGLFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSS 180
Query: 181 LLGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLINL 240
LLGFSLACVLGLGCPLAEIPIMKFFHLW+YPKANI+IFGEGIISWT+TCYFVYTPFLINL
Sbjct: 181 LLGFSLACVLGLGCPLAEIPIMKFFHLWEYPKANIDIFGEGIISWTVTCYFVYTPFLINL 240
Query: 241 SRWLKSVVDAAATTVNEDG 257
SRWLKSVVDAAA +E G
Sbjct: 241 SRWLKSVVDAAAVNEDESG 257
BLAST of Clc10G10820 vs. ExPASy TrEMBL
Match:
A0A6J1GD48 (uncharacterized protein LOC111452860 OS=Cucurbita moschata OX=3662 GN=LOC111452860 PE=4 SV=1)
HSP 1 Score: 412.9 bits (1060), Expect = 9.7e-112
Identity = 216/260 (83.08%), Postives = 232/260 (89.23%), Query Frame = 0
Query: 1 MQSI-SLAFSHSLQFSQSHFHSNSKISLLKPYCHCSSHGSKR-TRSSLSLTTTWPSISIA 60
MQSI +L FSHSLQFSQSHFH N K SLL+ SSHGS R R+SLSL T+WPSISIA
Sbjct: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRT---LSSHGSNRKARTSLSLRTSWPSISIA 60
Query: 61 LFGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLYIDE 120
LFGSGFLLGPLLDGLHSRVNLVVY+ GS+ IGPL TNIWVPFLLG+FYCTVGLIQLYIDE
Sbjct: 61 LFGSGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDE 120
Query: 121 KFSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSL 180
FSP R EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSL
Sbjct: 121 NFSPNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSL 180
Query: 181 LGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLINLS 240
LGFSLACV+GL CPLAEIPIMKFFHLW YP+AN+EIFGEG+ISWT+TCYFVYTPFLINLS
Sbjct: 181 LGFSLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLS 240
Query: 241 RWLKSVVDAAATTVNEDGPA 259
RWLKSV+D+ A V +DG A
Sbjct: 241 RWLKSVLDSVA--VKKDGSA 255
BLAST of Clc10G10820 vs. ExPASy TrEMBL
Match:
A0A6J1DRY1 (uncharacterized protein LOC111023831 OS=Momordica charantia OX=3673 GN=LOC111023831 PE=4 SV=1)
HSP 1 Score: 403.7 bits (1036), Expect = 5.9e-109
Identity = 212/255 (83.14%), Postives = 222/255 (87.06%), Query Frame = 0
Query: 1 MQSI-SLAFSHSLQFSQSHFHSNSKISLLKPYCHCSSHGSKRTRSSLSLTTTWPSISIAL 60
MQSI L S LQF Q F S SK S LKP C SHGS+ R+SLSL TTWPSISIAL
Sbjct: 1 MQSIYGLGSSRYLQFPQFPFRSISKHSPLKPRC---SHGSESRRTSLSLRTTWPSISIAL 60
Query: 61 FGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLYIDEK 120
FGSGFLLGPLLDGLHSRVNLVVY+TGSI +GPLHTNIWVPFLLGLFY TVGL+QLYIDE
Sbjct: 61 FGSGFLLGPLLDGLHSRVNLVVYQTGSIDVGPLHTNIWVPFLLGLFYSTVGLMQLYIDEN 120
Query: 121 FSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSLL 180
FS SEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAE IWA LDSSLL
Sbjct: 121 FSRNSSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAELIWAFLDSSLL 180
Query: 181 GFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLINLSR 240
GFSLACV+GLGCPLAEIPIMKFFHLW YP+AN+EIFGEGIISW +TCYFVYTPFLINLSR
Sbjct: 181 GFSLACVVGLGCPLAEIPIMKFFHLWSYPQANVEIFGEGIISWIITCYFVYTPFLINLSR 240
Query: 241 WLKSVVDAAATTVNE 255
WLKSVVDAAA +E
Sbjct: 241 WLKSVVDAAAVKKDE 252
BLAST of Clc10G10820 vs. ExPASy TrEMBL
Match:
A0A6J1KGW6 (uncharacterized protein LOC111493103 OS=Cucurbita maxima OX=3661 GN=LOC111493103 PE=4 SV=1)
HSP 1 Score: 399.4 bits (1025), Expect = 1.1e-107
Identity = 214/261 (81.99%), Postives = 229/261 (87.74%), Query Frame = 0
Query: 1 MQSI-SLAFSHSLQFSQSHFHSNSKISLLKPYCHCSSHGSKR-TRSSLSLTTTWPSISIA 60
MQSI +L FSHSLQFSQSHFH N K SLL+ +HGS R R+SLSL T+WPSISIA
Sbjct: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRT---LGTHGSNRQARTSLSLRTSWPSISIA 60
Query: 61 LFGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIWVPFLLGLFYCTVGLIQLYIDE 120
LFGSGFLLGPLLDGLHSRVNLVVY+ GS+ IGPL TNI VPFLLGLFYCTVGLIQLYIDE
Sbjct: 61 LFGSGFLLGPLLDGLHSRVNLVVYQIGSLDIGPLRTNICVPFLLGLFYCTVGLIQLYIDE 120
Query: 121 KFSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSL 180
F P R EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSL
Sbjct: 121 NFLPNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSL 180
Query: 181 LGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGEGIISWTMTCYFVYTPFLINLS 240
LGFSLACV+GL CPLAEIPIMKFFHLW YP+AN+EIFGEG+ISWT+TCYFVYTPFLINLS
Sbjct: 181 LGFSLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLS 240
Query: 241 RWL-KSVVDAAATTVNEDGPA 259
RWL SVVD+AA V +DG A
Sbjct: 241 RWLMMSVVDSAA--VKKDGSA 256
BLAST of Clc10G10820 vs. TAIR 10
Match:
AT4G01935.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; Has 37 Blast hits to 37 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 30; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )
HSP 1 Score: 295.4 bits (755), Expect = 4.3e-80
Identity = 143/218 (65.60%), Postives = 178/218 (81.65%), Query Frame = 0
Query: 39 SKRTRSSLSLTTTW-PSISIALFGSGFLLGPLLDGLHSRVNLVVYRTGSIVIGPLHTNIW 98
S+R +S S +W +S++LFGSGF+LGPLLDG+HSRV+LVVY+ G+ IGPLHTNIW
Sbjct: 34 SQRKQSKTSTGKSWIVPVSLSLFGSGFVLGPLLDGIHSRVDLVVYQNGAFQIGPLHTNIW 93
Query: 99 VPFLLGLFYCTVGLIQLYIDEKFSPKRSEGSLGRTVASLIALALFIELSAEMYKAGVADN 158
VPFLLGLFYCTVGL+QL +DE S GSL +TV SL+AL F+ELSAEMYKAGV+DN
Sbjct: 94 VPFLLGLFYCTVGLLQLLLDETTSASPPRGSLDKTVISLLALMFFLELSAEMYKAGVSDN 153
Query: 159 IEAYALFAGAEFIWALLDSSLLGFSLACVLGLGCPLAEIPIMKFFHLWDYPKANIEIFGE 218
IEAY LFA AEFIW LD + + F++A +LG+ CPLAEIPIM+FFHLW YP+ANIEIFG+
Sbjct: 154 IEAYILFALAEFIWFSLDRTWICFTIATLLGVACPLAEIPIMQFFHLWYYPEANIEIFGQ 213
Query: 219 GIISWTMTCYFVYTPFLINLSRWLKSVVDAAATTVNED 256
G+++WT TCYFVYTPFLINL+RWL++V++ TT+ D
Sbjct: 214 GLVTWTTTCYFVYTPFLINLARWLRTVME--RTTIEVD 249
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038904924.1 | 1.3e-123 | 89.19 | uncharacterized protein LOC120091134 [Benincasa hispida] >XP_038904925.1 unchara... | [more] |
XP_008447464.1 | 5.3e-120 | 86.69 | PREDICTED: uncharacterized protein LOC103489906 [Cucumis melo] >XP_008447465.1 P... | [more] |
XP_004150901.1 | 2.0e-119 | 87.26 | uncharacterized protein LOC101205226 [Cucumis sativus] >XP_031739074.1 uncharact... | [more] |
KAG7036627.1 | 1.5e-111 | 83.08 | hypothetical protein SDJN02_00246 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022949544.1 | 2.0e-111 | 83.08 | uncharacterized protein LOC111452860 [Cucurbita moschata] >XP_022949545.1 unchar... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3BI47 | 2.5e-120 | 86.69 | uncharacterized protein LOC103489906 OS=Cucumis melo OX=3656 GN=LOC103489906 PE=... | [more] |
A0A0A0LBT1 | 9.7e-120 | 87.26 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G535110 PE=4 SV=1 | [more] |
A0A6J1GD48 | 9.7e-112 | 83.08 | uncharacterized protein LOC111452860 OS=Cucurbita moschata OX=3662 GN=LOC1114528... | [more] |
A0A6J1DRY1 | 5.9e-109 | 83.14 | uncharacterized protein LOC111023831 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A6J1KGW6 | 1.1e-107 | 81.99 | uncharacterized protein LOC111493103 OS=Cucurbita maxima OX=3661 GN=LOC111493103... | [more] |
Match Name | E-value | Identity | Description | |
AT4G01935.1 | 4.3e-80 | 65.60 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |