Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAAATTTGAACTCTCAAATTTATTCAAATGTTATAATTTATATATATATATATATAAATTTAAAAGTTAAATTTGACTTAATTTTAAACTTTCTAAAAAGTTGAGGGACTCAATTTTTTAAATTGAAAATTTAAAATTGAGATTACAACCCCCACTATTCTTTCCTGGGTGGTTTTTTTTGCAATTTATTATTATTATTATTATTTTTGTTTTGTTTTGTAAAAAGAAGACAAAATTAAGCGTGGAGTTGTGGACTCGTACTCGTACACGTAATTACGAAACGACATGTCACACTTACAACGCCAGACCTAAAACCCTTATCCGACGTCGTTTTACACTTCCAGATCCCTTTCTTCTCTCTCCGAACCATCGTCTTCTTCCTCTTCGAAGGGATCAGATTTCTCACGGCGGCACTTCCCTCTCTTCACTCCCTCCGTTCCCGCCTCCGAAATGTCTGATCAATCTCCTCCTCCTCCTCCTCCCGCCGGCGAGTCCGAGTCCCGTCCCGTCGGTGGCACCGAGCACAGCTGGTGCCGCGCCGTCCCCGGCGGCACCGGCACCACTGTCCTCGGCCTACTCCTCTCAAAACCTCCCGATATTCCCCATCTCCAATCCTCCCTCCACACTCTCCAAAACCTCCACCCAATCCTCCGCTCCAAAATCCACCACGATCCTTCCCGACGAGATTTCTCCTTCCTCATTCCTCCTTCTCCGCCGCTTAACCTCCAGATCCTCGACCTCGCCGCCACCGCAAGCGCTATCGCCTCTCATCCCGACGCCAAGGATCCTTCCGTCTCCGATTTCCACAAGATCCACGAAACGGAGATCAACCGCGCCACGTGGTTCGATCCAAACCATCCGTCGTACTCCGACACCGACGTGATGTTCGCTACCGTCTACACCGTATGCGACAGCCAATGGGCGGTATTCCTCCGCCTCCACACGGCGACATGCGATCGTGCCGCGGCGGCGGCACTGTTGAGAGAACTGCTAGTGCTTGTGGCGGCCGGAGGGGAAATAGAGGGCGGAGGATTTGAAATTGGGGATAATGGTGAGATCGGATTAGGGATTGAAAATCTAATCCCTAACGGTAAAGCGAATAAATCTCTGTGGGCGCGTGGATTAGATATGCTTGGTTACTCCTTGAATTCGTTCCGATTAGCGAATTTGGAATTCAAAGACGCGAATTCTGAAAGATTTTCTCAGATGATTAGGTTGAAGATGAACTCCGATGAGACTCAGAAACTTCTCGCTGTAAGTTTTTTCTCTTTCTCCGCATAATTTCCTTCATTTTCTCGGGAAAATGTTGGGAGTTTTTTCTTTTTAATTTTACGTAGGTATATCAATATTCAAATTTCAACAATGATTAATGCAAGTGGGACCCTCTTCTATCTCAAAACCCAATCAGATTTCACTTTCAATCTAAATGATAATATTCATATACAATTTTCCCATTTTCTGTTTTTTTTATGTACCAAAATATTTCTAGCAGTTTTTAATCTTTACTTTTGTATTTATATTGTTTATAAACCCTGAATTTAATCCTTTTGTGTATTATCATTTATCAACCTAGGTTTTCATTCTTCATAAAACATTCAAATTCTTTCATTTTAATACTTAGTGAATTCACGATTCTTTTAAGACATAATTAACAGATATTTAAAGGAGAAAAAAGTATAAGAAAAGTTTATAATTTCTACTAAATTGTAAAAAATTTAGAGGATGAATTAGAAACTTTCATAATTAAAAAAAAAAAAACACTATTGGTCCTTAACTTTCACGGAGTACCAATATAATTTCTAAACTTTAGTTTGTAACTATTTAGTTATTATCAATTTACTAATAATTTAATCTTGAACTTTAGTACGTAACAATTTAGTCATTTTAATTTTAAATTTATTAGATATCAACTTTATGATGTATACTTTGTATTTATAAGAATATTGAATTTCTAATTAATTTATCGATTTATTTATATAAAAAATCTTAAGATTAATTTCTACAAAAGTGATTAAACCATTACAAATTTCAAAGTATGAGAAGTGAATTGTTACACAGTAAATTTTAAGGCCTAAATTATTACAAAATTAATAATACATGAACTAAATCATTACAAAAATAAAAATTAGAGACTTAATTGTTACTTTACCTAAAAGTGATTTTTTAGCATGATAAAAGGATTAAAATCTTTTAGGAATTTTTTTAAAAAATTTATTTATAGGGAAATAACATTTGTATTTGCTAATTTCAACTAAACAATCATATTTGTAAATCTTAGAAGAAAAAAAAAGTTGTCTTTTCAATATATTTATTGAGTAAAAGTCTTTTCAATATTTTAATTTATACTCGACACAATGTTTGATTATTATTATTTTAAAAGGGAATGAGTTTTATTACGGTGTGGTTAGGTGTAAGGAGTCAGGTTTGGTAGTGTGAGATGTCCACGCTGTCACTGTCGTTTGGCTTTGTACGTTTGATGTAAAGTGCGGGTGGTGGGGACGACATGTAGCCTCGAAATAATAAATATTTATTTTCTATTATAAAGCCTTCCATTGTTATAAACATGCTTTTATATGGTATTATAATTATAAAAAAATACTCTTTTTCTTTTTTTAAGTATGTTTTCAAATATATCAAAATAAACTCTATAAAATATAAAAAATTTATATTGTCTATCTGCAATAAACTACAATAAATTTCTAGCGATAGAATACAATAATTTATATATTTATAAATAATTTGATATTTTTATTTATTTATAATAAATTTTCTTCCTTTCATTATTTTGTTTGTTTGTGTTTAATGATGTACTTTTTCTTTTCAACTAAACAAGGGAAAAAAAAACTTATATTATTTTGTTTGTTTGTGTTTAATGATGTACTTTTTCTTTTCAACTTAAAAAAAAACTTATGTCCTCAAAATTTTGAGAAATAATAGGACAAATTTCGTTGGCAATTTATATTTTATTTTATATTTTTAGATTTATTGTGTTAAACTACTACTTAGATTATATTTTTAAAAATTGTTTTTGCATTTTGTGCTAAACAATGCAAATGAAATTAACATATTTTTTTTATTGTATCTATATTTTAAATTTTAAATTTAGAATTATATTAAATAAAGATTAAAATATTTATTAATACAATATTTTTGTAAATTCAATTTATTTGTTTTAAAATTAAAATGTCTATAAATATTACAAAATAATAGTTATACAATTTTTTTAACATAACATGTTCACAAAGAAATTACTAATAGTTTGTTGTCAACTAATATTTATTGGATAACTATTATTAGTTTTATGATATATAAATATATTATCAAATACATTTTAAATAATTGTTCAAGTGAGATATTCAATTTTTGAATCAAAAATACTTGAAATAGACTTTTAGTTTTTCTTAAGGTGCTTTTTTAGTTACCAAAATTTTAAGAATAGGTTTTAAATGTGATTGAAGTTTTTTTTTAAATTTTTATTTTATTTTATTTTTTTTCATTTTAAACAATAGTATGATATTTTGAATCAGAAATTGTTTTAAAAAAAAATTATTTATTTCTCTTCCACTTTCTTTTCTGTTTCTTTCTTTAGTTCAATATGCGAACAAATGGCTACCAGTTTCAAACTACACGTTTCCATTTAACTATTTCTAGTAAATTAATTTTATTTTTAACCGAATATAATAGTTATGCTTTTTTTCTAACCAAAAAAATAATAATAATAATTAAAAAAAAAATAATAACACTATCGAAGAAGAAGAGAGAGGATATGATGTGAGGAGAGAAGGAGAATGTTGGAAGGGAGGGAGAGGAGATAAATTTTTAAAACTATTTTTTAAATTTAAAATGTAAAAAATTGTTTAAAGGATCATATGAACCTATTCTTAAAGGTTAGACTAACTAAAAATTATTTTAAAAATTAAGTCTTCAAAGTTTTTGAAAATCAAGCTTATTTCTTCTCAATTTTTTTTTCGATGATTTTTATCCCTTTTACGTAAAAGAGTTTAATTTTTTGCCAAATTTTAAAAATAATAATATATTTGTTTTTAGTTTTAAAAAATATGATTTGGTTTTTAAAAAATTAGTAGAAAGTAGACAACCAAATGTATAAATTTTATGGTAAAAATTAGTGCTTATAAACTTAATTTTAAAAAACAAAAACTAAGGCCACGTTTTATTGGGCCGATAAACACTTGTTCCATCTCTAAATTCTTTAAATCTATTTTTTAGTTTTTACCAATGTTTTTAAAAACCAAACCTAATTTTAAAAAAAAAATAGTTTTAGAAAAACTTTTTGTATTTTTTAAATTTGACTAAGAATACAACTCTTCTACTCAACTACTCAAGTAAGAGTAATATCATTGTCATAAATTGAGAGAAAGTTGGCTTAAATTAAAAAAAAAAAAACGAAATAGTTATCAAACGAAACCTAAATAATAGGTTATTAAACCATGCTTGTATTTTCTCTGATTTGATATGAATGTTTGAAGGGCTGCAAATCGAGAGGCATTAAGCTCTGTGGAGCTTTGGCAGCTGCTGGATTGATTGCTACTCGTTGTTCTAAGGACCTTCCTCCTCACCAGAGGGAGAAATATGCTGTTGTTACTCTCAATGATTGTCGTTCCCTCCTTGATCCTCCCCTCACAAGCCACCATTTAGGTAAATTCAACTCCCCTTCTCTTTTCTGGACTCGCTCCTATATCGCTCCATTAGCTCTACGTTTTTTCAACTTTCTCTGCATATTAAGAACTACGGATACTTCAGTTTTTCAAAAGAGTATAAAGTCTCACTATGTACGTGTGTCGTATATGTGCCTTGTATTTATATCCATGTTTCTTAACTAGTGTGTCATGTGAACATGTCAACCTCCCACATTGGATGATTTGTCTTGTAGATTGTTAAATCGCCAATGTAATCAACCCAAATGCTTAAGTTAATGGGTGATGATAAATTTAATTAATCACTTGACATTTCCCGTTCACTTGTAGGTTTGAAAATTTGTAAAAGTCCTAACAAGTGGAAAGCATAGTTAGTTGGTGAGAAAATGACATTACATAGGTTTGAACGCACACTCTGATACCATATTAAAACACTAATCAACTCAATAAAGCTTAAGTTGATGGGTTATGGTGAATTTAATTATATCAACACTTTAACATGATTTAACCTTATAACTATTGTAGTTTTTGCTAACTTGAGGATAGTTAGGATGGTTAAGGCATATATGTCCACAACTAAGAGGTCAGAAGTTCAAATATTTCATACTCGGTCATTGAACTAAGGAAACGGTTGTAATTTTTCATGTAAAAATACTCTAAATTCTCGATTGATAACAATTTGGCGTATTTTGTTTTTGAAAATTAAGCTTATAAACAATTTGGTGTGTTTTGTTTTTGAAAATTAAGCTTACAAATGCAACTTTCGCCTATAAGTTTATTTGCTTTGTTATCTTTTCCAAAACTCAAAACAAAAAGTCAAACAGTTATCGAACATGACCTTGATTCTCAGTTGAACAAACAAAAAAGCATATAAATTGAATAGAAGATAATCATTGGGATTAAGTTTTTTAGTTATTAATTATTTTTCTTTTTTACATTTCAGGATTCTATCACTCTGCCATCCTCAACACACATGACATATCAGCTGAAGATACACTATGGGAAGTGGCAAAGCGATGCTATTTTTCCTTCTCAAATGCCAAAGACAGCAACAAGCATTTCTCAGACATGTCTGACTTGAACTTCCTCATGTGCAAAGCGATTGAAAATCCCAGCCTCACTCCCTCTTCGTCCATGAGAACTGCCCTGATCTCGGTCTTTGAAGATCCCATCATCGAAACTTCCGGTCCTGCGCAGCAGCACATCGGCCTACACGACTACATCGGTTGTGCCTCTGCACACGGTGTTGGGCCATCGATCGCCTTCTTTGATATGATTCGTGACGGTCAGTTGGATTGTGCTTGTGTGTACCCGTCGCCTTTGTTCTCTCGAGAACAAATGAACCAAATTTTTGATGAGATGAAGAAAATTCTGGTGAATAATGCCATGGAAGTAGTTGAAGGTTAAGAATTTAGTTGTTGGACTGGGATCGAAGTTCCAGGTAGCTCGATAAGGGGAAGATCCACGATATATTAGCGTGGACAATTATCTTTTATATTCAAAATAAAGTCACGAACACTTATATTCAAAAAAGTTAATATCATATAACTGTGAAGTAATTAATGGGTCATTGTCTTTAACAATAGTCATAATGCCTTTTAACATTCAACAAATATTTAATGACGACAAATTCAACTGTCCAAACAAATTATTTCAT
mRNA sequence
TAAAATTTGAACTCTCAAATTTATTCAAATGTTATAATTTATATATATATATATATAAATTTAAAAGTTAAATTTGACTTAATTTTAAACTTTCTAAAAAGTTGAGGGACTCAATTTTTTAAATTGAAAATTTAAAATTGAGATTACAACCCCCACTATTCTTTCCTGGGTGGTTTTTTTTGCAATTTATTATTATTATTATTATTTTTGTTTTGTTTTGTAAAAAGAAGACAAAATTAAGCGTGGAGTTGTGGACTCGTACTCGTACACGTAATTACGAAACGACATGTCACACTTACAACGCCAGACCTAAAACCCTTATCCGACGTCGTTTTACACTTCCAGATCCCTTTCTTCTCTCTCCGAACCATCGTCTTCTTCCTCTTCGAAGGGATCAGATTTCTCACGGCGGCACTTCCCTCTCTTCACTCCCTCCGTTCCCGCCTCCGAAATGTCTGATCAATCTCCTCCTCCTCCTCCTCCCGCCGGCGAGTCCGAGTCCCGTCCCGTCGGTGGCACCGAGCACAGCTGGTGCCGCGCCGTCCCCGGCGGCACCGGCACCACTGTCCTCGGCCTACTCCTCTCAAAACCTCCCGATATTCCCCATCTCCAATCCTCCCTCCACACTCTCCAAAACCTCCACCCAATCCTCCGCTCCAAAATCCACCACGATCCTTCCCGACGAGATTTCTCCTTCCTCATTCCTCCTTCTCCGCCGCTTAACCTCCAGATCCTCGACCTCGCCGCCACCGCAAGCGCTATCGCCTCTCATCCCGACGCCAAGGATCCTTCCGTCTCCGATTTCCACAAGATCCACGAAACGGAGATCAACCGCGCCACGTGGTTCGATCCAAACCATCCGTCGTACTCCGACACCGACGTGATGTTCGCTACCGTCTACACCGTATGCGACAGCCAATGGGCGGTATTCCTCCGCCTCCACACGGCGACATGCGATCGTGCCGCGGCGGCGGCACTGTTGAGAGAACTGCTAGTGCTTGTGGCGGCCGGAGGGGAAATAGAGGGCGGAGGATTTGAAATTGGGGATAATGGTGAGATCGGATTAGGGATTGAAAATCTAATCCCTAACGGTAAAGCGAATAAATCTCTGTGGGCGCGTGGATTAGATATGCTTGGTTACTCCTTGAATTCGTTCCGATTAGCGAATTTGGAATTCAAAGACGCGAATTCTGAAAGATTTTCTCAGATGATTAGGTTGAAGATGAACTCCGATGAGACTCAGAAACTTCTCGCTGGCTGCAAATCGAGAGGCATTAAGCTCTGTGGAGCTTTGGCAGCTGCTGGATTGATTGCTACTCGTTGTTCTAAGGACCTTCCTCCTCACCAGAGGGAGAAATATGCTGTTGTTACTCTCAATGATTGTCGTTCCCTCCTTGATCCTCCCCTCACAAGCCACCATTTAGGATTCTATCACTCTGCCATCCTCAACACACATGACATATCAGCTGAAGATACACTATGGGAAGTGGCAAAGCGATGCTATTTTTCCTTCTCAAATGCCAAAGACAGCAACAAGCATTTCTCAGACATGTCTGACTTGAACTTCCTCATGTGCAAAGCGATTGAAAATCCCAGCCTCACTCCCTCTTCGTCCATGAGAACTGCCCTGATCTCGGTCTTTGAAGATCCCATCATCGAAACTTCCGGTCCTGCGCAGCAGCACATCGGCCTACACGACTACATCGGTTGTGCCTCTGCACACGGTGTTGGGCCATCGATCGCCTTCTTTGATATGATTCGTGACGGTCAGTTGGATTGTGCTTGTGTGTACCCGTCGCCTTTGTTCTCTCGAGAACAAATGAACCAAATTTTTGATGAGATGAAGAAAATTCTGGTGAATAATGCCATGGAAGTAGTTGAAGGTTAAGAATTTAGTTGTTGGACTGGGATCGAAGTTCCAGGTAGCTCGATAAGGGGAAGATCCACGATATATTAGCGTGGACAATTATCTTTTATATTCAAAATAAAGTCACGAACACTTATATTCAAAAAAGTTAATATCATATAACTGTGAAGTAATTAATGGGTCATTGTCTTTAACAATAGTCATAATGCCTTTTAACATTCAACAAATATTTAATGACGACAAATTCAACTGTCCAAACAAATTATTTCAT
Coding sequence (CDS)
ATGTCTGATCAATCTCCTCCTCCTCCTCCTCCCGCCGGCGAGTCCGAGTCCCGTCCCGTCGGTGGCACCGAGCACAGCTGGTGCCGCGCCGTCCCCGGCGGCACCGGCACCACTGTCCTCGGCCTACTCCTCTCAAAACCTCCCGATATTCCCCATCTCCAATCCTCCCTCCACACTCTCCAAAACCTCCACCCAATCCTCCGCTCCAAAATCCACCACGATCCTTCCCGACGAGATTTCTCCTTCCTCATTCCTCCTTCTCCGCCGCTTAACCTCCAGATCCTCGACCTCGCCGCCACCGCAAGCGCTATCGCCTCTCATCCCGACGCCAAGGATCCTTCCGTCTCCGATTTCCACAAGATCCACGAAACGGAGATCAACCGCGCCACGTGGTTCGATCCAAACCATCCGTCGTACTCCGACACCGACGTGATGTTCGCTACCGTCTACACCGTATGCGACAGCCAATGGGCGGTATTCCTCCGCCTCCACACGGCGACATGCGATCGTGCCGCGGCGGCGGCACTGTTGAGAGAACTGCTAGTGCTTGTGGCGGCCGGAGGGGAAATAGAGGGCGGAGGATTTGAAATTGGGGATAATGGTGAGATCGGATTAGGGATTGAAAATCTAATCCCTAACGGTAAAGCGAATAAATCTCTGTGGGCGCGTGGATTAGATATGCTTGGTTACTCCTTGAATTCGTTCCGATTAGCGAATTTGGAATTCAAAGACGCGAATTCTGAAAGATTTTCTCAGATGATTAGGTTGAAGATGAACTCCGATGAGACTCAGAAACTTCTCGCTGGCTGCAAATCGAGAGGCATTAAGCTCTGTGGAGCTTTGGCAGCTGCTGGATTGATTGCTACTCGTTGTTCTAAGGACCTTCCTCCTCACCAGAGGGAGAAATATGCTGTTGTTACTCTCAATGATTGTCGTTCCCTCCTTGATCCTCCCCTCACAAGCCACCATTTAGGATTCTATCACTCTGCCATCCTCAACACACATGACATATCAGCTGAAGATACACTATGGGAAGTGGCAAAGCGATGCTATTTTTCCTTCTCAAATGCCAAAGACAGCAACAAGCATTTCTCAGACATGTCTGACTTGAACTTCCTCATGTGCAAAGCGATTGAAAATCCCAGCCTCACTCCCTCTTCGTCCATGAGAACTGCCCTGATCTCGGTCTTTGAAGATCCCATCATCGAAACTTCCGGTCCTGCGCAGCAGCACATCGGCCTACACGACTACATCGGTTGTGCCTCTGCACACGGTGTTGGGCCATCGATCGCCTTCTTTGATATGATTCGTGACGGTCAGTTGGATTGTGCTTGTGTGTACCCGTCGCCTTTGTTCTCTCGAGAACAAATGAACCAAATTTTTGATGAGATGAAGAAAATTCTGGTGAATAATGCCATGGAAGTAGTTGAAGGTTAA
Protein sequence
MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTLQNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHKIHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLRELLVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANLEFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQREKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDSNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG
Homology
BLAST of Clc10G18370 vs. NCBI nr
Match:
XP_038905440.1 (uncharacterized protein LOC120091472 [Benincasa hispida])
HSP 1 Score: 899.4 bits (2323), Expect = 1.3e-257
Identity = 444/478 (92.89%), Postives = 459/478 (96.03%), Query Frame = 0
Query: 1 MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
MSDQS PPPAGES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 1 MSDQS---PPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
Query: 61 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
QNLHPILRSKIHHDPSRRDFSFLI PSPPL+LQILDL ATA AIASHPDA DPSVSDFHK
Sbjct: 61 QNLHPILRSKIHHDPSRRDFSFLISPSPPLHLQILDLPATARAIASHPDANDPSVSDFHK 120
Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
IHE EIN ATWFDPNHPSYSDTDVMFATVYT+ DSQWA+FLRLHTATCDRAAAAALLREL
Sbjct: 121 IHEQEINSATWFDPNHPSYSDTDVMFATVYTMSDSQWAIFLRLHTATCDRAAAAALLREL 180
Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
LVL A GGEIEGGGFEIGDNGEIGLGIE+LIPNGKANKSLWARGLDMLGYSLNSFRLANL
Sbjct: 181 LVLTATGGEIEGGGFEIGDNGEIGLGIEDLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQR 300
EFKDANSERFSQMIRLKMNS ETQKLLAGCK RG+KLCGALAAAGL+ATRCSKDLP HQ+
Sbjct: 241 EFKDANSERFSQMIRLKMNSHETQKLLAGCKLRGVKLCGALAAAGLLATRCSKDLPLHQK 300
Query: 301 EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDS 360
EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFS+SNAKD+
Sbjct: 301 EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSYSNAKDN 360
Query: 361 NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIGC 420
NKHFSDMSDLNFLMCKAIENP LTPSSSMRTALISVFEDPII+TSGP QQ++GLHDY GC
Sbjct: 361 NKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFEDPIIDTSGPEQQNLGLHDYSGC 420
Query: 421 ASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
ASAHGVGPSIA FDMIRDGQLDCACVYPSPLFSR+QMN+IFDEMKKILVNNAMEVVEG
Sbjct: 421 ASAHGVGPSIALFDMIRDGQLDCACVYPSPLFSRDQMNRIFDEMKKILVNNAMEVVEG 475
BLAST of Clc10G18370 vs. NCBI nr
Match:
XP_008442855.1 (PREDICTED: uncharacterized protein LOC103486623 [Cucumis melo])
HSP 1 Score: 869.8 bits (2246), Expect = 1.1e-248
Identity = 437/479 (91.23%), Postives = 450/479 (93.95%), Query Frame = 0
Query: 1 MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
MSDQS PPPPAGES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 1 MSDQS--PPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
Query: 61 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
QNLHPILRSKIHHDP RRDFSFLIP SP L+LQILDLAAT AIASHPDA DPSVSDFHK
Sbjct: 61 QNLHPILRSKIHHDPLRRDFSFLIPASPSLHLQILDLAATTRAIASHPDADDPSVSDFHK 120
Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
IHE EINR WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 121 IHEHEINRVIWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 180
Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
LVL A GGEIEGG FEIGDNGEIGLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 181 LVLAADGGEIEGGRFEIGDNGEIGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 240
Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
EFKD NSERFSQMIRLKMNSDETQKLLAGCK RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 241 EFKDPNSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDHLPPYQ 300
Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
+EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAED LWEVA RCYFSFSNAKD
Sbjct: 301 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDKLWEVANRCYFSFSNAKD 360
Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
+NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGP Q++GL+DYIG
Sbjct: 361 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPGHQNLGLNDYIG 420
Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSR+QMNQIFDEMKKILVN+A+EV EG
Sbjct: 421 YASAHGVGPSIALFDTIRDGQLDCACVYPSPLFSRDQMNQIFDEMKKILVNSAVEVNEG 477
BLAST of Clc10G18370 vs. NCBI nr
Match:
XP_004149221.3 (uncharacterized protein LOC101208906 [Cucumis sativus] >KGN59146.1 hypothetical protein Csa_000929 [Cucumis sativus])
HSP 1 Score: 868.6 bits (2243), Expect = 2.5e-248
Identity = 433/479 (90.40%), Postives = 452/479 (94.36%), Query Frame = 0
Query: 1 MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
MSDQS PPPP ES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 50 MSDQSLPPPP--AESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 109
Query: 61 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
QNLHPILRSKIHHDPSRRDFSFLIPPSPPL+LQILDLAATA AIASHPDA DPSVSDFHK
Sbjct: 110 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAIASHPDADDPSVSDFHK 169
Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
IHE EINR WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 170 IHEHEINRVMWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 229
Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
LVL A GGEIEGGGFE GDNGE+GLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 230 LVLAAGGGEIEGGGFETGDNGEVGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 289
Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
EFKD N+ERFSQMIRL+MNSDETQKLLAGCK RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 290 EFKDPNTERFSQMIRLRMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDHLPPYQ 349
Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
+EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDT+WEVA RCYFSFSNAKD
Sbjct: 350 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTVWEVASRCYFSFSNAKD 409
Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
+NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIE SGP QQ++GLHDYIG
Sbjct: 410 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIEISGPEQQNLGLHDYIG 469
Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
ASAHGVGPSIA FD IRDGQLD ACVYPSPLFSR+QMN+IFD+MKKILVN+++EV EG
Sbjct: 470 YASAHGVGPSIAIFDTIRDGQLDSACVYPSPLFSRDQMNRIFDDMKKILVNSSVEVNEG 526
BLAST of Clc10G18370 vs. NCBI nr
Match:
KAA0043871.1 (GATA zinc finger domain-containing protein isoform 1 [Cucumis melo var. makuwa] >TYK25265.1 GATA zinc finger domain-containing protein isoform 1 [Cucumis melo var. makuwa])
HSP 1 Score: 826.6 bits (2134), Expect = 1.1e-235
Identity = 420/479 (87.68%), Postives = 433/479 (90.40%), Query Frame = 0
Query: 1 MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
MSDQS PPPPAGES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 1 MSDQS--PPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
Query: 61 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
QNLHPILRSKIHHDP RRDFSFLIP SP L+LQILDLAAT AIASHPDA DPSVSDFHK
Sbjct: 61 QNLHPILRSKIHHDPLRRDFSFLIPASPSLHLQILDLAATTRAIASHPDADDPSVSDFHK 120
Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
IHE EINR WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 121 IHEHEINRVIWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 180
Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
LVL A GGEIEGG FEIGDNGEIGLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 181 LVLAADGGEIEGGRFEIGDNGEIGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 240
Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
EFKD NSERFSQMI RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 241 EFKDPNSERFSQMI------------------RGIKLCGALAAAGLIATRCSKDHLPPYQ 300
Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
+EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAED LWEVA RCYFSFSNAKD
Sbjct: 301 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDKLWEVANRCYFSFSNAKD 360
Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
+NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGP Q++GL+DYIG
Sbjct: 361 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPGHQNLGLNDYIG 420
Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSR+QMNQIFDEMKKILVN+A+EV EG
Sbjct: 421 YASAHGVGPSIALFDTIRDGQLDCACVYPSPLFSRDQMNQIFDEMKKILVNSAVEVNEG 459
BLAST of Clc10G18370 vs. NCBI nr
Match:
XP_022982938.1 (uncharacterized protein LOC111481632 [Cucurbita maxima])
HSP 1 Score: 823.5 bits (2126), Expect = 9.1e-235
Identity = 405/465 (87.10%), Postives = 431/465 (92.69%), Query Frame = 0
Query: 14 ESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTLQNLHPILRSKIHH 73
E RPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDI HLQ+SLH LQNLHPILRSKIHH
Sbjct: 4 EIRIRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHH 63
Query: 74 DPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHKIHETEINRATWFD 133
DPSRRDFSFLIPPSP ++LQILDLAA A AIASHPDA DPS+SDFHKI E EINRA W +
Sbjct: 64 DPSRRDFSFLIPPSPSIHLQILDLAAAARAIASHPDADDPSISDFHKILEHEINRAKWLN 123
Query: 134 PNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLRELLVLVAAGGEIEGG 193
P+HPSYSDTDVMFATVY V D QWAVFL LHTA CDR AAA+LLRELLVL AA G+IEGG
Sbjct: 124 PSHPSYSDTDVMFATVYAVSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAEGKIEGG 183
Query: 194 GFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANLEFKDANSERFSQM 253
GF+IGDNGEIG GIE+LIP+GKA+K LWARGLDMLGYSLNSFR ANLEFKDA+SERFSQM
Sbjct: 184 GFKIGDNGEIGSGIEDLIPSGKAHKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQM 243
Query: 254 IRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQREKYAVVTLNDCRS 313
IRLK+NSDETQKLLAGCKSRGIKLCGAL AAGLIATRCSKDLPPHQ EKYAVVTL DCRS
Sbjct: 244 IRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPHQTEKYAVVTLIDCRS 303
Query: 314 LLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDSNKHFSDMSDLNFL 373
LLDPPLT+HHLGFYHSAILNTHDISAEDTLWEV++RCYFSFSNAKD+NKHF+DMSDLNFL
Sbjct: 304 LLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVSERCYFSFSNAKDNNKHFTDMSDLNFL 363
Query: 374 MCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIGCASAHGVGPSIAFF 433
M KAIENP LTPSSSMRTALIS FEDPII TS PAQQH+G+ DYIGCASAHGVGPSIA F
Sbjct: 364 MGKAIENPGLTPSSSMRTALISAFEDPIIYTSDPAQQHLGISDYIGCASAHGVGPSIALF 423
Query: 434 DMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
D+IRDGQLDCACVYPSPLFSR+QMNQ+FDEMKKILV++AMEVVEG
Sbjct: 424 DLIRDGQLDCACVYPSPLFSRDQMNQLFDEMKKILVSSAMEVVEG 468
BLAST of Clc10G18370 vs. ExPASy TrEMBL
Match:
A0A1S3B7G8 (uncharacterized protein LOC103486623 OS=Cucumis melo OX=3656 GN=LOC103486623 PE=4 SV=1)
HSP 1 Score: 869.8 bits (2246), Expect = 5.4e-249
Identity = 437/479 (91.23%), Postives = 450/479 (93.95%), Query Frame = 0
Query: 1 MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
MSDQS PPPPAGES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 1 MSDQS--PPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
Query: 61 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
QNLHPILRSKIHHDP RRDFSFLIP SP L+LQILDLAAT AIASHPDA DPSVSDFHK
Sbjct: 61 QNLHPILRSKIHHDPLRRDFSFLIPASPSLHLQILDLAATTRAIASHPDADDPSVSDFHK 120
Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
IHE EINR WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 121 IHEHEINRVIWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 180
Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
LVL A GGEIEGG FEIGDNGEIGLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 181 LVLAADGGEIEGGRFEIGDNGEIGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 240
Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
EFKD NSERFSQMIRLKMNSDETQKLLAGCK RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 241 EFKDPNSERFSQMIRLKMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDHLPPYQ 300
Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
+EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAED LWEVA RCYFSFSNAKD
Sbjct: 301 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDKLWEVANRCYFSFSNAKD 360
Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
+NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGP Q++GL+DYIG
Sbjct: 361 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPGHQNLGLNDYIG 420
Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSR+QMNQIFDEMKKILVN+A+EV EG
Sbjct: 421 YASAHGVGPSIALFDTIRDGQLDCACVYPSPLFSRDQMNQIFDEMKKILVNSAVEVNEG 477
BLAST of Clc10G18370 vs. ExPASy TrEMBL
Match:
A0A0A0LGP2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G777540 PE=4 SV=1)
HSP 1 Score: 868.6 bits (2243), Expect = 1.2e-248
Identity = 433/479 (90.40%), Postives = 452/479 (94.36%), Query Frame = 0
Query: 1 MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
MSDQS PPPP ES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 50 MSDQSLPPPP--AESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 109
Query: 61 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
QNLHPILRSKIHHDPSRRDFSFLIPPSPPL+LQILDLAATA AIASHPDA DPSVSDFHK
Sbjct: 110 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLHLQILDLAATARAIASHPDADDPSVSDFHK 169
Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
IHE EINR WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 170 IHEHEINRVMWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 229
Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
LVL A GGEIEGGGFE GDNGE+GLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 230 LVLAAGGGEIEGGGFETGDNGEVGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 289
Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
EFKD N+ERFSQMIRL+MNSDETQKLLAGCK RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 290 EFKDPNTERFSQMIRLRMNSDETQKLLAGCKLRGIKLCGALAAAGLIATRCSKDHLPPYQ 349
Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
+EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDT+WEVA RCYFSFSNAKD
Sbjct: 350 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTVWEVASRCYFSFSNAKD 409
Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
+NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIE SGP QQ++GLHDYIG
Sbjct: 410 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIEISGPEQQNLGLHDYIG 469
Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
ASAHGVGPSIA FD IRDGQLD ACVYPSPLFSR+QMN+IFD+MKKILVN+++EV EG
Sbjct: 470 YASAHGVGPSIAIFDTIRDGQLDSACVYPSPLFSRDQMNRIFDDMKKILVNSSVEVNEG 526
BLAST of Clc10G18370 vs. ExPASy TrEMBL
Match:
A0A5A7TQM7 (GATA zinc finger domain-containing protein isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003980 PE=4 SV=1)
HSP 1 Score: 826.6 bits (2134), Expect = 5.2e-236
Identity = 420/479 (87.68%), Postives = 433/479 (90.40%), Query Frame = 0
Query: 1 MSDQSPPPPPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
MSDQS PPPPAGES+SRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL
Sbjct: 1 MSDQS--PPPPAGESKSRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTL 60
Query: 61 QNLHPILRSKIHHDPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHK 120
QNLHPILRSKIHHDP RRDFSFLIP SP L+LQILDLAAT AIASHPDA DPSVSDFHK
Sbjct: 61 QNLHPILRSKIHHDPLRRDFSFLIPASPSLHLQILDLAATTRAIASHPDADDPSVSDFHK 120
Query: 121 IHETEINRATWFDPNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLREL 180
IHE EINR WFDP HPSYSDTDVMFATVYTV +SQWAVFL LHTATCDRAAAAALLREL
Sbjct: 121 IHEHEINRVIWFDPTHPSYSDTDVMFATVYTVSESQWAVFLSLHTATCDRAAAAALLREL 180
Query: 181 LVLVAAGGEIEGGGFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANL 240
LVL A GGEIEGG FEIGDNGEIGLGIE+LIPNGKANKSLWARG DMLGYSLNSFRLANL
Sbjct: 181 LVLAADGGEIEGGRFEIGDNGEIGLGIEDLIPNGKANKSLWARGFDMLGYSLNSFRLANL 240
Query: 241 EFKDANSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKD-LPPHQ 300
EFKD NSERFSQMI RGIKLCGALAAAGLIATRCSKD LPP+Q
Sbjct: 241 EFKDPNSERFSQMI------------------RGIKLCGALAAAGLIATRCSKDHLPPYQ 300
Query: 301 REKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKD 360
+EKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAED LWEVA RCYFSFSNAKD
Sbjct: 301 KEKYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDKLWEVANRCYFSFSNAKD 360
Query: 361 SNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIG 420
+NKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGP Q++GL+DYIG
Sbjct: 361 NNKHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPIIETSGPGHQNLGLNDYIG 420
Query: 421 CASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
ASAHGVGPSIA FD IRDGQLDCACVYPSPLFSR+QMNQIFDEMKKILVN+A+EV EG
Sbjct: 421 YASAHGVGPSIALFDTIRDGQLDCACVYPSPLFSRDQMNQIFDEMKKILVNSAVEVNEG 459
BLAST of Clc10G18370 vs. ExPASy TrEMBL
Match:
A0A6J1J6C5 (uncharacterized protein LOC111481632 OS=Cucurbita maxima OX=3661 GN=LOC111481632 PE=4 SV=1)
HSP 1 Score: 823.5 bits (2126), Expect = 4.4e-235
Identity = 405/465 (87.10%), Postives = 431/465 (92.69%), Query Frame = 0
Query: 14 ESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTLQNLHPILRSKIHH 73
E RPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDI HLQ+SLH LQNLHPILRSKIHH
Sbjct: 4 EIRIRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHH 63
Query: 74 DPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHKIHETEINRATWFD 133
DPSRRDFSFLIPPSP ++LQILDLAA A AIASHPDA DPS+SDFHKI E EINRA W +
Sbjct: 64 DPSRRDFSFLIPPSPSIHLQILDLAAAARAIASHPDADDPSISDFHKILEHEINRAKWLN 123
Query: 134 PNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLRELLVLVAAGGEIEGG 193
P+HPSYSDTDVMFATVY V D QWAVFL LHTA CDR AAA+LLRELLVL AA G+IEGG
Sbjct: 124 PSHPSYSDTDVMFATVYAVSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAEGKIEGG 183
Query: 194 GFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANLEFKDANSERFSQM 253
GF+IGDNGEIG GIE+LIP+GKA+K LWARGLDMLGYSLNSFR ANLEFKDA+SERFSQM
Sbjct: 184 GFKIGDNGEIGSGIEDLIPSGKAHKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQM 243
Query: 254 IRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQREKYAVVTLNDCRS 313
IRLK+NSDETQKLLAGCKSRGIKLCGAL AAGLIATRCSKDLPPHQ EKYAVVTL DCRS
Sbjct: 244 IRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPHQTEKYAVVTLIDCRS 303
Query: 314 LLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDSNKHFSDMSDLNFL 373
LLDPPLT+HHLGFYHSAILNTHDISAEDTLWEV++RCYFSFSNAKD+NKHF+DMSDLNFL
Sbjct: 304 LLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVSERCYFSFSNAKDNNKHFTDMSDLNFL 363
Query: 374 MCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIGCASAHGVGPSIAFF 433
M KAIENP LTPSSSMRTALIS FEDPII TS PAQQH+G+ DYIGCASAHGVGPSIA F
Sbjct: 364 MGKAIENPGLTPSSSMRTALISAFEDPIIYTSDPAQQHLGISDYIGCASAHGVGPSIALF 423
Query: 434 DMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
D+IRDGQLDCACVYPSPLFSR+QMNQ+FDEMKKILV++AMEVVEG
Sbjct: 424 DLIRDGQLDCACVYPSPLFSRDQMNQLFDEMKKILVSSAMEVVEG 468
BLAST of Clc10G18370 vs. ExPASy TrEMBL
Match:
A0A6J1F424 (uncharacterized protein LOC111442199 OS=Cucurbita moschata OX=3662 GN=LOC111442199 PE=4 SV=1)
HSP 1 Score: 815.8 bits (2106), Expect = 9.2e-233
Identity = 402/465 (86.45%), Postives = 428/465 (92.04%), Query Frame = 0
Query: 14 ESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTLQNLHPILRSKIHH 73
E + RPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDI HLQ+SLH LQNLHPILRSKIHH
Sbjct: 4 EIKFRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDISHLQASLHNLQNLHPILRSKIHH 63
Query: 74 DPSRRDFSFLIPPSPPLNLQILDLAATASAIASHPDAKDPSVSDFHKIHETEINRATWFD 133
DPSRRDFS LIPPSP ++LQILDLAA A AIASHPDA +PS+SDFHKI E EINRA W +
Sbjct: 64 DPSRRDFSLLIPPSPSIHLQILDLAAAARAIASHPDADNPSISDFHKILEHEINRAKWLN 123
Query: 134 PNHPSYSDTDVMFATVYTVCDSQWAVFLRLHTATCDRAAAAALLRELLVLVAAGGEIEGG 193
P+HPSYSDTDVMFATVY + D QWAVFL LHTA CDR AAA+LLRELLVL AAGG+IEGG
Sbjct: 124 PSHPSYSDTDVMFATVYALSDGQWAVFLTLHTAACDRVAAASLLRELLVLTAAGGKIEGG 183
Query: 194 GFEIGDNGEIGLGIENLIPNGKANKSLWARGLDMLGYSLNSFRLANLEFKDANSERFSQM 253
GFEIGDNGEIG GIE+LIP+GKA K LWARGLDMLGYSLNSFR ANLEFKDA+SERFSQM
Sbjct: 184 GFEIGDNGEIGSGIEDLIPSGKAYKPLWARGLDMLGYSLNSFRFANLEFKDASSERFSQM 243
Query: 254 IRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQREKYAVVTLNDCRS 313
IRLK+NSDETQKLLAGCKSRGIKLCGAL AAGLIATRCSKDLPP+Q EKYAVVTL DCRS
Sbjct: 244 IRLKLNSDETQKLLAGCKSRGIKLCGALEAAGLIATRCSKDLPPYQTEKYAVVTLIDCRS 303
Query: 314 LLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDSNKHFSDMSDLNFL 373
LLDPPLT+HHLGFYHSAILNTHDISAEDTLWEVA+RCYFSFSN K++NKHF+DMSDLNFL
Sbjct: 304 LLDPPLTTHHLGFYHSAILNTHDISAEDTLWEVAERCYFSFSNGKENNKHFTDMSDLNFL 363
Query: 374 MCKAIENPSLTPSSSMRTALISVFEDPIIETSGPAQQHIGLHDYIGCASAHGVGPSIAFF 433
M KAIENP LTPSSSMRTALIS FEDPII TS PAQQH+G+ DYIGCASAHGVGPSIA F
Sbjct: 364 MGKAIENPGLTPSSSMRTALISAFEDPIIYTSDPAQQHLGIFDYIGCASAHGVGPSIALF 423
Query: 434 DMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNAMEVVEG 479
DMIRDGQLDCACVYPSPLFSR+QMN +FDEMKKILV+ AMEVVEG
Sbjct: 424 DMIRDGQLDCACVYPSPLFSRDQMNLLFDEMKKILVSGAMEVVEG 468
BLAST of Clc10G18370 vs. TAIR 10
Match:
AT3G52610.1 (unknown protein; Has 68 Blast hits to 67 proteins in 21 species: Archae - 0; Bacteria - 11; Metazoa - 0; Fungi - 0; Plants - 55; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )
HSP 1 Score: 473.0 bits (1216), Expect = 2.8e-133
Identity = 252/472 (53.39%), Postives = 331/472 (70.13%), Query Frame = 0
Query: 9 PPPAGESESRPVGGTEHSWCRAVPGGTGTTVLGLLLSKPPDIPHLQSSLHTLQNLHPILR 68
P +S +RPVGGTE+SWCRA+ GGTG V+ LLLS+ P + +LQ++L LQ HP LR
Sbjct: 4 PNRVPKSMTRPVGGTEYSWCRAIDGGTGIAVIALLLSRTPKLQNLQNTLDKLQIYHPTLR 63
Query: 69 SKIHHDPSRRDFSFLIPPSPPLNLQI--LDLAATASAIASHPDAKDPSVSDFHKIHETEI 128
S I D S FSF++ + +++I D +TA I D+ DP I E E+
Sbjct: 64 SNIRFDASANSFSFVVTSAADSHVEIHPFDSVSTAQIIR---DSDDPCADPHRIILEHEM 123
Query: 129 NRATWFDPNHPSYSDTDVMFATVYTVCD--SQWAVFLRLHTATCDRAAAAALLRELLVLV 188
N+ TW +P+ S++ V ++Y + D Q + RL+TA DR AA LLRE +
Sbjct: 124 NKNTWINPHRWIKSESRVFIVSLYDLTDDGEQRILTFRLNTAAVDRTAAVTLLREFMKET 183
Query: 189 AAGGEIEGGGFEIGDNGEIGLG--IENLIPNGKANKSLWARGLDMLGYSLNSFRLANLEF 248
AA G G +GLG IE LIP+GK +K WARG+D+LGYSLN+FR +NL F
Sbjct: 184 AADG-FGNGPVVAATETAVGLGKAIEELIPSGKGDKPFWARGIDVLGYSLNAFRFSNLNF 243
Query: 249 KDA-NSERFSQMIRLKMNSDETQKLLAGCKSRGIKLCGALAAAGLIATRCSKDLPPHQRE 308
DA NS R SQ++RLK++ D+T KL+AGCK+RG+KL ALA++ LIA SK+LPP+Q E
Sbjct: 244 VDAENSNRRSQLVRLKLDRDQTLKLVAGCKARGLKLWAALASSALIAAYSSKNLPPYQGE 303
Query: 309 KYAVVTLNDCRSLLDPPLTSHHLGFYHSAILNTHDISAEDTLWEVAKRCYFSFSNAKDSN 368
KYAVVTL+DCRS+L+PPLTS+ GFYH+ IL+THD++ E+ LW++AKRCY SF+++K+SN
Sbjct: 304 KYAVVTLSDCRSILEPPLTSNDFGFYHAGILHTHDLTGEEKLWDLAKRCYDSFTSSKNSN 363
Query: 369 KHFSDMSDLNFLMCKAIENPSLTPSSSMRTALISVFEDPII-ETSGPAQQHIGLHDYIGC 428
K F+DMSDLNFLMCKAIENP+LTPSSS+RTA IS+FEDP+I E+ P +G+ DYIGC
Sbjct: 364 KQFTDMSDLNFLMCKAIENPNLTPSSSLRTAFISIFEDPVIDESPEPELASLGVQDYIGC 423
Query: 429 ASAHGVGPSIAFFDMIRDGQLDCACVYPSPLFSREQMNQIFDEMKKILVNNA 473
AS HGVGPS+A FD +RDG+LDCA VYPSPL SREQM+ + MK IL+ +
Sbjct: 424 ASIHGVGPSVAVFDALRDGKLDCAFVYPSPLHSREQMDGLIQHMKTILLEGS 471
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038905440.1 | 1.3e-257 | 92.89 | uncharacterized protein LOC120091472 [Benincasa hispida] | [more] |
XP_008442855.1 | 1.1e-248 | 91.23 | PREDICTED: uncharacterized protein LOC103486623 [Cucumis melo] | [more] |
XP_004149221.3 | 2.5e-248 | 90.40 | uncharacterized protein LOC101208906 [Cucumis sativus] >KGN59146.1 hypothetical ... | [more] |
KAA0043871.1 | 1.1e-235 | 87.68 | GATA zinc finger domain-containing protein isoform 1 [Cucumis melo var. makuwa] ... | [more] |
XP_022982938.1 | 9.1e-235 | 87.10 | uncharacterized protein LOC111481632 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3B7G8 | 5.4e-249 | 91.23 | uncharacterized protein LOC103486623 OS=Cucumis melo OX=3656 GN=LOC103486623 PE=... | [more] |
A0A0A0LGP2 | 1.2e-248 | 90.40 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G777540 PE=4 SV=1 | [more] |
A0A5A7TQM7 | 5.2e-236 | 87.68 | GATA zinc finger domain-containing protein isoform 1 OS=Cucumis melo var. makuwa... | [more] |
A0A6J1J6C5 | 4.4e-235 | 87.10 | uncharacterized protein LOC111481632 OS=Cucurbita maxima OX=3661 GN=LOC111481632... | [more] |
A0A6J1F424 | 9.2e-233 | 86.45 | uncharacterized protein LOC111442199 OS=Cucurbita moschata OX=3662 GN=LOC1114421... | [more] |
Match Name | E-value | Identity | Description | |
AT3G52610.1 | 2.8e-133 | 53.39 | unknown protein; Has 68 Blast hits to 67 proteins in 21 species: Archae - 0; Bac... | [more] |