Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAGAAGAAGAAGAAGAAGGAGAACTCCGTCGCTCGCACAACCTGAAGATACTGCGAACCTCTTCAAACTCAATCAATGGCGTATGAAATCCCTCGCGATCTGATCAATCAACTTCAGATCTCTCTTCGAAATAGGGCCAAAATCTCCTCCTACGACCCTCACGATCCTTCACTTCCAAATCTACCATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCCCCGCCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCATTTATTTGCGTTTTCTGCGGCAGGGAACAGAACACCGACGTCCCTCCGGACCCCATTAATTTCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCGGTAGGTTCTTCTATTTTTATAGGAATCTTGGGGTTGTTTTCTTTTTCTTGTATATTTGATAAGATTCGTGTAGGTATAGGGAAAAGTTAGACATCAGCTTTGCCTGTTGTTCTAACGTAGATGCTTGTGTTTGAAATCAGCTACGTGCTAGTCTCACTTAGCCACATTGTTGCTAAAATTTTGTTGCTGATTCTGGAATTCAGTATTGGTTCTTTACTTCTTATAGTCTAAAGGGTTGATGGAAATTTTGTTTCTAGTTTCAGAGAAAATCTCTTAGAATTTCAAACCGGGACTTGCTATCTTAATTCTAATGTTCATGGTTGTGGCATTCGGGGAGCTTGTCTGTGAATTCAAGAATATATCTGAAATGGATAGTTTTCGAAATTGTGCTTATTGGATTCATACTCTTTGTGGAAGTTGAGAGCCCAAAAGTATTTTCTGAATTTGGTTCAATAGGATTAGATAATTGATTGATGTTGTTGCATGCCTGGGATTCCTTTGGGCAGATCCTTTGAAATGTATTTCGAAATCTTTGGCAGGAGATGGTGGGACCAATCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAGCAATTTCCCCTGACGAATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAAGGGATCTCAGACGAAACCCCGGCTCCAAGTAAAAGTTCCTTGAATTTGGCTGGAGTTGATCTTGACTTCTACTTCTCTGAGGAAAAAAAAGACACTGCTTCAAAAGCATCTGATGAGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGTTTATTTGATAAGGTTCCATCTTCCGCGACGGCAGCAAGGACCACTAAACATGAGAATGATGATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCTGCAACTTCTCATGATAATTCTAAATCAATTGATCCTTTTGCTGTTTCTGGGGTCAATATATCTTCCTCTTTGGAAACAACGTTTGGAGACCATAGCAAGTCCAGAAGTGGAGAATCAGAAGATACTAAAAATCCCTCTTCATCAATGGCCAATGACTGGTTTCAACAAGATGATTTATGGAGTAGTTCTAATCACGAAACGATTCGCATGCCAGATCAGCTTGAACAAACTGGAATTTTAATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCAGCAAGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGATAAAAGTGTTATTAAAGATGATGATTCAGCTGATGCTTGGGATGATTTTACTAGCTCAACTGGTGTGCAAGGCCCCTCTGATGATTCTAGGAAAGACATTGTGAATGACATTGTGCCAAAGGTGGATGAGATATCAGAAGTAGATTTCTTCAGCACAACCACCTCAAGGGATAGTGATTTTGGAAACTCTTCTCAGCCAAATTCATTTGCAGATGCATTCCCCAAATCCGTAGAAAAAGCAACGTGGCCAGATGCTTCTGATTTAAGCAGGTATGTAGAAACTAATCAAATATTTCTGTAAAGCAATCTGTTAGGAGGTTCCCTGTTTTCATTGGCACAAATATTGATTGGCAACATGGCCAAATATTTCTGGAAGAATTTTTTTTTTGTAAAAATAAACCACTGGTACAGTTGAAGCCCATGGCTAGGAAACCCTGTTTTCATTTTTCCTTCGATGCTCACTAGTACTGATTGGTTTTTTAAGGCTATTTTGCTTTTAGACTCTCCATCAATAATCACTTTGTTCTTCATGCAAATGCGTCACTCCCTTCAAGCTTCATTTATATGTTATGCTTATGCTTTATGGAAACTATTGGACAACTTGTTCCCTAGCATTTACATTTTTCATGTTGTAGGATGAATGAAGAGAATGGAGAAAGTGGAGAAAATTCTGAAGCTATGAAGCGTCAAGCTGCATCAGGTCCTAGTTCAAGTTCTGATGATATACAGATGATGATGGCGAAGATGCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTCCCCCCAAAGTGATGCATCTTTAATTCTTCTTCTGAAGCACTCTGCCACTGAGCTTTTTTTGTATTTTTCTTTCCCACTTTCTTTTTAAATCTGTAGCAGTATAGTGTTAGTTTAGTTATTACGGAATGCATTCTTTGATTTTATAAAAT
mRNA sequence
GAAGAAGAAGAAGAAGAAGGAGAACTCCGTCGCTCGCACAACCTGAAGATACTGCGAACCTCTTCAAACTCAATCAATGGCGTATGAAATCCCTCGCGATCTGATCAATCAACTTCAGATCTCTCTTCGAAATAGGGCCAAAATCTCCTCCTACGACCCTCACGATCCTTCACTTCCAAATCTACCATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCCCCGCCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCATTTATTTGCGTTTTCTGCGGCAGGGAACAGAACACCGACGTCCCTCCGGACCCCATTAATTTCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCGGAGATGGTGGGACCAATCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAGCAATTTCCCCTGACGAATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAAGGGATCTCAGACGAAACCCCGGCTCCAAGTAAAAGTTCCTTGAATTTGGCTGGAGTTGATCTTGACTTCTACTTCTCTGAGGAAAAAAAAGACACTGCTTCAAAAGCATCTGATGAGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGTTTATTTGATAAGGTTCCATCTTCCGCGACGGCAGCAAGGACCACTAAACATGAGAATGATGATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCTGCAACTTCTCATGATAATTCTAAATCAATTGATCCTTTTGCTGTTTCTGGGGTCAATATATCTTCCTCTTTGGAAACAACGTTTGGAGACCATAGCAAGTCCAGAAGTGGAGAATCAGAAGATACTAAAAATCCCTCTTCATCAATGGCCAATGACTGGTTTCAACAAGATGATTTATGGAGTAGTTCTAATCACGAAACGATTCGCATGCCAGATCAGCTTGAACAAACTGGAATTTTAATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCAGCAAGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGATAAAAGTGTTATTAAAGATGATGATTCAGCTGATGCTTGGGATGATTTTACTAGCTCAACTGGTGTGCAAGGCCCCTCTGATGATTCTAGGAAAGACATTGTGAATGACATTGTGCCAAAGGTGGATGAGATATCAGAAGTAGATTTCTTCAGCACAACCACCTCAAGGGATAGTGATTTTGGAAACTCTTCTCAGCCAAATTCATTTGCAGATGCATTCCCCAAATCCGTAGAAAAAGCAACGTGGCCAGATGCTTCTGATTTAAGCAGGATGAATGAAGAGAATGGAGAAAGTGGAGAAAATTCTGAAGCTATGAAGCGTCAAGCTGCATCAGGTCCTAGTTCAAGTTCTGATGATATACAGATGATGATGGCGAAGATGCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTCCCCCCAAAGTGATGCATCTTTAATTCTTCTTCTGAAGCACTCTGCCACTGAGCTTTTTTTGTATTTTTCTTTCCCACTTTCTTTTTAAATCTGTAGCAGTATAGTGTTAGTTTAGTTATTACGGAATGCATTCTTTGATTTTATAAAAT
Coding sequence (CDS)
ATGGCGTATGAAATCCCTCGCGATCTGATCAATCAACTTCAGATCTCTCTTCGAAATAGGGCCAAAATCTCCTCCTACGACCCTCACGATCCTTCACTTCCAAATCTACCATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCCCCGCCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCATTTATTTGCGTTTTCTGCGGCAGGGAACAGAACACCGACGTCCCTCCGGACCCCATTAATTTCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCGGAGATGGTGGGACCAATCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAGCAATTTCCCCTGACGAATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAAGGGATCTCAGACGAAACCCCGGCTCCAAGTAAAAGTTCCTTGAATTTGGCTGGAGTTGATCTTGACTTCTACTTCTCTGAGGAAAAAAAAGACACTGCTTCAAAAGCATCTGATGAGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGTTTATTTGATAAGGTTCCATCTTCCGCGACGGCAGCAAGGACCACTAAACATGAGAATGATGATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCTGCAACTTCTCATGATAATTCTAAATCAATTGATCCTTTTGCTGTTTCTGGGGTCAATATATCTTCCTCTTTGGAAACAACGTTTGGAGACCATAGCAAGTCCAGAAGTGGAGAATCAGAAGATACTAAAAATCCCTCTTCATCAATGGCCAATGACTGGTTTCAACAAGATGATTTATGGAGTAGTTCTAATCACGAAACGATTCGCATGCCAGATCAGCTTGAACAAACTGGAATTTTAATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCAGCAAGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGATAAAAGTGTTATTAAAGATGATGATTCAGCTGATGCTTGGGATGATTTTACTAGCTCAACTGGTGTGCAAGGCCCCTCTGATGATTCTAGGAAAGACATTGTGAATGACATTGTGCCAAAGGTGGATGAGATATCAGAAGTAGATTTCTTCAGCACAACCACCTCAAGGGATAGTGATTTTGGAAACTCTTCTCAGCCAAATTCATTTGCAGATGCATTCCCCAAATCCGTAGAAAAAGCAACGTGGCCAGATGCTTCTGATTTAAGCAGGATGAATGAAGAGAATGGAGAAAGTGGAGAAAATTCTGAAGCTATGAAGCGTCAAGCTGCATCAGGTCCTAGTTCAAGTTCTGATGATATACAGATGATGATGGCGAAGATGCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTCCCCCCAAAGTGA
Protein sequence
MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSFADAFPKSVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
Homology
BLAST of ClCG02G009040 vs. NCBI nr
Match:
XP_038902680.1 (uncharacterized protein LOC120089318 [Benincasa hispida])
HSP 1 Score: 836.3 bits (2159), Expect = 1.4e-238
Identity = 431/497 (86.72%), Postives = 460/497 (92.56%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKG 60
MAYEIP DLI QLQISLRN AKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHC G
Sbjct: 1 MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNG 60
Query: 61 RLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNR 120
RLLRDLKSF+CVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMV PI+LKESNR
Sbjct: 61 RLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVEPINLKESNR 120
Query: 121 GKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASK 180
GKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKS+LNLA VDLD+YFSEEKKDT SK
Sbjct: 121 GKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSK 180
Query: 181 ASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHD 240
AS+EPPPLNKQTVEDNVDLSLFD VPSS TA RTTKHE+ DSFSGWEASFQ ASSAT HD
Sbjct: 181 ASNEPPPLNKQTVEDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHD 240
Query: 241 NSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNH 300
NSKS+DPFAVS VNISSSLETTFGD +KSRSGE++DTKNPSSS+ NDWFQQ DLWSSSNH
Sbjct: 241 NSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQDLWSSSNH 300
Query: 301 ETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSA 360
ETIRMPDQ+EQTGI+IDGRAAETANYSSSASVDWFQ DQRQGGSQKKPDDKS K D SA
Sbjct: 301 ETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSA 360
Query: 361 DAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSFA 420
DAWDDFTSSTGV GPSD+SRKDIVND+V KVDEISEVDFFSTT +SDF NSSQPNSFA
Sbjct: 361 DAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDFFSTT---NSDFRNSSQPNSFA 420
Query: 421 DAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAK 480
+AFP S+ KATW DASDLSRM+EE+GE+GENS+A++ Q+ASGPSSS+DD+QMMM K
Sbjct: 421 EAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEK 480
Query: 481 MHDLSFMLESNLSVPPK 494
MHDLSFMLESNLS+PPK
Sbjct: 481 MHDLSFMLESNLSIPPK 493
BLAST of ClCG02G009040 vs. NCBI nr
Match:
KAA0034793.1 (dentin sialophosphoprotein [Cucumis melo var. makuwa])
HSP 1 Score: 800.0 bits (2065), Expect = 1.1e-227
Identity = 417/498 (83.73%), Postives = 451/498 (90.56%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKG 60
MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKG
Sbjct: 1 MAYEIPRDLIKQLQISLRNEAKISSYDPHHPSLPNLPSFNQTIAELDPSPPYLRCKHCKG 60
Query: 61 RLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNR 120
RLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+SLDLDGSEMVGPIDLKESNR
Sbjct: 61 RLLRDLKSFICVFCGREQYSDVPPNPINFKNTIACRWLLQSLDLDGSEMVGPIDLKESNR 120
Query: 121 GKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASK 180
GKSPEQFPLT+LLDLEIRWPES+K GI DETPAPSKS+LNLAGVDL +YF+EEK DT SK
Sbjct: 121 GKSPEQFPLTDLLDLEIRWPESDKNGIIDETPAPSKSTLNLAGVDLGYYFTEEKNDTTSK 180
Query: 181 ASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHD 240
ASD PP +KQTVEDN DLSLFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS D
Sbjct: 181 ASDVLPPPSKQTVEDNADLSLFDKFPSSESATRTTKHESDDSFSGWEASFQTASSATSLD 240
Query: 241 NSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN 300
NSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS NDWF QQDDLWSSSN
Sbjct: 241 NSKSIDPFVVSGVNVSSS-EMTFGDQNKSRSGETEDTKDPSSSTTNDWFQQQDDLWSSSN 300
Query: 301 HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDS 360
H+T+ MPDQ+EQTGILIDGRA ETANYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDS
Sbjct: 301 HKTVHMPDQVEQTGILIDGRATETANYSSSATVDWFQDDQWQGGSQKKPDDKSVFKDDDS 360
Query: 361 ADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSF 420
ADAWD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDFFSTTT++DSDF +SSQP SF
Sbjct: 361 ADAWDNFTSSTGVQGPSDNSRKDIVKD-VPKVDEISEVDFFSTTTTKDSDFRDSSQPISF 420
Query: 421 ADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMA 480
A+AFP SVEKA WPDASDL+RM EENG+S ENS+A + QAASG SS+DD QM+M
Sbjct: 421 AEAFPNPNGTSVEKAIWPDASDLTRMGEENGKSRENSDAAQHQAASG-GSSTDDAQMIME 480
Query: 481 KMHDLSFMLESNLSVPPK 494
KMHDLSFMLESNLS+PPK
Sbjct: 481 KMHDLSFMLESNLSIPPK 495
BLAST of ClCG02G009040 vs. NCBI nr
Match:
XP_011649988.1 (uncharacterized protein LOC101209977 [Cucumis sativus] >KGN63225.1 hypothetical protein Csa_021941 [Cucumis sativus])
HSP 1 Score: 799.3 bits (2063), Expect = 1.9e-227
Identity = 415/498 (83.33%), Postives = 447/498 (89.76%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKG 60
MAYEIPRDLI QLQISLRN A ISSYDPH PSLPNLPS +ETIA+LDPSPPYLRCKHCKG
Sbjct: 1 MAYEIPRDLIKQLQISLRNEANISSYDPHHPSLPNLPSFNETIADLDPSPPYLRCKHCKG 60
Query: 61 RLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNR 120
RLLRDLKSFICVFCGREQ +DVPPDPINF NTIACRWLL+SLDLDGSEMVG IDLKESNR
Sbjct: 61 RLLRDLKSFICVFCGREQYSDVPPDPINFNNTIACRWLLQSLDLDGSEMVGTIDLKESNR 120
Query: 121 GKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASK 180
GKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKS+LNLAGVDL YF+EEK DT SK
Sbjct: 121 GKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLGNYFTEEKNDTTSK 180
Query: 181 ASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHD 240
ASD PP +K+TVEDN DLSLFDK PS TA RTTKHE+DDSFSGWEASFQ ASSAT D
Sbjct: 181 ASDGLPPPSKRTVEDNADLSLFDKFPSFETATRTTKHESDDSFSGWEASFQPASSATPLD 240
Query: 241 NSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN 300
NSKS+DPF VSGVNISSSLETTFG+ +KS SGE+EDTKNPSSS NDWF QQDDLWSSSN
Sbjct: 241 NSKSVDPFVVSGVNISSSLETTFGNQNKSSSGETEDTKNPSSSTTNDWFQQQDDLWSSSN 300
Query: 301 HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDS 360
H+TI MPDQ+EQTGILIDGR ETANYSSSA+VDWFQDDQ QG SQKKPDDKSV KDD S
Sbjct: 301 HKTIHMPDQVEQTGILIDGRTTETANYSSSATVDWFQDDQLQGVSQKKPDDKSVFKDDGS 360
Query: 361 ADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSF 420
ADAWDDFTSSTGVQGP D+S+KDIVND VPKVDEISEVDFFST T++DSDF +SSQP SF
Sbjct: 361 ADAWDDFTSSTGVQGPFDNSKKDIVND-VPKVDEISEVDFFSTMTTKDSDFRDSSQPISF 420
Query: 421 ADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMA 480
A+AFP SVEKA WPDASDLSRM+EENG++ ENS+A++RQAASGPSSS+DD +MMM
Sbjct: 421 AEAFPNPNGTSVEKAIWPDASDLSRMSEENGKTRENSDAVQRQAASGPSSSTDDAKMMME 480
Query: 481 KMHDLSFMLESNLSVPPK 494
KMHDLSFMLES LS+PPK
Sbjct: 481 KMHDLSFMLESKLSIPPK 497
BLAST of ClCG02G009040 vs. NCBI nr
Match:
XP_008455912.1 (PREDICTED: uncharacterized protein LOC103495983 [Cucumis melo] >TYK09594.1 dentin sialophosphoprotein [Cucumis melo var. makuwa])
HSP 1 Score: 798.1 bits (2060), Expect = 4.2e-227
Identity = 415/498 (83.33%), Postives = 449/498 (90.16%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKG 60
MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKG
Sbjct: 1 MAYEIPRDLIKQLQISLRNEAKISSYDPHHPSLPNLPSFNQTIAELDPSPPYLRCKHCKG 60
Query: 61 RLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNR 120
RLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+SLDLDGSEMVGPIDLKESNR
Sbjct: 61 RLLRDLKSFICVFCGREQYSDVPPNPINFKNTIACRWLLQSLDLDGSEMVGPIDLKESNR 120
Query: 121 GKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASK 180
GKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKS+LNLAGVDL +YF+EEK DT SK
Sbjct: 121 GKSPEQFPLTDLLDLEIRWPESDKNGITDETPAPSKSTLNLAGVDLGYYFTEEKNDTTSK 180
Query: 181 ASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHD 240
ASD PP +KQTVEDN DLSLFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS D
Sbjct: 181 ASDVLPPASKQTVEDNADLSLFDKFPSSESATRTTKHESDDSFSGWEASFQTASSATSLD 240
Query: 241 NSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN 300
NSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS NDWF QQDDLWSSSN
Sbjct: 241 NSKSIDPFVVSGVNVSSS-EMTFGDQNKSRSGETEDTKDPSSSTTNDWFQQQDDLWSSSN 300
Query: 301 HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDS 360
H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDS
Sbjct: 301 HKTVHMPDQVEQTGILIDGRATETTNYSSSATVDWFQDDQWQGGSQKKPDDKSVFKDDDS 360
Query: 361 ADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSF 420
AD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDFFSTTT++DSDF +SSQP SF
Sbjct: 361 ADTWDNFTSSTGVQGPSDNSRKDIVKD-VPKVDEISEVDFFSTTTTKDSDFRDSSQPISF 420
Query: 421 ADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMA 480
A+AFP SVEKA WPDASDL+RM EENG+S ENS+A QAASG SS+DD QM+M
Sbjct: 421 AEAFPNPNGTSVEKAIWPDASDLTRMGEENGKSRENSDAAPHQAASG-GSSTDDAQMIME 480
Query: 481 KMHDLSFMLESNLSVPPK 494
KMHDLSFMLESNLS+PPK
Sbjct: 481 KMHDLSFMLESNLSIPPK 495
BLAST of ClCG02G009040 vs. NCBI nr
Match:
XP_022970990.1 (uncharacterized protein LOC111469795 [Cucurbita maxima])
HSP 1 Score: 722.6 bits (1864), Expect = 2.3e-204
Identity = 384/507 (75.74%), Postives = 431/507 (85.01%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKG 60
MA++IP DLI QLQISLRN AK+SSYDPHD SLPNLPSLHETIA+LDPSPPYLRCKHCKG
Sbjct: 1 MAFQIPNDLIKQLQISLRNEAKLSSYDPHDSSLPNLPSLHETIAKLDPSPPYLRCKHCKG 60
Query: 61 RLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNR 120
RLLRDLKSF+CVFCG+EQNT+VPPDPINFKNTIACRWLLESLDLDGSEMVG +DLKESNR
Sbjct: 61 RLLRDLKSFVCVFCGKEQNTEVPPDPINFKNTIACRWLLESLDLDGSEMVGHMDLKESNR 120
Query: 121 GKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASK 180
GKS E+FPLT+LLDL+IRWPESEK+G+SD T APSKS+LNLA VDLD YFSEE KDT K
Sbjct: 121 GKSAEEFPLTDLLDLKIRWPESEKRGLSDNTLAPSKSTLNLAEVDLDNYFSEENKDTTLK 180
Query: 181 ASDEPPPLNKQ-------TVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTA 240
SDE PLN+Q T +DNVDLSLF V SS TA R +HE+ DSFSGWEA+FQT
Sbjct: 181 VSDE--PLNQQIDGSERKTFQDNVDLSLFGNVQSSETATRINEHESSDSFSGWEANFQTV 240
Query: 241 SSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD 300
+SATSH+NSKS+DPFA+SGV+IS SLE T G +K RSGE E+TKNPSSSM +DWF QQD
Sbjct: 241 NSATSHNNSKSVDPFAISGVDISYSLELTSGHQNKYRSGEIEETKNPSSSMTSDWFQQQD 300
Query: 301 DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKS 360
DLWSSSNHETI P+Q++QTG DG+ TA+YSSSASVDWFQDDQ QGGS KKPDD S
Sbjct: 301 DLWSSSNHETICTPEQVDQTG--FDGKTVGTADYSSSASVDWFQDDQWQGGS-KKPDDNS 360
Query: 361 VIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGN 420
KDDDSADAWDDFTSSTG+QG D+ KDIVN+IVPKVDEISE+DFFSTTTS+D +FGN
Sbjct: 361 DFKDDDSADAWDDFTSSTGMQGSLDNFGKDIVNNIVPKVDEISEIDFFSTTTSKDINFGN 420
Query: 421 SSQPNSFADAFPK-----SVEKATWPDASDLSRMNEENGESGENSEAMKR-QAASGPSSS 480
SQPN F +AFP S EKAT PDASDLSRM+EENG+SGENS+A K QA+S PSS+
Sbjct: 421 FSQPNLFVEAFPNLNGGTSEEKATRPDASDLSRMSEENGKSGENSKATKEIQASSAPSSN 480
Query: 481 SDDIQMMMAKMHDLSFMLESNLSVPPK 494
DD+QMMMAKMHDLSFMLES+LS+PPK
Sbjct: 481 LDDVQMMMAKMHDLSFMLESHLSIPPK 502
BLAST of ClCG02G009040 vs. ExPASy TrEMBL
Match:
A0A5A7SW96 (Dentin sialophosphoprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold213G00190 PE=4 SV=1)
HSP 1 Score: 800.0 bits (2065), Expect = 5.4e-228
Identity = 417/498 (83.73%), Postives = 451/498 (90.56%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKG 60
MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKG
Sbjct: 1 MAYEIPRDLIKQLQISLRNEAKISSYDPHHPSLPNLPSFNQTIAELDPSPPYLRCKHCKG 60
Query: 61 RLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNR 120
RLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+SLDLDGSEMVGPIDLKESNR
Sbjct: 61 RLLRDLKSFICVFCGREQYSDVPPNPINFKNTIACRWLLQSLDLDGSEMVGPIDLKESNR 120
Query: 121 GKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASK 180
GKSPEQFPLT+LLDLEIRWPES+K GI DETPAPSKS+LNLAGVDL +YF+EEK DT SK
Sbjct: 121 GKSPEQFPLTDLLDLEIRWPESDKNGIIDETPAPSKSTLNLAGVDLGYYFTEEKNDTTSK 180
Query: 181 ASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHD 240
ASD PP +KQTVEDN DLSLFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS D
Sbjct: 181 ASDVLPPPSKQTVEDNADLSLFDKFPSSESATRTTKHESDDSFSGWEASFQTASSATSLD 240
Query: 241 NSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN 300
NSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS NDWF QQDDLWSSSN
Sbjct: 241 NSKSIDPFVVSGVNVSSS-EMTFGDQNKSRSGETEDTKDPSSSTTNDWFQQQDDLWSSSN 300
Query: 301 HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDS 360
H+T+ MPDQ+EQTGILIDGRA ETANYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDS
Sbjct: 301 HKTVHMPDQVEQTGILIDGRATETANYSSSATVDWFQDDQWQGGSQKKPDDKSVFKDDDS 360
Query: 361 ADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSF 420
ADAWD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDFFSTTT++DSDF +SSQP SF
Sbjct: 361 ADAWDNFTSSTGVQGPSDNSRKDIVKD-VPKVDEISEVDFFSTTTTKDSDFRDSSQPISF 420
Query: 421 ADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMA 480
A+AFP SVEKA WPDASDL+RM EENG+S ENS+A + QAASG SS+DD QM+M
Sbjct: 421 AEAFPNPNGTSVEKAIWPDASDLTRMGEENGKSRENSDAAQHQAASG-GSSTDDAQMIME 480
Query: 481 KMHDLSFMLESNLSVPPK 494
KMHDLSFMLESNLS+PPK
Sbjct: 481 KMHDLSFMLESNLSIPPK 495
BLAST of ClCG02G009040 vs. ExPASy TrEMBL
Match:
A0A0A0LMS7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G416160 PE=4 SV=1)
HSP 1 Score: 799.3 bits (2063), Expect = 9.2e-228
Identity = 415/498 (83.33%), Postives = 447/498 (89.76%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKG 60
MAYEIPRDLI QLQISLRN A ISSYDPH PSLPNLPS +ETIA+LDPSPPYLRCKHCKG
Sbjct: 1 MAYEIPRDLIKQLQISLRNEANISSYDPHHPSLPNLPSFNETIADLDPSPPYLRCKHCKG 60
Query: 61 RLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNR 120
RLLRDLKSFICVFCGREQ +DVPPDPINF NTIACRWLL+SLDLDGSEMVG IDLKESNR
Sbjct: 61 RLLRDLKSFICVFCGREQYSDVPPDPINFNNTIACRWLLQSLDLDGSEMVGTIDLKESNR 120
Query: 121 GKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASK 180
GKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKS+LNLAGVDL YF+EEK DT SK
Sbjct: 121 GKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLGNYFTEEKNDTTSK 180
Query: 181 ASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHD 240
ASD PP +K+TVEDN DLSLFDK PS TA RTTKHE+DDSFSGWEASFQ ASSAT D
Sbjct: 181 ASDGLPPPSKRTVEDNADLSLFDKFPSFETATRTTKHESDDSFSGWEASFQPASSATPLD 240
Query: 241 NSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN 300
NSKS+DPF VSGVNISSSLETTFG+ +KS SGE+EDTKNPSSS NDWF QQDDLWSSSN
Sbjct: 241 NSKSVDPFVVSGVNISSSLETTFGNQNKSSSGETEDTKNPSSSTTNDWFQQQDDLWSSSN 300
Query: 301 HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDS 360
H+TI MPDQ+EQTGILIDGR ETANYSSSA+VDWFQDDQ QG SQKKPDDKSV KDD S
Sbjct: 301 HKTIHMPDQVEQTGILIDGRTTETANYSSSATVDWFQDDQLQGVSQKKPDDKSVFKDDGS 360
Query: 361 ADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSF 420
ADAWDDFTSSTGVQGP D+S+KDIVND VPKVDEISEVDFFST T++DSDF +SSQP SF
Sbjct: 361 ADAWDDFTSSTGVQGPFDNSKKDIVND-VPKVDEISEVDFFSTMTTKDSDFRDSSQPISF 420
Query: 421 ADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMA 480
A+AFP SVEKA WPDASDLSRM+EENG++ ENS+A++RQAASGPSSS+DD +MMM
Sbjct: 421 AEAFPNPNGTSVEKAIWPDASDLSRMSEENGKTRENSDAVQRQAASGPSSSTDDAKMMME 480
Query: 481 KMHDLSFMLESNLSVPPK 494
KMHDLSFMLES LS+PPK
Sbjct: 481 KMHDLSFMLESKLSIPPK 497
BLAST of ClCG02G009040 vs. ExPASy TrEMBL
Match:
A0A5D3CEG4 (Dentin sialophosphoprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold458G00230 PE=4 SV=1)
HSP 1 Score: 798.1 bits (2060), Expect = 2.0e-227
Identity = 415/498 (83.33%), Postives = 449/498 (90.16%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKG 60
MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKG
Sbjct: 1 MAYEIPRDLIKQLQISLRNEAKISSYDPHHPSLPNLPSFNQTIAELDPSPPYLRCKHCKG 60
Query: 61 RLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNR 120
RLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+SLDLDGSEMVGPIDLKESNR
Sbjct: 61 RLLRDLKSFICVFCGREQYSDVPPNPINFKNTIACRWLLQSLDLDGSEMVGPIDLKESNR 120
Query: 121 GKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASK 180
GKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKS+LNLAGVDL +YF+EEK DT SK
Sbjct: 121 GKSPEQFPLTDLLDLEIRWPESDKNGITDETPAPSKSTLNLAGVDLGYYFTEEKNDTTSK 180
Query: 181 ASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHD 240
ASD PP +KQTVEDN DLSLFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS D
Sbjct: 181 ASDVLPPASKQTVEDNADLSLFDKFPSSESATRTTKHESDDSFSGWEASFQTASSATSLD 240
Query: 241 NSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN 300
NSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS NDWF QQDDLWSSSN
Sbjct: 241 NSKSIDPFVVSGVNVSSS-EMTFGDQNKSRSGETEDTKDPSSSTTNDWFQQQDDLWSSSN 300
Query: 301 HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDS 360
H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDS
Sbjct: 301 HKTVHMPDQVEQTGILIDGRATETTNYSSSATVDWFQDDQWQGGSQKKPDDKSVFKDDDS 360
Query: 361 ADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSF 420
AD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDFFSTTT++DSDF +SSQP SF
Sbjct: 361 ADTWDNFTSSTGVQGPSDNSRKDIVKD-VPKVDEISEVDFFSTTTTKDSDFRDSSQPISF 420
Query: 421 ADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMA 480
A+AFP SVEKA WPDASDL+RM EENG+S ENS+A QAASG SS+DD QM+M
Sbjct: 421 AEAFPNPNGTSVEKAIWPDASDLTRMGEENGKSRENSDAAPHQAASG-GSSTDDAQMIME 480
Query: 481 KMHDLSFMLESNLSVPPK 494
KMHDLSFMLESNLS+PPK
Sbjct: 481 KMHDLSFMLESNLSIPPK 495
BLAST of ClCG02G009040 vs. ExPASy TrEMBL
Match:
A0A1S3C2P9 (uncharacterized protein LOC103495983 OS=Cucumis melo OX=3656 GN=LOC103495983 PE=4 SV=1)
HSP 1 Score: 798.1 bits (2060), Expect = 2.0e-227
Identity = 415/498 (83.33%), Postives = 449/498 (90.16%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKG 60
MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKG
Sbjct: 1 MAYEIPRDLIKQLQISLRNEAKISSYDPHHPSLPNLPSFNQTIAELDPSPPYLRCKHCKG 60
Query: 61 RLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNR 120
RLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+SLDLDGSEMVGPIDLKESNR
Sbjct: 61 RLLRDLKSFICVFCGREQYSDVPPNPINFKNTIACRWLLQSLDLDGSEMVGPIDLKESNR 120
Query: 121 GKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASK 180
GKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKS+LNLAGVDL +YF+EEK DT SK
Sbjct: 121 GKSPEQFPLTDLLDLEIRWPESDKNGITDETPAPSKSTLNLAGVDLGYYFTEEKNDTTSK 180
Query: 181 ASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHD 240
ASD PP +KQTVEDN DLSLFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS D
Sbjct: 181 ASDVLPPASKQTVEDNADLSLFDKFPSSESATRTTKHESDDSFSGWEASFQTASSATSLD 240
Query: 241 NSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN 300
NSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS NDWF QQDDLWSSSN
Sbjct: 241 NSKSIDPFVVSGVNVSSS-EMTFGDQNKSRSGETEDTKDPSSSTTNDWFQQQDDLWSSSN 300
Query: 301 HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDS 360
H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDS
Sbjct: 301 HKTVHMPDQVEQTGILIDGRATETTNYSSSATVDWFQDDQWQGGSQKKPDDKSVFKDDDS 360
Query: 361 ADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSF 420
AD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDFFSTTT++DSDF +SSQP SF
Sbjct: 361 ADTWDNFTSSTGVQGPSDNSRKDIVKD-VPKVDEISEVDFFSTTTTKDSDFRDSSQPISF 420
Query: 421 ADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMA 480
A+AFP SVEKA WPDASDL+RM EENG+S ENS+A QAASG SS+DD QM+M
Sbjct: 421 AEAFPNPNGTSVEKAIWPDASDLTRMGEENGKSRENSDAAPHQAASG-GSSTDDAQMIME 480
Query: 481 KMHDLSFMLESNLSVPPK 494
KMHDLSFMLESNLS+PPK
Sbjct: 481 KMHDLSFMLESNLSIPPK 495
BLAST of ClCG02G009040 vs. ExPASy TrEMBL
Match:
A0A6J1I4G5 (uncharacterized protein LOC111469795 OS=Cucurbita maxima OX=3661 GN=LOC111469795 PE=4 SV=1)
HSP 1 Score: 722.6 bits (1864), Expect = 1.1e-204
Identity = 384/507 (75.74%), Postives = 431/507 (85.01%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKG 60
MA++IP DLI QLQISLRN AK+SSYDPHD SLPNLPSLHETIA+LDPSPPYLRCKHCKG
Sbjct: 1 MAFQIPNDLIKQLQISLRNEAKLSSYDPHDSSLPNLPSLHETIAKLDPSPPYLRCKHCKG 60
Query: 61 RLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMVGPIDLKESNR 120
RLLRDLKSF+CVFCG+EQNT+VPPDPINFKNTIACRWLLESLDLDGSEMVG +DLKESNR
Sbjct: 61 RLLRDLKSFVCVFCGKEQNTEVPPDPINFKNTIACRWLLESLDLDGSEMVGHMDLKESNR 120
Query: 121 GKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASK 180
GKS E+FPLT+LLDL+IRWPESEK+G+SD T APSKS+LNLA VDLD YFSEE KDT K
Sbjct: 121 GKSAEEFPLTDLLDLKIRWPESEKRGLSDNTLAPSKSTLNLAEVDLDNYFSEENKDTTLK 180
Query: 181 ASDEPPPLNKQ-------TVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTA 240
SDE PLN+Q T +DNVDLSLF V SS TA R +HE+ DSFSGWEA+FQT
Sbjct: 181 VSDE--PLNQQIDGSERKTFQDNVDLSLFGNVQSSETATRINEHESSDSFSGWEANFQTV 240
Query: 241 SSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD 300
+SATSH+NSKS+DPFA+SGV+IS SLE T G +K RSGE E+TKNPSSSM +DWF QQD
Sbjct: 241 NSATSHNNSKSVDPFAISGVDISYSLELTSGHQNKYRSGEIEETKNPSSSMTSDWFQQQD 300
Query: 301 DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKS 360
DLWSSSNHETI P+Q++QTG DG+ TA+YSSSASVDWFQDDQ QGGS KKPDD S
Sbjct: 301 DLWSSSNHETICTPEQVDQTG--FDGKTVGTADYSSSASVDWFQDDQWQGGS-KKPDDNS 360
Query: 361 VIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGN 420
KDDDSADAWDDFTSSTG+QG D+ KDIVN+IVPKVDEISE+DFFSTTTS+D +FGN
Sbjct: 361 DFKDDDSADAWDDFTSSTGMQGSLDNFGKDIVNNIVPKVDEISEIDFFSTTTSKDINFGN 420
Query: 421 SSQPNSFADAFPK-----SVEKATWPDASDLSRMNEENGESGENSEAMKR-QAASGPSSS 480
SQPN F +AFP S EKAT PDASDLSRM+EENG+SGENS+A K QA+S PSS+
Sbjct: 421 FSQPNLFVEAFPNLNGGTSEEKATRPDASDLSRMSEENGKSGENSKATKEIQASSAPSSN 480
Query: 481 SDDIQMMMAKMHDLSFMLESNLSVPPK 494
DD+QMMMAKMHDLSFMLES+LS+PPK
Sbjct: 481 LDDVQMMMAKMHDLSFMLESHLSIPPK 502
BLAST of ClCG02G009040 vs. TAIR 10
Match:
AT1G05090.1 (dentin sialophosphoprotein-related )
HSP 1 Score: 190.7 bits (483), Expect = 2.9e-48
Identity = 194/710 (27.32%), Postives = 279/710 (39.30%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCK 60
MA EI DLINQL++SLR AK++S D D S P+LP+ E IAELD S PYLRC++CK
Sbjct: 1 MAMEISVDLINQLKVSLRKEAKLTSVDDCSDSSFPSLPTSEEAIAELDASAPYLRCRNCK 60
Query: 61 GRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWLLESLDLDGSEMVGPI-DLKE 120
G+LLR ++S ICVFCG +Q T D PPDPI F +T A +W L SL+LDGSEMV P+ +
Sbjct: 61 GKLLRGIESLICVFCGNQQRTSDNPPDPIKFTSTSAYKWFLTSLNLDGSEMVEPLKETDG 120
Query: 121 SNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEE 180
S+RG K+P + L+ LDLEI+W E+K D K+ LNL G++LD YF E
Sbjct: 121 SSRGATKAPPSKGIALSKFLDLEIQWSALEEKS-DDGQSVQKKNPLNLGGINLDDYFVER 180
Query: 181 KKDTASKASDEPPPLNKQTVEDNVDLSLFDKVPSSAT----------------------- 240
+ D + E P+ +D LSLFD V S
Sbjct: 181 RGDLSKVEQAESKPVEDDDFKDPRSLSLFDSVKSQGVVGSQQHDNVGLFDKKDAPKSVVS 240
Query: 241 --------------------------------------------------AARTTKHEND 300
A RT+ ++D
Sbjct: 241 SGEHENLSLFAGRDAQEKDENLSLFEGKEDAQRTSSSKVDESFGFFEGKDAQRTSSSKDD 300
Query: 301 DSF--------------------------------------------------------- 360
+SF
Sbjct: 301 ESFGMFEGKKDAQRNSSSKEDESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKS 360
Query: 361 ---------SGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGES 420
S W++ FQ+A S DPF S V++++ +++ FG +
Sbjct: 361 FDDKIVAASSDWDSDFQSADQNLSQKKIDG-DPFVSSPVDLAAHMDSVFGSGKDLLYAQP 420
Query: 421 EDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQL--EQTGILIDGRAAETANYSSSASV 480
D+ S A DW QDDL+ + E + + G ++ G N +SS +
Sbjct: 421 ADSSTAYVSKAGDWL-QDDLFGNVTGEAQTNDSAVHDKNEGQIVGG------NGNSSMDI 480
Query: 481 DWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSS----------------------- 493
DW DD Q +K + +DD D W+DF SS
Sbjct: 481 DWIGDDLWQTNEKKSIEKTPTDVNDDDDDDWNDFASSANSKTPNNPLSQTMESSQFEIFY 540
BLAST of ClCG02G009040 vs. TAIR 10
Match:
AT4G20720.1 (dentin sialophosphoprotein-related )
HSP 1 Score: 183.7 bits (465), Expect = 3.5e-46
Identity = 119/283 (42.05%), Postives = 160/283 (56.54%), Query Frame = 0
Query: 1 MAYEIPRDLINQLQISLRNRAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCK 60
MA EI DLINQL++SLR AK++S D D S P+LP+ E IAELD S PYLRC++CK
Sbjct: 1 MAMEISVDLINQLKVSLRKEAKLTSVDDCSDSSFPSLPTSEEAIAELDASAPYLRCRNCK 60
Query: 61 GRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWLLESLDLDGSEMVGPI-DLKE 120
G+LLR ++S ICVFCG +Q T D PPDPI F +T A +W L SL+LDGSEMV P+ +
Sbjct: 61 GKLLRGIESLICVFCGNQQRTSDNPPDPIKFTSTSAYKWFLTSLNLDGSEMVEPLKETDG 120
Query: 121 SNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEE 180
S+RG K+P + L+ LDLEI+W E+K D K+ LNL G++LD YF E
Sbjct: 121 SSRGATKAPPSKGIALSKFLDLEIQWSALEEKS-DDGQSVQKKNPLNLGGINLDDYFVER 180
Query: 181 KKDTASKASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTA 240
+ D + E P+ +D LSLFD V S + +H+N F +A
Sbjct: 181 RGDLSKVEQAESKPVEDDDFKDPRSLSLFDSVKSQGVVG-SQQHDNVGLFDKKDAPKSVV 240
Query: 241 SSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESED 277
SS + S A V+ ++ F + +R+ ED
Sbjct: 241 SSGEHENLSLFAGRDAQESVSFAAQGNFGFFEEKDARNSFKED 281
HSP 2 Score: 49.7 bits (117), Expect = 7.9e-06
Identity = 93/349 (26.65%), Postives = 150/349 (42.98%), Query Frame = 0
Query: 162 AGVDLDFYFSEEKKDTASKASDEPP----PLNKQTVEDNV---DLSLFDKVPSSATAART 221
A D D F ++ + K D P P++ D+V L P+ ++ A
Sbjct: 390 ASSDWDSDFQSADQNLSQKKIDGDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYV 449
Query: 222 TKHEN---DDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTF-GDHSKSR 281
+K + DD F QT SA HD ++ + G N +SS++ + GD
Sbjct: 450 SKAGDWLQDDLFGNVTGEAQTNDSAV-HDKNEG----QIVGGNGNSSMDIDWIGDDLWQT 509
Query: 282 SGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMP--DQLEQTGILIDGRAAETANYSS 341
+ + K P+ +D +D SS+N +T P +E + I A+ N
Sbjct: 510 NEKKSIEKTPTDVNDDDDDDWNDFASSANSKTPNNPLSQTMESSQFEIFYGHAQDKNGVK 569
Query: 342 SASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIV 401
SV D++Q D ++DD WD FTSST +Q S +
Sbjct: 570 EQSV-----DEKQNTDTSVMSDIGKCQEDDLFGTWDSFTSSTILQ----TSLQPPTIHAN 629
Query: 402 PKVDEISEVDFF-STTTSRDSDFGNSSQPNSFADAF---PKSVEKATWPD-ASDLSRMNE 461
P ++ E++ F +RD DF + S+ + F+++ S E P S L R ++
Sbjct: 630 PSGEKNPEMNLFGENNNNRDLDFDSISRSDFFSESSGGKTNSEEVKVIPSGTSTLDRPSD 689
Query: 462 ENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPP 493
+G + + + + P S SD + +M++MHDLSFMLE+ LSVPP
Sbjct: 690 PDGSKDQTVDLVVGTTTTVPKSKSDVAEELMSQMHDLSFMLETKLSVPP 724
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038902680.1 | 1.4e-238 | 86.72 | uncharacterized protein LOC120089318 [Benincasa hispida] | [more] |
KAA0034793.1 | 1.1e-227 | 83.73 | dentin sialophosphoprotein [Cucumis melo var. makuwa] | [more] |
XP_011649988.1 | 1.9e-227 | 83.33 | uncharacterized protein LOC101209977 [Cucumis sativus] >KGN63225.1 hypothetical ... | [more] |
XP_008455912.1 | 4.2e-227 | 83.33 | PREDICTED: uncharacterized protein LOC103495983 [Cucumis melo] >TYK09594.1 denti... | [more] |
XP_022970990.1 | 2.3e-204 | 75.74 | uncharacterized protein LOC111469795 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7SW96 | 5.4e-228 | 83.73 | Dentin sialophosphoprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... | [more] |
A0A0A0LMS7 | 9.2e-228 | 83.33 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G416160 PE=4 SV=1 | [more] |
A0A5D3CEG4 | 2.0e-227 | 83.33 | Dentin sialophosphoprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... | [more] |
A0A1S3C2P9 | 2.0e-227 | 83.33 | uncharacterized protein LOC103495983 OS=Cucumis melo OX=3656 GN=LOC103495983 PE=... | [more] |
A0A6J1I4G5 | 1.1e-204 | 75.74 | uncharacterized protein LOC111469795 OS=Cucurbita maxima OX=3661 GN=LOC111469795... | [more] |