Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGCAATCACGCATGCAATTTCAAAATTACAGAGATATGAAGAAGAAGATGATGAAAAGACGATGAGGTACGTGGTTGGGTCATGGTGACCTTACATCCTCTTTCAAGCTTTCTACGCTTTCCTTTTAGAAGAATTATACATTACCATATAAAAGAAGGAAAATAAAGTAATTTTTTTTTTAAAAAAAATAAATAATAAATAAAATGTGAAAAAAAGAAAGAAAGATGAGGATATTAGGGGTGGAAAATAAATGTTTATGGGTTTTGATTAGGGTCCATTATATATAAAAATGAATTTGAATGGTAATAGCCGTTAGGTTTGAAAAAATTAATTAATATAAGTAGAGAGGAGAAGAAAGAAAGAAAAGGAGATAGGGATAGGGAATGAAGATGATTCATTTGAATTTGGGAGTGTGTAGTGATTTCGATCTTGAGAGAGCAGTTTGTAACCATGGGCAGTTTATGATGCCACCAAACCAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCTAATTCAAACTCTTCTGTATTTGTCTCTATCAACCAAACTTCGTCTTTTCTTCTCACCATTCAAATCCACTCTTCTTCTGCTGCTCTCTCTCCCCAAGATCAACAAACTATATTGGTATGCTGAATTATTACATTATTTTATAATTTTTTAGATTCTAATTAATTAATGAACTCCTACGTATTTATGTAGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGATGAGGATGAGTTGAGGAAATTTCAAAGTTTGCATCCCAAAGCCAAACAGATGGGATTTGGTCGGCTTTTTCGGTCTCCCACTGTTTTTGAAGATGCACTCAAGTCCATCCTTCTATGCAATACCACGTAATTAATTAATTGATTAATGAATATATAATAGTATTAGCTACATGAAAATTATATTATTAATATATGCATATGCATGGGGTGAGTTGACAGGTGGAAAAGGACACTGGCAATGGCTGGACAGCTGTGTGAGCTCCAAGCCAGAATGAGCAGCCAAAATAGGAAGAGAAAAAGGAAATTAATTGGGAATTTTCCAAATGCAGAAGAAGTTTGTAGAATGGGCGTTGAATTGTTGAAGAAGCATAACCTTGGTTACAGAGCTGGTTTCATCATTAACTTTGCTCAACGCGTTCAAAATGCCACAATTGATCTCCAAAATCCTAATAATTTCCCTAAAATCAAAGGCTTCGGACCTTTTGCAACCGCTAATCTACTCATGTGCCTCGGATTTTACCGTCAACTTCCTATTGATACTGAAACTATAAGGCACATAAAACAGGTTTCATTTTTTTTACAAAAAATAATTATGTATATTCCTAAATTTAATTCCCTCTTTCATTATAACATTATTAATTATTTGAATTTACAAGGTACATGGAAGACAATTTTGCAACAATAAGACAGTACGGGAAGATGTCAAACAAATTTACGACAAGTATGCTCCATTCCAGTGCTTGGCCTATCGGTACGTCAATCAATTTTCTTTTTCCTTTTTTTTTTTTAAGCCTAAAATATTTGCTTTAAATGCTTTATTTTGGTTTTAAATATTTTTATTTTAATTTCAGGTTGGAGCTTGTGGAATATTACGAGAGCAAATTCGGGAAGCTAAGTGAACTGTGCTCCCTTGATTATCATAAGATCAGTGGCGCCACCCTCAACCTTTGA
mRNA sequence
ATGAAGGCAATCACGCATGCAATTTCAAAATTACAGAGATATGAAGAAGAAGATGATGAAAAGACGATGAGTGATTTCGATCTTGAGAGAGCAGTTTGTAACCATGGGCAGTTTATGATGCCACCAAACCAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCTAATTCAAACTCTTCTGTATTTGTCTCTATCAACCAAACTTCGTCTTTTCTTCTCACCATTCAAATCCACTCTTCTTCTGCTGCTCTCTCTCCCCAAGATCAACAAACTATATTGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGATGAGGATGAGTTGAGGAAATTTCAAAGTTTGCATCCCAAAGCCAAACAGATGGGATTTGGTCGGCTTTTTCGGTCTCCCACTGTTTTTGAAGATGCACTCAAGTCCATCCTTCTATGCAATACCACGTGGAAAAGGACACTGGCAATGGCTGGACAGCTGTGTGAGCTCCAAGCCAGAATGAGCAGCCAAAATAGGAAGAGAAAAAGGAAATTAATTGGGAATTTTCCAAATGCAGAAGAAGTTTGTAGAATGGGCGTTGAATTGTTGAAGAAGCATAACCTTGGTTACAGAGCTGGTTTCATCATTAACTTTGCTCAACGCGTTCAAAATGCCACAATTGATCTCCAAAATCCTAATAATTTCCCTAAAATCAAAGGCTTCGGACCTTTTGCAACCGCTAATCTACTCATGTGCCTCGGATTTTACCGTCAACTTCCTATTGATACTGAAACTATAAGGCACATAAAACAGGTACATGGAAGACAATTTTGCAACAATAAGACAGTACGGGAAGATGTCAAACAAATTTACGACAAGTATGCTCCATTCCAGTGCTTGGCCTATCGGTTGGAGCTTGTGGAATATTACGAGAGCAAATTCGGGAAGCTAAGTGAACTGTGCTCCCTTGATTATCATAAGATCAGTGGCGCCACCCTCAACCTTTGA
Coding sequence (CDS)
ATGAAGGCAATCACGCATGCAATTTCAAAATTACAGAGATATGAAGAAGAAGATGATGAAAAGACGATGAGTGATTTCGATCTTGAGAGAGCAGTTTGTAACCATGGGCAGTTTATGATGCCACCAAACCAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTCTCTAATTCAAACTCTTCTGTATTTGTCTCTATCAACCAAACTTCGTCTTTTCTTCTCACCATTCAAATCCACTCTTCTTCTGCTGCTCTCTCTCCCCAAGATCAACAAACTATATTGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGATGAGGATGAGTTGAGGAAATTTCAAAGTTTGCATCCCAAAGCCAAACAGATGGGATTTGGTCGGCTTTTTCGGTCTCCCACTGTTTTTGAAGATGCACTCAAGTCCATCCTTCTATGCAATACCACGTGGAAAAGGACACTGGCAATGGCTGGACAGCTGTGTGAGCTCCAAGCCAGAATGAGCAGCCAAAATAGGAAGAGAAAAAGGAAATTAATTGGGAATTTTCCAAATGCAGAAGAAGTTTGTAGAATGGGCGTTGAATTGTTGAAGAAGCATAACCTTGGTTACAGAGCTGGTTTCATCATTAACTTTGCTCAACGCGTTCAAAATGCCACAATTGATCTCCAAAATCCTAATAATTTCCCTAAAATCAAAGGCTTCGGACCTTTTGCAACCGCTAATCTACTCATGTGCCTCGGATTTTACCGTCAACTTCCTATTGATACTGAAACTATAAGGCACATAAAACAGGTACATGGAAGACAATTTTGCAACAATAAGACAGTACGGGAAGATGTCAAACAAATTTACGACAAGTATGCTCCATTCCAGTGCTTGGCCTATCGGTTGGAGCTTGTGGAATATTACGAGAGCAAATTCGGGAAGCTAAGTGAACTGTGCTCCCTTGATTATCATAAGATCAGTGGCGCCACCCTCAACCTTTGA
Protein sequence
MKAITHAISKLQRYEEEDDEKTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRKLIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQNATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATLNL
Homology
BLAST of Clc08G00010 vs. NCBI nr
Match:
XP_038877617.1 (uncharacterized protein LOC120069874 [Benincasa hispida])
HSP 1 Score: 497.3 bits (1279), Expect = 1.0e-136
Identity = 251/287 (87.46%), Postives = 264/287 (91.99%), Query Frame = 0
Query: 22 TMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQI 81
++SDFDLE+AVCNHGQFMMPPNQWIPSSKTLQRPLRLS+S+SSVFVSINQ SS LLTIQI
Sbjct: 11 SVSDFDLEKAVCNHGQFMMPPNQWIPSSKTLQRPLRLSDSHSSVFVSINQPSSSLLTIQI 70
Query: 82 HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSPTVFED 141
HSSS LSPQDQQ ILDQVVRMLRLTEKDEDELRKFQSLHP+AKQMGFGRLFRSPT+FED
Sbjct: 71 HSSSTPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPRAKQMGFGRLFRSPTLFED 130
Query: 142 ALKSILLCNTTWKRTLAMAGQLCELQARMSSQ-NRKRKRKL------IGNFPNAEEVCRM 201
ALKSILLCNTTWKRTLAMAGQLCELQA+M Q RKRKRKL IGNFPNAEEVCRM
Sbjct: 131 ALKSILLCNTTWKRTLAMAGQLCELQAKMRRQITRKRKRKLGEKEGEIGNFPNAEEVCRM 190
Query: 202 GVELLKKHNLGYRAGFIINFAQRVQNATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQ 261
GVELLKKH LGYRA +IINFA+ VQ+ IDLQNPN FPKIKGFGPFATAN+LMCLG YRQ
Sbjct: 191 GVELLKKHCLGYRAAYIINFAKCVQSGKIDLQNPNYFPKIKGFGPFATANVLMCLGLYRQ 250
Query: 262 LPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLE 302
LPIDTETIRH+KQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAY LE
Sbjct: 251 LPIDTETIRHLKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYWLE 297
BLAST of Clc08G00010 vs. NCBI nr
Match:
KAG6585875.1 (hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia] >KAG7020778.1 hypothetical protein SDJN02_17466, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 455.3 bits (1170), Expect = 4.5e-124
Identity = 226/319 (70.85%), Postives = 270/319 (84.64%), Query Frame = 0
Query: 23 MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIH 82
+SDF+LE+AVCNHG FMM PNQWIPSSKTLQRPLRLSNS++S+ VSINQ+SS LLT+QIH
Sbjct: 10 VSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIH 69
Query: 83 SSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSPTVFEDA 142
S +L P+D+ ILDQV RMLRLTEKDEDE+R+FQ+LHP AKQ+GFGR+FRSP++FED
Sbjct: 70 -SPRSLPPKDEVAILDQVARMLRLTEKDEDEIRRFQNLHPTAKQIGFGRIFRSPSLFEDV 129
Query: 143 LKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELL 202
+KSIL+CNT+W+RTL MA +LCE+QA+M +++KRKRK GNFPNA EVCRMGVE L
Sbjct: 130 VKSILMCNTSWRRTLEMAEKLCEVQAKM-RESKKRKRKGNNERGNFPNAREVCRMGVEAL 189
Query: 203 KKHNLGYRAGFIINFAQRVQNATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFY 262
K H LGYRA +++ FAQ V++ I+LQ +P+ FPKIKGFGPFATAN+ MCLGFY
Sbjct: 190 KNHCLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFY 249
Query: 263 RQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK 322
QLPIDTETIRH+KQVHG Q+C KTV EDVKQIYD YAP+QCLAY LELV+YYE+KFGK
Sbjct: 250 HQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGK 309
Query: 323 LSELCSLDYHKISGATLNL 332
LSEL S DYHKISG+TL+L
Sbjct: 310 LSELSSFDYHKISGSTLHL 326
BLAST of Clc08G00010 vs. NCBI nr
Match:
XP_022951918.1 (uncharacterized protein LOC111454659 [Cucurbita moschata])
HSP 1 Score: 455.3 bits (1170), Expect = 4.5e-124
Identity = 226/319 (70.85%), Postives = 270/319 (84.64%), Query Frame = 0
Query: 23 MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIH 82
+SDF+LE+AVCNHG FMM PNQWIPSSKTLQRPLRLSNS++S+ VSINQ+SS LLT+QIH
Sbjct: 10 VSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIH 69
Query: 83 SSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSPTVFEDA 142
S +L P+D+ ILDQV RMLRLTEKDEDE+R+FQ+LHP AKQ+GFGR+FRSP++FED
Sbjct: 70 -SPRSLPPKDEVAILDQVARMLRLTEKDEDEIRRFQNLHPTAKQIGFGRIFRSPSLFEDV 129
Query: 143 LKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELL 202
+KSIL+CNT+W+RTL MA +LCE+QA+M +++KRKRK GNFPNA EVCRMGVE L
Sbjct: 130 VKSILMCNTSWRRTLEMAEKLCEVQAKM-RESKKRKRKGNNERGNFPNAREVCRMGVEAL 189
Query: 203 KKHNLGYRAGFIINFAQRVQNATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFY 262
K H LGYRA +++ FAQ V++ I+LQ +P+ FPKIKGFGPFATAN+ MCLGFY
Sbjct: 190 KNHCLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFY 249
Query: 263 RQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK 322
QLPIDTETIRH+KQVHG Q+C KTV EDVKQIYD YAP+QCLAY LELV+YYE+KFGK
Sbjct: 250 HQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGK 309
Query: 323 LSELCSLDYHKISGATLNL 332
LSEL S DYHKISG+TL+L
Sbjct: 310 LSELSSFDYHKISGSTLHL 326
BLAST of Clc08G00010 vs. NCBI nr
Match:
XP_022156993.1 (uncharacterized protein LOC111023822 [Momordica charantia])
HSP 1 Score: 432.2 bits (1110), Expect = 4.1e-117
Identity = 223/322 (69.25%), Postives = 257/322 (79.81%), Query Frame = 0
Query: 21 KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQ 80
+T S FDLERAVCNHG FMMPPN+WIPSSKTLQRPLRL++S +SV VSI+Q SS LL IQ
Sbjct: 14 ETTSGFDLERAVCNHGFFMMPPNKWIPSSKTLQRPLRLADSTTSVLVSISQPSSHLLNIQ 73
Query: 81 IHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSPTVFE 140
IH SS + SP D+Q ILDQV RMLR+TE+DE+ +R FQ+LH KAK++GFGRLFRSPT+FE
Sbjct: 74 IH-SSPSFSPLDRQAILDQVTRMLRITERDEENIRNFQNLHAKAKEIGFGRLFRSPTLFE 133
Query: 141 DALKSILLCNTTWKRTLAMAGQLCELQARMS----SQNRKRKRK-------LIGNFPNAE 200
DA+KSILLCN TW+RTLAMAGQLCELQA++ + +KRKRK GNFP A
Sbjct: 134 DAVKSILLCNATWRRTLAMAGQLCELQAKLGRGPITDGKKRKRKGKGECELEGGNFPTAA 193
Query: 201 EVCRMGVELLKKHNLGYRAGFIINFAQRVQNATIDLQNPN---NFPKIKGFGPFATANLL 260
E+CRM V LL+KH +GYRA +II+ AQRVQN IDLQ +FPKIKGFGPF TAN+
Sbjct: 194 ELCRMSVLLLQKHFIGYRAVYIIDLAQRVQNGKIDLQKIERALSFPKIKGFGPFTTANVF 253
Query: 261 MCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYY 320
MCLG Y +LPIDTETIRH+KQVHGRQ CN KT E VK +YDKYAPFQCLAY +ELVEYY
Sbjct: 254 MCLGLYDRLPIDTETIRHLKQVHGRQDCNMKTAEEAVKDVYDKYAPFQCLAYWMELVEYY 313
Query: 321 ESKFGKLSELCSLDYHKISGAT 329
ES+FGKLSEL DY KISG T
Sbjct: 314 ESRFGKLSELGWHDYKKISGTT 334
BLAST of Clc08G00010 vs. NCBI nr
Match:
XP_021905122.1 (uncharacterized protein LOC110820055 isoform X2 [Carica papaya])
HSP 1 Score: 326.2 bits (835), Expect = 3.2e-85
Identity = 172/323 (53.25%), Postives = 223/323 (69.04%), Query Frame = 0
Query: 26 FDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTS-SFLLTIQIHSS 85
F+LE+AVCNHG FMMPPN W PS KTL+RPLRLSN +SSV+ SI+ S S L IQ+H
Sbjct: 14 FNLEKAVCNHGFFMMPPNLWSPSKKTLERPLRLSNVSSSVYASISHPSNSTFLVIQLHHI 73
Query: 86 SAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSPTVFEDALK 145
+S D+ IL+QV RMLR+++KDE+ +R+FQ +H AK GFGR+FRSP++FED +K
Sbjct: 74 H-NISSSDKHAILEQVGRMLRISKKDEEVVREFQKVHEAAKNKGFGRVFRSPSLFEDVVK 133
Query: 146 SILLCNTTWKRTLAMAGQLCELQ---ARMSSQNRKRKRKLI----------------GNF 205
S+LLCN TW RTL MA LCELQ R S +++KRK GNF
Sbjct: 134 SLLLCNCTWGRTLKMAKSLCELQYEIVRGISVEKRKKRKRTTNRSINDTMNQEYFSKGNF 193
Query: 206 PNAEEVCRMGVELLKKH-NLGYRAGFIINFAQRVQNATIDLQNPNNFPKIKGFGPFATAN 265
PNAEE+ + +LL++ LGYRA ++IN AQ V++ +DL N + KIKGFG F AN
Sbjct: 194 PNAEELAGLSPDLLEERCKLGYRANYVINLAQLVKSGRLDLTNIQDLVKIKGFGSFVCAN 253
Query: 266 LLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVE 325
+ MC+GFY+ +P DTET+RH+KQVHG + C+ T+ +DVK IYDKY+PFQ LAY EL+
Sbjct: 254 VSMCIGFYQNIPADTETMRHLKQVHGLETCSRSTLVKDVKAIYDKYSPFQALAYWFELLN 313
Query: 326 YYESKFGKLSELCSLDYHKISGA 328
YYESK GKLSEL Y ++G+
Sbjct: 314 YYESKCGKLSELPCSKYPSVTGS 335
BLAST of Clc08G00010 vs. ExPASy TrEMBL
Match:
A0A6J1GJ25 (uncharacterized protein LOC111454659 OS=Cucurbita moschata OX=3662 GN=LOC111454659 PE=4 SV=1)
HSP 1 Score: 455.3 bits (1170), Expect = 2.2e-124
Identity = 226/319 (70.85%), Postives = 270/319 (84.64%), Query Frame = 0
Query: 23 MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIH 82
+SDF+LE+AVCNHG FMM PNQWIPSSKTLQRPLRLSNS++S+ VSINQ+SS LLT+QIH
Sbjct: 10 VSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRLSNSDTSLLVSINQSSSSLLTLQIH 69
Query: 83 SSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSPTVFEDA 142
S +L P+D+ ILDQV RMLRLTEKDEDE+R+FQ+LHP AKQ+GFGR+FRSP++FED
Sbjct: 70 -SPRSLPPKDEVAILDQVARMLRLTEKDEDEIRRFQNLHPTAKQIGFGRIFRSPSLFEDV 129
Query: 143 LKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELL 202
+KSIL+CNT+W+RTL MA +LCE+QA+M +++KRKRK GNFPNA EVCRMGVE L
Sbjct: 130 VKSILMCNTSWRRTLEMAEKLCEVQAKM-RESKKRKRKGNNERGNFPNAREVCRMGVEAL 189
Query: 203 KKHNLGYRAGFIINFAQRVQNATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFY 262
K H LGYRA +++ FAQ V++ I+LQ +P+ FPKIKGFGPFATAN+ MCLGFY
Sbjct: 190 KNHCLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATANIFMCLGFY 249
Query: 263 RQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK 322
QLPIDTETIRH+KQVHG Q+C KTV EDVKQIYD YAP+QCLAY LELV+YYE+KFGK
Sbjct: 250 HQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELVQYYETKFGK 309
Query: 323 LSELCSLDYHKISGATLNL 332
LSEL S DYHKISG+TL+L
Sbjct: 310 LSELSSFDYHKISGSTLHL 326
BLAST of Clc08G00010 vs. ExPASy TrEMBL
Match:
A0A6J1DS88 (uncharacterized protein LOC111023822 OS=Momordica charantia OX=3673 GN=LOC111023822 PE=4 SV=1)
HSP 1 Score: 432.2 bits (1110), Expect = 2.0e-117
Identity = 223/322 (69.25%), Postives = 257/322 (79.81%), Query Frame = 0
Query: 21 KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQ 80
+T S FDLERAVCNHG FMMPPN+WIPSSKTLQRPLRL++S +SV VSI+Q SS LL IQ
Sbjct: 14 ETTSGFDLERAVCNHGFFMMPPNKWIPSSKTLQRPLRLADSTTSVLVSISQPSSHLLNIQ 73
Query: 81 IHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSPTVFE 140
IH SS + SP D+Q ILDQV RMLR+TE+DE+ +R FQ+LH KAK++GFGRLFRSPT+FE
Sbjct: 74 IH-SSPSFSPLDRQAILDQVTRMLRITERDEENIRNFQNLHAKAKEIGFGRLFRSPTLFE 133
Query: 141 DALKSILLCNTTWKRTLAMAGQLCELQARMS----SQNRKRKRK-------LIGNFPNAE 200
DA+KSILLCN TW+RTLAMAGQLCELQA++ + +KRKRK GNFP A
Sbjct: 134 DAVKSILLCNATWRRTLAMAGQLCELQAKLGRGPITDGKKRKRKGKGECELEGGNFPTAA 193
Query: 201 EVCRMGVELLKKHNLGYRAGFIINFAQRVQNATIDLQNPN---NFPKIKGFGPFATANLL 260
E+CRM V LL+KH +GYRA +II+ AQRVQN IDLQ +FPKIKGFGPF TAN+
Sbjct: 194 ELCRMSVLLLQKHFIGYRAVYIIDLAQRVQNGKIDLQKIERALSFPKIKGFGPFTTANVF 253
Query: 261 MCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYY 320
MCLG Y +LPIDTETIRH+KQVHGRQ CN KT E VK +YDKYAPFQCLAY +ELVEYY
Sbjct: 254 MCLGLYDRLPIDTETIRHLKQVHGRQDCNMKTAEEAVKDVYDKYAPFQCLAYWMELVEYY 313
Query: 321 ESKFGKLSELCSLDYHKISGAT 329
ES+FGKLSEL DY KISG T
Sbjct: 314 ESRFGKLSELGWHDYKKISGTT 334
BLAST of Clc08G00010 vs. ExPASy TrEMBL
Match:
A0A6A1W9S6 (Uncharacterized protein OS=Morella rubra OX=262757 GN=CJ030_MR3G027886 PE=4 SV=1)
HSP 1 Score: 320.9 bits (821), Expect = 6.4e-84
Identity = 177/370 (47.84%), Postives = 242/370 (65.41%), Query Frame = 0
Query: 20 EKTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQ----TSSF 79
E+ + F++E+AVCNHG FMM PN WIPS+KTLQRPLRL+NS SV VSI+ T+++
Sbjct: 11 EECVRTFNMEKAVCNHGFFMMAPNAWIPSTKTLQRPLRLANSAVSVLVSISHPASGTANY 70
Query: 80 LLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRS 139
+L IQ+H + +SPQD++ IL+QV RMLR++E+DE LR+FQ+LHP+AK+ GFGR FRS
Sbjct: 71 IL-IQVHDTD-KVSPQDEKAILEQVARMLRISERDERNLREFQNLHPEAKEKGFGRCFRS 130
Query: 140 PTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---------------ARMSSQNRKRKRK 199
P++FEDA+KS+LLCN TW RTL MA LCELQ AR S+ R KRK
Sbjct: 131 PSLFEDAIKSLLLCNCTWTRTLDMAKALCELQWELANGLIPDKCENLARQYSRKRGLKRK 190
Query: 200 L------------------------------IGNFPNAEEVCRMGVELLKKH-NLGYRAG 259
+GNFP+++EV + L+ H NLGYRA
Sbjct: 191 QATRKQSKVKKCERNCSDNSQLPLKGKDCRPLGNFPSSKEVAMLNEYFLENHCNLGYRAR 250
Query: 260 FIINFAQRVQNATIDLQNPNN------------FPKIKGFGPFATANLLMCLGFYRQLPI 319
+I+ A++V++ + L+ ++ KIKGFGPFA AN++MC+G+Y+ +P+
Sbjct: 251 YIVKLAKQVESGKLKLKEFDDDHSATCEELYEKLTKIKGFGPFACANVMMCMGYYQLVPV 310
Query: 320 DTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELC 328
DTET+RH++QVHGR+ +TV EDVK +YDK+APFQ LAY EL+E+YE KFGKLSEL
Sbjct: 311 DTETVRHLRQVHGRK---KETVHEDVKDVYDKHAPFQSLAYWFELLEHYERKFGKLSELP 370
BLAST of Clc08G00010 vs. ExPASy TrEMBL
Match:
A0A2P5ACW8 (DNA glycosylase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_344830 PE=4 SV=1)
HSP 1 Score: 311.6 bits (797), Expect = 3.9e-81
Identity = 174/357 (48.74%), Postives = 236/357 (66.11%), Query Frame = 0
Query: 21 KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQT--SSFLLT 80
++ S F++E+AVCNHG FMM PN+W PS+KTLQRPLRL++ SSV VSI+ + S LL
Sbjct: 13 ESKSSFNMEKAVCNHGFFMMAPNRWSPSAKTLQRPLRLADGASSVTVSISHSPLHSHLLY 72
Query: 81 IQI--HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSP 140
I++ S S ALS D IL+QV RMLR+T++DE ++R+FQ +HP+AK+ GFGR+FRSP
Sbjct: 73 IRVLLQSPSKALSLSDSNAILEQVGRMLRITKRDERDVREFQKVHPQAKERGFGRVFRSP 132
Query: 141 TVFEDALKSILLCNTTWKRTLAMAGQLCELQARM---------------SSQNRKRKR-- 200
++FEDA+KSILLCN +W RTL MA LC+LQ + S++ KRKR
Sbjct: 133 SLFEDAVKSILLCNCSWARTLKMAEALCKLQFEVTENHVHPIKKTTTSTSNKGLKRKRAK 192
Query: 201 ---------KLIGNFPNAEEVCRMGVE-LLKKHN--LGYRAGFIINFAQRVQNATID--- 260
+++GNFPNA E+ + L+K+ LGYRA I++ A+ ++ ++
Sbjct: 193 TKATDDDDSQIMGNFPNAREIASLDKSYFLEKYTPILGYRAKHILSLAKDFESGKLNGLE 252
Query: 261 ---------LQNPNN---FPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQ 320
L + KI+GFGPF AN+LMC+ Y +P D+ETIRH++QVHGR+
Sbjct: 253 VAEKAEEEALHHEEMILIMKKIRGFGPFVCANVLMCIRIYENVPADSETIRHLQQVHGRK 312
Query: 321 FCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATL 330
CN KT+ ++VK+IYDKYAPFQCLAY +EL+EYYE KFGKLSEL Y ISG+ L
Sbjct: 313 NCNKKTILKEVKEIYDKYAPFQCLAYWMELLEYYEDKFGKLSELPESSYKTISGSRL 369
BLAST of Clc08G00010 vs. ExPASy TrEMBL
Match:
A0A2P5FT40 (DNA glycosylase OS=Trema orientale OX=63057 GN=TorRG33x02_031220 PE=4 SV=1)
HSP 1 Score: 310.1 bits (793), Expect = 1.1e-80
Identity = 172/358 (48.04%), Postives = 235/358 (65.64%), Query Frame = 0
Query: 21 KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQT--SSFLLT 80
++ S F++E+AVCNHG FMM PN+W PS+KTLQRPLRL++ SSV VSI+ + S LL
Sbjct: 13 ESKSSFNMEKAVCNHGFFMMAPNRWSPSAKTLQRPLRLADGASSVTVSISHSPLHSHLLY 72
Query: 81 IQI--HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSP 140
I++ S S LS D IL+QV RMLR+TE+DE ++R+FQ +HP+AK+ GFGR+FRSP
Sbjct: 73 IRVLLQSPSKGLSLSDSNAILEQVGRMLRITERDERDVREFQKVHPQAKERGFGRVFRSP 132
Query: 141 TVFEDALKSILLCNTTWKRTLAMAGQLCELQ---------------ARMSSQNRKRKR-- 200
++FEDA+KSILLCN +W RTL MA LC+LQ + S+++ KRKR
Sbjct: 133 SLFEDAVKSILLCNCSWARTLKMAEALCKLQFEVTENHVHTIRRTTSSTSNKDLKRKRAK 192
Query: 201 ----------KLIGNFPNAEEVCRM-GVELLKKHN--LGYRAGFIINFAQRVQNATID-L 260
+++GNFPNA E+ + L+K+ LGYRA I++ A+ ++ ++ L
Sbjct: 193 SKASTDDDDSQIVGNFPNAREIASLDNSYFLEKYTPILGYRAKHILSLAKDFESGKLNGL 252
Query: 261 QNPNN--------------FPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGR 320
+ I+GFGPF AN+LMC+ Y +P D+ETIRH++QVH R
Sbjct: 253 EEAEKAAEEVLHHEEMIMIMKNIRGFGPFVCANVLMCIRIYENVPADSETIRHLQQVHAR 312
Query: 321 QFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATL 330
+ CN KT++++VK+IYDKYAPFQCLAY +EL+EYYE KFGKLSEL Y ISG+ L
Sbjct: 313 KNCNKKTIQKEVKEIYDKYAPFQCLAYWMELLEYYEDKFGKLSELPESSYKTISGSRL 370
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038877617.1 | 1.0e-136 | 87.46 | uncharacterized protein LOC120069874 [Benincasa hispida] | [more] |
KAG6585875.1 | 4.5e-124 | 70.85 | hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022951918.1 | 4.5e-124 | 70.85 | uncharacterized protein LOC111454659 [Cucurbita moschata] | [more] |
XP_022156993.1 | 4.1e-117 | 69.25 | uncharacterized protein LOC111023822 [Momordica charantia] | [more] |
XP_021905122.1 | 3.2e-85 | 53.25 | uncharacterized protein LOC110820055 isoform X2 [Carica papaya] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1GJ25 | 2.2e-124 | 70.85 | uncharacterized protein LOC111454659 OS=Cucurbita moschata OX=3662 GN=LOC1114546... | [more] |
A0A6J1DS88 | 2.0e-117 | 69.25 | uncharacterized protein LOC111023822 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A6A1W9S6 | 6.4e-84 | 47.84 | Uncharacterized protein OS=Morella rubra OX=262757 GN=CJ030_MR3G027886 PE=4 SV=1 | [more] |
A0A2P5ACW8 | 3.9e-81 | 48.74 | DNA glycosylase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_344830 PE=4 SV=1 | [more] |
A0A2P5FT40 | 1.1e-80 | 48.04 | DNA glycosylase OS=Trema orientale OX=63057 GN=TorRG33x02_031220 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |