Homology
BLAST of HG10003629 vs. NCBI nr
Match:
XP_038886411.1 (protein SET DOMAIN GROUP 41 [Benincasa hispida])
HSP 1 Score: 1095.5 bits (2832), Expect = 0.0e+00
Identity = 549/654 (83.94%), Postives = 584/654 (89.30%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEM AMEDIEMAEDITPPL PLT+ALHDSFL THCSSCFS LPNPPISHSNLLRYCS
Sbjct: 1 MEMEMIAMEDIEMAEDITPPLLPLTSALHDSFLFTHCSSCFSLLPNPPISHSNLLRYCSP 60
Query: 61 KCSLSHSDPLTAAFFSAHPFPS--SDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLT 120
KCSLSHSDPLTAAFFS HPFPS S TSDLRASLRLLHLLLSHP A S PPERIFGLLT
Sbjct: 61 KCSLSHSDPLTAAFFSTHPFPSPFSYTSDLRASLRLLHLLLSHPPASLSPPPERIFGLLT 120
Query: 121 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 180
NRHKLM PQ D+++F KLREGVDAIAA SADI HG+ L EA LCLV TNAVDV DS
Sbjct: 121 NRHKLMFPQHDAELFPKLREGVDAIAALL---SADIPHGHTLAEAALCLVFTNAVDVHDS 180
Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 240
GRTIGIAVY PTFCWINHSCSPNACYRFET S STTTR RIAPSCTDL+T +GSC+QMG
Sbjct: 181 TGRTIGIAVYPPTFCWINHSCSPNACYRFETSSASTTTRSRIAPSCTDLLTGQGSCSQMG 240
Query: 241 TVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFV 300
TVRSNLSDFI EDFQG GPRV+VRSIK IR+GEAVTIAYCDLLQPKAMRQSELWSRYQFV
Sbjct: 241 TVRSNLSDFITEDFQGNGPRVMVRSIKSIRRGEAVTIAYCDLLQPKAMRQSELWSRYQFV 300
Query: 301 CSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 360
CSCQRCSA+PLTYVDHALQE+SA KVE DSTSISNFDHD+AVRRIDDYV++AITEYLSI
Sbjct: 301 CSCQRCSAKPLTYVDHALQELSASKVELHDSTSISNFDHDKAVRRIDDYVNSAITEYLSI 360
Query: 361 SSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCD 420
SPESC EKL+NLLTLGF DEQAED E+KQPV +RLHPLHFLSLN YTALASAYKVRSCD
Sbjct: 361 GSPESCCEKLRNLLTLGFYDEQAEDGEQKQPVNLRLHPLHFLSLNVYTALASAYKVRSCD 420
Query: 421 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 480
LLALSS+MD D+E+Q AS M + SAAYSLFLAGATHHLFLS+PSLI SA+ CWV+AGES
Sbjct: 421 LLALSSEMDCDNEDQCNASTMCKASAAYSLFLAGATHHLFLSEPSLIVSASTCWVLAGES 480
Query: 481 LLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISN 540
LL LA HS LWATTN+SKWG PVGKRMCS CSWVDKFNASRI G+ IEADF EFSIGISN
Sbjct: 481 LLTLARHSLLWATTNTSKWGFPVGKRMCSTCSWVDKFNASRIHGQPIEADFREFSIGISN 540
Query: 541 CIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKD 600
CIANMS+KSWSFLTHGCPYLKAFTDP +FSWPK I YS+ RD++AHSID CACS +KD
Sbjct: 541 CIANMSRKSWSFLTHGCPYLKAFTDPFNFSWPKMIPMYSSDRDIRAHSIDRLCACSNSKD 600
Query: 601 VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
VCFQ EPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNIL+DL+
Sbjct: 601 VCFQCEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILYDLN 651
BLAST of HG10003629 vs. NCBI nr
Match:
XP_008463080.1 (PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo])
HSP 1 Score: 1066.2 bits (2756), Expect = 1.1e-307
Identity = 539/655 (82.29%), Postives = 574/655 (87.63%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKC
Sbjct: 1 MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60
Query: 63 SLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLLLSHPSAFHSAPPERIFGLLT 122
SLSHSDPLTAAFFS HP P SSDTSDLRASLRL LHLLLSHPS S PP RIFGLLT
Sbjct: 61 SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120
Query: 123 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 182
NRHKLM PQ+ S+VFLKLRE +AIAA RRKN ADI G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180
Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 242
IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240
Query: 243 TVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFV 302
VRSN+ DF+REDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQFV
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300
Query: 303 CSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 362
CSCQRCSA PLTYVDHALQEISAVKVE LDS ISNFDHD AVRRID+YVDNAITEYLSI
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYLSI 360
Query: 363 SSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCD 422
SPESC EKLQNLLT GF DEQ ED E KQPV +RLHP HFL LNAYTAL SAYKVRSCD
Sbjct: 361 GSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSCD 420
Query: 423 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 482
LLALSS+MD D+EN+ A MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAGES
Sbjct: 421 LLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGES 480
Query: 483 LLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGIS 542
LLILA HSSLWA TTN+S WG P+GKRMCSNCSWVD+FN SRI GR I+ADF EFSIGIS
Sbjct: 481 LLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIGIS 540
Query: 543 NCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTK 602
NCIA++S+K WSFLTHGCPYLKAFTDP DFSWPK +N D+ H ID SCACSKTK
Sbjct: 541 NCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPK-----TNDGDIGGHGIDRSCACSKTK 600
Query: 603 DVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
D+CF+ EPQ SNQERESI GLGIHCL+YGGYLASICYG+HSHLASQIQNIL+DL+
Sbjct: 601 DICFECEPQDSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 650
BLAST of HG10003629 vs. NCBI nr
Match:
XP_011656459.1 (protein SET DOMAIN GROUP 41 [Cucumis sativus])
HSP 1 Score: 1048.5 bits (2710), Expect = 2.4e-302
Identity = 529/655 (80.76%), Postives = 571/655 (87.18%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEM A+EDIEMAEDI+PPLFPLT+ALHDSFL THCSSCFS LPNPPISHS L YCSL
Sbjct: 1 MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60
Query: 61 KCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLT 120
KCSLSHSDPLT AFFS HPFP SSDTSDLRASLRLLHLLLSHPS S PP+RI+GLLT
Sbjct: 61 KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120
Query: 121 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 180
NRHKLM PQ+DS+VFLKLREG +AIAA RRKN ADI G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180
Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 240
IG+TIGIAVYA TF WINHSCSPNACYRFETPSDS TTR RIAPSCTD +++EGSC QMG
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240
Query: 241 TVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFV 300
VRSN+ DFIREDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQFV
Sbjct: 241 NVRSNILDFIREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300
Query: 301 CSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 360
CSCQRCSA PLTYVDHALQEIS+VKVE LDST ISNFDHD AVRRID+YVDNAITEYLS
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEYLST 360
Query: 361 SSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCD 420
SSPESC EKLQNLLT GF DEQ ED E KQ V +RLHPLHFL LNAYTAL SAYKVRSCD
Sbjct: 361 SSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVRSCD 420
Query: 421 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 480
L+ALSS+MD D+ N+ A M +TSAAY+LFLAGATH LFL +PSL+ASAANCWVVAGES
Sbjct: 421 LVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVAGES 480
Query: 481 LLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGIS 540
LLILA HSSLWA TTN+S W P+GKRMC NCSWVD+FNASRI G+ ++ADF EFSIGIS
Sbjct: 481 LLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSIGIS 540
Query: 541 NCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTK 600
NCIA++SQK WS LTHGCPYLKAFT P DFSWPK +N +D+ ID SCACSKT+
Sbjct: 541 NCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPK-----TNEQDICGRGIDHSCACSKTQ 600
Query: 601 DVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
DVC + +PQ SNQERESI GLGIHCL+YGGYLASICYGHHSHLASQIQNIL+DL+
Sbjct: 601 DVCLECKPQDSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILNDLN 650
BLAST of HG10003629 vs. NCBI nr
Match:
XP_023520942.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 972.6 bits (2513), Expect = 1.7e-279
Identity = 507/652 (77.76%), Postives = 546/652 (83.74%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEMRAMEDIEMAEDITPPL PLTAALHD+FLLTHCSSCFSPLPN ISHSNLLRYCS
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFLLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
C SHSD LTAA FS FP SDTSDLRASLRLLHLLLS PSA+ SAPPERIFGLLTNR
Sbjct: 61 IC--SHSDSLTAAVFSTGQFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNR 120
Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
KLM+ DDS+VF+K+REG DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLADDDSEVFVKIREGSDAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180
Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
RTIGIAVY PTFCWINHSCSPNACYRFETPSDS TRLRI+P CTD+ T EGSC+QM TV
Sbjct: 181 RTIGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTV 240
Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCS 300
R N S FI +DFQGYGPRV+VRSIK IR GEAVTIAYCDLLQPKAMRQSEL SRY+FVCS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSIRNGEAVTIAYCDLLQPKAMRQSELRSRYKFVCS 300
Query: 301 CQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISS 360
CQRCSA+P TYVDHALQEISAV VE LDSTSISNFD+D A+ RIDDYV+NAI EYLSI S
Sbjct: 301 CQRCSAKPPTYVDHALQEISAVNVELLDSTSISNFDYDTAIARIDDYVNNAIAEYLSIGS 360
Query: 361 PESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCDLL 420
ESC EKLQNLLTLGF DEQAED + KQ + +RLHP+HFL LNAYTALASAYKVRS
Sbjct: 361 SESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSW--- 420
Query: 421 ALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLL 480
N DENQ A+ MS+TSAAYSLFLAGATHHLFLS+PSLIASAANCWVVAGESLL
Sbjct: 421 -------NGDENQCNAT-MSKTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLL 480
Query: 481 ILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCI 540
IL HSSLW +N+SK P+G+ C NCSWVDKFN SRI GRSIEADF EFSIGISNCI
Sbjct: 481 ILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEADFREFSIGISNCI 540
Query: 541 ANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKDVC 600
AN+SQK WSFL H C YLKAFTDP DFSWPK ITT SNYR D SC CSK +DV
Sbjct: 541 ANISQKYWSFLAHECSYLKAFTDPFDFSWPKTITTCSNYR-------DRSCDCSKIQDV- 600
Query: 601 FQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
S+Q+R+SI LGIHCLFYGGYLASICYGHHSHLASQIQ ILHD++
Sbjct: 601 -------SDQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIQCILHDMN 623
BLAST of HG10003629 vs. NCBI nr
Match:
XP_022932824.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata])
HSP 1 Score: 951.0 bits (2457), Expect = 5.2e-273
Identity = 494/652 (75.77%), Postives = 540/652 (82.82%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEMRAMEDIEMAEDITPPL PLTAALHD+F LTHCSSCFSPLPN ISHSNLLRYCS
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
C S SD LTAA FS FP SDTSDLRASLRLLHLLLS SA+ SAPPERIFGLLTNR
Sbjct: 61 IC--SRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNR 120
Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
KLM+ +DDS+VF+K+R+G DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180
Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
+TIGIAVY PTFCWINHSCSPNACYRFETPSDS TRLRI+P CTD+ T EGSCNQM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTV 240
Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCS 300
R N S FI +DFQGYGPRV+VRSIK +RKGEAVTIAYCDLLQPKA+RQSEL SRY+FVCS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCS 300
Query: 301 CQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISS 360
CQRCSA+P TYVDHALQEISA VE LDSTSISNFD+D A+RRIDDYV+NAI EYLSI S
Sbjct: 301 CQRCSAKPPTYVDHALQEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGS 360
Query: 361 PESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCDLL 420
PESC EKLQNLLTLGF DEQAED + KQ + +RLHP+HFL LN YTALASAYKVRS
Sbjct: 361 PESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW--- 420
Query: 421 ALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLL 480
NDDENQ A+ MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLL
Sbjct: 421 -------NDDENQCNAT-MSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLL 480
Query: 481 ILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCI 540
IL HSSLW +N+SK P+G+ C NCSWVDKFN +RI GRSIEADF EFSIGISNCI
Sbjct: 481 ILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISNCI 540
Query: 541 ANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKDVC 600
A++S K WSFL H C YLKAFTDP DFSWPK ITT NY SC CSK +DV
Sbjct: 541 ADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLNYH-------GRSCDCSKIQDV- 600
Query: 601 FQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
S Q+R+SI LGIHCLFYGGYLASICYGH SHLASQI+ ILHD++
Sbjct: 601 -------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECILHDMN 623
BLAST of HG10003629 vs. ExPASy Swiss-Prot
Match:
Q3ECY6 (Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1)
HSP 1 Score: 374.4 bits (960), Expect = 2.6e-102
Identity = 244/649 (37.60%), Postives = 350/649 (53.93%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
ME+RA EDIE+ D+ PPL PL ++L+DSFL +HCSSCFS LP P YCS C
Sbjct: 1 MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60
Query: 63 SLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNRHK 122
S LT +F ++ FP T L + +R LL+ + S+ P R+ LLTN H
Sbjct: 61 S------LTDSFTNSPQFPPEITPILPSDIRTSLHLLNSTAVDTSSSPHRLNNLLTNHHL 120
Query: 123 LMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRT 182
LM D + + + + IA R N R LEEA +C VLTNAV+V DS G
Sbjct: 121 LMA---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNGLA 180
Query: 183 IGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRS 242
+GIA+Y +F WINHSCSPN+CYRF ++ T+ + + T+ +N Q+
Sbjct: 181 LGIALYNSSFSWINHSCSPNSCYRF---VNNRTSYHDVHVTNTETSSNLELQEQVCGTSL 240
Query: 243 NLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCSCQ 302
N + G GP+++VRSIK I+ GE +T++Y DLLQP +RQS+LWS+Y+F+C+C
Sbjct: 241 NSGN-------GNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCNCG 300
Query: 303 RCSAEPLTYVDHALQEISAVKVEFLDSTSISNFD----HDQAVRRIDDYVDNAITEYLSI 362
RC+A P YVD L+ + ++ E T++ +FD D+AV +++DY+ AI ++LS
Sbjct: 301 RCAASPPAYVDSILEGVLTLESE---KTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSD 360
Query: 363 S-SPESCYEKLQNLLTLGFCDEQAEDKEEKQP-VMRLHPLHFLSLNAYTALASAYKVRSC 422
+ P++C E ++++L G + KE+ QP +RLH H+++LNAY LA+AY++RS
Sbjct: 361 NIDPKTCCEMIESVLHHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRSI 420
Query: 423 DLLALSSKMDNDDENQRG-ASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 482
D ++ G MSR SAAYSLFLAG +HHLF ++ S SAA W AG
Sbjct: 421 D-------------SETGIVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAG 480
Query: 483 ESLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGI 542
E L LA + + S C+ C ++ N+ R D E S I
Sbjct: 481 ELLFDLAPKLLMELSVESDV--------KCTKCLMLETSNSHR--------DIKEKSRQI 540
Query: 543 SNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKT 602
+C+ ++SQ +WSFLT GCPYL+ F P DFS +T + R+
Sbjct: 541 LSCVRDISQVTWSFLTRGCPYLEKFRSPVDFS----LTRTNGERE--------------- 557
Query: 603 KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQ 645
+ S + ++L L HCL Y L +CYG SHL S+ +
Sbjct: 601 ---------ESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557
BLAST of HG10003629 vs. ExPASy Swiss-Prot
Match:
Q9H7B4 (Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens OX=9606 GN=SMYD3 PE=1 SV=4)
HSP 1 Score: 49.3 bits (116), Expect = 1.9e-04
Identity = 31/140 (22.14%), Postives = 54/140 (38.57%), Query Frame = 0
Query: 168 VLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDL 227
V+ N+ + ++ + +G+ +Y P+ +NHSC PN F
Sbjct: 178 VICNSFTICNAEMQEVGVGLY-PSISLLNHSCDPNCSIVFN------------------- 237
Query: 228 VTNEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMR 287
GP +++R+++ I GE +TI Y D+L R
Sbjct: 238 ----------------------------GPHLLLRAVRDIEVGEELTICYLDMLMTSEER 269
Query: 288 QSELWSRYQFVCSCQRCSAE 308
+ +L +Y F C C RC +
Sbjct: 298 RKQLRDQYCFECDCFRCQTQ 269
BLAST of HG10003629 vs. ExPASy Swiss-Prot
Match:
Q9CWR2 (Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 SV=1)
HSP 1 Score: 49.3 bits (116), Expect = 1.9e-04
Identity = 31/140 (22.14%), Postives = 54/140 (38.57%), Query Frame = 0
Query: 168 VLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDL 227
V+ N+ + ++ + +G+ +Y P+ +NHSC PN F
Sbjct: 178 VICNSFTICNAEMQEVGVGLY-PSMSLLNHSCDPNCSIVFN------------------- 237
Query: 228 VTNEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMR 287
GP +++R+++ I GE +TI Y D+L R
Sbjct: 238 ----------------------------GPHLLLRAVREIEAGEELTICYLDMLMTSEER 269
Query: 288 QSELWSRYQFVCSCQRCSAE 308
+ +L +Y F C C RC +
Sbjct: 298 RKQLRDQYCFECDCIRCQTQ 269
BLAST of HG10003629 vs. ExPASy Swiss-Prot
Match:
Q557F7 (SET and MYND domain-containing protein DDB_G0273589 OS=Dictyostelium discoideum OX=44689 GN=DDB_G0273589 PE=3 SV=1)
HSP 1 Score: 47.0 bits (110), Expect = 9.6e-04
Identity = 39/154 (25.32%), Postives = 61/154 (39.61%), Query Frame = 0
Query: 154 IRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDS 213
IR N +++ N + + IG+AV +P+ + NHSC PN
Sbjct: 218 IRKINEKSRSIIHKTRCNQFGIWTKNDKCIGVAV-SPSSSYFNHSCIPN----------- 277
Query: 214 TTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAV 273
CTD +R+ G + +S+ I+KG+ +
Sbjct: 278 ----------CTD---------------------VRD-----GSNMTFKSLYPIKKGDQL 323
Query: 274 TIAYCDLLQPKAMRQSELWSRYQFVCSCQRCSAE 308
TI+Y +L QP R+ EL Y F C C RC+ +
Sbjct: 338 TISYIELDQPIQDRKDELKYGYYFDCICPRCNGD 323
BLAST of HG10003629 vs. ExPASy TrEMBL
Match:
A0A1S3CIT0 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 PE=4 SV=1)
HSP 1 Score: 1066.2 bits (2756), Expect = 5.3e-308
Identity = 539/655 (82.29%), Postives = 574/655 (87.63%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKC
Sbjct: 1 MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60
Query: 63 SLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLLLSHPSAFHSAPPERIFGLLT 122
SLSHSDPLTAAFFS HP P SSDTSDLRASLRL LHLLLSHPS S PP RIFGLLT
Sbjct: 61 SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120
Query: 123 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 182
NRHKLM PQ+ S+VFLKLRE +AIAA RRKN ADI G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180
Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 242
IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240
Query: 243 TVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFV 302
VRSN+ DF+REDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQFV
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300
Query: 303 CSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 362
CSCQRCSA PLTYVDHALQEISAVKVE LDS ISNFDHD AVRRID+YVDNAITEYLSI
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYLSI 360
Query: 363 SSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCD 422
SPESC EKLQNLLT GF DEQ ED E KQPV +RLHP HFL LNAYTAL SAYKVRSCD
Sbjct: 361 GSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSCD 420
Query: 423 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 482
LLALSS+MD D+EN+ A MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAGES
Sbjct: 421 LLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGES 480
Query: 483 LLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGIS 542
LLILA HSSLWA TTN+S WG P+GKRMCSNCSWVD+FN SRI GR I+ADF EFSIGIS
Sbjct: 481 LLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIGIS 540
Query: 543 NCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTK 602
NCIA++S+K WSFLTHGCPYLKAFTDP DFSWPK +N D+ H ID SCACSKTK
Sbjct: 541 NCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPK-----TNDGDIGGHGIDRSCACSKTK 600
Query: 603 DVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
D+CF+ EPQ SNQERESI GLGIHCL+YGGYLASICYG+HSHLASQIQNIL+DL+
Sbjct: 601 DICFECEPQDSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 650
BLAST of HG10003629 vs. ExPASy TrEMBL
Match:
A0A0A0KAK3 (SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV=1)
HSP 1 Score: 1036.6 bits (2679), Expect = 4.5e-299
Identity = 526/657 (80.06%), Postives = 568/657 (86.45%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEM A+EDIEMAEDI+PPLFPLT+ALHDSFL THCSSCFS LPNPPISHS L YCSL
Sbjct: 1 MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60
Query: 61 KCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLT 120
KCSLSHSDPLT AFFS HPFP SSDTSDLRASLRLLHLLLSHPS S PP+RI+GLLT
Sbjct: 61 KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120
Query: 121 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 180
NRHKLM PQ+DS+VFLKLREG +AIAA RRKN ADI G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180
Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 240
IG+TIGIAVYA TF WINHSCSPNACYRFETPSDS TTR RIAPSCTD +++EGSC QMG
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240
Query: 241 TVRSNLSDFIRED--FQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQ 300
VRSN+ DFIRE G GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQ
Sbjct: 241 NVRSNILDFIREGALLNGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQ 300
Query: 301 FVCSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYL 360
FVCSCQRCSA PLTYVDHALQEIS+VKVE LDST ISNFDHD AVRRID+YVDNAITEYL
Sbjct: 301 FVCSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEYL 360
Query: 361 SISSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRS 420
S SSPESC EKLQNLLT GF DEQ ED E KQ V +RLHPLHFL LNAYTAL SAYKVRS
Sbjct: 361 STSSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVRS 420
Query: 421 CDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 480
CDL+ALSS+MD D+ N+ A M +TSAAY+LFLAGATH LFL +PSL+ASAANCWVVAG
Sbjct: 421 CDLVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVAG 480
Query: 481 ESLLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIG 540
ESLLILA HSSLWA TTN+S W P+GKRMC NCSWVD+FNASRI G+ ++ADF EFSIG
Sbjct: 481 ESLLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSIG 540
Query: 541 ISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSK 600
ISNCIA++SQK WS LTHGCPYLKAFT P DFSWPK +N +D+ ID SCACSK
Sbjct: 541 ISNCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPK-----TNEQDICGRGIDHSCACSK 600
Query: 601 TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
T+DVC + +PQ SNQERESI GLGIHCL+YGGYLASICYGHHSHLASQIQNIL+DL+
Sbjct: 601 TQDVCLECKPQDSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILNDLN 652
BLAST of HG10003629 vs. ExPASy TrEMBL
Match:
A0A6J1EY39 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)
HSP 1 Score: 951.0 bits (2457), Expect = 2.5e-273
Identity = 494/652 (75.77%), Postives = 540/652 (82.82%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEMEMRAMEDIEMAEDITPPL PLTAALHD+F LTHCSSCFSPLPN ISHSNLLRYCS
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
C S SD LTAA FS FP SDTSDLRASLRLLHLLLS SA+ SAPPERIFGLLTNR
Sbjct: 61 IC--SRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNR 120
Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
KLM+ +DDS+VF+K+R+G DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180
Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
+TIGIAVY PTFCWINHSCSPNACYRFETPSDS TRLRI+P CTD+ T EGSCNQM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTV 240
Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCS 300
R N S FI +DFQGYGPRV+VRSIK +RKGEAVTIAYCDLLQPKA+RQSEL SRY+FVCS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCS 300
Query: 301 CQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISS 360
CQRCSA+P TYVDHALQEISA VE LDSTSISNFD+D A+RRIDDYV+NAI EYLSI S
Sbjct: 301 CQRCSAKPPTYVDHALQEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGS 360
Query: 361 PESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCDLL 420
PESC EKLQNLLTLGF DEQAED + KQ + +RLHP+HFL LN YTALASAYKVRS
Sbjct: 361 PESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW--- 420
Query: 421 ALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLL 480
NDDENQ A+ MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLL
Sbjct: 421 -------NDDENQCNAT-MSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLL 480
Query: 481 ILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCI 540
IL HSSLW +N+SK P+G+ C NCSWVDKFN +RI GRSIEADF EFSIGISNCI
Sbjct: 481 ILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISNCI 540
Query: 541 ANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKDVC 600
A++S K WSFL H C YLKAFTDP DFSWPK ITT NY SC CSK +DV
Sbjct: 541 ADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLNYH-------GRSCDCSKIQDV- 600
Query: 601 FQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
S Q+R+SI LGIHCLFYGGYLASICYGH SHLASQI+ ILHD++
Sbjct: 601 -------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECILHDMN 623
BLAST of HG10003629 vs. ExPASy TrEMBL
Match:
A0A6J1I954 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)
HSP 1 Score: 943.7 bits (2438), Expect = 4.0e-271
Identity = 493/653 (75.50%), Postives = 537/653 (82.24%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
MEME+RAMEDIEMAEDITPPL PLTAALHDSFLLTHCSSCFSPLPN PISHSNLLRYCS
Sbjct: 1 MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60
Query: 61 KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
C S+SD LTAA FS F SDTSDLRASLRLLHLLLS SA+ S PPERIFGLLTNR
Sbjct: 61 IC--SYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNR 120
Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
KLM+ DDS+VF K+R+G DAIA SRR NSADIR+ NALEEA++CLVLTNAV+VQDS+G
Sbjct: 121 EKLMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVG 180
Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
+TIGIAVY PTFCWINHSCSPNACYRFETPSDS TRLRI+P CTD+ T EGSC+QM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTV 240
Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCS 300
R N S FI +DFQGYGPRV+VRSIK IRKGEAVTIAYCDLLQPKAMRQSEL SRY+FVCS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELRSRYKFVCS 300
Query: 301 CQRCSAEPLTYVDHALQEISAVKV-EFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSIS 360
CQRCSA+P TYVDHALQEI AV V E LDSTSISNFD+D A+ RIDDYV+NAI EYLSI
Sbjct: 301 CQRCSAKPPTYVDHALQEIFAVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIG 360
Query: 361 SPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCDL 420
SPESC EKLQNLLTLGF DEQA+D + KQ + +RLHP+HFL LN YTALASAYKVRS
Sbjct: 361 SPESCCEKLQNLLTLGFYDEQADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSW-- 420
Query: 421 LALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESL 480
ND+ENQ S MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGESL
Sbjct: 421 --------NDNENQCNTSTMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESL 480
Query: 481 LILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNC 540
L L HSSLW +N+SK P+G+ C NCSWVDKFN SRI GRSIE DF EFSIGISNC
Sbjct: 481 LRLVRHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEVDFQEFSIGISNC 540
Query: 541 IANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKDV 600
IAN+S K WSFLTH CPYLKAFTDP DFSWPK ITT SNYR D C SK +DV
Sbjct: 541 IANISHKYWSFLTHECPYLKAFTDPFDFSWPKTITTCSNYR-------DRLCDYSKIQDV 600
Query: 601 CFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
S+Q+R+SI LGIHCLFYGGYLASICYGH SHL+SQIQ IL D++
Sbjct: 601 --------SDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQCILQDMN 625
BLAST of HG10003629 vs. ExPASy TrEMBL
Match:
A0A1S3CJZ3 (protein SET DOMAIN GROUP 41 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501316 PE=4 SV=1)
HSP 1 Score: 865.5 bits (2235), Expect = 1.4e-247
Identity = 444/532 (83.46%), Postives = 468/532 (87.97%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKC
Sbjct: 1 MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60
Query: 63 SLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLLLSHPSAFHSAPPERIFGLLT 122
SLSHSDPLTAAFFS HP P SSDTSDLRASLRL LHLLLSHPS S PP RIFGLLT
Sbjct: 61 SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120
Query: 123 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 182
NRHKLM PQ+ S+VFLKLRE +AIAA RRKN ADI G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180
Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 242
IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240
Query: 243 TVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFV 302
VRSN+ DF+REDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQFV
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300
Query: 303 CSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 362
CSCQRCSA PLTYVDHALQEISAVKVE LDS ISNFDHD AVRRID+YVDNAITEYLSI
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYLSI 360
Query: 363 SSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCD 422
SPESC EKLQNLLT GF DEQ ED E KQPV +RLHP HFL LNAYTAL SAYKVRSCD
Sbjct: 361 GSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSCD 420
Query: 423 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 482
LLALSS+MD D+EN+ A MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAGES
Sbjct: 421 LLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGES 480
Query: 483 LLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADF 529
LLILA HSSLWA TTN+S WG P+GKRMCSNCSWVD+FN SRI GR I+ADF
Sbjct: 481 LLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADF 532
BLAST of HG10003629 vs. TAIR 10
Match:
AT1G43245.1 (SET domain-containing protein )
HSP 1 Score: 374.4 bits (960), Expect = 1.9e-103
Identity = 244/649 (37.60%), Postives = 350/649 (53.93%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
ME+RA EDIE+ D+ PPL PL ++L+DSFL +HCSSCFS LP P YCS C
Sbjct: 1 MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60
Query: 63 SLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNRHK 122
S LT +F ++ FP T L + +R LL+ + S+ P R+ LLTN H
Sbjct: 61 S------LTDSFTNSPQFPPEITPILPSDIRTSLHLLNSTAVDTSSSPHRLNNLLTNHHL 120
Query: 123 LMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRT 182
LM D + + + + IA R N R LEEA +C VLTNAV+V DS G
Sbjct: 121 LMA---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNGLA 180
Query: 183 IGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRS 242
+GIA+Y +F WINHSCSPN+CYRF ++ T+ + + T+ +N Q+
Sbjct: 181 LGIALYNSSFSWINHSCSPNSCYRF---VNNRTSYHDVHVTNTETSSNLELQEQVCGTSL 240
Query: 243 NLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCSCQ 302
N + G GP+++VRSIK I+ GE +T++Y DLLQP +RQS+LWS+Y+F+C+C
Sbjct: 241 NSGN-------GNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCNCG 300
Query: 303 RCSAEPLTYVDHALQEISAVKVEFLDSTSISNFD----HDQAVRRIDDYVDNAITEYLSI 362
RC+A P YVD L+ + ++ E T++ +FD D+AV +++DY+ AI ++LS
Sbjct: 301 RCAASPPAYVDSILEGVLTLESE---KTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSD 360
Query: 363 S-SPESCYEKLQNLLTLGFCDEQAEDKEEKQP-VMRLHPLHFLSLNAYTALASAYKVRSC 422
+ P++C E ++++L G + KE+ QP +RLH H+++LNAY LA+AY++RS
Sbjct: 361 NIDPKTCCEMIESVLHHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRSI 420
Query: 423 DLLALSSKMDNDDENQRG-ASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 482
D ++ G MSR SAAYSLFLAG +HHLF ++ S SAA W AG
Sbjct: 421 D-------------SETGIVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAG 480
Query: 483 ESLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGI 542
E L LA + + S C+ C ++ N+ R D E S I
Sbjct: 481 ELLFDLAPKLLMELSVESDV--------KCTKCLMLETSNSHR--------DIKEKSRQI 540
Query: 543 SNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKT 602
+C+ ++SQ +WSFLT GCPYL+ F P DFS +T + R+
Sbjct: 541 LSCVRDISQVTWSFLTRGCPYLEKFRSPVDFS----LTRTNGERE--------------- 557
Query: 603 KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQ 645
+ S + ++L L HCL Y L +CYG SHL S+ +
Sbjct: 601 ---------ESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557
BLAST of HG10003629 vs. TAIR 10
Match:
AT2G17900.1 (SET domain group 37 )
HSP 1 Score: 44.3 bits (103), Expect = 4.4e-04
Identity = 39/135 (28.89%), Postives = 48/135 (35.56%), Query Frame = 0
Query: 171 NAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTN 230
NA + DS R GI ++ P INHSCSPNA FE
Sbjct: 189 NAHSICDSELRPQGIGLF-PLVSIINHSCSPNAVLVFE---------------------- 248
Query: 231 EGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSE 290
QM VVR++ I K +TI+Y + RQ
Sbjct: 249 ----EQM---------------------AVVRAMDNISKDSEITISYIETAGSTLTRQKS 275
Query: 291 LWSRYQFVCSCQRCS 306
L +Y F C C RCS
Sbjct: 309 LKEQYLFHCQCARCS 275
BLAST of HG10003629 vs. TAIR 10
Match:
AT1G26760.1 (SET domain protein 35 )
HSP 1 Score: 43.1 bits (100), Expect = 9.8e-04
Identity = 20/56 (35.71%), Postives = 31/56 (55.36%), Query Frame = 0
Query: 256 GPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCSCQRCSAEPLTY 312
G V+V + + I+ GE ++ AY D+L P R+ E+ + F C C RC E + Y
Sbjct: 354 GDYVIVHASRDIKTGEEISFAYFDVLSPLEKRK-EMAESWGFCCGCSRCKFESVLY 408
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q3ECY6 | 2.6e-102 | 37.60 | Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1 | [more] |
Q9H7B4 | 1.9e-04 | 22.14 | Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens OX=9606 GN=SMYD3 PE=1 S... | [more] |
Q9CWR2 | 1.9e-04 | 22.14 | Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 ... | [more] |
Q557F7 | 9.6e-04 | 25.32 | SET and MYND domain-containing protein DDB_G0273589 OS=Dictyostelium discoideum ... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3CIT0 | 5.3e-308 | 82.29 | protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 P... | [more] |
A0A0A0KAK3 | 4.5e-299 | 80.06 | SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV... | [more] |
A0A6J1EY39 | 2.5e-273 | 75.77 | protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11143... | [more] |
A0A6J1I954 | 4.0e-271 | 75.50 | protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1114726... | [more] |
A0A1S3CJZ3 | 1.4e-247 | 83.46 | protein SET DOMAIN GROUP 41 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501316 P... | [more] |