HG10003629 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003629
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSET domain-containing protein
LocationChr08: 4515096 .. 4519922 (-)
RNA-Seq ExpressionHG10003629
SyntenyHG10003629
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGTTTCCCCTCACCGCCGCTCTCCATGATTCCTTCCTCCTCACTCATTGTTCCTCCTGCTTCTCCCCTCTCCCAAATCCCCCAATTTCTCACTCCAATCTCCTCCGCTACTGCTCCCTCAAATGCTCCCTTTCCCATTCTGATCCCCTCACCGCCGCCTTCTTCTCCGCCCATCCCTTCCCCTCCTCCGACACCTCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCCATCCCTCCGCTTTCCACTCCGCTCCTCCCGAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACGATTCCCAAGTCTTCCTCAAGCTTCGGGAAGGGGTAGACGCAATAGCCGCTTCCAGAAGGAAGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCTGTTCTCTGCCTTGTCTTGACCAACGCCGTCGATGTTCAGGATTCGATCGGTCGCACCATTGGAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTCCTTCGGATTCCACCACTACGAGGTTACGCATCGCCCCTTCCTGTACTGATCTTGTGACGAATGAAGGAAGTTGTAATCAAGTAATTGTTTGAACTACGAATAATTTTTGTGTTTATGGGGTTTTTTTCTCCCTTTCTGGTGAAATTAGTGAGAGCTTTTGCATGAATGTTTTTGGCAGATGGGTACTGTTCGTAGCAACCTTTCGGATTTCATAAGAGAAGGTGCGTTTCTTCACGTTTGCATTCAATTTAGTGCTGTAAGTGTTGTCTCTGTTTGTATATTTATGTGAAGTTTGATGAAGATTTTCAGGGTTATGGTCCAAGAGTTGTGGTTAGGAGTATAAAGGGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGACTTGTTGCAACCTAAGGTATTGTAGCTATATCTTAGTTTTGTTAACTTCCCTTCAGGATATTTCAGAGGGAATTATTTTCTGAAGTAAGTTCCACATTAGCATGCCAAGTTTCATTATTACCTTAGCCCCATTTGATAAATATGGTTTTTTGTTTTTGAAAATTTAGCTTACTTCCACCTATAAGTTTCTATGGTTTGTTTTCTACTTTCTATGGTTTGTTTTCTACTTTCTACTCATGTTTTAAAAAATGGAGCCAAGTTTTAAAAGATAATAGAGTAGTTTTCAAACACATGTTCTTGTATTTAGATAATGAAATTGAGGGAAACAAGCATAAAATTTAATATATAGAAAACTACAAAACGAAATTGTTATCAAATGGCTTCTTAGTATATATTTTAACTCAAAGATTTTCAGAGTAAATGTATTTGGTTCTTGTGGAATATCGTTGTTAATTGCCTCTTCTTGTTGAGGAATTATGATGAAATGACTGGCTATAGATTTGTGAGAAATTTGTTGGTTTTGTCTCAGGCAATGAGGCAGTCAGAGTTGTGGTCAAGATATCAATTTGTCTGTAGTTGCCAGCGATGTAGTGCCGAGCCCCTAACTTATGTGGACCATGCTTTGCAAGTAAGAAAAACTAAAAGATGAGTAGTTTTTGAAAACTTGTTTTTGTTTTTGAAATTTGGCTAAGAATTCAAATGTTTCCTTTAACAAAGACAAAAACCATTGTAAAAGGAGGGGAGAAACAAACACAATTACCAATTTGTTAGGACTCTTCAGAAAACCCCAACAAAAACACAAATCTGGCAAAAACTTCCTTGATTTTTATATTTATATTCTGGAAAAGTATCACAGCAAAGTCAACAACAAGGGAAATAGCAAGAACAAGCAAAACCAATCAAAGCTAACCCAACACATTCGGGAGAGTTGGGAAATTCTCTCCTAGAAGACTACTCTCCTCTACTAAAAACTTCACACCCAAAACACTATTGAAAAACCCTACTTAAGCCCTCAGACATCCAATCTCTCTTAGCCATTCTTGTGGTCACTTCCCTTTTTGCTTAGCATAACAACCCTCCACTTCCACTAACAGTTTGCCCCCTTGGAAAAGATTCTCTTGCCCTTCCTTTCATATGTGTGAATAATTGGTGGCCTAACGATTCCCTTGGGTTTGAAACTCACCTTATCCTGGTGGAACCTAGGGAACGATCTATTCATCATCTCCGTGTCTTCCCAAGTGGCTTCACCCTTAGGTAGATTTGTCCACTTTATGAACCATTCATCACGAGCTCTATCACTGTTCCACCTCTTACCCAACACCACAACTGGTGTTACTTGCAATTCAAATTCATTAGACAAACCCAGGGGGCACTCTTGTGCAGCCACATTCAACCCATCACCTTCTTCAACTGTGAAACATGAAAAACATCATGGATCTTTGCCTCTGGGGTAACTCCAGCCTATAAGCTACTTCTCTTATACATTCTTTAATCACCTTGGCGACAATACACTTGGCAAATGATGCCAAGTTGTGGTGGAGGTCCTGTTACATGGACATCCAATAAGGTTGTTGTACCATCAACGCATGGGAAAAACTGAAACAAGAATGCGCAGCGGTTGGTGGGTGATCGTATGGGAGAAGTAGCTTATAGGTTGGAGTTACCTCCAAAGGCAAAGATTCCTGATGTTTTCCATGTCTCACAGTTGAAGAAGGTGATAGGATCGAATGTGGTTACACAAAAGTGCCCCCGGATTTGTCTGATAAATTTGAATTGCAAGTAACACTAGCTGTGGTGTTGGGTAAGAGGTGGAACAACGAAAGAACAAAAGAGCTCGTGATGAATGCCTTATAAAGTGGACTAAGCTATCTGAGGATGAAGCCACCTTCAGGACCAAGTGAGTTTCAAACCCGAGGGAATTGTTAGGCCACCAATTATTCACACATATGAAAGGAAGGGCAAGAGAGTCTTTTCCAAGGGGCATTCAATTAGTAGAAGTGAAGGATTGTTAAGCGAAAAATGGAGGGGACCAGGAGAATGGGGCTGAGAGCGCCTGGACGTCTGAGGGCTTAAGTAGGGTTTTTTCAATAGTGTTTTGGGTGTGAAGTTTTTAGTAGAAGAGAGACTAGGAGAGAATTTCCAAGCTTTCTTTAATGTTCTGGGTTAGCTTTGGTTAATGGTTTTACTTCTTGTTATTTCTCTTATTGACTTTGCTGTGATAGTTTTTCCAGAATATAAATATATAAATAAGGGAAGTTCTTGCCCTCTTTTCTGGGATTTGTGTGTTTTTGTTGGGATTTTCTGTATGCAGGTGGAAGATTCCTAACATTATCAAAGTGGGCCTAGCAATTAGGATGAATTTGGTATCACTCTTGACATAATGAAAAATGTATCTGGCCTTTTGAAAACTTTCAAGATGTTTGATTTAAAAGATTCTGACTATGCATAGAACACTTGGAAAAAAAAAAGAACTTAAGCATTTAAAAGCAGTTTAGAAGGAGAGTTTGGTATAGTATTTGGGAGAGTGAAAAGTGCTTTTAATTAATCAAAAGCACTTTTCCAAATTTGTATGGTGGATTGTAACATAAACAGTGTGATTTAAAAAAACTCTAAAAACACTTGAAGGCATTTTAAAGTTTTCCCCTTAGAAATGATCATTTTAGAAAACTCATTACAAACTCAAACCTATTCTCAAATTGTCGGTGATTAGATTTCTGGTTATATATGTTGTTCTTTTGAATATTTTGTCGTTGTTTTTTGGAGCTCTACCTTATTTTTTCTCTTATGAGGGAAGATAAATTTCTTCAGGAAATCTCTGCTGTCAAGGTGGAATTTCTTGATTCAACTTCCATTAGCAACTTTGATCATGACCAAGCAGTGAGAAGAATAGATGATTATGTCGACAATGCTATCACCGAGTACCTGTCTATCAGTTCTCCTGAATCGTGTTATGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTGTGATGAGCAAGCGGAAGACAAGGAAGAAAAACAGCCAGTTATGAGGCTGCATCCTTTGCACTTCCTGTCGCTGAATGCATACACTGCTCTCGCATCAGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTGAGTTCCAAAATGGACAATGACGATGAAAATCAACGTGGAGCATCTATCATGAGCAGAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGACCCATCTTTGATTGCGTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTATTCTTGCTAGTCACAGCTCATTATGGGCTACTACTAACTCTTCAAAATGGGGTTTACCTGTTGGAAAAAGAATGTGCTCTAACTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCTTGGTCGATCTATCGAAGCTGATTTTCACGAGTTTTCAATTGGTATTTCAAATTGTATTGCTAATATGTCACAAAAATCTTGGAGCTTTCTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTCTGATTTCAGCTGGCCAAAGGCTATCACAACATATTCGAATTACCGAGATCTACAGGCTCATAGCATCGATCCTTCGTGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTCAGCATTCTAACCAAGAGAGAGAATCTATCCTTGGGCTTGGCATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAATATTTTACATGACTTGGATTAA

mRNA sequence

ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGTTTCCCCTCACCGCCGCTCTCCATGATTCCTTCCTCCTCACTCATTGTTCCTCCTGCTTCTCCCCTCTCCCAAATCCCCCAATTTCTCACTCCAATCTCCTCCGCTACTGCTCCCTCAAATGCTCCCTTTCCCATTCTGATCCCCTCACCGCCGCCTTCTTCTCCGCCCATCCCTTCCCCTCCTCCGACACCTCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCCATCCCTCCGCTTTCCACTCCGCTCCTCCCGAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACGATTCCCAAGTCTTCCTCAAGCTTCGGGAAGGGGTAGACGCAATAGCCGCTTCCAGAAGGAAGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCTGTTCTCTGCCTTGTCTTGACCAACGCCGTCGATGTTCAGGATTCGATCGGTCGCACCATTGGAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTCCTTCGGATTCCACCACTACGAGGTTACGCATCGCCCCTTCCTGTACTGATCTTGTGACGAATGAAGGAAGTTGTAATCAAATGGGTACTGTTCGTAGCAACCTTTCGGATTTCATAAGAGAAGATTTTCAGGGTTATGGTCCAAGAGTTGTGGTTAGGAGTATAAAGGGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGACTTGTTGCAACCTAAGGCAATGAGGCAGTCAGAGTTGTGGTCAAGATATCAATTTGTCTGTAGTTGCCAGCGATGTAGTGCCGAGCCCCTAACTTATGTGGACCATGCTTTGCAAGAAATCTCTGCTGTCAAGGTGGAATTTCTTGATTCAACTTCCATTAGCAACTTTGATCATGACCAAGCAGTGAGAAGAATAGATGATTATGTCGACAATGCTATCACCGAGTACCTGTCTATCAGTTCTCCTGAATCGTGTTATGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTGTGATGAGCAAGCGGAAGACAAGGAAGAAAAACAGCCAGTTATGAGGCTGCATCCTTTGCACTTCCTGTCGCTGAATGCATACACTGCTCTCGCATCAGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTGAGTTCCAAAATGGACAATGACGATGAAAATCAACGTGGAGCATCTATCATGAGCAGAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGACCCATCTTTGATTGCGTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTATTCTTGCTAGTCACAGCTCATTATGGGCTACTACTAACTCTTCAAAATGGGGTTTACCTGTTGGAAAAAGAATGTGCTCTAACTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCTTGGTCGATCTATCGAAGCTGATTTTCACGAGTTTTCAATTGGTATTTCAAATTGTATTGCTAATATGTCACAAAAATCTTGGAGCTTTCTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTCTGATTTCAGCTGGCCAAAGGCTATCACAACATATTCGAATTACCGAGATCTACAGGCTCATAGCATCGATCCTTCGTGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTCAGCATTCTAACCAAGAGAGAGAATCTATCCTTGGGCTTGGCATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAATATTTTACATGACTTGGATTAA

Coding sequence (CDS)

ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGTTTCCCCTCACCGCCGCTCTCCATGATTCCTTCCTCCTCACTCATTGTTCCTCCTGCTTCTCCCCTCTCCCAAATCCCCCAATTTCTCACTCCAATCTCCTCCGCTACTGCTCCCTCAAATGCTCCCTTTCCCATTCTGATCCCCTCACCGCCGCCTTCTTCTCCGCCCATCCCTTCCCCTCCTCCGACACCTCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCCATCCCTCCGCTTTCCACTCCGCTCCTCCCGAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACGATTCCCAAGTCTTCCTCAAGCTTCGGGAAGGGGTAGACGCAATAGCCGCTTCCAGAAGGAAGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCTGTTCTCTGCCTTGTCTTGACCAACGCCGTCGATGTTCAGGATTCGATCGGTCGCACCATTGGAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTCCTTCGGATTCCACCACTACGAGGTTACGCATCGCCCCTTCCTGTACTGATCTTGTGACGAATGAAGGAAGTTGTAATCAAATGGGTACTGTTCGTAGCAACCTTTCGGATTTCATAAGAGAAGATTTTCAGGGTTATGGTCCAAGAGTTGTGGTTAGGAGTATAAAGGGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGACTTGTTGCAACCTAAGGCAATGAGGCAGTCAGAGTTGTGGTCAAGATATCAATTTGTCTGTAGTTGCCAGCGATGTAGTGCCGAGCCCCTAACTTATGTGGACCATGCTTTGCAAGAAATCTCTGCTGTCAAGGTGGAATTTCTTGATTCAACTTCCATTAGCAACTTTGATCATGACCAAGCAGTGAGAAGAATAGATGATTATGTCGACAATGCTATCACCGAGTACCTGTCTATCAGTTCTCCTGAATCGTGTTATGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTGTGATGAGCAAGCGGAAGACAAGGAAGAAAAACAGCCAGTTATGAGGCTGCATCCTTTGCACTTCCTGTCGCTGAATGCATACACTGCTCTCGCATCAGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTGAGTTCCAAAATGGACAATGACGATGAAAATCAACGTGGAGCATCTATCATGAGCAGAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGACCCATCTTTGATTGCGTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTATTCTTGCTAGTCACAGCTCATTATGGGCTACTACTAACTCTTCAAAATGGGGTTTACCTGTTGGAAAAAGAATGTGCTCTAACTGCTCATGGGTCGATAAGTTCAATGCGAGTAGAATCCTTGGTCGATCTATCGAAGCTGATTTTCACGAGTTTTCAATTGGTATTTCAAATTGTATTGCTAATATGTCACAAAAATCTTGGAGCTTTCTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTCTGATTTCAGCTGGCCAAAGGCTATCACAACATATTCGAATTACCGAGATCTACAGGCTCATAGCATCGATCCTTCGTGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTCAGCATTCTAACCAAGAGAGAGAATCTATCCTTGGGCTTGGCATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAATATTTTACATGACTTGGATTAA

Protein sequence

MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISSPESCYEKLQNLLTLGFCDEQAEDKEEKQPVMRLHPLHFLSLNAYTALASAYKVRSCDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD
Homology
BLAST of HG10003629 vs. NCBI nr
Match: XP_038886411.1 (protein SET DOMAIN GROUP 41 [Benincasa hispida])

HSP 1 Score: 1095.5 bits (2832), Expect = 0.0e+00
Identity = 549/654 (83.94%), Postives = 584/654 (89.30%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
           MEMEM AMEDIEMAEDITPPL PLT+ALHDSFL THCSSCFS LPNPPISHSNLLRYCS 
Sbjct: 1   MEMEMIAMEDIEMAEDITPPLLPLTSALHDSFLFTHCSSCFSLLPNPPISHSNLLRYCSP 60

Query: 61  KCSLSHSDPLTAAFFSAHPFPS--SDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLT 120
           KCSLSHSDPLTAAFFS HPFPS  S TSDLRASLRLLHLLLSHP A  S PPERIFGLLT
Sbjct: 61  KCSLSHSDPLTAAFFSTHPFPSPFSYTSDLRASLRLLHLLLSHPPASLSPPPERIFGLLT 120

Query: 121 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 180
           NRHKLM PQ D+++F KLREGVDAIAA     SADI HG+ L EA LCLV TNAVDV DS
Sbjct: 121 NRHKLMFPQHDAELFPKLREGVDAIAALL---SADIPHGHTLAEAALCLVFTNAVDVHDS 180

Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 240
            GRTIGIAVY PTFCWINHSCSPNACYRFET S STTTR RIAPSCTDL+T +GSC+QMG
Sbjct: 181 TGRTIGIAVYPPTFCWINHSCSPNACYRFETSSASTTTRSRIAPSCTDLLTGQGSCSQMG 240

Query: 241 TVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFV 300
           TVRSNLSDFI EDFQG GPRV+VRSIK IR+GEAVTIAYCDLLQPKAMRQSELWSRYQFV
Sbjct: 241 TVRSNLSDFITEDFQGNGPRVMVRSIKSIRRGEAVTIAYCDLLQPKAMRQSELWSRYQFV 300

Query: 301 CSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 360
           CSCQRCSA+PLTYVDHALQE+SA KVE  DSTSISNFDHD+AVRRIDDYV++AITEYLSI
Sbjct: 301 CSCQRCSAKPLTYVDHALQELSASKVELHDSTSISNFDHDKAVRRIDDYVNSAITEYLSI 360

Query: 361 SSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCD 420
            SPESC EKL+NLLTLGF DEQAED E+KQPV +RLHPLHFLSLN YTALASAYKVRSCD
Sbjct: 361 GSPESCCEKLRNLLTLGFYDEQAEDGEQKQPVNLRLHPLHFLSLNVYTALASAYKVRSCD 420

Query: 421 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 480
           LLALSS+MD D+E+Q  AS M + SAAYSLFLAGATHHLFLS+PSLI SA+ CWV+AGES
Sbjct: 421 LLALSSEMDCDNEDQCNASTMCKASAAYSLFLAGATHHLFLSEPSLIVSASTCWVLAGES 480

Query: 481 LLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISN 540
           LL LA HS LWATTN+SKWG PVGKRMCS CSWVDKFNASRI G+ IEADF EFSIGISN
Sbjct: 481 LLTLARHSLLWATTNTSKWGFPVGKRMCSTCSWVDKFNASRIHGQPIEADFREFSIGISN 540

Query: 541 CIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKD 600
           CIANMS+KSWSFLTHGCPYLKAFTDP +FSWPK I  YS+ RD++AHSID  CACS +KD
Sbjct: 541 CIANMSRKSWSFLTHGCPYLKAFTDPFNFSWPKMIPMYSSDRDIRAHSIDRLCACSNSKD 600

Query: 601 VCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
           VCFQ EPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNIL+DL+
Sbjct: 601 VCFQCEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILYDLN 651

BLAST of HG10003629 vs. NCBI nr
Match: XP_008463080.1 (PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo])

HSP 1 Score: 1066.2 bits (2756), Expect = 1.1e-307
Identity = 539/655 (82.29%), Postives = 574/655 (87.63%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
           MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKC
Sbjct: 1   MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60

Query: 63  SLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLLLSHPSAFHSAPPERIFGLLT 122
           SLSHSDPLTAAFFS HP P  SSDTSDLRASLRL  LHLLLSHPS   S PP RIFGLLT
Sbjct: 61  SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120

Query: 123 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 182
           NRHKLM PQ+ S+VFLKLRE  +AIAA RRKN ADI  G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180

Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 242
           IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD  TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240

Query: 243 TVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFV 302
            VRSN+ DF+REDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQFV
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 303 CSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 362
           CSCQRCSA PLTYVDHALQEISAVKVE LDS  ISNFDHD AVRRID+YVDNAITEYLSI
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYLSI 360

Query: 363 SSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCD 422
            SPESC EKLQNLLT GF DEQ ED E KQPV +RLHP HFL LNAYTAL SAYKVRSCD
Sbjct: 361 GSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSCD 420

Query: 423 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 482
           LLALSS+MD D+EN+  A  MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAGES
Sbjct: 421 LLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGES 480

Query: 483 LLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGIS 542
           LLILA HSSLWA TTN+S WG P+GKRMCSNCSWVD+FN SRI GR I+ADF EFSIGIS
Sbjct: 481 LLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIGIS 540

Query: 543 NCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTK 602
           NCIA++S+K WSFLTHGCPYLKAFTDP DFSWPK     +N  D+  H ID SCACSKTK
Sbjct: 541 NCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPK-----TNDGDIGGHGIDRSCACSKTK 600

Query: 603 DVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
           D+CF+ EPQ SNQERESI GLGIHCL+YGGYLASICYG+HSHLASQIQNIL+DL+
Sbjct: 601 DICFECEPQDSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 650

BLAST of HG10003629 vs. NCBI nr
Match: XP_011656459.1 (protein SET DOMAIN GROUP 41 [Cucumis sativus])

HSP 1 Score: 1048.5 bits (2710), Expect = 2.4e-302
Identity = 529/655 (80.76%), Postives = 571/655 (87.18%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
           MEMEM A+EDIEMAEDI+PPLFPLT+ALHDSFL THCSSCFS LPNPPISHS  L YCSL
Sbjct: 1   MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60

Query: 61  KCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLT 120
           KCSLSHSDPLT AFFS HPFP  SSDTSDLRASLRLLHLLLSHPS   S PP+RI+GLLT
Sbjct: 61  KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120

Query: 121 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 180
           NRHKLM PQ+DS+VFLKLREG +AIAA RRKN ADI  G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180

Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 240
           IG+TIGIAVYA TF WINHSCSPNACYRFETPSDS TTR RIAPSCTD +++EGSC QMG
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240

Query: 241 TVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFV 300
            VRSN+ DFIREDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQFV
Sbjct: 241 NVRSNILDFIREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 301 CSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 360
           CSCQRCSA PLTYVDHALQEIS+VKVE LDST ISNFDHD AVRRID+YVDNAITEYLS 
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEYLST 360

Query: 361 SSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCD 420
           SSPESC EKLQNLLT GF DEQ ED E KQ V +RLHPLHFL LNAYTAL SAYKVRSCD
Sbjct: 361 SSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVRSCD 420

Query: 421 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 480
           L+ALSS+MD D+ N+  A  M +TSAAY+LFLAGATH LFL +PSL+ASAANCWVVAGES
Sbjct: 421 LVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVAGES 480

Query: 481 LLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGIS 540
           LLILA HSSLWA TTN+S W  P+GKRMC NCSWVD+FNASRI G+ ++ADF EFSIGIS
Sbjct: 481 LLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSIGIS 540

Query: 541 NCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTK 600
           NCIA++SQK WS LTHGCPYLKAFT P DFSWPK     +N +D+    ID SCACSKT+
Sbjct: 541 NCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPK-----TNEQDICGRGIDHSCACSKTQ 600

Query: 601 DVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
           DVC + +PQ SNQERESI GLGIHCL+YGGYLASICYGHHSHLASQIQNIL+DL+
Sbjct: 601 DVCLECKPQDSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILNDLN 650

BLAST of HG10003629 vs. NCBI nr
Match: XP_023520942.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 972.6 bits (2513), Expect = 1.7e-279
Identity = 507/652 (77.76%), Postives = 546/652 (83.74%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
           MEMEMRAMEDIEMAEDITPPL PLTAALHD+FLLTHCSSCFSPLPN  ISHSNLLRYCS 
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFLLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
            C  SHSD LTAA FS   FP SDTSDLRASLRLLHLLLS PSA+ SAPPERIFGLLTNR
Sbjct: 61  IC--SHSDSLTAAVFSTGQFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNR 120

Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
            KLM+  DDS+VF+K+REG DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLADDDSEVFVKIREGSDAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180

Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
           RTIGIAVY PTFCWINHSCSPNACYRFETPSDS  TRLRI+P CTD+ T EGSC+QM TV
Sbjct: 181 RTIGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTV 240

Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCS 300
           R N S FI +DFQGYGPRV+VRSIK IR GEAVTIAYCDLLQPKAMRQSEL SRY+FVCS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSIRNGEAVTIAYCDLLQPKAMRQSELRSRYKFVCS 300

Query: 301 CQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISS 360
           CQRCSA+P TYVDHALQEISAV VE LDSTSISNFD+D A+ RIDDYV+NAI EYLSI S
Sbjct: 301 CQRCSAKPPTYVDHALQEISAVNVELLDSTSISNFDYDTAIARIDDYVNNAIAEYLSIGS 360

Query: 361 PESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCDLL 420
            ESC EKLQNLLTLGF DEQAED + KQ + +RLHP+HFL LNAYTALASAYKVRS    
Sbjct: 361 SESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSW--- 420

Query: 421 ALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLL 480
                  N DENQ  A+ MS+TSAAYSLFLAGATHHLFLS+PSLIASAANCWVVAGESLL
Sbjct: 421 -------NGDENQCNAT-MSKTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLL 480

Query: 481 ILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCI 540
           IL  HSSLW  +N+SK   P+G+  C NCSWVDKFN SRI GRSIEADF EFSIGISNCI
Sbjct: 481 ILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEADFREFSIGISNCI 540

Query: 541 ANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKDVC 600
           AN+SQK WSFL H C YLKAFTDP DFSWPK ITT SNYR       D SC CSK +DV 
Sbjct: 541 ANISQKYWSFLAHECSYLKAFTDPFDFSWPKTITTCSNYR-------DRSCDCSKIQDV- 600

Query: 601 FQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
                  S+Q+R+SI  LGIHCLFYGGYLASICYGHHSHLASQIQ ILHD++
Sbjct: 601 -------SDQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIQCILHDMN 623

BLAST of HG10003629 vs. NCBI nr
Match: XP_022932824.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata])

HSP 1 Score: 951.0 bits (2457), Expect = 5.2e-273
Identity = 494/652 (75.77%), Postives = 540/652 (82.82%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
           MEMEMRAMEDIEMAEDITPPL PLTAALHD+F LTHCSSCFSPLPN  ISHSNLLRYCS 
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
            C  S SD LTAA FS   FP SDTSDLRASLRLLHLLLS  SA+ SAPPERIFGLLTNR
Sbjct: 61  IC--SRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNR 120

Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
            KLM+ +DDS+VF+K+R+G DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180

Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
           +TIGIAVY PTFCWINHSCSPNACYRFETPSDS  TRLRI+P CTD+ T EGSCNQM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTV 240

Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCS 300
           R N S FI +DFQGYGPRV+VRSIK +RKGEAVTIAYCDLLQPKA+RQSEL SRY+FVCS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCS 300

Query: 301 CQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISS 360
           CQRCSA+P TYVDHALQEISA  VE LDSTSISNFD+D A+RRIDDYV+NAI EYLSI S
Sbjct: 301 CQRCSAKPPTYVDHALQEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGS 360

Query: 361 PESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCDLL 420
           PESC EKLQNLLTLGF DEQAED + KQ + +RLHP+HFL LN YTALASAYKVRS    
Sbjct: 361 PESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW--- 420

Query: 421 ALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLL 480
                  NDDENQ  A+ MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLL
Sbjct: 421 -------NDDENQCNAT-MSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLL 480

Query: 481 ILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCI 540
           IL  HSSLW  +N+SK   P+G+  C NCSWVDKFN +RI GRSIEADF EFSIGISNCI
Sbjct: 481 ILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISNCI 540

Query: 541 ANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKDVC 600
           A++S K WSFL H C YLKAFTDP DFSWPK ITT  NY          SC CSK +DV 
Sbjct: 541 ADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLNYH-------GRSCDCSKIQDV- 600

Query: 601 FQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
                  S Q+R+SI  LGIHCLFYGGYLASICYGH SHLASQI+ ILHD++
Sbjct: 601 -------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECILHDMN 623

BLAST of HG10003629 vs. ExPASy Swiss-Prot
Match: Q3ECY6 (Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1)

HSP 1 Score: 374.4 bits (960), Expect = 2.6e-102
Identity = 244/649 (37.60%), Postives = 350/649 (53.93%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
           ME+RA EDIE+  D+ PPL PL ++L+DSFL +HCSSCFS LP  P        YCS  C
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60

Query: 63  SLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNRHK 122
           S      LT +F ++  FP   T  L + +R    LL+  +   S+ P R+  LLTN H 
Sbjct: 61  S------LTDSFTNSPQFPPEITPILPSDIRTSLHLLNSTAVDTSSSPHRLNNLLTNHHL 120

Query: 123 LMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRT 182
           LM    D  + + +    + IA   R N    R    LEEA +C VLTNAV+V DS G  
Sbjct: 121 LMA---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNGLA 180

Query: 183 IGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRS 242
           +GIA+Y  +F WINHSCSPN+CYRF    ++ T+   +  + T+  +N     Q+     
Sbjct: 181 LGIALYNSSFSWINHSCSPNSCYRF---VNNRTSYHDVHVTNTETSSNLELQEQVCGTSL 240

Query: 243 NLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCSCQ 302
           N  +       G GP+++VRSIK I+ GE +T++Y DLLQP  +RQS+LWS+Y+F+C+C 
Sbjct: 241 NSGN-------GNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCNCG 300

Query: 303 RCSAEPLTYVDHALQEISAVKVEFLDSTSISNFD----HDQAVRRIDDYVDNAITEYLSI 362
           RC+A P  YVD  L+ +  ++ E    T++ +FD     D+AV +++DY+  AI ++LS 
Sbjct: 301 RCAASPPAYVDSILEGVLTLESE---KTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSD 360

Query: 363 S-SPESCYEKLQNLLTLGFCDEQAEDKEEKQP-VMRLHPLHFLSLNAYTALASAYKVRSC 422
           +  P++C E ++++L  G      + KE+ QP  +RLH  H+++LNAY  LA+AY++RS 
Sbjct: 361 NIDPKTCCEMIESVLHHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRSI 420

Query: 423 DLLALSSKMDNDDENQRG-ASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 482
           D             ++ G    MSR SAAYSLFLAG +HHLF ++ S   SAA  W  AG
Sbjct: 421 D-------------SETGIVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAG 480

Query: 483 ESLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGI 542
           E L  LA    +  +  S           C+ C  ++  N+ R        D  E S  I
Sbjct: 481 ELLFDLAPKLLMELSVESDV--------KCTKCLMLETSNSHR--------DIKEKSRQI 540

Query: 543 SNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKT 602
            +C+ ++SQ +WSFLT GCPYL+ F  P DFS    +T  +  R+               
Sbjct: 541 LSCVRDISQVTWSFLTRGCPYLEKFRSPVDFS----LTRTNGERE--------------- 557

Query: 603 KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQ 645
                    + S  +  ++L L  HCL Y   L  +CYG  SHL S+ +
Sbjct: 601 ---------ESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557

BLAST of HG10003629 vs. ExPASy Swiss-Prot
Match: Q9H7B4 (Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens OX=9606 GN=SMYD3 PE=1 SV=4)

HSP 1 Score: 49.3 bits (116), Expect = 1.9e-04
Identity = 31/140 (22.14%), Postives = 54/140 (38.57%), Query Frame = 0

Query: 168 VLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDL 227
           V+ N+  + ++  + +G+ +Y P+   +NHSC PN    F                    
Sbjct: 178 VICNSFTICNAEMQEVGVGLY-PSISLLNHSCDPNCSIVFN------------------- 237

Query: 228 VTNEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMR 287
                                       GP +++R+++ I  GE +TI Y D+L     R
Sbjct: 238 ----------------------------GPHLLLRAVRDIEVGEELTICYLDMLMTSEER 269

Query: 288 QSELWSRYQFVCSCQRCSAE 308
           + +L  +Y F C C RC  +
Sbjct: 298 RKQLRDQYCFECDCFRCQTQ 269

BLAST of HG10003629 vs. ExPASy Swiss-Prot
Match: Q9CWR2 (Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 SV=1)

HSP 1 Score: 49.3 bits (116), Expect = 1.9e-04
Identity = 31/140 (22.14%), Postives = 54/140 (38.57%), Query Frame = 0

Query: 168 VLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDL 227
           V+ N+  + ++  + +G+ +Y P+   +NHSC PN    F                    
Sbjct: 178 VICNSFTICNAEMQEVGVGLY-PSMSLLNHSCDPNCSIVFN------------------- 237

Query: 228 VTNEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMR 287
                                       GP +++R+++ I  GE +TI Y D+L     R
Sbjct: 238 ----------------------------GPHLLLRAVREIEAGEELTICYLDMLMTSEER 269

Query: 288 QSELWSRYQFVCSCQRCSAE 308
           + +L  +Y F C C RC  +
Sbjct: 298 RKQLRDQYCFECDCIRCQTQ 269

BLAST of HG10003629 vs. ExPASy Swiss-Prot
Match: Q557F7 (SET and MYND domain-containing protein DDB_G0273589 OS=Dictyostelium discoideum OX=44689 GN=DDB_G0273589 PE=3 SV=1)

HSP 1 Score: 47.0 bits (110), Expect = 9.6e-04
Identity = 39/154 (25.32%), Postives = 61/154 (39.61%), Query Frame = 0

Query: 154 IRHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDS 213
           IR  N    +++     N   +     + IG+AV +P+  + NHSC PN           
Sbjct: 218 IRKINEKSRSIIHKTRCNQFGIWTKNDKCIGVAV-SPSSSYFNHSCIPN----------- 277

Query: 214 TTTRLRIAPSCTDLVTNEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAV 273
                     CTD                     +R+     G  +  +S+  I+KG+ +
Sbjct: 278 ----------CTD---------------------VRD-----GSNMTFKSLYPIKKGDQL 323

Query: 274 TIAYCDLLQPKAMRQSELWSRYQFVCSCQRCSAE 308
           TI+Y +L QP   R+ EL   Y F C C RC+ +
Sbjct: 338 TISYIELDQPIQDRKDELKYGYYFDCICPRCNGD 323

BLAST of HG10003629 vs. ExPASy TrEMBL
Match: A0A1S3CIT0 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 PE=4 SV=1)

HSP 1 Score: 1066.2 bits (2756), Expect = 5.3e-308
Identity = 539/655 (82.29%), Postives = 574/655 (87.63%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
           MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKC
Sbjct: 1   MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60

Query: 63  SLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLLLSHPSAFHSAPPERIFGLLT 122
           SLSHSDPLTAAFFS HP P  SSDTSDLRASLRL  LHLLLSHPS   S PP RIFGLLT
Sbjct: 61  SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120

Query: 123 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 182
           NRHKLM PQ+ S+VFLKLRE  +AIAA RRKN ADI  G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180

Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 242
           IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD  TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240

Query: 243 TVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFV 302
            VRSN+ DF+REDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQFV
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 303 CSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 362
           CSCQRCSA PLTYVDHALQEISAVKVE LDS  ISNFDHD AVRRID+YVDNAITEYLSI
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYLSI 360

Query: 363 SSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCD 422
            SPESC EKLQNLLT GF DEQ ED E KQPV +RLHP HFL LNAYTAL SAYKVRSCD
Sbjct: 361 GSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSCD 420

Query: 423 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 482
           LLALSS+MD D+EN+  A  MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAGES
Sbjct: 421 LLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGES 480

Query: 483 LLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGIS 542
           LLILA HSSLWA TTN+S WG P+GKRMCSNCSWVD+FN SRI GR I+ADF EFSIGIS
Sbjct: 481 LLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIGIS 540

Query: 543 NCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTK 602
           NCIA++S+K WSFLTHGCPYLKAFTDP DFSWPK     +N  D+  H ID SCACSKTK
Sbjct: 541 NCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPK-----TNDGDIGGHGIDRSCACSKTK 600

Query: 603 DVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
           D+CF+ EPQ SNQERESI GLGIHCL+YGGYLASICYG+HSHLASQIQNIL+DL+
Sbjct: 601 DICFECEPQDSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 650

BLAST of HG10003629 vs. ExPASy TrEMBL
Match: A0A0A0KAK3 (SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV=1)

HSP 1 Score: 1036.6 bits (2679), Expect = 4.5e-299
Identity = 526/657 (80.06%), Postives = 568/657 (86.45%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
           MEMEM A+EDIEMAEDI+PPLFPLT+ALHDSFL THCSSCFS LPNPPISHS  L YCSL
Sbjct: 1   MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60

Query: 61  KCSLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLT 120
           KCSLSHSDPLT AFFS HPFP  SSDTSDLRASLRLLHLLLSHPS   S PP+RI+GLLT
Sbjct: 61  KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120

Query: 121 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 180
           NRHKLM PQ+DS+VFLKLREG +AIAA RRKN ADI  G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180

Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 240
           IG+TIGIAVYA TF WINHSCSPNACYRFETPSDS TTR RIAPSCTD +++EGSC QMG
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240

Query: 241 TVRSNLSDFIRED--FQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQ 300
            VRSN+ DFIRE     G GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQ
Sbjct: 241 NVRSNILDFIREGALLNGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQ 300

Query: 301 FVCSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYL 360
           FVCSCQRCSA PLTYVDHALQEIS+VKVE LDST ISNFDHD AVRRID+YVDNAITEYL
Sbjct: 301 FVCSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEYL 360

Query: 361 SISSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRS 420
           S SSPESC EKLQNLLT GF DEQ ED E KQ V +RLHPLHFL LNAYTAL SAYKVRS
Sbjct: 361 STSSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVRS 420

Query: 421 CDLLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 480
           CDL+ALSS+MD D+ N+  A  M +TSAAY+LFLAGATH LFL +PSL+ASAANCWVVAG
Sbjct: 421 CDLVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVAG 480

Query: 481 ESLLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIG 540
           ESLLILA HSSLWA TTN+S W  P+GKRMC NCSWVD+FNASRI G+ ++ADF EFSIG
Sbjct: 481 ESLLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSIG 540

Query: 541 ISNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSK 600
           ISNCIA++SQK WS LTHGCPYLKAFT P DFSWPK     +N +D+    ID SCACSK
Sbjct: 541 ISNCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPK-----TNEQDICGRGIDHSCACSK 600

Query: 601 TKDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
           T+DVC + +PQ SNQERESI GLGIHCL+YGGYLASICYGHHSHLASQIQNIL+DL+
Sbjct: 601 TQDVCLECKPQDSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILNDLN 652

BLAST of HG10003629 vs. ExPASy TrEMBL
Match: A0A6J1EY39 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)

HSP 1 Score: 951.0 bits (2457), Expect = 2.5e-273
Identity = 494/652 (75.77%), Postives = 540/652 (82.82%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
           MEMEMRAMEDIEMAEDITPPL PLTAALHD+F LTHCSSCFSPLPN  ISHSNLLRYCS 
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
            C  S SD LTAA FS   FP SDTSDLRASLRLLHLLLS  SA+ SAPPERIFGLLTNR
Sbjct: 61  IC--SRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNR 120

Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
            KLM+ +DDS+VF+K+R+G DA+AASRR NSADIR+ NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180

Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
           +TIGIAVY PTFCWINHSCSPNACYRFETPSDS  TRLRI+P CTD+ T EGSCNQM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTV 240

Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCS 300
           R N S FI +DFQGYGPRV+VRSIK +RKGEAVTIAYCDLLQPKA+RQSEL SRY+FVCS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCS 300

Query: 301 CQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSISS 360
           CQRCSA+P TYVDHALQEISA  VE LDSTSISNFD+D A+RRIDDYV+NAI EYLSI S
Sbjct: 301 CQRCSAKPPTYVDHALQEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGS 360

Query: 361 PESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCDLL 420
           PESC EKLQNLLTLGF DEQAED + KQ + +RLHP+HFL LN YTALASAYKVRS    
Sbjct: 361 PESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW--- 420

Query: 421 ALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLL 480
                  NDDENQ  A+ MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLL
Sbjct: 421 -------NDDENQCNAT-MSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLL 480

Query: 481 ILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNCI 540
           IL  HSSLW  +N+SK   P+G+  C NCSWVDKFN +RI GRSIEADF EFSIGISNCI
Sbjct: 481 ILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISNCI 540

Query: 541 ANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKDVC 600
           A++S K WSFL H C YLKAFTDP DFSWPK ITT  NY          SC CSK +DV 
Sbjct: 541 ADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLNYH-------GRSCDCSKIQDV- 600

Query: 601 FQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
                  S Q+R+SI  LGIHCLFYGGYLASICYGH SHLASQI+ ILHD++
Sbjct: 601 -------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECILHDMN 623

BLAST of HG10003629 vs. ExPASy TrEMBL
Match: A0A6J1I954 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)

HSP 1 Score: 943.7 bits (2438), Expect = 4.0e-271
Identity = 493/653 (75.50%), Postives = 537/653 (82.24%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSL 60
           MEME+RAMEDIEMAEDITPPL PLTAALHDSFLLTHCSSCFSPLPN PISHSNLLRYCS 
Sbjct: 1   MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60

Query: 61  KCSLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNR 120
            C  S+SD LTAA FS   F  SDTSDLRASLRLLHLLLS  SA+ S PPERIFGLLTNR
Sbjct: 61  IC--SYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNR 120

Query: 121 HKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIG 180
            KLM+  DDS+VF K+R+G DAIA SRR NSADIR+ NALEEA++CLVLTNAV+VQDS+G
Sbjct: 121 EKLMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVG 180

Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTV 240
           +TIGIAVY PTFCWINHSCSPNACYRFETPSDS  TRLRI+P CTD+ T EGSC+QM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTV 240

Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCS 300
           R N S FI +DFQGYGPRV+VRSIK IRKGEAVTIAYCDLLQPKAMRQSEL SRY+FVCS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELRSRYKFVCS 300

Query: 301 CQRCSAEPLTYVDHALQEISAVKV-EFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSIS 360
           CQRCSA+P TYVDHALQEI AV V E LDSTSISNFD+D A+ RIDDYV+NAI EYLSI 
Sbjct: 301 CQRCSAKPPTYVDHALQEIFAVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIG 360

Query: 361 SPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCDL 420
           SPESC EKLQNLLTLGF DEQA+D + KQ + +RLHP+HFL LN YTALASAYKVRS   
Sbjct: 361 SPESCCEKLQNLLTLGFYDEQADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSW-- 420

Query: 421 LALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESL 480
                   ND+ENQ   S MS+TSAAYSLFLAGATHHLFL++PSLIASAANCWVVAGESL
Sbjct: 421 --------NDNENQCNTSTMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESL 480

Query: 481 LILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGISNC 540
           L L  HSSLW  +N+SK   P+G+  C NCSWVDKFN SRI GRSIE DF EFSIGISNC
Sbjct: 481 LRLVRHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEVDFQEFSIGISNC 540

Query: 541 IANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKTKDV 600
           IAN+S K WSFLTH CPYLKAFTDP DFSWPK ITT SNYR       D  C  SK +DV
Sbjct: 541 IANISHKYWSFLTHECPYLKAFTDPFDFSWPKTITTCSNYR-------DRLCDYSKIQDV 600

Query: 601 CFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILHDLD 652
                   S+Q+R+SI  LGIHCLFYGGYLASICYGH SHL+SQIQ IL D++
Sbjct: 601 --------SDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQCILQDMN 625

BLAST of HG10003629 vs. ExPASy TrEMBL
Match: A0A1S3CJZ3 (protein SET DOMAIN GROUP 41 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501316 PE=4 SV=1)

HSP 1 Score: 865.5 bits (2235), Expect = 1.4e-247
Identity = 444/532 (83.46%), Postives = 468/532 (87.97%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
           MEMRA+EDIEMAEDITPPLFPLT+ALHDSFL THCSSCFS LPNPPISHS LL YCSLKC
Sbjct: 1   MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60

Query: 63  SLSHSDPLTAAFFSAHPFP--SSDTSDLRASLRL--LHLLLSHPSAFHSAPPERIFGLLT 122
           SLSHSDPLTAAFFS HP P  SSDTSDLRASLRL  LHLLLSHPS   S PP RIFGLLT
Sbjct: 61  SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120

Query: 123 NRHKLMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDS 182
           NRHKLM PQ+ S+VFLKLRE  +AIAA RRKN ADI  G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180

Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMG 242
           IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD  TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240

Query: 243 TVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFV 302
            VRSN+ DF+REDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQFV
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 303 CSCQRCSAEPLTYVDHALQEISAVKVEFLDSTSISNFDHDQAVRRIDDYVDNAITEYLSI 362
           CSCQRCSA PLTYVDHALQEISAVKVE LDS  ISNFDHD AVRRID+YVDNAITEYLSI
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYLSI 360

Query: 363 SSPESCYEKLQNLLTLGFCDEQAEDKEEKQPV-MRLHPLHFLSLNAYTALASAYKVRSCD 422
            SPESC EKLQNLLT GF DEQ ED E KQPV +RLHP HFL LNAYTAL SAYKVRSCD
Sbjct: 361 GSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSCD 420

Query: 423 LLALSSKMDNDDENQRGASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGES 482
           LLALSS+MD D+EN+  A  MS+TSAAY+LFLAGATHHLFL +PSLIASAANCWVVAGES
Sbjct: 421 LLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGES 480

Query: 483 LLILASHSSLWA-TTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADF 529
           LLILA HSSLWA TTN+S WG P+GKRMCSNCSWVD+FN SRI GR I+ADF
Sbjct: 481 LLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADF 532

BLAST of HG10003629 vs. TAIR 10
Match: AT1G43245.1 (SET domain-containing protein )

HSP 1 Score: 374.4 bits (960), Expect = 1.9e-103
Identity = 244/649 (37.60%), Postives = 350/649 (53.93%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLFPLTAALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSLKC 62
           ME+RA EDIE+  D+ PPL PL ++L+DSFL +HCSSCFS LP  P        YCS  C
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60

Query: 63  SLSHSDPLTAAFFSAHPFPSSDTSDLRASLRLLHLLLSHPSAFHSAPPERIFGLLTNRHK 122
           S      LT +F ++  FP   T  L + +R    LL+  +   S+ P R+  LLTN H 
Sbjct: 61  S------LTDSFTNSPQFPPEITPILPSDIRTSLHLLNSTAVDTSSSPHRLNNLLTNHHL 120

Query: 123 LMIPQDDSQVFLKLREGVDAIAASRRKNSADIRHGNALEEAVLCLVLTNAVDVQDSIGRT 182
           LM    D  + + +    + IA   R N    R    LEEA +C VLTNAV+V DS G  
Sbjct: 121 LMA---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNGLA 180

Query: 183 IGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTNEGSCNQMGTVRS 242
           +GIA+Y  +F WINHSCSPN+CYRF    ++ T+   +  + T+  +N     Q+     
Sbjct: 181 LGIALYNSSFSWINHSCSPNSCYRF---VNNRTSYHDVHVTNTETSSNLELQEQVCGTSL 240

Query: 243 NLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCSCQ 302
           N  +       G GP+++VRSIK I+ GE +T++Y DLLQP  +RQS+LWS+Y+F+C+C 
Sbjct: 241 NSGN-------GNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCNCG 300

Query: 303 RCSAEPLTYVDHALQEISAVKVEFLDSTSISNFD----HDQAVRRIDDYVDNAITEYLSI 362
           RC+A P  YVD  L+ +  ++ E    T++ +FD     D+AV +++DY+  AI ++LS 
Sbjct: 301 RCAASPPAYVDSILEGVLTLESE---KTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSD 360

Query: 363 S-SPESCYEKLQNLLTLGFCDEQAEDKEEKQP-VMRLHPLHFLSLNAYTALASAYKVRSC 422
           +  P++C E ++++L  G      + KE+ QP  +RLH  H+++LNAY  LA+AY++RS 
Sbjct: 361 NIDPKTCCEMIESVLHHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRSI 420

Query: 423 DLLALSSKMDNDDENQRG-ASIMSRTSAAYSLFLAGATHHLFLSDPSLIASAANCWVVAG 482
           D             ++ G    MSR SAAYSLFLAG +HHLF ++ S   SAA  W  AG
Sbjct: 421 D-------------SETGIVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAG 480

Query: 483 ESLLILASHSSLWATTNSSKWGLPVGKRMCSNCSWVDKFNASRILGRSIEADFHEFSIGI 542
           E L  LA    +  +  S           C+ C  ++  N+ R        D  E S  I
Sbjct: 481 ELLFDLAPKLLMELSVESDV--------KCTKCLMLETSNSHR--------DIKEKSRQI 540

Query: 543 SNCIANMSQKSWSFLTHGCPYLKAFTDPSDFSWPKAITTYSNYRDLQAHSIDPSCACSKT 602
            +C+ ++SQ +WSFLT GCPYL+ F  P DFS    +T  +  R+               
Sbjct: 541 LSCVRDISQVTWSFLTRGCPYLEKFRSPVDFS----LTRTNGERE--------------- 557

Query: 603 KDVCFQSEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQ 645
                    + S  +  ++L L  HCL Y   L  +CYG  SHL S+ +
Sbjct: 601 ---------ESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557

BLAST of HG10003629 vs. TAIR 10
Match: AT2G17900.1 (SET domain group 37 )

HSP 1 Score: 44.3 bits (103), Expect = 4.4e-04
Identity = 39/135 (28.89%), Postives = 48/135 (35.56%), Query Frame = 0

Query: 171 NAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVTN 230
           NA  + DS  R  GI ++ P    INHSCSPNA   FE                      
Sbjct: 189 NAHSICDSELRPQGIGLF-PLVSIINHSCSPNAVLVFE---------------------- 248

Query: 231 EGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSE 290
                QM                      VVR++  I K   +TI+Y +       RQ  
Sbjct: 249 ----EQM---------------------AVVRAMDNISKDSEITISYIETAGSTLTRQKS 275

Query: 291 LWSRYQFVCSCQRCS 306
           L  +Y F C C RCS
Sbjct: 309 LKEQYLFHCQCARCS 275

BLAST of HG10003629 vs. TAIR 10
Match: AT1G26760.1 (SET domain protein 35 )

HSP 1 Score: 43.1 bits (100), Expect = 9.8e-04
Identity = 20/56 (35.71%), Postives = 31/56 (55.36%), Query Frame = 0

Query: 256 GPRVVVRSIKGIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFVCSCQRCSAEPLTY 312
           G  V+V + + I+ GE ++ AY D+L P   R+ E+   + F C C RC  E + Y
Sbjct: 354 GDYVIVHASRDIKTGEEISFAYFDVLSPLEKRK-EMAESWGFCCGCSRCKFESVLY 408

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886411.10.0e+0083.94protein SET DOMAIN GROUP 41 [Benincasa hispida][more]
XP_008463080.11.1e-30782.29PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo][more]
XP_011656459.12.4e-30280.76protein SET DOMAIN GROUP 41 [Cucumis sativus][more]
XP_023520942.11.7e-27977.76protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022932824.15.2e-27375.77protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q3ECY62.6e-10237.60Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1[more]
Q9H7B41.9e-0422.14Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens OX=9606 GN=SMYD3 PE=1 S... [more]
Q9CWR21.9e-0422.14Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 ... [more]
Q557F79.6e-0425.32SET and MYND domain-containing protein DDB_G0273589 OS=Dictyostelium discoideum ... [more]
Match NameE-valueIdentityDescription
A0A1S3CIT05.3e-30882.29protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 P... [more]
A0A0A0KAK34.5e-29980.06SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV... [more]
A0A6J1EY392.5e-27375.77protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A6J1I9544.0e-27175.50protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1114726... [more]
A0A1S3CJZ31.4e-24783.46protein SET DOMAIN GROUP 41 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501316 P... [more]
Match NameE-valueIdentityDescription
AT1G43245.11.9e-10337.60SET domain-containing protein [more]
AT2G17900.14.4e-0428.89SET domain group 37 [more]
AT1G26760.19.8e-0435.71SET domain protein 35 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 640..651
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 256..304
e-value: 9.7E-14
score: 53.8
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 22..211
e-value: 2.2E-14
score: 55.9
NoneNo IPR availableGENE3D6.10.140.2220coord: 36..82
e-value: 2.2E-14
score: 55.9
NoneNo IPR availableGENE3D1.10.220.160coord: 83..172
e-value: 2.2E-14
score: 55.9
NoneNo IPR availablePANTHERPTHR47780PROTEIN SET DOMAIN GROUP 41coord: 3..649
NoneNo IPR availableCDDcd20071SET_SMYDcoord: 151..304
e-value: 9.26277E-19
score: 80.4995
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 184..302
IPR001214SET domainPFAMPF00856SETcoord: 173..277
e-value: 5.4E-5
score: 23.7

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003629.1HG10003629.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding