Spg004474 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg004474
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein SET DOMAIN GROUP 41 isoform X1
Locationscaffold9: 13143777 .. 13146839 (-)
RNA-Seq ExpressionSpg004474
SyntenySpg004474
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCATAAACAAAACCCCTAACCACTGTTGTAGACGAGCAGAGGAGGAGACTGGAGAGGCAGAAATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGCCAACCCTCACCTCCGCTCTCCATGATTCCTTCCTCCTCACTCACTGCTCCTCCTGCTTCTCCCCTCTCCCCAATCCCCCAATTTCCCATTCCAATCTCCTCCGCTACTGCTCCACCAAATGCTACGATTCCGATTCCGCCACCGCCGCCTTCTTCTCCGCCGACCATCTTCCCTTCTCCGACACCGCTGACTTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCGATCCCTCCGCCTGGCACTCTGCTCCTCCCGAGCGCCTCTTTGGCCTTCTCACCAACCGCGAGAAATTGATGCTCGCCGAAGAAGATTCCGAGGTCCTCGTCAGGATTCGGCAAGGGGCCGACGCCATGGCCGCTTTCAGAAGGACGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCCGTTCTATGTCTCGTGATTACCAACGCTGTGGAGGTTCAGGATTCACTCGGCCGCACCGTCGGAATCGCTGTGTACCATCCTATCTTCTGCTGGATCAATCACAGTTGTTCTCCCAACGCATGTTACAGATTTGAAGCTCCGTCGGATTCCTTCAAGACGAGGCTGCAGATTTCCCCCAAATGCACTGACCTTGAGACTGATGAAGGAAGTTCTAATCAAGTAATCGTTTGAACCACGAATAATTTGGTGTTTGTGAGCGTTTATCTCTCTCCCTCTCGTGATGTTAGTGAGAGCTTTTGCATGAATGTTTTTGGCAGATAGGTACTGTTCGTAGCGACATGTTGGATTTCATAAGAAAAGGTGTGTTTCTTCACTTTTGCATTCAGTTAAATGCCATAAATATTGTCTCTGCTTGTATGTTTATGTGAATTTTGATGAAGATTTTCAGGGTGGTCCAAGAGTTGTGGTTAGGAGTATAAAGAGTATAAGGAAAGGCGAGGCAGTCACAATCGCATACTGTGACTTGTTGCAACCTAAGGTATTGTGGTAATATCTTAATTTAATTCCGGAGGGAAATTTTAGTACTAAGTCTTACATGAGCATGCTAAATTTTATTAAATTTCAGCTCAGAGTTACAGAGTACATGAATATTGTGCTTGTTGAATATCATTGTTTATAGGGTCTGGGATGTTTGGACACTCGAAAGCAAGTTCAAGCTTATAACCATCATGGGTTGGCCCTAATGGTCAATAAGGGCCATAAAAAAAATAAAGGGCTTTAAGGGAATGGGTTCAATCCATGGTGGCCACCTACCTAGGATATTTAATATCCTATGAGTTTCCTTGGCAACCAAATGTAGTAGGGTCAGGCGGTTGTCCCGTGAGATTAGTCGAGGTGCGTAAGCTGGCCCGGACACTCACGGATATCAAAAAAAAAAAAAAAAAGTTCAAGCTTATAGGTGGAGAACTTAATATGTTGAAACCTTTATTGTCTCTGTAACTGTTCTTGGGTTGAGCACAGCCTCCCCAAACACCATAGAAGTGGGCCGGATACCCGGTTAACAAGAAATTATTTCTTGTTGAGGCACTATGATGAAATGGGATGAATGTGTAACTGAAAAATTTGTTGATTTTGTCTTAGGCAATGAGGCAGTCAGAGTTATGGCCAAGGTATCAATTTTTCTGTTGTTGCCAGCGATGTAGTGCCAAGTCCCTAACTTATGTGGACCATGCTTTGCAAGTAAGAATTTAGTGACTTAGTCGTGGACTTCTATATATCTCATTGTGTATTGCTTTGCTATGGTCTTAAAAACAGTATGTGTTTTGAATTTCTTCAGGAAATTTCTGCTGTCAAAGTGGAAATGTTTGTTGATTCAACTTCCATTAGCAACTTTGACCACGACGATGCAGTGAGAAGAATAAACGATTATGTCGACAGTGCAATTGCCCAATACCTATCTATTGGTTCTCCTGAATCATGTTGTGAGAAGCTTGAAAACTTGCTTACTTTAGGGTTCTGTTACAAGCAAGAGGAAGATGAGGAAGGAAAACAGCTGGTTAATTTGAGGCTGCATCCCCTGAACTACCTGTCGCTTAATGCATACACAGCTCTCGCATCGGCTTATAAAGTCTGTTCATGTGATTTATTGGCTTTGAATTCCAAAATGGATGACGACGACGAACATCAACGTAATGCATCGACCATGAGCAAAACAACTGCTGCATACTCCTTGTTCCTTGCAGGTGCTACACACCATCTTTTTCTTTCTGATCCATCCTTAATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTCTGCTTATTCTTGCTAGAAGCAGCTCATTATGGGCTACTACTAACATGTCAAAATGGAGTTTCCCTGTGGAGAAAAGAATGTGCTCTAATTGCTCATGGGTCGATAAGTTCAATGAAAGTAGAATCCACCATCGATCTTTAAAAGGCGATTTTCGCGAGTTTTCAATTGGTATTTCAAATTGCATTGCTAATATTGCACAAAAATCTTGGAGCTTTCTGACTCATGACTGCCCATATTTGAAGGCTTTCATTGATCCCTTTGATTTCAGCTGGCCAAAGATAACCACAGCGTATTCGAATAAATGCAATATACGGGCTCATAGCATCGATCGTTCGTGTGCTTGTAGTAAAGCTAGAGAGGTGGTTTGTCAGTGTGAACTTCATGTGCATTCTGACCAAGAGAGGCAAGCAATCTTTGATCTCGGTATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGCCACCATTCACATTTGGCATCTCAGATTCAGAATATTTTAGAGTAGATGAATCGCTAACAGTTGTTATCCAATAGGATTCTAAATTGTTCTGAGATTGAAACTTCCATCCCACGGGTTTACTGAAGAGAACCTTCCAGTTGTATAAAATCAGGATTTTAGTTCTTGTTTTAAAGGCGTTGGCATGGTTTGCTAGTATTTTAAAGACTAGTCTGACAATCTAAG

mRNA sequence

ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGCCAACCCTCACCTCCGCTCTCCATGATTCCTTCCTCCTCACTCACTGCTCCTCCTGCTTCTCCCCTCTCCCCAATCCCCCAATTTCCCATTCCAATCTCCTCCGCTACTGCTCCACCAAATGCTACGATTCCGATTCCGCCACCGCCGCCTTCTTCTCCGCCGACCATCTTCCCTTCTCCGACACCGCTGACTTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCGATCCCTCCGCCTGGCACTCTGCTCCTCCCGAGCGCCTCTTTGGCCTTCTCACCAACCGCGAGAAATTGATGCTCGCCGAAGAAGATTCCGAGGTCCTCGTCAGGATTCGGCAAGGGGCCGACGCCATGGCCGCTTTCAGAAGGACGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCCGTTCTATGTCTCGTGATTACCAACGCTGTGGAGGTTCAGGATTCACTCGGCCGCACCGTCGGAATCGCTGTGTACCATCCTATCTTCTGCTGGATCAATCACAGTTGTTCTCCCAACGCATGTTACAGATTTGAAGCTCCGTCGGATTCCTTCAAGACGAGGCTGCAGATTTCCCCCAAATGCACTGACCTTGAGACTGATGAAGGAAGTTCTAATCAAATAGGTACTGTTCGTAGCGACATGTTGGATTTCATAAGAAAAGATTTTCAGGGTGGTCCAAGAGTTGTGGTTAGGAGTATAAAGAGTATAAGGAAAGGCGAGGCAGTCACAATCGCATACTGTGACTTGTTGCAACCTAAGGCAATGAGGCAGTCAGAGTTATGGCCAAGGTATCAATTTTTCTGTTGTTGCCAGCGATGTAGTGCCAAGTCCCTAACTTATGTGGACCATGCTTTGCAAGAAATTTCTGCTGTCAAAGTGGAAATGTTTGTTGATTCAACTTCCATTAGCAACTTTGACCACGACGATGCAGTGAGAAGAATAAACGATTATGTCGACAGTGCAATTGCCCAATACCTATCTATTGGTTCTCCTGAATCATGTTGTGAGAAGCTTGAAAACTTGCTTACTTTAGGGTTCTGTTACAAGCAAGAGGAAGATGAGGAAGGAAAACAGCTGGTTAATTTGAGGCTGCATCCCCTGAACTACCTGTCGCTTAATGCATACACAGCTCTCGCATCGGCTTATAAAGTCTGTTCATGTGATTTATTGGCTTTGAATTCCAAAATGGATGACGACGACGAACATCAACGTAATGCATCGACCATGAGCAAAACAACTGCTGCATACTCCTTGTTCCTTGCAGGTGCTACACACCATCTTTTTCTTTCTGATCCATCCTTAATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTCTGCTTATTCTTGCTAGAAGCAGCTCATTATGGGCTACTACTAACATGTCAAAATGGAGTTTCCCTGTGGAGAAAAGAATGTGCTCTAATTGCTCATGGGTCGATAAGTTCAATGAAAGTAGAATCCACCATCGATCTTTAAAAGGCGATTTTCGCGAGTTTTCAATTGGTATTTCAAATTGCATTGCTAATATTGCACAAAAATCTTGGAGCTTTCTGACTCATGACTGCCCATATTTGAAGGCTTTCATTGATCCCTTTGATTTCAGCTGGCCAAAGATAACCACAGCGTATTCGAATAAATGCAATATACGGGCTCATAGCATCGATCGTTCGTGTGCTTGTAGTAAAGCTAGAGAGGTGGTTTGTCAGTGTGAACTTCATGTGCATTCTGACCAAGAGAGGCAAGCAATCTTTGATCTCGGTATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGCCACCATTCACATTTGGCATCTCAGATTCAGAATATTTTAGAGTAG

Coding sequence (CDS)

ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCATTGCCAACCCTCACCTCCGCTCTCCATGATTCCTTCCTCCTCACTCACTGCTCCTCCTGCTTCTCCCCTCTCCCCAATCCCCCAATTTCCCATTCCAATCTCCTCCGCTACTGCTCCACCAAATGCTACGATTCCGATTCCGCCACCGCCGCCTTCTTCTCCGCCGACCATCTTCCCTTCTCCGACACCGCTGACTTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCGATCCCTCCGCCTGGCACTCTGCTCCTCCCGAGCGCCTCTTTGGCCTTCTCACCAACCGCGAGAAATTGATGCTCGCCGAAGAAGATTCCGAGGTCCTCGTCAGGATTCGGCAAGGGGCCGACGCCATGGCCGCTTTCAGAAGGACGAACTCTGCCGATATTCGCCATGGAAACGCCTTGGAAGAGGCCGTTCTATGTCTCGTGATTACCAACGCTGTGGAGGTTCAGGATTCACTCGGCCGCACCGTCGGAATCGCTGTGTACCATCCTATCTTCTGCTGGATCAATCACAGTTGTTCTCCCAACGCATGTTACAGATTTGAAGCTCCGTCGGATTCCTTCAAGACGAGGCTGCAGATTTCCCCCAAATGCACTGACCTTGAGACTGATGAAGGAAGTTCTAATCAAATAGGTACTGTTCGTAGCGACATGTTGGATTTCATAAGAAAAGATTTTCAGGGTGGTCCAAGAGTTGTGGTTAGGAGTATAAAGAGTATAAGGAAAGGCGAGGCAGTCACAATCGCATACTGTGACTTGTTGCAACCTAAGGCAATGAGGCAGTCAGAGTTATGGCCAAGGTATCAATTTTTCTGTTGTTGCCAGCGATGTAGTGCCAAGTCCCTAACTTATGTGGACCATGCTTTGCAAGAAATTTCTGCTGTCAAAGTGGAAATGTTTGTTGATTCAACTTCCATTAGCAACTTTGACCACGACGATGCAGTGAGAAGAATAAACGATTATGTCGACAGTGCAATTGCCCAATACCTATCTATTGGTTCTCCTGAATCATGTTGTGAGAAGCTTGAAAACTTGCTTACTTTAGGGTTCTGTTACAAGCAAGAGGAAGATGAGGAAGGAAAACAGCTGGTTAATTTGAGGCTGCATCCCCTGAACTACCTGTCGCTTAATGCATACACAGCTCTCGCATCGGCTTATAAAGTCTGTTCATGTGATTTATTGGCTTTGAATTCCAAAATGGATGACGACGACGAACATCAACGTAATGCATCGACCATGAGCAAAACAACTGCTGCATACTCCTTGTTCCTTGCAGGTGCTACACACCATCTTTTTCTTTCTGATCCATCCTTAATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTCTGCTTATTCTTGCTAGAAGCAGCTCATTATGGGCTACTACTAACATGTCAAAATGGAGTTTCCCTGTGGAGAAAAGAATGTGCTCTAATTGCTCATGGGTCGATAAGTTCAATGAAAGTAGAATCCACCATCGATCTTTAAAAGGCGATTTTCGCGAGTTTTCAATTGGTATTTCAAATTGCATTGCTAATATTGCACAAAAATCTTGGAGCTTTCTGACTCATGACTGCCCATATTTGAAGGCTTTCATTGATCCCTTTGATTTCAGCTGGCCAAAGATAACCACAGCGTATTCGAATAAATGCAATATACGGGCTCATAGCATCGATCGTTCGTGTGCTTGTAGTAAAGCTAGAGAGGTGGTTTGTCAGTGTGAACTTCATGTGCATTCTGACCAAGAGAGGCAAGCAATCTTTGATCTCGGTATCCATTGCTTATTCTATGGGGGCTATTTAGCAAGTATTTGTTATGGCCACCATTCACATTTGGCATCTCAGATTCAGAATATTTTAGAGTAG

Protein sequence

MEMEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSTKCYDSDSATAAFFSADHLPFSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLTNREKLMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDSLGRTVGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIGTVRSDMLDFIRKDFQGGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFFCCCQRCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQYLSIGSPESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVCSCDLLALNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLILARSSSLWATTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSIGISNCIANIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCACSKAREVVCQCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQNILE
Homology
BLAST of Spg004474 vs. NCBI nr
Match: XP_038886411.1 (protein SET DOMAIN GROUP 41 [Benincasa hispida])

HSP 1 Score: 953.0 bits (2462), Expect = 1.4e-273
Identity = 488/652 (74.85%), Postives = 541/652 (82.98%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCST 60
           MEMEM AMEDIEMAEDITPPL  LTSALHDSFL THCSSCFS LPNPPISHSNLLRYCS 
Sbjct: 1   MEMEMIAMEDIEMAEDITPPLLPLTSALHDSFLFTHCSSCFSLLPNPPISHSNLLRYCSP 60

Query: 61  KC--YDSDSATAAFFSADHL--PFSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLT 120
           KC    SD  TAAFFS      PFS T+D RASLRLLHLLLS P A  S PPER+FGLLT
Sbjct: 61  KCSLSHSDPLTAAFFSTHPFPSPFSYTSDLRASLRLLHLLLSHPPASLSPPPERIFGLLT 120

Query: 121 NREKLMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDS 180
           NR KLM  + D+E+  ++R+G DA+AA     SADI HG+ L EA LCLV TNAV+V DS
Sbjct: 121 NRHKLMFPQHDAELFPKLREGVDAIAALL---SADIPHGHTLAEAALCLVFTNAVDVHDS 180

Query: 181 LGRTVGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIG 240
            GRT+GIAVY P FCWINHSCSPNACYRFE  S S  TR +I+P CTDL T +GS +Q+G
Sbjct: 181 TGRTIGIAVYPPTFCWINHSCSPNACYRFETSSASTTTRSRIAPSCTDLLTGQGSCSQMG 240

Query: 241 TVRSDMLDFIRKDFQG-GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFF 300
           TVRS++ DFI +DFQG GPRV+VRSIKSIR+GEAVTIAYCDLLQPKAMRQSELW RYQF 
Sbjct: 241 TVRSNLSDFITEDFQGNGPRVMVRSIKSIRRGEAVTIAYCDLLQPKAMRQSELWSRYQFV 300

Query: 301 CCCQRCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQYLS 360
           C CQRCSAK LTYVDHALQE+SA KVE+  DSTSISNFDHD AVRRI+DYV+SAI +YLS
Sbjct: 301 CSCQRCSAKPLTYVDHALQELSASKVELH-DSTSISNFDHDKAVRRIDDYVNSAITEYLS 360

Query: 361 IGSPESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVCSC 420
           IGSPESCCEKL NLLTLGF  +Q ED E KQ VNLRLHPL++LSLN YTALASAYKV SC
Sbjct: 361 IGSPESCCEKLRNLLTLGFYDEQAEDGEQKQPVNLRLHPLHFLSLNVYTALASAYKVRSC 420

Query: 421 DLLALNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGE 480
           DLLAL+S+MD D+E Q NASTM K +AAYSLFLAGATHHLFLS+PSLI SA+ CWV+AGE
Sbjct: 421 DLLALSSEMDCDNEDQCNASTMCKASAAYSLFLAGATHHLFLSEPSLIVSASTCWVLAGE 480

Query: 481 SLLILARSSSLWATTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSIGIS 540
           SLL LAR S LWATTN SKW FPV KRMCS CSWVDKFN SRIH + ++ DFREFSIGIS
Sbjct: 481 SLLTLARHSLLWATTNTSKWGFPVGKRMCSTCSWVDKFNASRIHGQPIEADFREFSIGIS 540

Query: 541 NCIANIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCACSKAR 600
           NCIAN+++KSWSFLTH CPYLKAF DPF+FSWPK+   YS+  +IRAHSIDR CACS ++
Sbjct: 541 NCIANMSRKSWSFLTHGCPYLKAFTDPFNFSWPKMIPMYSSDRDIRAHSIDRLCACSNSK 600

Query: 601 EVVCQCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQNIL 648
           +V  QCE   HS+QER++I  LGIHCLFYGGYLASICYGHHSHLASQIQNIL
Sbjct: 601 DVCFQCEPQ-HSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNIL 647

BLAST of Spg004474 vs. NCBI nr
Match: XP_023520942.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 944.1 bits (2439), Expect = 6.3e-271
Identity = 488/648 (75.31%), Postives = 541/648 (83.49%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCST 60
           MEMEMRAMEDIEMAEDITPPLP LT+ALHD+FLLTHCSSCFSPLPN  ISHSNLLRYCS 
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFLLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCYDSDSATAAFFSADHLPFSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLTNREK 120
            C  SDS TAA FS    PFSDT+D RASLRLLHLLLSDPSAW SAPPER+FGLLTNREK
Sbjct: 61  ICSHSDSLTAAVFSTGQFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDSLGRT 180
           LMLA++DSEV V+IR+G+DAMAA RRTNSADIR+ NALEEA+LCLV+TNAVEVQDS+GRT
Sbjct: 121 LMLADDDSEVFVKIREGSDAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGRT 180

Query: 181 VGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIGTVRS 240
           +GIAVYHP FCWINHSCSPNACYRFE PSDS KTRL+ISP CTD+ T EGS +Q+ TVR 
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 DMLDFIRKDFQG-GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFFCCCQ 300
           +   FI KDFQG GPRV+VRSIKSIR GEAVTIAYCDLLQPKAMRQSEL  RY+F C CQ
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSIRNGEAVTIAYCDLLQPKAMRQSELRSRYKFVCSCQ 300

Query: 301 RCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQYLSIGSP 360
           RCSAK  TYVDHALQEISAV VE+ +DSTSISNFD+D A+ RI+DYV++AIA+YLSIGS 
Sbjct: 301 RCSAKPPTYVDHALQEISAVNVEL-LDSTSISNFDYDTAIARIDDYVNNAIAEYLSIGSS 360

Query: 361 ESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVCSCDLLA 420
           ESCCEKL+NLLTLGF  +Q ED +GKQL+NLRLHP+++L LNAYTALASAYKV S     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSW---- 420

Query: 421 LNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLI 480
                 + DE+Q NA TMSKT+AAYSLFLAGATHHLFLS+PSLIASAANCWVVAGESLLI
Sbjct: 421 ------NGDENQCNA-TMSKTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLI 480

Query: 481 LARSSSLWATTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSIGISNCIA 540
           L + SSLW  +N SK S P+ +  C NCSWVDKFN SRIH RS++ DFREFSIGISNCIA
Sbjct: 481 LVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEADFREFSIGISNCIA 540

Query: 541 NIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCACSKAREVVC 600
           NI+QK WSFL H+C YLKAF DPFDFSWPK  T  SN         DRSC CSK ++V  
Sbjct: 541 NISQKYWSFLAHECSYLKAFTDPFDFSWPKTITTCSN-------YRDRSCDCSKIQDV-- 600

Query: 601 QCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQNIL 648
                  SDQ+RQ+IF+LGIHCLFYGGYLASICYGHHSHLASQIQ IL
Sbjct: 601 -------SDQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIQCIL 619

BLAST of Spg004474 vs. NCBI nr
Match: XP_022974027.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima])

HSP 1 Score: 932.9 bits (2410), Expect = 1.4e-267
Identity = 478/649 (73.65%), Postives = 537/649 (82.74%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCST 60
           MEME+RAMEDIEMAEDITPPLP LT+ALHDSFLLTHCSSCFSPLPN PISHSNLLRYCS 
Sbjct: 1   MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60

Query: 61  KCYDSDSATAAFFSADHLPFSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLTNREK 120
            C  SDS TAA FS DH  FSDT+D RASLRLLHLLLSD SAW S PPER+FGLLTNREK
Sbjct: 61  ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120

Query: 121 LMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDSLGRT 180
           LMLA++DSEV  +IR+GADA+A  RRTNSADIR+ NALEEA++CLV+TNAVEVQDS+G+T
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180

Query: 181 VGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIGTVRS 240
           +GIAVYHP FCWINHSCSPNACYRFE PSDS KTRL+ISP CTD+ T EGS +Q+ TVR 
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 DMLDFIRKDFQG-GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFFCCCQ 300
           +   FI KDFQG GPRV+VRSIKSIRKGEAVTIAYCDLLQPKAMRQSEL  RY+F C CQ
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELRSRYKFVCSCQ 300

Query: 301 RCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQYLSIGSP 360
           RCSAK  TYVDHALQEI AV VE  +DSTSISNFD+D A+ RI+DYV++AIA+YLSIGSP
Sbjct: 301 RCSAKPPTYVDHALQEIFAVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSP 360

Query: 361 ESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVCSCDLLA 420
           ESCCEKL+NLLTLGF  +Q +D +GKQL+NLRLHP+++L LN YTALASAYKV S     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSW---- 420

Query: 421 LNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLI 480
                 +D+E+Q N STMSKT+AAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLL 
Sbjct: 421 ------NDNENQCNTSTMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLR 480

Query: 481 LARSSSLWATTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSIGISNCIA 540
           L R SSLW  +N SK S P+ +  C NCSWVDKFN SRIH RS++ DF+EFSIGISNCIA
Sbjct: 481 LVRHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEVDFQEFSIGISNCIA 540

Query: 541 NIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCACSKAREVVC 600
           NI+ K WSFLTH+CPYLKAF DPFDFSWPK  T  SN         DR C  SK ++V  
Sbjct: 541 NISHKYWSFLTHECPYLKAFTDPFDFSWPKTITTCSN-------YRDRLCDYSKIQDV-- 600

Query: 601 QCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQNILE 649
                  SDQ+RQ+IF+LGIHCLFYGGYLASICYGH SHL+SQIQ IL+
Sbjct: 601 -------SDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQCILQ 622

BLAST of Spg004474 vs. NCBI nr
Match: XP_022932824.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata])

HSP 1 Score: 931.4 bits (2406), Expect = 4.2e-267
Identity = 481/648 (74.23%), Postives = 539/648 (83.18%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCST 60
           MEMEMRAMEDIEMAEDITPPLP LT+ALHD+F LTHCSSCFSPLPN  ISHSNLLRYCS 
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCYDSDSATAAFFSADHLPFSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLTNREK 120
            C  SDS TAA FS DH PFSDT+D RASLRLLHLLLSD SAW SAPPER+FGLLTNREK
Sbjct: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDSLGRT 180
           LMLAE+DSEV V+IR+GADAMAA RRTNSADIR+ NALEEA+LCLV+TNAVEVQDS+G+T
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180

Query: 181 VGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIGTVRS 240
           +GIAVYHP FCWINHSCSPNACYRFE PSDS  TRL+ISP CTD+ T EGS NQ+ TVR 
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240

Query: 241 DMLDFIRKDFQG-GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFFCCCQ 300
           +   FI KDFQG GPRV+VRSIKS+RKGEAVTIAYCDLLQPKA+RQSEL  RY+F C CQ
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCSCQ 300

Query: 301 RCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQYLSIGSP 360
           RCSAK  TYVDHALQEISA  VE+ +DSTSISNFD+D A+RRI+DYV++AIA+YLSIGSP
Sbjct: 301 RCSAKPPTYVDHALQEISAFNVEL-LDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSP 360

Query: 361 ESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVCSCDLLA 420
           ESCCEKL+NLLTLGF  +Q ED +GKQL+NLRLHP+++L LN YTALASAYKV S     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW---- 420

Query: 421 LNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLI 480
                 +DDE+Q NA TMSKT+AAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLLI
Sbjct: 421 ------NDDENQCNA-TMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLI 480

Query: 481 LARSSSLWATTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSIGISNCIA 540
           L + SSLW  +N SK S P+ +  C NCSWVDKFN +RIH RS++ DFREFSIGISNCIA
Sbjct: 481 LVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISNCIA 540

Query: 541 NIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCACSKAREVVC 600
           +I+ K WSFL H+C YLKAF DPFDFSWPK  T   N      H   RSC CSK ++V  
Sbjct: 541 DISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLN-----YHG--RSCDCSKIQDV-- 600

Query: 601 QCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQNIL 648
                  S+Q+RQ+IF+LGIHCLFYGGYLASICYGH SHLASQI+ IL
Sbjct: 601 -------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECIL 619

BLAST of Spg004474 vs. NCBI nr
Match: XP_008463080.1 (PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo])

HSP 1 Score: 928.3 bits (2398), Expect = 3.6e-266
Identity = 478/653 (73.20%), Postives = 540/653 (82.70%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSTKC 62
           MEMRA+EDIEMAEDITPPL  LTSALHDSFL THCSSCFS LPNPPISHS LL YCS KC
Sbjct: 1   MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60

Query: 63  --YDSDSATAAFFSADHLP--FSDTADFRASLRL--LHLLLSDPSAWHSAPPERLFGLLT 122
               SD  TAAFFS   LP   SDT+D RASLRL  LHLLLS PS   S PP R+FGLLT
Sbjct: 61  SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120

Query: 123 NREKLMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDS 182
           NR KLM  +  SEV +++R+ A+A+AA RR N ADI  G ALEEAVLCLV+TNAV+VQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180

Query: 183 LGRTVGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIG 242
           +G+T+GIAVY P F WINHSCSPNACYRFE PSD F TR +I+P CTD  +DEG+  Q+G
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240

Query: 243 TVRSDMLDFIRKDFQG-GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFF 302
            VRS++LDF+R+DFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELW RYQF 
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 303 CCCQRCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQYLS 362
           C CQRCSA  LTYVDHALQEISAVKVE+ +DS  ISNFDHD AVRRI++YVD+AI +YLS
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVEL-LDSAPISNFDHDTAVRRIDEYVDNAITEYLS 360

Query: 363 IGSPESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVCSC 422
           IGSPESCCEKL+NLLT GF  +Q ED EGKQ V+LRLHP ++L LNAYTAL SAYKV SC
Sbjct: 361 IGSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSC 420

Query: 423 DLLALNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGE 482
           DLLAL+S+MD D+E++ NA TMSKT+AAY+LFLAGATHHLFL +PSLIASAANCWVVAGE
Sbjct: 421 DLLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGE 480

Query: 483 SLLILARSSSLWA-TTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSIGI 542
           SLLILAR SSLWA TTN S W FP+ KRMCSNCSWVD+FN SRIH R ++ DFREFSIGI
Sbjct: 481 SLLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIGI 540

Query: 543 SNCIANIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCACSKA 602
           SNCIA+I++K WSFLTH CPYLKAF DPFDFSWPK     +N  +I  H IDRSCACSK 
Sbjct: 541 SNCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPK-----TNDGDIGGHGIDRSCACSKT 600

Query: 603 REVVCQCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQNIL 648
           +++  +CE    S+QER++I  LGIHCL+YGGYLASICYG+HSHLASQIQNIL
Sbjct: 601 KDICFECEPQ-DSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNIL 646

BLAST of Spg004474 vs. ExPASy Swiss-Prot
Match: Q3ECY6 (Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1)

HSP 1 Score: 352.1 bits (902), Expect = 1.4e-95
Identity = 243/653 (37.21%), Postives = 330/653 (50.54%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSTKC 62
           ME+RA EDIE+  D+ PPL  L S+L+DSFL +HCSSCFS LP  P        YCS  C
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60

Query: 63  YDSDSAT-AAFFSADHLPFSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLTNREKL 122
             +DS T +  F  +  P    +D R SL LL+    D     S+ P RL  LLTN   L
Sbjct: 61  SLTDSFTNSPQFPPEITPIL-PSDIRTSLHLLNSTAVDT----SSSPHRLNNLLTNHHLL 120

Query: 123 MLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDSLGRTV 182
           M    D  + V I   A+ +A   R+N    R    LEEA +C V+TNAVEV DS G  +
Sbjct: 121 M---ADPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNGLAL 180

Query: 183 GIAVYHPIFCWINHSCSPNACYRFEAPSDSF-KTRLQISPKCTDLETDE---GSSNQIGT 242
           GIA+Y+  F WINHSCSPN+CYRF     S+    +  +   ++LE  E   G+S   G 
Sbjct: 181 GIALYNSSFSWINHSCSPNSCYRFVNNRTSYHDVHVTNTETSSNLELQEQVCGTSLNSGN 240

Query: 243 VRSDMLDFIRKDFQGGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFFCC 302
                          GP+++VRSIK I+ GE +T++Y DLLQP  +RQS+LW +Y+F C 
Sbjct: 241 -------------GNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCN 300

Query: 303 CQRCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFD----HDDAVRRINDYVDSAIAQY 362
           C RC+A    YVD  L+ +  ++ E     T++ +FD     D+AV ++NDY+  AI  +
Sbjct: 301 CGRCAASPPAYVDSILEGVLTLESE----KTTVGHFDGSTNKDEAVGKMNDYIQEAIDDF 360

Query: 363 LSIG-SPESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKV 422
           LS    P++CCE +E++L  G  +K     E  Q   LRLH  +Y++LNAY  LA+AY++
Sbjct: 361 LSDNIDPKTCCEMIESVLHHGIQFK-----EDSQPHCLRLHACHYVALNAYITLATAYRI 420

Query: 423 CSCDLLALNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVV 482
            S             D        MS+ +AAYSLFLAG +HHLF ++ S   SAA  W  
Sbjct: 421 RSI------------DSETGIVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKN 480

Query: 483 AGESLLILARSSSLWATTNMSKWSFPVEKRM-CSNCSWVDKFNESRIHHRSLKGDFREFS 542
           AGE L  LA    +            VE  + C+ C  ++  N  R        D +E S
Sbjct: 481 AGELLFDLAPKLLM---------ELSVESDVKCTKCLMLETSNSHR--------DIKEKS 540

Query: 543 IGISNCIANIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCAC 602
             I +C+ +I+Q +WSFLT  CPYL+ F  P DFS  +                      
Sbjct: 541 RQILSCVRDISQVTWSFLTRGCPYLEKFRSPVDFSLTRTNG------------------- 557

Query: 603 SKAREVVCQCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQ 645
                     E    S  +   +  L  HCL Y   L  +CYG  SHL S+ +
Sbjct: 601 ----------EREESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557

BLAST of Spg004474 vs. ExPASy Swiss-Prot
Match: Q9CWR2 (Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 SV=1)

HSP 1 Score: 47.8 bits (112), Expect = 5.6e-04
Identity = 33/139 (23.74%), Postives = 53/139 (38.13%), Query Frame = 0

Query: 166 VITNAVEVQDSLGRTVGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDL 225
           VI N+  + ++  + VG+ +Y P    +NHSC PN    F                    
Sbjct: 178 VICNSFTICNAEMQEVGVGLY-PSMSLLNHSCDPNCSIVF-------------------- 237

Query: 226 ETDEGSSNQIGTVRSDMLDFIRKDFQGGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQ 285
                                      GP +++R+++ I  GE +TI Y D+L     R+
Sbjct: 238 --------------------------NGPHLLLRAVREIEAGEELTICYLDMLMTSEERR 269

Query: 286 SELWPRYQFFCCCQRCSAK 305
            +L  +Y F C C RC  +
Sbjct: 298 KQLRDQYCFECDCIRCQTQ 269

BLAST of Spg004474 vs. ExPASy Swiss-Prot
Match: Q9H7B4 (Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens OX=9606 GN=SMYD3 PE=1 SV=4)

HSP 1 Score: 47.4 bits (111), Expect = 7.3e-04
Identity = 33/139 (23.74%), Postives = 53/139 (38.13%), Query Frame = 0

Query: 166 VITNAVEVQDSLGRTVGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDL 225
           VI N+  + ++  + VG+ +Y P    +NHSC PN    F                    
Sbjct: 178 VICNSFTICNAEMQEVGVGLY-PSISLLNHSCDPNCSIVF-------------------- 237

Query: 226 ETDEGSSNQIGTVRSDMLDFIRKDFQGGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQ 285
                                      GP +++R+++ I  GE +TI Y D+L     R+
Sbjct: 238 --------------------------NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERR 269

Query: 286 SELWPRYQFFCCCQRCSAK 305
            +L  +Y F C C RC  +
Sbjct: 298 KQLRDQYCFECDCFRCQTQ 269

BLAST of Spg004474 vs. ExPASy TrEMBL
Match: A0A6J1I954 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)

HSP 1 Score: 932.9 bits (2410), Expect = 7.0e-268
Identity = 478/649 (73.65%), Postives = 537/649 (82.74%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCST 60
           MEME+RAMEDIEMAEDITPPLP LT+ALHDSFLLTHCSSCFSPLPN PISHSNLLRYCS 
Sbjct: 1   MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60

Query: 61  KCYDSDSATAAFFSADHLPFSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLTNREK 120
            C  SDS TAA FS DH  FSDT+D RASLRLLHLLLSD SAW S PPER+FGLLTNREK
Sbjct: 61  ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120

Query: 121 LMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDSLGRT 180
           LMLA++DSEV  +IR+GADA+A  RRTNSADIR+ NALEEA++CLV+TNAVEVQDS+G+T
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180

Query: 181 VGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIGTVRS 240
           +GIAVYHP FCWINHSCSPNACYRFE PSDS KTRL+ISP CTD+ T EGS +Q+ TVR 
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 DMLDFIRKDFQG-GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFFCCCQ 300
           +   FI KDFQG GPRV+VRSIKSIRKGEAVTIAYCDLLQPKAMRQSEL  RY+F C CQ
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELRSRYKFVCSCQ 300

Query: 301 RCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQYLSIGSP 360
           RCSAK  TYVDHALQEI AV VE  +DSTSISNFD+D A+ RI+DYV++AIA+YLSIGSP
Sbjct: 301 RCSAKPPTYVDHALQEIFAVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSP 360

Query: 361 ESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVCSCDLLA 420
           ESCCEKL+NLLTLGF  +Q +D +GKQL+NLRLHP+++L LN YTALASAYKV S     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSW---- 420

Query: 421 LNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLI 480
                 +D+E+Q N STMSKT+AAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLL 
Sbjct: 421 ------NDNENQCNTSTMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLR 480

Query: 481 LARSSSLWATTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSIGISNCIA 540
           L R SSLW  +N SK S P+ +  C NCSWVDKFN SRIH RS++ DF+EFSIGISNCIA
Sbjct: 481 LVRHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEVDFQEFSIGISNCIA 540

Query: 541 NIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCACSKAREVVC 600
           NI+ K WSFLTH+CPYLKAF DPFDFSWPK  T  SN         DR C  SK ++V  
Sbjct: 541 NISHKYWSFLTHECPYLKAFTDPFDFSWPKTITTCSN-------YRDRLCDYSKIQDV-- 600

Query: 601 QCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQNILE 649
                  SDQ+RQ+IF+LGIHCLFYGGYLASICYGH SHL+SQIQ IL+
Sbjct: 601 -------SDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQCILQ 622

BLAST of Spg004474 vs. ExPASy TrEMBL
Match: A0A6J1EY39 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)

HSP 1 Score: 931.4 bits (2406), Expect = 2.0e-267
Identity = 481/648 (74.23%), Postives = 539/648 (83.18%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCST 60
           MEMEMRAMEDIEMAEDITPPLP LT+ALHD+F LTHCSSCFSPLPN  ISHSNLLRYCS 
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCYDSDSATAAFFSADHLPFSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLTNREK 120
            C  SDS TAA FS DH PFSDT+D RASLRLLHLLLSD SAW SAPPER+FGLLTNREK
Sbjct: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDSLGRT 180
           LMLAE+DSEV V+IR+GADAMAA RRTNSADIR+ NALEEA+LCLV+TNAVEVQDS+G+T
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180

Query: 181 VGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIGTVRS 240
           +GIAVYHP FCWINHSCSPNACYRFE PSDS  TRL+ISP CTD+ T EGS NQ+ TVR 
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240

Query: 241 DMLDFIRKDFQG-GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFFCCCQ 300
           +   FI KDFQG GPRV+VRSIKS+RKGEAVTIAYCDLLQPKA+RQSEL  RY+F C CQ
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCSCQ 300

Query: 301 RCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQYLSIGSP 360
           RCSAK  TYVDHALQEISA  VE+ +DSTSISNFD+D A+RRI+DYV++AIA+YLSIGSP
Sbjct: 301 RCSAKPPTYVDHALQEISAFNVEL-LDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSP 360

Query: 361 ESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVCSCDLLA 420
           ESCCEKL+NLLTLGF  +Q ED +GKQL+NLRLHP+++L LN YTALASAYKV S     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW---- 420

Query: 421 LNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLI 480
                 +DDE+Q NA TMSKT+AAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLLI
Sbjct: 421 ------NDDENQCNA-TMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLI 480

Query: 481 LARSSSLWATTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSIGISNCIA 540
           L + SSLW  +N SK S P+ +  C NCSWVDKFN +RIH RS++ DFREFSIGISNCIA
Sbjct: 481 LVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISNCIA 540

Query: 541 NIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCACSKAREVVC 600
           +I+ K WSFL H+C YLKAF DPFDFSWPK  T   N      H   RSC CSK ++V  
Sbjct: 541 DISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLN-----YHG--RSCDCSKIQDV-- 600

Query: 601 QCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQNIL 648
                  S+Q+RQ+IF+LGIHCLFYGGYLASICYGH SHLASQI+ IL
Sbjct: 601 -------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECIL 619

BLAST of Spg004474 vs. ExPASy TrEMBL
Match: A0A1S3CIT0 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 PE=4 SV=1)

HSP 1 Score: 928.3 bits (2398), Expect = 1.7e-266
Identity = 478/653 (73.20%), Postives = 540/653 (82.70%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSTKC 62
           MEMRA+EDIEMAEDITPPL  LTSALHDSFL THCSSCFS LPNPPISHS LL YCS KC
Sbjct: 1   MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60

Query: 63  --YDSDSATAAFFSADHLP--FSDTADFRASLRL--LHLLLSDPSAWHSAPPERLFGLLT 122
               SD  TAAFFS   LP   SDT+D RASLRL  LHLLLS PS   S PP R+FGLLT
Sbjct: 61  SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120

Query: 123 NREKLMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDS 182
           NR KLM  +  SEV +++R+ A+A+AA RR N ADI  G ALEEAVLCLV+TNAV+VQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180

Query: 183 LGRTVGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIG 242
           +G+T+GIAVY P F WINHSCSPNACYRFE PSD F TR +I+P CTD  +DEG+  Q+G
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240

Query: 243 TVRSDMLDFIRKDFQG-GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFF 302
            VRS++LDF+R+DFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELW RYQF 
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 303 CCCQRCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQYLS 362
           C CQRCSA  LTYVDHALQEISAVKVE+ +DS  ISNFDHD AVRRI++YVD+AI +YLS
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVEL-LDSAPISNFDHDTAVRRIDEYVDNAITEYLS 360

Query: 363 IGSPESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVCSC 422
           IGSPESCCEKL+NLLT GF  +Q ED EGKQ V+LRLHP ++L LNAYTAL SAYKV SC
Sbjct: 361 IGSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSC 420

Query: 423 DLLALNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGE 482
           DLLAL+S+MD D+E++ NA TMSKT+AAY+LFLAGATHHLFL +PSLIASAANCWVVAGE
Sbjct: 421 DLLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGE 480

Query: 483 SLLILARSSSLWA-TTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSIGI 542
           SLLILAR SSLWA TTN S W FP+ KRMCSNCSWVD+FN SRIH R ++ DFREFSIGI
Sbjct: 481 SLLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIGI 540

Query: 543 SNCIANIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCACSKA 602
           SNCIA+I++K WSFLTH CPYLKAF DPFDFSWPK     +N  +I  H IDRSCACSK 
Sbjct: 541 SNCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPK-----TNDGDIGGHGIDRSCACSKT 600

Query: 603 REVVCQCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQNIL 648
           +++  +CE    S+QER++I  LGIHCL+YGGYLASICYG+HSHLASQIQNIL
Sbjct: 601 KDICFECEPQ-DSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNIL 646

BLAST of Spg004474 vs. ExPASy TrEMBL
Match: A0A0A0KAK3 (SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV=1)

HSP 1 Score: 895.2 bits (2312), Expect = 1.6e-256
Identity = 463/655 (70.69%), Postives = 529/655 (80.76%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCST 60
           MEMEM A+EDIEMAEDI+PPL  LTSALHDSFL THCSSCFS LPNPPISHS  L YCS 
Sbjct: 1   MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60

Query: 61  KC--YDSDSATAAFFSADHLP--FSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLT 120
           KC    SD  T AFFS    P   SDT+D RASLRLLHLLLS PS   S PP+R++GLLT
Sbjct: 61  KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120

Query: 121 NREKLMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDS 180
           NR KLM  + DSEV +++R+GA+A+AA RR N ADI  G ALEEAVLCLV+TNAV+VQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180

Query: 181 LGRTVGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIG 240
           +G+T+GIAVY   F WINHSCSPNACYRFE PSDS  TR +I+P CTD  +DEGS  Q+G
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240

Query: 241 TVRSDMLDFIRKDF---QGGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQ 300
            VRS++LDFIR+       GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELW RYQ
Sbjct: 241 NVRSNILDFIREGALLNGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQ 300

Query: 301 FFCCCQRCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQY 360
           F C CQRCSA  LTYVDHALQEIS+VKVE+ +DST ISNFDHD AVRRI++YVD+AI +Y
Sbjct: 301 FVCSCQRCSAVPLTYVDHALQEISSVKVEL-LDSTPISNFDHDTAVRRIDEYVDNAITEY 360

Query: 361 LSIGSPESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVC 420
           LS  SPESCCEKL+NLLT GF  +Q ED EGKQ V+LRLHPL++L LNAYTAL SAYKV 
Sbjct: 361 LSTSSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVR 420

Query: 421 SCDLLALNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVA 480
           SCDL+AL+S+MD D+ ++ NA TM KT+AAY+LFLAGATH LFL +PSL+ASAANCWVVA
Sbjct: 421 SCDLVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVA 480

Query: 481 GESLLILARSSSLWA-TTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSI 540
           GESLLILAR SSLWA TTN S W FP+ KRMC NCSWVD+FN SRIH + ++ DFREFSI
Sbjct: 481 GESLLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSI 540

Query: 541 GISNCIANIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCACS 600
           GISNCIA+I+QK WS LTH CPYLKAF  PFDFSWPK     +N+ +I    ID SCACS
Sbjct: 541 GISNCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPK-----TNEQDICGRGIDHSCACS 600

Query: 601 KAREVVCQCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQNIL 648
           K ++V  +C+    S+QER++I  LGIHCL+YGGYLASICYGHHSHLASQIQNIL
Sbjct: 601 KTQDVCLECKPQ-DSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNIL 648

BLAST of Spg004474 vs. ExPASy TrEMBL
Match: A0A6J1F365 (protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 2.5e-225
Identity = 408/533 (76.55%), Postives = 456/533 (85.55%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCST 60
           MEMEMRAMEDIEMAEDITPPLP LT+ALHD+F LTHCSSCFSPLPN  ISHSNLLRYCS 
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCYDSDSATAAFFSADHLPFSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLTNREK 120
            C  SDS TAA FS DH PFSDT+D RASLRLLHLLLSD SAW SAPPER+FGLLTNREK
Sbjct: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDSLGRT 180
           LMLAE+DSEV V+IR+GADAMAA RRTNSADIR+ NALEEA+LCLV+TNAVEVQDS+G+T
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180

Query: 181 VGIAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIGTVRS 240
           +GIAVYHP FCWINHSCSPNACYRFE PSDS  TRL+ISP CTD+ T EGS NQ+ TVR 
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240

Query: 241 DMLDFIRKDFQG-GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFFCCCQ 300
           +   FI KDFQG GPRV+VRSIKS+RKGEAVTIAYCDLLQPKA+RQSEL  RY+F C CQ
Sbjct: 241 NFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCSCQ 300

Query: 301 RCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFDHDDAVRRINDYVDSAIAQYLSIGSP 360
           RCSAK  TYVDHALQEISA  VE+ +DSTSISNFD+D A+RRI+DYV++AIA+YLSIGSP
Sbjct: 301 RCSAKPPTYVDHALQEISAFNVEL-LDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSP 360

Query: 361 ESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKVCSCDLLA 420
           ESCCEKL+NLLTLGF  +Q ED +GKQL+NLRLHP+++L LN YTALASAYKV S     
Sbjct: 361 ESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW---- 420

Query: 421 LNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVVAGESLLI 480
                 +DDE+Q NA TMSKT+AAYSLFLAGATHHLFL++PSLIASAANCWVVAGESLLI
Sbjct: 421 ------NDDENQCNA-TMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLI 480

Query: 481 LARSSSLWATTNMSKWSFPVEKRMCSNCSWVDKFNESRIHHRSLKGDFREFSI 533
           L + SSLW  +N SK S P+ +  C NCSWVDKFN +RIH RS++ DFREFSI
Sbjct: 481 LVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSI 520

BLAST of Spg004474 vs. TAIR 10
Match: AT1G43245.1 (SET domain-containing protein )

HSP 1 Score: 352.1 bits (902), Expect = 9.8e-97
Identity = 243/653 (37.21%), Postives = 330/653 (50.54%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLPTLTSALHDSFLLTHCSSCFSPLPNPPISHSNLLRYCSTKC 62
           ME+RA EDIE+  D+ PPL  L S+L+DSFL +HCSSCFS LP  P        YCS  C
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60

Query: 63  YDSDSAT-AAFFSADHLPFSDTADFRASLRLLHLLLSDPSAWHSAPPERLFGLLTNREKL 122
             +DS T +  F  +  P    +D R SL LL+    D     S+ P RL  LLTN   L
Sbjct: 61  SLTDSFTNSPQFPPEITPIL-PSDIRTSLHLLNSTAVDT----SSSPHRLNNLLTNHHLL 120

Query: 123 MLAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDSLGRTV 182
           M    D  + V I   A+ +A   R+N    R    LEEA +C V+TNAVEV DS G  +
Sbjct: 121 M---ADPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNGLAL 180

Query: 183 GIAVYHPIFCWINHSCSPNACYRFEAPSDSF-KTRLQISPKCTDLETDE---GSSNQIGT 242
           GIA+Y+  F WINHSCSPN+CYRF     S+    +  +   ++LE  E   G+S   G 
Sbjct: 181 GIALYNSSFSWINHSCSPNSCYRFVNNRTSYHDVHVTNTETSSNLELQEQVCGTSLNSGN 240

Query: 243 VRSDMLDFIRKDFQGGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFFCC 302
                          GP+++VRSIK I+ GE +T++Y DLLQP  +RQS+LW +Y+F C 
Sbjct: 241 -------------GNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCN 300

Query: 303 CQRCSAKSLTYVDHALQEISAVKVEMFVDSTSISNFD----HDDAVRRINDYVDSAIAQY 362
           C RC+A    YVD  L+ +  ++ E     T++ +FD     D+AV ++NDY+  AI  +
Sbjct: 301 CGRCAASPPAYVDSILEGVLTLESE----KTTVGHFDGSTNKDEAVGKMNDYIQEAIDDF 360

Query: 363 LSIG-SPESCCEKLENLLTLGFCYKQEEDEEGKQLVNLRLHPLNYLSLNAYTALASAYKV 422
           LS    P++CCE +E++L  G  +K     E  Q   LRLH  +Y++LNAY  LA+AY++
Sbjct: 361 LSDNIDPKTCCEMIESVLHHGIQFK-----EDSQPHCLRLHACHYVALNAYITLATAYRI 420

Query: 423 CSCDLLALNSKMDDDDEHQRNASTMSKTTAAYSLFLAGATHHLFLSDPSLIASAANCWVV 482
            S             D        MS+ +AAYSLFLAG +HHLF ++ S   SAA  W  
Sbjct: 421 RSI------------DSETGIVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKN 480

Query: 483 AGESLLILARSSSLWATTNMSKWSFPVEKRM-CSNCSWVDKFNESRIHHRSLKGDFREFS 542
           AGE L  LA    +            VE  + C+ C  ++  N  R        D +E S
Sbjct: 481 AGELLFDLAPKLLM---------ELSVESDVKCTKCLMLETSNSHR--------DIKEKS 540

Query: 543 IGISNCIANIAQKSWSFLTHDCPYLKAFIDPFDFSWPKITTAYSNKCNIRAHSIDRSCAC 602
             I +C+ +I+Q +WSFLT  CPYL+ F  P DFS  +                      
Sbjct: 541 RQILSCVRDISQVTWSFLTRGCPYLEKFRSPVDFSLTRTNG------------------- 557

Query: 603 SKAREVVCQCELHVHSDQERQAIFDLGIHCLFYGGYLASICYGHHSHLASQIQ 645
                     E    S  +   +  L  HCL Y   L  +CYG  SHL S+ +
Sbjct: 601 ----------EREESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557

BLAST of Spg004474 vs. TAIR 10
Match: AT2G17900.1 (SET domain group 37 )

HSP 1 Score: 45.1 bits (105), Expect = 2.6e-04
Identity = 45/180 (25.00%), Postives = 65/180 (36.11%), Query Frame = 0

Query: 123 LAEEDSEVLVRIRQGADAMAAFRRTNSADIRHGNALEEAVLCLVITNAVEVQDSLGRTVG 182
           ++E D + ++   Q A+ +    +  S D+R          C    NA  + DS  R  G
Sbjct: 147 MSEIDEKQMLLYAQMANLVNLILQFPSVDLREIAENFSKFSC----NAHSICDSELRPQG 206

Query: 183 IAVYHPIFCWINHSCSPNACYRFEAPSDSFKTRLQISPKCTDLETDEGSSNQIGTVRSDM 242
           I ++ P+   INHSCSPNA   FE                                    
Sbjct: 207 IGLF-PLVSIINHSCSPNAVLVFEE----------------------------------- 266

Query: 243 LDFIRKDFQGGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWPRYQFFCCCQRCS 302
                         VVR++ +I K   +TI+Y +       RQ  L  +Y F C C RCS
Sbjct: 267 -----------QMAVVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCS 275

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886411.11.4e-27374.85protein SET DOMAIN GROUP 41 [Benincasa hispida][more]
XP_023520942.16.3e-27175.31protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022974027.11.4e-26773.65protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima][more]
XP_022932824.14.2e-26774.23protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata][more]
XP_008463080.13.6e-26673.20PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
Q3ECY61.4e-9537.21Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1[more]
Q9CWR25.6e-0423.74Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 ... [more]
Q9H7B47.3e-0423.74Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens OX=9606 GN=SMYD3 PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A6J1I9547.0e-26873.65protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1114726... [more]
A0A6J1EY392.0e-26774.23protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A1S3CIT01.7e-26673.20protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 P... [more]
A0A0A0KAK31.6e-25670.69SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV... [more]
A0A6J1F3652.5e-22576.55protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
Match NameE-valueIdentityDescription
AT1G43245.19.8e-9737.21SET domain-containing protein [more]
AT2G17900.12.6e-0425.00SET domain group 37 [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 135..305
e-value: 1.5E-26
score: 94.5
NoneNo IPR availablePANTHERPTHR47780PROTEIN SET DOMAIN GROUP 41coord: 3..648
NoneNo IPR availableCDDcd20071SET_SMYDcoord: 149..301
e-value: 8.52683E-19
score: 80.4995
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 180..299
IPR001214SET domainPFAMPF00856SETcoord: 142..274
e-value: 5.9E-7
score: 30.0

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg004474.1Spg004474.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding