CcUC08G147180 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC08G147180
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProtein SET DOMAIN GROUP 41
LocationCicolChr08: 5265439 .. 5270557 (+)
RNA-Seq ExpressionCcUC08G147180
SyntenyCcUC08G147180
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAGGACATTACTCCGCCATTGTTTCCCCTCGCCGCCGCTCTCCATGATTCCTTCCTCCTCACTCACTGCTCCTCCTGCTTCTCCCCTCTCCCAAACCCCCCAATTTCTCACTCCAATCTCCTCCACTACTGCTCCCTCAAATGCTCCCTTTCCCATTCTGATCCCCTCACCACCGCCTTCTTCTCCGCCCTTCCTTTCCCCTCCTCCGACACCGCCGACCTCCGCGCCTCCCTCCGCCTCCTCCTCCTCCTCCTCTCCCATCCCTCCGCCTCCCACTCCGCTCCTCCCGAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACCATTCCGAGGTCTTCCTCAAGCTTCGGGAAGGGGCCGCCGCCATAGCCGCTTGCAGAAGGAATAACTCTGCCGATATTTCCCATGGAAACGCCTTGGAAGAGGCTGTCCTGTGTCTCGTCTTGACCAACGCCGTCGATGTTCAGGATTCGATCGGCCGCACCATTGGAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTCCATCGGATTCCACCACTACGAGGTTACGGATTGCCCCTTCCTGCACCGATCTTGTGGCGAATGAAGGAAGTTGTAATCAAGTAATTGTTTGAATTACGAATAATTTTTGTGTTTTTCTCCCTTTCTCGTGAAATTAGTGGGAGCTATTGCATGAATGTTTTTGGCAGATGGGTACTGTTCGTAGCAACCTTTCGGATTTCATAAGAGAAGGTGCGTTTCTTCACGGTTGCTTTGAATTTAGTTCTGTAAGTTTTGTCTCTGTTTGTATGTTTATGTGAATTTTGATGAAGATTTTCAGGGTTATGGTCCAAGAGTTGTGGTTAGGAGTATAAAGAGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGACTTGTTGCAACCCAAGGTATTCTGGCTATATCTTAGTTTTGTTAACTTCCCTTCAGGATTATTCAGTGGGAATTATTTTCTGAACTAAGTTCCCAAAAAGCATGCTAAGTTTCATTATAACCTTAGCCCCATTTGATAACTATGGTTTTTTGTTTTTTAAAATTTAGCTTATAAACACTACTTCCACTCATAACTTTCTATGGTTTGTTTTCTACTTTCTACTTATGTTTTCAAAAAACTGAACCAAGTTTTAGTAACATAATAGATTAGTTTCAAAAACATCTTCTTGTTTTTGGATAAAGAAAAATTGAGGGAAACAAGCATAACATTTGTGAGAAATTTGTTGATTTTGTCTCAGGCAATGAGGCAGTCAGAGTTGTGGTCAAGATATCAATTCTCCTGTAGTTGCCAGCGATGTAGTGTCGAGCCCCTAACTTATGTGGACCATGCTTTGCAAGTAAGAAAAACTAAAAGATGAATAGTTTTTAAAAAATTGTTTTTGTTTTTGAAATTTAGCTAAGAATTCAAATGTTTCCTTTAGAAAAGACAAAAACCATTGTAAAAGGAAATATGGAAAAACAAACACAATTACCAATTTGTTAGGACTCTTTCACCTGCACACAAATCCCAACAAGGACACACAAATCCGGCAAGAATTTTGTGATTTATGTATTTATATTCTGGAAAAATATCATAGCAAAGTAACTATCATGGATTGGCCTAGTGGTAAAAAAGAGACATATTCCCAATAAATAGCTAAGAGGTCATAGGTTCAATCCATGGTGGCTACTTACCTAATTCCTACTTAAGCCCTCAGATGTCTAATCTCTCTCAGCCATTCTCGTGGTCACCTCCCTTTTTTCTTAGCATAACAACCCTCCACCTCCACTAACAGTTTGCCCTTCCTTTCATGTGTGAATAATTGGTGGCCTAATAATTCCCTCGGGTTTTGAAACTCACATTATCCTGAAGGTGGAACATAGGGAACTATCTATTCAGCATCTCCTTGTCTTCCCAAGTGGCTTCACCTTTGTGAACCATTCATCACGAGCTCTTTCAATGTTCCACCTCTTACCCCAACAATGCAGAAGCGAGTAAAAAAAATCGATCACTTAGATCAACCTCACTAAATAATTGTACTTCAAGTCTTGAAACCCCAAGCCAGGATCACCATCCAAACTAAAAGTACATAACACCAGTATTTAAAAAAACAATAATCAAATAGAATAACAAACTAAGAACAACTAAAACTATCAACGATTTAGCAGAAGCCTAAATTTACTAGACTATGAATAAGCTCCTTGTGTTATGCCAAAACGCAAACCTTCAATTACTGTCAGAACTATCGCTTCAAAGTTACAAGGATGAGAACAAAAATACCCAACAAGCAACGTGCACAACAAATTCGCAGTACCTTTTCTTCAAAACAAAACGTCTTGAACCATAGAAGTTTAATAAGCAAATCAATTATTTCTCTTTCGTATCCATCATTGGATAGAAGTCAAACTTTCAAACAAAAAGTTTTTCAGTCTTTCAGTCAAAAAATCTCTGTACTTCAGATAATCTTTCTACAAAACCAAAAATGTTTGAACAAACTTTTAGCACTTTTGGAAACAGAATTTTGTGTTTTTTAAAACACAAATCTTTCAATACTTCCAAGAAGGCATCCTTCGGGACTCACAAGGGACATAACAAATTCTTGGTAATTCCTTTTGGGTTGACCAATGCCCTTGCTACAGTTCAATCATTAATGAACCAGGTATTTCGCCCTTTCCTCTAGAGAGGTTTGTTAGTTTTCTCTGATGATATTTTGGTGTACAACCCTAATGAACATATGCACGAAAAATACTTAGCTATGATATTGAATGTGCTAAGAGATAACAAATTGTATGCAAACCATAAGAAATGTGTATTTGGCCGGTCAAGGACACCCTACCTAGGCCATTAGGTCTTTGCCGTGGGCGTTGAAGCTGAGGGAGAGAAGGTATGTGCCATGATTGGCCAATTCCTAAGAATGTTACGGAGTTTGAGGCTTCTTTGGATTGATAGGGTATTATAGAAGGTTTGTGAAGGACTGGTTCTGTGGCAGCGCCTTTGACAAAATTACTCCAAAAGGATGCATTACATTGGAATGACATAGCCACTGAGGTGTTCTATAACCTGAAACAGATGATGGTAACGCTCTCTGTGTTGGCTTTGCCTAATTTTAACTTATTATTTATGATTCAAACATATGCGTCTGGAACTGGGCTGAGGTTGTTTTAATGCGAGAACAGAGGCCAATTGCCTATTATAGCCAAACGCTTTCCACCAGAGCTCAAGGGAAACCCATTTATGAGAGAGAGAGCTAATGAATGTGGTCTTAGCAGTGCAAAGATGGAGGCATTATCTCTTGGGGCGCACGGCTATTTCATATAGAAAAGCCTTAAAGTTTCTCATCAAATAAGGAGAGGTACAATCTCAGTTCCAAAGATGGTTCACCAAGCTTTTAGGCTATGATTTCAAGATCTTATATCAGCCTGGCTTACAAACCAAAGCGGGCAGACACGTTTTCTCAAATGCCCCAGAAGGTCGAGTTGCTTAGTCTAATTGCTCCACCATTAATTGATGTGGATATCATTCAGCAGGAGGTGATAAAAGATAAGGAGCTGAAGAAAATTCGAGAGCAGTTGGAGATGGACCTTGGGGGATTCCTAAGTACTCTCTTGACCAAGGAAGGTTGTTTTATAAAGGAAGCTTGGTGTTATCTAAGACCTCGGTTTGTAAACCTTCTATGATTCTAGTATTTGGGAGAGTGAAAAGTGGTTTTAATTAATCAAAAGCACTTTTCCAAATTTGCATGGTGGATTGTAACCTAAACAGTGTGATTTTAAAAAACACTGAAAACCTTTTCCACTTAGGAAAGATCATTTTAGAAAACTCATTACAAACTCAAACCTATTCTTTGTGGGTGACTAGATTTCTGGTTATAGTTGTTCTTTTGAATATTGTGTCGTTTTTTTGGAGCTTTCCCTTATTCTTTTTCATAATACTAAAATGATATACATTAAAAATTTCTTCAGGAAATCTCTGCTGTCAAAGTGGAATTGCTCGATTCAACTTCGTTTAGCAACTTTGATCATGACAAAGTAGTGAGAAGAATGAATGATTATGTCGACAATGTTATCACCGAGTACCTGTCTATTGGTTCTCCTGAATCATGTTGTGAGAAGCTTCAAGAGTTGCTTACTTTAGGTTTCTGTGACGAGCAAGCGGAAGATGGGGAAGGAAAACAGCCAGTTAACCTGAGACTGCATCCTTTGCACTTCCTATCGCTGAATGCATACACTGCTCTTGCATCGGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTGAGTTCTGAAATGGACAATAATGATGAAAATCAACGTGACGCATCTACCATGAGCAGAACAAGTGCAGCATATTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGAACCATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTACTCTTGCTAGACACAGCTCACTATGGGCTACTACTAACTCCTCAAAATGGGGTTTCCCTGTCGGAAGAAGAATGTGCTCCAACTGCTCATGGGTCGATAAGTTCAATGCAAGTAGGATCCTTGGTCGATCTATCGAAGCTGATTTTCGTGAGTTTTCATGTATTTCAAATTGTATTGCTAATATGTCACAAAAAACTTGGAGCTTTTTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTTCGATTTCAGCTGGCCAAAGACGATCATGACGTATTCGAGTGACCGGGATATAGGGGCTCGTAGCATTGATCGTTCATGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTCAGCATTCTAATCAAGTGAGAGAGTCTATCATTGGGCTTGGCATCCATTGCTTATTCTATGGAGGCTATTTAGCAAGTATTTTTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAATATTTTACATGACTTGGATTGATATAGTTATTAAGTAAAAATGTAAACTGTTCTGAGATTGAATCTTTTTT

mRNA sequence

ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAGGACATTACTCCGCCATTGTTTCCCCTCGCCGCCGCTCTCCATGATTCCTTCCTCCTCACTCACTGCTCCTCCTGCTTCTCCCCTCTCCCAAACCCCCCAATTTCTCACTCCAATCTCCTCCACTACTGCTCCCTCAAATGCTCCCTTTCCCATTCTGATCCCCTCACCACCGCCTTCTTCTCCGCCCTTCCTTTCCCCTCCTCCGACACCGCCGACCTCCGCGCCTCCCTCCGCCTCCTCCTCCTCCTCCTCTCCCATCCCTCCGCCTCCCACTCCGCTCCTCCCGAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACCATTCCGAGGTCTTCCTCAAGCTTCGGGAAGGGGCCGCCGCCATAGCCGCTTGCAGAAGGAATAACTCTGCCGATATTTCCCATGGAAACGCCTTGGAAGAGGCTGTCCTGTGTCTCGTCTTGACCAACGCCGTCGATGTTCAGGATTCGATCGGCCGCACCATTGGAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTCCATCGGATTCCACCACTACGAGGTTACGGATTGCCCCTTCCTGCACCGATCTTGTGGCGAATGAAGGAAGTTGTAATCAAATGGGTACTGTTCGTAGCAACCTTTCGGATTTCATAAGAGAAGATTTTCAGGGTTATGGTCCAAGAGTTGTGGTTAGGAGTATAAAGAGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGACTTGTTGCAACCCAAGGCAATGAGGCAGTCAGAGTTGTGGTCAAGATATCAATTCTCCTGTAGTTGCCAGCGATGTAGTGTCGAGCCCCTAACTTATGTGGACCATGCTTTGCAAGAAATCTCTGCTGTCAAAGTGGAATTGCTCGATTCAACTTCGTTTAGCAACTTTGATCATGACAAAGTAGTGAGAAGAATGAATGATTATGTCGACAATGTTATCACCGAGTACCTGTCTATTGGTTCTCCTGAATCATGTTGTGAGAAGCTTCAAGAGTTGCTTACTTTAGGTTTCTGTGACGAGCAAGCGGAAGATGGGGAAGGAAAACAGCCAGTTAACCTGAGACTGCATCCTTTGCACTTCCTATCGCTGAATGCATACACTGCTCTTGCATCGGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTGAGTTCTGAAATGGACAATAATGATGAAAATCAACGTGACGCATCTACCATGAGCAGAACAAGTGCAGCATATTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGAACCATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTACTCTTGCTAGACACAGCTCACTATGGGCTACTACTAACTCCTCAAAATGGGGTTTCCCTGTCGGAAGAAGAATGTGCTCCAACTGCTCATGGGTCGATAAGTTCAATGCAAGTAGGATCCTTGGTCGATCTATCGAAGCTGATTTTCGTGAGTTTTCATGTATTTCAAATTGTATTGCTAATATGTCACAAAAAACTTGGAGCTTTTTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTTCGATTTCAGCTGGCCAAAGACGATCATGACGTATTCGAGTGACCGGGATATAGGGGCTCGTAGCATTGATCGTTCATGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTCAGCATTCTAATCAAGTGAGAGAGTCTATCATTGGGCTTGGCATCCATTGCTTATTCTATGGAGGCTATTTAGCAAGTATTTTTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAATATTTTACATGACTTGGATTGATATAGTTATTAAGTAAAAATGTAAACTGTTCTGAGATTGAATCTTTTTT

Coding sequence (CDS)

ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAGGACATTACTCCGCCATTGTTTCCCCTCGCCGCCGCTCTCCATGATTCCTTCCTCCTCACTCACTGCTCCTCCTGCTTCTCCCCTCTCCCAAACCCCCCAATTTCTCACTCCAATCTCCTCCACTACTGCTCCCTCAAATGCTCCCTTTCCCATTCTGATCCCCTCACCACCGCCTTCTTCTCCGCCCTTCCTTTCCCCTCCTCCGACACCGCCGACCTCCGCGCCTCCCTCCGCCTCCTCCTCCTCCTCCTCTCCCATCCCTCCGCCTCCCACTCCGCTCCTCCCGAACGCATCTTTGGCCTTCTCACCAATCGACACAAATTGATGATCCCCCAAGACCATTCCGAGGTCTTCCTCAAGCTTCGGGAAGGGGCCGCCGCCATAGCCGCTTGCAGAAGGAATAACTCTGCCGATATTTCCCATGGAAACGCCTTGGAAGAGGCTGTCCTGTGTCTCGTCTTGACCAACGCCGTCGATGTTCAGGATTCGATCGGCCGCACCATTGGAATCGCTGTGTACGCTCCTACCTTCTGCTGGATTAATCACAGTTGTTCTCCCAACGCTTGTTACAGATTTGAAACTCCATCGGATTCCACCACTACGAGGTTACGGATTGCCCCTTCCTGCACCGATCTTGTGGCGAATGAAGGAAGTTGTAATCAAATGGGTACTGTTCGTAGCAACCTTTCGGATTTCATAAGAGAAGATTTTCAGGGTTATGGTCCAAGAGTTGTGGTTAGGAGTATAAAGAGTATAAGGAAAGGTGAAGCAGTCACAATCGCATACTGTGACTTGTTGCAACCCAAGGCAATGAGGCAGTCAGAGTTGTGGTCAAGATATCAATTCTCCTGTAGTTGCCAGCGATGTAGTGTCGAGCCCCTAACTTATGTGGACCATGCTTTGCAAGAAATCTCTGCTGTCAAAGTGGAATTGCTCGATTCAACTTCGTTTAGCAACTTTGATCATGACAAAGTAGTGAGAAGAATGAATGATTATGTCGACAATGTTATCACCGAGTACCTGTCTATTGGTTCTCCTGAATCATGTTGTGAGAAGCTTCAAGAGTTGCTTACTTTAGGTTTCTGTGACGAGCAAGCGGAAGATGGGGAAGGAAAACAGCCAGTTAACCTGAGACTGCATCCTTTGCACTTCCTATCGCTGAATGCATACACTGCTCTTGCATCGGCTTACAAAGTCCGTTCATGTGATTTATTGGCTTTGAGTTCTGAAATGGACAATAATGATGAAAATCAACGTGACGCATCTACCATGAGCAGAACAAGTGCAGCATATTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGAACCATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTACTCTTGCTAGACACAGCTCACTATGGGCTACTACTAACTCCTCAAAATGGGGTTTCCCTGTCGGAAGAAGAATGTGCTCCAACTGCTCATGGGTCGATAAGTTCAATGCAAGTAGGATCCTTGGTCGATCTATCGAAGCTGATTTTCGTGAGTTTTCATGTATTTCAAATTGTATTGCTAATATGTCACAAAAAACTTGGAGCTTTTTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTTCGATTTCAGCTGGCCAAAGACGATCATGACGTATTCGAGTGACCGGGATATAGGGGCTCGTAGCATTGATCGTTCATGTGCTTGTAGTAAAACTAAGGATGTTTGTTTTCAGAGTGAACCTCAGCATTCTAATCAAGTGAGAGAGTCTATCATTGGGCTTGGCATCCATTGCTTATTCTATGGAGGCTATTTAGCAAGTATTTTTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAATATTTTACATGACTTGGATTGA

Protein sequence

MEMEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSLKCSLSHSDPLTTAFFSALPFPSSDTADLRASLRLLLLLLSHPSASHSAPPERIFGLLTNRHKLMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCSCQRCSVEPLTYVDHALQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITEYLSIGSPESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSCDLLALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHSSLWATTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSCISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTKDVCFQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD
Homology
BLAST of CcUC08G147180 vs. NCBI nr
Match: XP_038886411.1 (protein SET DOMAIN GROUP 41 [Benincasa hispida])

HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 544/654 (83.18%), Postives = 578/654 (88.38%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSL 60
           MEMEM AMEDIEMAEDITPPL PL +ALHDSFL THCSSCFS LPNPPISHSNLL YCS 
Sbjct: 1   MEMEMIAMEDIEMAEDITPPLLPLTSALHDSFLFTHCSSCFSLLPNPPISHSNLLRYCSP 60

Query: 61  KCSLSHSDPLTTAFFSALPFPS--SDTADLRASLRLLLLLLSHPSASHSAPPERIFGLLT 120
           KCSLSHSDPLT AFFS  PFPS  S T+DLRASLRLL LLLSHP AS S PPERIFGLLT
Sbjct: 61  KCSLSHSDPLTAAFFSTHPFPSPFSYTSDLRASLRLLHLLLSHPPASLSPPPERIFGLLT 120

Query: 121 NRHKLMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDS 180
           NRHKLM PQ  +E+F KLREG  AIAA     SADI HG+ L EA LCLV TNAVDV DS
Sbjct: 121 NRHKLMFPQHDAELFPKLREGVDAIAALL---SADIPHGHTLAEAALCLVFTNAVDVHDS 180

Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMG 240
            GRTIGIAVY PTFCWINHSCSPNACYRFET S STTTR RIAPSCTDL+  +GSC+QMG
Sbjct: 181 TGRTIGIAVYPPTFCWINHSCSPNACYRFETSSASTTTRSRIAPSCTDLLTGQGSCSQMG 240

Query: 241 TVRSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFS 300
           TVRSNLSDFI EDFQG GPRV+VRSIKSIR+GEAVTIAYCDLLQPKAMRQSELWSRYQF 
Sbjct: 241 TVRSNLSDFITEDFQGNGPRVMVRSIKSIRRGEAVTIAYCDLLQPKAMRQSELWSRYQFV 300

Query: 301 CSCQRCSVEPLTYVDHALQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITEYLSI 360
           CSCQRCS +PLTYVDHALQE+SA KVEL DSTS SNFDHDK VRR++DYV++ ITEYLSI
Sbjct: 301 CSCQRCSAKPLTYVDHALQELSASKVELHDSTSISNFDHDKAVRRIDDYVNSAITEYLSI 360

Query: 361 GSPESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSCD 420
           GSPESCCEKL+ LLTLGF DEQAEDGE KQPVNLRLHPLHFLSLN YTALASAYKVRSCD
Sbjct: 361 GSPESCCEKLRNLLTLGFYDEQAEDGEQKQPVNLRLHPLHFLSLNVYTALASAYKVRSCD 420

Query: 421 LLALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGES 480
           LLALSSEMD ++E+Q +ASTM + SAAYSLFLAGATHHLFLSEPSLI SA+ CWV+AGES
Sbjct: 421 LLALSSEMDCDNEDQCNASTMCKASAAYSLFLAGATHHLFLSEPSLIVSASTCWVLAGES 480

Query: 481 LLTLARHSSLWATTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSC-ISN 540
           LLTLARHS LWATTN+SKWGFPVG+RMCS CSWVDKFNASRI G+ IEADFREFS  ISN
Sbjct: 481 LLTLARHSLLWATTNTSKWGFPVGKRMCSTCSWVDKFNASRIHGQPIEADFREFSIGISN 540

Query: 541 CIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTKD 600
           CIANMS+K+WSFLTHGCPYLKAFTDPF+FSWPK I  YSSDRDI A SIDR CACS +KD
Sbjct: 541 CIANMSRKSWSFLTHGCPYLKAFTDPFNFSWPKMIPMYSSDRDIRAHSIDRLCACSNSKD 600

Query: 601 VCFQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD 652
           VCFQ EPQHSNQ RESI+GLGIHCLFYGGYLASI YGHHSHLASQIQNIL+DL+
Sbjct: 601 VCFQCEPQHSNQERESILGLGIHCLFYGGYLASICYGHHSHLASQIQNILYDLN 651

BLAST of CcUC08G147180 vs. NCBI nr
Match: XP_008463080.1 (PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo])

HSP 1 Score: 1066.2 bits (2756), Expect = 1.1e-307
Identity = 539/655 (82.29%), Postives = 575/655 (87.79%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSLKC 62
           MEMRA+EDIEMAEDITPPLFPL +ALHDSFL THCSSCFS LPNPPISHS LLHYCSLKC
Sbjct: 1   MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60

Query: 63  SLSHSDPLTTAFFSALPFP--SSDTADLRASLRL--LLLLLSHPSASHSAPPERIFGLLT 122
           SLSHSDPLT AFFS  P P  SSDT+DLRASLRL  L LLLSHPS S S PP RIFGLLT
Sbjct: 61  SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120

Query: 123 NRHKLMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDS 182
           NRHKLM PQ+ SEVFLKLRE A AIAA RR N ADIS G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180

Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMG 242
           IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD  TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240

Query: 243 TVRSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFS 302
            VRSN+ DF+REDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQF 
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 303 CSCQRCSVEPLTYVDHALQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITEYLSI 362
           CSCQRCS  PLTYVDHALQEISAVKVELLDS   SNFDHD  VRR+++YVDN ITEYLSI
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYLSI 360

Query: 363 GSPESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSCD 422
           GSPESCCEKLQ LLT GF DEQ EDGEGKQPV+LRLHP HFL LNAYTAL SAYKVRSCD
Sbjct: 361 GSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSCD 420

Query: 423 LLALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGES 482
           LLALSSEMD ++EN+ +A TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGES
Sbjct: 421 LLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGES 480

Query: 483 LLTLARHSSLWA-TTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSC-IS 542
           LL LARHSSLWA TTN+S WGFP+G+RMCSNCSWVD+FN SRI GR I+ADFREFS  IS
Sbjct: 481 LLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIGIS 540

Query: 543 NCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTK 602
           NCIA++S+K WSFLTHGCPYLKAFTDPFDFSWPKT     +D DIG   IDRSCACSKTK
Sbjct: 541 NCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPKT-----NDGDIGGHGIDRSCACSKTK 600

Query: 603 DVCFQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD 652
           D+CF+ EPQ SNQ RESI GLGIHCL+YGGYLASI YG+HSHLASQIQNIL+DL+
Sbjct: 601 DICFECEPQDSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 650

BLAST of CcUC08G147180 vs. NCBI nr
Match: XP_011656459.1 (protein SET DOMAIN GROUP 41 [Cucumis sativus])

HSP 1 Score: 1040.4 bits (2689), Expect = 6.5e-300
Identity = 525/655 (80.15%), Postives = 569/655 (86.87%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSL 60
           MEMEM A+EDIEMAEDI+PPLFPL +ALHDSFL THCSSCFS LPNPPISHS  LHYCSL
Sbjct: 1   MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60

Query: 61  KCSLSHSDPLTTAFFSALPFP--SSDTADLRASLRLLLLLLSHPSASHSAPPERIFGLLT 120
           KCSLSHSDPLT AFFS  PFP  SSDT+DLRASLRLL LLLSHPS S S PP+RI+GLLT
Sbjct: 61  KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120

Query: 121 NRHKLMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDS 180
           NRHKLM PQ+ SEVFLKLREGA AIAA RR N ADI  G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180

Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMG 240
           IG+TIGIAVYA TF WINHSCSPNACYRFETPSDS TTR RIAPSCTD +++EGSC QMG
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240

Query: 241 TVRSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFS 300
            VRSN+ DFIREDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQF 
Sbjct: 241 NVRSNILDFIREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 301 CSCQRCSVEPLTYVDHALQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITEYLSI 360
           CSCQRCS  PLTYVDHALQEIS+VKVELLDST  SNFDHD  VRR+++YVDN ITEYLS 
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEYLST 360

Query: 361 GSPESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSCD 420
            SPESCCEKLQ LLT GF DEQ EDGEGKQ V+LRLHPLHFL LNAYTAL SAYKVRSCD
Sbjct: 361 SSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVRSCD 420

Query: 421 LLALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGES 480
           L+ALSSEMD ++ N+ +A TM +TSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGES
Sbjct: 421 LVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVAGES 480

Query: 481 LLTLARHSSLWA-TTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSC-IS 540
           LL LARHSSLWA TTN+S W FP+G+RMC NCSWVD+FNASRI G+ ++ADFREFS  IS
Sbjct: 481 LLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSIGIS 540

Query: 541 NCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTK 600
           NCIA++SQK WS LTHGCPYLKAFT PFDFSWPKT     +++DI  R ID SCACSKT+
Sbjct: 541 NCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPKT-----NEQDICGRGIDHSCACSKTQ 600

Query: 601 DVCFQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD 652
           DVC + +PQ SNQ RESI GLGIHCL+YGGYLASI YGHHSHLASQIQNIL+DL+
Sbjct: 601 DVCLECKPQDSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILNDLN 650

BLAST of CcUC08G147180 vs. NCBI nr
Match: XP_023520942.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 947.2 bits (2447), Expect = 7.4e-272
Identity = 497/652 (76.23%), Postives = 536/652 (82.21%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSL 60
           MEMEMRAMEDIEMAEDITPPL PL AALHD+FLLTHCSSCFSPLPN  ISHSNLL YCS 
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFLLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCSLSHSDPLTTAFFSALPFPSSDTADLRASLRLLLLLLSHPSASHSAPPERIFGLLTNR 120
            C  SHSD LT A FS   FP SDT+DLRASLRLL LLLS PSA  SAPPERIFGLLTNR
Sbjct: 61  IC--SHSDSLTAAVFSTGQFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNR 120

Query: 121 HKLMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDSIG 180
            KLM+  D SEVF+K+REG+ A+AA RR NSADI + NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLADDDSEVFVKIREGSDAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180

Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMGTV 240
           RTIGIAVY PTFCWINHSCSPNACYRFETPSDS  TRLRI+P CTD+   EGSC+QM TV
Sbjct: 181 RTIGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTV 240

Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCS 300
           R N S FI +DFQGYGPRV+VRSIKSIR GEAVTIAYCDLLQPKAMRQSEL SRY+F CS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSIRNGEAVTIAYCDLLQPKAMRQSELRSRYKFVCS 300

Query: 301 CQRCSVEPLTYVDHALQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITEYLSIGS 360
           CQRCS +P TYVDHALQEISAV VELLDSTS SNFD+D  + R++DYV+N I EYLSIGS
Sbjct: 301 CQRCSAKPPTYVDHALQEISAVNVELLDSTSISNFDYDTAIARIDDYVNNAIAEYLSIGS 360

Query: 361 PESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSCDLL 420
            ESCCEKLQ LLTLGF DEQAEDG+GKQ +NLRLHP+HFL LNAYTALASAYKVRS    
Sbjct: 361 SESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSW--- 420

Query: 421 ALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLL 480
                  N DENQ +A TMS+TSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLL
Sbjct: 421 -------NGDENQCNA-TMSKTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLL 480

Query: 481 TLARHSSLWATTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSC-ISNCI 540
            L +HSSLW  +N+SK   P+G   C NCSWVDKFN SRI GRSIEADFREFS  ISNCI
Sbjct: 481 ILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEADFREFSIGISNCI 540

Query: 541 ANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTKDVC 600
           AN+SQK WSFL H C YLKAFTDPFDFSWPKTI T S+ R       DRSC CSK +DV 
Sbjct: 541 ANISQKYWSFLAHECSYLKAFTDPFDFSWPKTITTCSNYR-------DRSCDCSKIQDV- 600

Query: 601 FQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD 652
                  S+Q R+SI  LGIHCLFYGGYLASI YGHHSHLASQIQ ILHD++
Sbjct: 601 -------SDQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIQCILHDMN 623

BLAST of CcUC08G147180 vs. NCBI nr
Match: XP_022932824.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata])

HSP 1 Score: 923.3 bits (2385), Expect = 1.2e-264
Identity = 484/652 (74.23%), Postives = 530/652 (81.29%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSL 60
           MEMEMRAMEDIEMAEDITPPL PL AALHD+F LTHCSSCFSPLPN  ISHSNLL YCS 
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCSLSHSDPLTTAFFSALPFPSSDTADLRASLRLLLLLLSHPSASHSAPPERIFGLLTNR 120
            C  S SD LT A FS   FP SDT+DLRASLRLL LLLS  SA  SAPPERIFGLLTNR
Sbjct: 61  IC--SRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNR 120

Query: 121 HKLMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDSIG 180
            KLM+ +D SEVF+K+R+GA A+AA RR NSADI + NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180

Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMGTV 240
           +TIGIAVY PTFCWINHSCSPNACYRFETPSDS  TRLRI+P CTD+   EGSCNQM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTV 240

Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCS 300
           R N S FI +DFQGYGPRV+VRSIKS+RKGEAVTIAYCDLLQPKA+RQSEL SRY+F CS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCS 300

Query: 301 CQRCSVEPLTYVDHALQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITEYLSIGS 360
           CQRCS +P TYVDHALQEISA  VELLDSTS SNFD+D  +RR++DYV+N I EYLSIGS
Sbjct: 301 CQRCSAKPPTYVDHALQEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGS 360

Query: 361 PESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSCDLL 420
           PESCCEKLQ LLTLGF DEQAEDG+GKQ +NLRLHP+HFL LN YTALASAYKVRS    
Sbjct: 361 PESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW--- 420

Query: 421 ALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLL 480
                  N+DENQ +A TMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGESLL
Sbjct: 421 -------NDDENQCNA-TMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLL 480

Query: 481 TLARHSSLWATTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSC-ISNCI 540
            L +HSSLW  +N+SK   P+G   C NCSWVDKFN +RI GRSIEADFREFS  ISNCI
Sbjct: 481 ILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISNCI 540

Query: 541 ANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTKDVC 600
           A++S K WSFL H C YLKAFTDPFDFSWPKTI T  +          RSC CSK +DV 
Sbjct: 541 ADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLNYH-------GRSCDCSKIQDV- 600

Query: 601 FQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD 652
                  S Q R+SI  LGIHCLFYGGYLASI YGH SHLASQI+ ILHD++
Sbjct: 601 -------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECILHDMN 623

BLAST of CcUC08G147180 vs. ExPASy Swiss-Prot
Match: Q3ECY6 (Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1)

HSP 1 Score: 370.9 bits (951), Expect = 2.9e-101
Identity = 244/648 (37.65%), Postives = 341/648 (52.62%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSLKC 62
           ME+RA EDIE+  D+ PPL PLA++L+DSFL +HCSSCFS LP  P        YCS  C
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQPL----YCSAAC 60

Query: 63  SLSHSDPLTTAFFSALPFPSSDTADLRASLRLLLLLLSHPSASHSAPPERIFGLLTNRHK 122
           S      LT +F ++  FP   T  L + +R  L LL+  +   S+ P R+  LLTN H 
Sbjct: 61  S------LTDSFTNSPQFPPEITPILPSDIRTSLHLLNSTAVDTSSSPHRLNNLLTNHHL 120

Query: 123 LMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDSIGRT 182
           LM       + + +   A  IA   R+N  +      LEEA +C VLTNAV+V DS G  
Sbjct: 121 LMA---DPSISVAIHHAANFIATVIRSNRKN----TELEEAAICAVLTNAVEVHDSNGLA 180

Query: 183 IGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMGTVRS 242
           +GIA+Y  +F WINHSCSPN+CYRF    ++ T+   +  + T+  +N     Q+     
Sbjct: 181 LGIALYNSSFSWINHSCSPNSCYRF---VNNRTSYHDVHVTNTETSSNLELQEQVCGTSL 240

Query: 243 NLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCSCQ 302
           N  +       G GP+++VRSIK I+ GE +T++Y DLLQP  +RQS+LWS+Y+F C+C 
Sbjct: 241 NSGN-------GNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCNCG 300

Query: 303 RCSVEPLTYVDHALQEISAVKVELLDSTSFSNFD----HDKVVRRMNDYVDNVITEYLSI 362
           RC+  P  YVD  L+ +  ++ E    T+  +FD     D+ V +MNDY+   I ++LS 
Sbjct: 301 RCAASPPAYVDSILEGVLTLESE---KTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSD 360

Query: 363 G-SPESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSC 422
              P++CCE ++ +L  G      +  E  QP  LRLH  H+++LNAY  LA+AY++RS 
Sbjct: 361 NIDPKTCCEMIESVLHHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRSI 420

Query: 423 DLLALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGE 482
                       D        MSR SAAYSLFLAG +HHLF +E S   SAA  W  AGE
Sbjct: 421 ------------DSETGIVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAGE 480

Query: 483 SLLTLARHSSLWATTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFS-CIS 542
            L  LA    +  +  S           C+ C  ++  N+ R        D +E S  I 
Sbjct: 481 LLFDLAPKLLMELSVESDV--------KCTKCLMLETSNSHR--------DIKEKSRQIL 540

Query: 543 NCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTK 602
           +C+ ++SQ TWSFLT GCPYL+ F  P DFS  +T    + +R+                
Sbjct: 541 SCVRDISQVTWSFLTRGCPYLEKFRSPVDFSLTRT----NGERE---------------- 557

Query: 603 DVCFQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQ 645
                   + S     +++ L  HCL Y   L  + YG  SHL S+ +
Sbjct: 601 --------ESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557

BLAST of CcUC08G147180 vs. ExPASy Swiss-Prot
Match: Q9CWR2 (Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 SV=1)

HSP 1 Score: 50.4 bits (119), Expect = 8.6e-05
Identity = 31/140 (22.14%), Postives = 54/140 (38.57%), Query Frame = 0

Query: 168 VLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDL 227
           V+ N+  + ++  + +G+ +Y P+   +NHSC PN    F                    
Sbjct: 178 VICNSFTICNAEMQEVGVGLY-PSMSLLNHSCDPNCSIVFN------------------- 237

Query: 228 VANEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMR 287
                                       GP +++R+++ I  GE +TI Y D+L     R
Sbjct: 238 ----------------------------GPHLLLRAVREIEAGEELTICYLDMLMTSEER 269

Query: 288 QSELWSRYQFSCSCQRCSVE 308
           + +L  +Y F C C RC  +
Sbjct: 298 RKQLRDQYCFECDCIRCQTQ 269

BLAST of CcUC08G147180 vs. ExPASy Swiss-Prot
Match: Q9H7B4 (Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens OX=9606 GN=SMYD3 PE=1 SV=4)

HSP 1 Score: 50.1 bits (118), Expect = 1.1e-04
Identity = 31/140 (22.14%), Postives = 54/140 (38.57%), Query Frame = 0

Query: 168 VLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDL 227
           V+ N+  + ++  + +G+ +Y P+   +NHSC PN    F                    
Sbjct: 178 VICNSFTICNAEMQEVGVGLY-PSISLLNHSCDPNCSIVFN------------------- 237

Query: 228 VANEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMR 287
                                       GP +++R+++ I  GE +TI Y D+L     R
Sbjct: 238 ----------------------------GPHLLLRAVRDIEVGEELTICYLDMLMTSEER 269

Query: 288 QSELWSRYQFSCSCQRCSVE 308
           + +L  +Y F C C RC  +
Sbjct: 298 RKQLRDQYCFECDCFRCQTQ 269

BLAST of CcUC08G147180 vs. ExPASy Swiss-Prot
Match: Q9NRG4 (N-lysine methyltransferase SMYD2 OS=Homo sapiens OX=9606 GN=SMYD2 PE=1 SV=2)

HSP 1 Score: 49.3 bits (116), Expect = 1.9e-04
Identity = 39/129 (30.23%), Postives = 54/129 (41.86%), Query Frame = 0

Query: 256 GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCSCQRCSVEPLTYVDHA 315
           G    VR+++ I+ GE V  +Y DLL P   R   L   Y F+C CQ C+          
Sbjct: 219 GTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTT--------- 278

Query: 316 LQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITE------YLSIGSPESCCEKLQ 375
            ++    KVE+      S+    + +R M  Y  NVI E      Y S       CE  Q
Sbjct: 279 -KDKDKAKVEI---RKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQ 334

Query: 376 ELLTLGFCD 379
           E ++  F D
Sbjct: 339 EKMSSVFED 334

BLAST of CcUC08G147180 vs. ExPASy Swiss-Prot
Match: O94256 (SET domain and MYND-type zinc finger protein 6 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=set6 PE=3 SV=1)

HSP 1 Score: 47.0 bits (110), Expect = 9.6e-04
Identity = 36/151 (23.84%), Postives = 59/151 (39.07%), Query Frame = 0

Query: 160 LEEAVLCLVLTNAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLR 219
           L + + C +  NA+++  S   ++G+ +     C +NHSC PN    F+           
Sbjct: 158 LFQKLFCRLAVNAMNLVTSSFDSLGMCL-DTILCRLNHSCDPNCQIIFD----------- 217

Query: 220 IAPSCTDLVANEGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCD 279
                                               G  V + S + I+K E + I+Y D
Sbjct: 218 ------------------------------------GAIVQLVSKRDIKKDEQLFISYID 260

Query: 280 LLQPKAMRQSELWSRYQFSCSCQRCSVEPLT 311
           +  PK++RQ +L  +Y FSC C RC  +  T
Sbjct: 278 IRLPKSIRQKQLLKKYFFSCYCPRCENDHTT 260

BLAST of CcUC08G147180 vs. ExPASy TrEMBL
Match: A0A1S3CIT0 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 PE=4 SV=1)

HSP 1 Score: 1066.2 bits (2756), Expect = 5.3e-308
Identity = 539/655 (82.29%), Postives = 575/655 (87.79%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSLKC 62
           MEMRA+EDIEMAEDITPPLFPL +ALHDSFL THCSSCFS LPNPPISHS LLHYCSLKC
Sbjct: 1   MEMRALEDIEMAEDITPPLFPLTSALHDSFLSTHCSSCFSLLPNPPISHSPLLHYCSLKC 60

Query: 63  SLSHSDPLTTAFFSALPFP--SSDTADLRASLRL--LLLLLSHPSASHSAPPERIFGLLT 122
           SLSHSDPLT AFFS  P P  SSDT+DLRASLRL  L LLLSHPS S S PP RIFGLLT
Sbjct: 61  SLSHSDPLTAAFFSIHPLPDASSDTSDLRASLRLLHLHLLLSHPSPSLSPPPHRIFGLLT 120

Query: 123 NRHKLMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDS 182
           NRHKLM PQ+ SEVFLKLRE A AIAA RR N ADIS G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDS 180

Query: 183 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMG 242
           IG+TIGIAVYAPTF WINHSCSPNACYRFETPSD  TTR RIAPSCTD V++EG+C QMG
Sbjct: 181 IGQTIGIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMG 240

Query: 243 TVRSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFS 302
            VRSN+ DF+REDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQF 
Sbjct: 241 NVRSNILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFV 300

Query: 303 CSCQRCSVEPLTYVDHALQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITEYLSI 362
           CSCQRCS  PLTYVDHALQEISAVKVELLDS   SNFDHD  VRR+++YVDN ITEYLSI
Sbjct: 301 CSCQRCSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYLSI 360

Query: 363 GSPESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSCD 422
           GSPESCCEKLQ LLT GF DEQ EDGEGKQPV+LRLHP HFL LNAYTAL SAYKVRSCD
Sbjct: 361 GSPESCCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSCD 420

Query: 423 LLALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGES 482
           LLALSSEMD ++EN+ +A TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGES
Sbjct: 421 LLALSSEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGES 480

Query: 483 LLTLARHSSLWA-TTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSC-IS 542
           LL LARHSSLWA TTN+S WGFP+G+RMCSNCSWVD+FN SRI GR I+ADFREFS  IS
Sbjct: 481 LLILARHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIGIS 540

Query: 543 NCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTK 602
           NCIA++S+K WSFLTHGCPYLKAFTDPFDFSWPKT     +D DIG   IDRSCACSKTK
Sbjct: 541 NCIASISRKCWSFLTHGCPYLKAFTDPFDFSWPKT-----NDGDIGGHGIDRSCACSKTK 600

Query: 603 DVCFQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD 652
           D+CF+ EPQ SNQ RESI GLGIHCL+YGGYLASI YG+HSHLASQIQNIL+DL+
Sbjct: 601 DICFECEPQDSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 650

BLAST of CcUC08G147180 vs. ExPASy TrEMBL
Match: A0A0A0KAK3 (SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV=1)

HSP 1 Score: 1028.5 bits (2658), Expect = 1.2e-296
Identity = 522/657 (79.45%), Postives = 566/657 (86.15%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSL 60
           MEMEM A+EDIEMAEDI+PPLFPL +ALHDSFL THCSSCFS LPNPPISHS  LHYCSL
Sbjct: 1   MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60

Query: 61  KCSLSHSDPLTTAFFSALPFP--SSDTADLRASLRLLLLLLSHPSASHSAPPERIFGLLT 120
           KCSLSHSDPLT AFFS  PFP  SSDT+DLRASLRLL LLLSHPS S S PP+RI+GLLT
Sbjct: 61  KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120

Query: 121 NRHKLMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDS 180
           NRHKLM PQ+ SEVFLKLREGA AIAA RR N ADI  G ALEEAVLCLVLTNAVDVQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180

Query: 181 IGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMG 240
           IG+TIGIAVYA TF WINHSCSPNACYRFETPSDS TTR RIAPSCTD +++EGSC QMG
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240

Query: 241 TVRSNLSDFIRED--FQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQ 300
            VRSN+ DFIRE     G GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQ
Sbjct: 241 NVRSNILDFIREGALLNGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQ 300

Query: 301 FSCSCQRCSVEPLTYVDHALQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITEYL 360
           F CSCQRCS  PLTYVDHALQEIS+VKVELLDST  SNFDHD  VRR+++YVDN ITEYL
Sbjct: 301 FVCSCQRCSAVPLTYVDHALQEISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEYL 360

Query: 361 SIGSPESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRS 420
           S  SPESCCEKLQ LLT GF DEQ EDGEGKQ V+LRLHPLHFL LNAYTAL SAYKVRS
Sbjct: 361 STSSPESCCEKLQNLLTFGFHDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVRS 420

Query: 421 CDLLALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAG 480
           CDL+ALSSEMD ++ N+ +A TM +TSAAY+LFLAGATH LFL EPSL+ASAANCWVVAG
Sbjct: 421 CDLVALSSEMDKDNGNRHNALTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVAG 480

Query: 481 ESLLTLARHSSLWA-TTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSC- 540
           ESLL LARHSSLWA TTN+S W FP+G+RMC NCSWVD+FNASRI G+ ++ADFREFS  
Sbjct: 481 ESLLILARHSSLWATTTNTSNWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSIG 540

Query: 541 ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSK 600
           ISNCIA++SQK WS LTHGCPYLKAFT PFDFSWPKT     +++DI  R ID SCACSK
Sbjct: 541 ISNCIASISQKCWSSLTHGCPYLKAFTGPFDFSWPKT-----NEQDICGRGIDHSCACSK 600

Query: 601 TKDVCFQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD 652
           T+DVC + +PQ SNQ RESI GLGIHCL+YGGYLASI YGHHSHLASQIQNIL+DL+
Sbjct: 601 TQDVCLECKPQDSNQERESISGLGIHCLYYGGYLASICYGHHSHLASQIQNILNDLN 652

BLAST of CcUC08G147180 vs. ExPASy TrEMBL
Match: A0A6J1EY39 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)

HSP 1 Score: 923.3 bits (2385), Expect = 5.6e-265
Identity = 484/652 (74.23%), Postives = 530/652 (81.29%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSL 60
           MEMEMRAMEDIEMAEDITPPL PL AALHD+F LTHCSSCFSPLPN  ISHSNLL YCS 
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  KCSLSHSDPLTTAFFSALPFPSSDTADLRASLRLLLLLLSHPSASHSAPPERIFGLLTNR 120
            C  S SD LT A FS   FP SDT+DLRASLRLL LLLS  SA  SAPPERIFGLLTNR
Sbjct: 61  IC--SRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNR 120

Query: 121 HKLMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDSIG 180
            KLM+ +D SEVF+K+R+GA A+AA RR NSADI + NALEEA+LCLVLTNAV+VQDS+G
Sbjct: 121 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 180

Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMGTV 240
           +TIGIAVY PTFCWINHSCSPNACYRFETPSDS  TRLRI+P CTD+   EGSCNQM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTV 240

Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCS 300
           R N S FI +DFQGYGPRV+VRSIKS+RKGEAVTIAYCDLLQPKA+RQSEL SRY+F CS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAVRQSELLSRYKFVCS 300

Query: 301 CQRCSVEPLTYVDHALQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITEYLSIGS 360
           CQRCS +P TYVDHALQEISA  VELLDSTS SNFD+D  +RR++DYV+N I EYLSIGS
Sbjct: 301 CQRCSAKPPTYVDHALQEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGS 360

Query: 361 PESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSCDLL 420
           PESCCEKLQ LLTLGF DEQAEDG+GKQ +NLRLHP+HFL LN YTALASAYKVRS    
Sbjct: 361 PESCCEKLQNLLTLGFYDEQAEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSW--- 420

Query: 421 ALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLL 480
                  N+DENQ +A TMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGESLL
Sbjct: 421 -------NDDENQCNA-TMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLL 480

Query: 481 TLARHSSLWATTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSC-ISNCI 540
            L +HSSLW  +N+SK   P+G   C NCSWVDKFN +RI GRSIEADFREFS  ISNCI
Sbjct: 481 ILVKHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEADFREFSIGISNCI 540

Query: 541 ANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTKDVC 600
           A++S K WSFL H C YLKAFTDPFDFSWPKTI T  +          RSC CSK +DV 
Sbjct: 541 ADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCLNYH-------GRSCDCSKIQDV- 600

Query: 601 FQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD 652
                  S Q R+SI  LGIHCLFYGGYLASI YGH SHLASQI+ ILHD++
Sbjct: 601 -------SEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECILHDMN 623

BLAST of CcUC08G147180 vs. ExPASy TrEMBL
Match: A0A6J1I954 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)

HSP 1 Score: 919.1 bits (2374), Expect = 1.1e-263
Identity = 484/653 (74.12%), Postives = 529/653 (81.01%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSL 60
           MEME+RAMEDIEMAEDITPPL PL AALHDSFLLTHCSSCFSPLPN PISHSNLL YCS 
Sbjct: 1   MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60

Query: 61  KCSLSHSDPLTTAFFSALPFPSSDTADLRASLRLLLLLLSHPSASHSAPPERIFGLLTNR 120
            C  S+SD LT A FS   F  SDT+DLRASLRLL LLLS  SA  S PPERIFGLLTNR
Sbjct: 61  IC--SYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNR 120

Query: 121 HKLMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDSIG 180
            KLM+  D SEVF K+R+GA AIA  RR NSADI + NALEEA++CLVLTNAV+VQDS+G
Sbjct: 121 EKLMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVG 180

Query: 181 RTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMGTV 240
           +TIGIAVY PTFCWINHSCSPNACYRFETPSDS  TRLRI+P CTD+   EGSC+QM TV
Sbjct: 181 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTV 240

Query: 241 RSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCS 300
           R N S FI +DFQGYGPRV+VRSIKSIRKGEAVTIAYCDLLQPKAMRQSEL SRY+F CS
Sbjct: 241 RRNFSHFITKDFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELRSRYKFVCS 300

Query: 301 CQRCSVEPLTYVDHALQEISAVKV-ELLDSTSFSNFDHDKVVRRMNDYVDNVITEYLSIG 360
           CQRCS +P TYVDHALQEI AV V ELLDSTS SNFD+D  + R++DYV+N I EYLSIG
Sbjct: 301 CQRCSAKPPTYVDHALQEIFAVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIG 360

Query: 361 SPESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSCDL 420
           SPESCCEKLQ LLTLGF DEQA+DG+GKQ +NLRLHP+HFL LN YTALASAYKVRS   
Sbjct: 361 SPESCCEKLQNLLTLGFYDEQADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSW-- 420

Query: 421 LALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESL 480
                   N++ENQ + STMS+TSAAYSLFLAGATHHLFL+EPSLIASAANCWVVAGESL
Sbjct: 421 --------NDNENQCNTSTMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESL 480

Query: 481 LTLARHSSLWATTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSC-ISNC 540
           L L RHSSLW  +N+SK   P+G   C NCSWVDKFN SRI GRSIE DF+EFS  ISNC
Sbjct: 481 LRLVRHSSLWG-SNTSKSSSPMGEITCLNCSWVDKFNTSRIHGRSIEVDFQEFSIGISNC 540

Query: 541 IANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTKDV 600
           IAN+S K WSFLTH CPYLKAFTDPFDFSWPKTI T S+ R       DR C  SK +DV
Sbjct: 541 IANISHKYWSFLTHECPYLKAFTDPFDFSWPKTITTCSNYR-------DRLCDYSKIQDV 600

Query: 601 CFQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD 652
                   S+Q R+SI  LGIHCLFYGGYLASI YGH SHL+SQIQ IL D++
Sbjct: 601 --------SDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQCILQDMN 625

BLAST of CcUC08G147180 vs. ExPASy TrEMBL
Match: A0A5A7T0X4 (Protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold155G00420 PE=4 SV=1)

HSP 1 Score: 873.2 bits (2255), Expect = 6.6e-250
Identity = 434/530 (81.89%), Postives = 467/530 (88.11%), Query Frame = 0

Query: 124 MIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDSIGRTI 183
           M PQ+ SEVFLKLRE A AIAA RR N ADIS G ALEEAVLCLVLTNAVDVQDSIG+TI
Sbjct: 1   MTPQNGSEVFLKLREAANAIAALRRKNYADISPGTALEEAVLCLVLTNAVDVQDSIGQTI 60

Query: 184 GIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMGTVRSN 243
           GIAVYAPTF WINHSCSPNACYRFETPSD  TTR RIAPSCTD V++EG+C QMG VRSN
Sbjct: 61  GIAVYAPTFSWINHSCSPNACYRFETPSDFFTTRFRIAPSCTDFVSDEGTCRQMGNVRSN 120

Query: 244 LSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCSCQR 303
           + DF+REDFQG GPRVVVRSIK I+KGEAVTIAYCDLLQPKA RQSELWSRYQF CSCQR
Sbjct: 121 ILDFMREDFQGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKARRQSELWSRYQFVCSCQR 180

Query: 304 CSVEPLTYVDHALQEISAVKVELLDSTSFSNFDHDKVVRRMNDYVDNVITEYLSIGSPES 363
           CS  PLTYVDHALQEISAVKVELLDS   SNFDHD  VRR+++YVDN ITEYLSIGSPES
Sbjct: 181 CSAVPLTYVDHALQEISAVKVELLDSAPISNFDHDTAVRRIDEYVDNAITEYLSIGSPES 240

Query: 364 CCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSCDLLALS 423
           CCEKLQ LLT GF DEQ EDGEGKQPV+LRLHP HFL LNAYTAL SAYKVRSCDLLALS
Sbjct: 241 CCEKLQNLLTFGFRDEQVEDGEGKQPVSLRLHPSHFLLLNAYTALTSAYKVRSCDLLALS 300

Query: 424 SEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLA 483
           SEMD ++EN+ +A TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGESLL LA
Sbjct: 301 SEMDKDNENRHNALTMSKTSAAYALFLAGATHHLFLFEPSLIASAANCWVVAGESLLILA 360

Query: 484 RHSSLWA-TTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFSC-ISNCIAN 543
           RHSSLWA TTN+S WGFP+G+RMCSNCSWVD+FN SRI GR I+ADFREFS  ISNCIA+
Sbjct: 361 RHSSLWATTTNTSDWGFPLGKRMCSNCSWVDEFNGSRIHGRRIQADFREFSIGISNCIAS 420

Query: 544 MSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTKDVCFQ 603
           +S+K WSFLTHGCPYLKAFTDPFDFSWPKT     +D DIG   IDRSCACSKTKD+CF+
Sbjct: 421 ISRKCWSFLTHGCPYLKAFTDPFDFSWPKT-----NDGDIGGHGIDRSCACSKTKDICFE 480

Query: 604 SEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQNILHDLD 652
            EPQ SNQ RESI GLGIHCL+YGGYLASI YG+HSHLASQIQNIL+DL+
Sbjct: 481 CEPQDSNQERESISGLGIHCLYYGGYLASICYGYHSHLASQIQNILNDLN 525

BLAST of CcUC08G147180 vs. TAIR 10
Match: AT1G43245.1 (SET domain-containing protein )

HSP 1 Score: 370.9 bits (951), Expect = 2.1e-102
Identity = 244/648 (37.65%), Postives = 341/648 (52.62%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLFPLAAALHDSFLLTHCSSCFSPLPNPPISHSNLLHYCSLKC 62
           ME+RA EDIE+  D+ PPL PLA++L+DSFL +HCSSCFS LP  P        YCS  C
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQPL----YCSAAC 60

Query: 63  SLSHSDPLTTAFFSALPFPSSDTADLRASLRLLLLLLSHPSASHSAPPERIFGLLTNRHK 122
           S      LT +F ++  FP   T  L + +R  L LL+  +   S+ P R+  LLTN H 
Sbjct: 61  S------LTDSFTNSPQFPPEITPILPSDIRTSLHLLNSTAVDTSSSPHRLNNLLTNHHL 120

Query: 123 LMIPQDHSEVFLKLREGAAAIAACRRNNSADISHGNALEEAVLCLVLTNAVDVQDSIGRT 182
           LM       + + +   A  IA   R+N  +      LEEA +C VLTNAV+V DS G  
Sbjct: 121 LMA---DPSISVAIHHAANFIATVIRSNRKN----TELEEAAICAVLTNAVEVHDSNGLA 180

Query: 183 IGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVANEGSCNQMGTVRS 242
           +GIA+Y  +F WINHSCSPN+CYRF    ++ T+   +  + T+  +N     Q+     
Sbjct: 181 LGIALYNSSFSWINHSCSPNSCYRF---VNNRTSYHDVHVTNTETSSNLELQEQVCGTSL 240

Query: 243 NLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCSCQ 302
           N  +       G GP+++VRSIK I+ GE +T++Y DLLQP  +RQS+LWS+Y+F C+C 
Sbjct: 241 NSGN-------GNGPKLIVRSIKRIKSGEEITVSYIDLLQPTGLRQSDLWSKYRFMCNCG 300

Query: 303 RCSVEPLTYVDHALQEISAVKVELLDSTSFSNFD----HDKVVRRMNDYVDNVITEYLSI 362
           RC+  P  YVD  L+ +  ++ E    T+  +FD     D+ V +MNDY+   I ++LS 
Sbjct: 301 RCAASPPAYVDSILEGVLTLESE---KTTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSD 360

Query: 363 G-SPESCCEKLQELLTLGFCDEQAEDGEGKQPVNLRLHPLHFLSLNAYTALASAYKVRSC 422
              P++CCE ++ +L  G      +  E  QP  LRLH  H+++LNAY  LA+AY++RS 
Sbjct: 361 NIDPKTCCEMIESVLHHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRSI 420

Query: 423 DLLALSSEMDNNDENQRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGE 482
                       D        MSR SAAYSLFLAG +HHLF +E S   SAA  W  AGE
Sbjct: 421 ------------DSETGIVCDMSRISAAYSLFLAGVSHHLFCAERSFAISAAKFWKNAGE 480

Query: 483 SLLTLARHSSLWATTNSSKWGFPVGRRMCSNCSWVDKFNASRILGRSIEADFREFS-CIS 542
            L  LA    +  +  S           C+ C  ++  N+ R        D +E S  I 
Sbjct: 481 LLFDLAPKLLMELSVESDV--------KCTKCLMLETSNSHR--------DIKEKSRQIL 540

Query: 543 NCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGARSIDRSCACSKTK 602
           +C+ ++SQ TWSFLT GCPYL+ F  P DFS  +T    + +R+                
Sbjct: 541 SCVRDISQVTWSFLTRGCPYLEKFRSPVDFSLTRT----NGERE---------------- 557

Query: 603 DVCFQSEPQHSNQVRESIIGLGIHCLFYGGYLASIFYGHHSHLASQIQ 645
                   + S     +++ L  HCL Y   L  + YG  SHL S+ +
Sbjct: 601 --------ESSKDQTVNVLLLSSHCLLYADLLTDLCYGQKSHLVSRFR 557

BLAST of CcUC08G147180 vs. TAIR 10
Match: AT2G17900.1 (SET domain group 37 )

HSP 1 Score: 44.7 bits (104), Expect = 3.4e-04
Identity = 39/135 (28.89%), Postives = 49/135 (36.30%), Query Frame = 0

Query: 171 NAVDVQDSIGRTIGIAVYAPTFCWINHSCSPNACYRFETPSDSTTTRLRIAPSCTDLVAN 230
           NA  + DS  R  GI ++ P    INHSCSPNA   FE                      
Sbjct: 189 NAHSICDSELRPQGIGLF-PLVSIINHSCSPNAVLVFE---------------------- 248

Query: 231 EGSCNQMGTVRSNLSDFIREDFQGYGPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSE 290
                QM                      VVR++ +I K   +TI+Y +       RQ  
Sbjct: 249 ----EQM---------------------AVVRAMDNISKDSEITISYIETAGSTLTRQKS 275

Query: 291 LWSRYQFSCSCQRCS 306
           L  +Y F C C RCS
Sbjct: 309 LKEQYLFHCQCARCS 275

BLAST of CcUC08G147180 vs. TAIR 10
Match: AT1G26760.1 (SET domain protein 35 )

HSP 1 Score: 43.9 bits (102), Expect = 5.7e-04
Identity = 23/71 (32.39%), Postives = 38/71 (53.52%), Query Frame = 0

Query: 256 GPRVVVRSIKSIRKGEAVTIAYCDLLQPKAMRQSELWSRYQFSCSCQRCSVEPLTYVDHA 315
           G  V+V + + I+ GE ++ AY D+L P   R+ E+   + F C C RC  E + Y  + 
Sbjct: 354 GDYVIVHASRDIKTGEEISFAYFDVLSPLEKRK-EMAESWGFCCGCSRCKFESVLYATN- 413

Query: 316 LQEISAVKVEL 327
            QE+   ++ L
Sbjct: 414 -QEVREFEMGL 421

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886411.10.0e+0083.18protein SET DOMAIN GROUP 41 [Benincasa hispida][more]
XP_008463080.11.1e-30782.29PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo][more]
XP_011656459.16.5e-30080.15protein SET DOMAIN GROUP 41 [Cucumis sativus][more]
XP_023520942.17.4e-27276.23protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022932824.11.2e-26474.23protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q3ECY62.9e-10137.65Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1[more]
Q9CWR28.6e-0522.14Histone-lysine N-methyltransferase SMYD3 OS=Mus musculus OX=10090 GN=Smyd3 PE=2 ... [more]
Q9H7B41.1e-0422.14Histone-lysine N-methyltransferase SMYD3 OS=Homo sapiens OX=9606 GN=SMYD3 PE=1 S... [more]
Q9NRG41.9e-0430.23N-lysine methyltransferase SMYD2 OS=Homo sapiens OX=9606 GN=SMYD2 PE=1 SV=2[more]
O942569.6e-0423.84SET domain and MYND-type zinc finger protein 6 OS=Schizosaccharomyces pombe (str... [more]
Match NameE-valueIdentityDescription
A0A1S3CIT05.3e-30882.29protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501316 P... [more]
A0A0A0KAK31.2e-29679.45SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV... [more]
A0A6J1EY395.6e-26574.23protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A6J1I9541.1e-26374.12protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1114726... [more]
A0A5A7T0X46.6e-25081.89Protein SET DOMAIN GROUP 41 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
Match NameE-valueIdentityDescription
AT1G43245.12.1e-10237.65SET domain-containing protein [more]
AT2G17900.13.4e-0428.89SET domain group 37 [more]
AT1G26760.15.7e-0432.39SET domain protein 35 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 147..277
e-value: 9.5E-18
score: 65.9
NoneNo IPR availablePANTHERPTHR47780PROTEIN SET DOMAIN GROUP 41coord: 3..649
NoneNo IPR availableCDDcd20071SET_SMYDcoord: 151..304
e-value: 2.39752E-18
score: 79.3439
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 184..302
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 279..417
e-value: 3.8E-7
score: 32.2
IPR001214SET domainPFAMPF00856SETcoord: 166..277
e-value: 4.0E-5
score: 24.1

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC08G147180.1CcUC08G147180.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding