CmUC10G200320 (gene) Watermelon (USVL531) v1

Overview
NameCmUC10G200320
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionHistone-lysine N-methyltransferase SETD1B-like protein
LocationCmU531Chr10: 32406674 .. 32410284 (+)
RNA-Seq ExpressionCmUC10G200320
SyntenyCmUC10G200320
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTACTGAAAATCCATATCCTTAACCCCAAACCAACAATTCTCGATTTCATTCCCCCATTTTAGCTTTCGTCTTCTCTTTTGCAGAGCTACTGCCACTCCCATGGCTCAAAAGCACTTACACGAGCTTTTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCTGACAGACGCTCCCTTCTCAAACGCCCTTCTCTCAAATCCCATTTCCATCTTAACAATTCAAAACCCATCTCCCATTCCTCTGGTTTTCCAACTAAATTTTGCAGGAGCGCTTGTTTTTTCTCTTTCAATCACTCCCCTGATCTCGTAAACTCATCTCCGCTCTTTGGATTTCAGTCGCCGGTCAAAACCCCTTGTCGAAACCCCAATCCCATTTTCCTTCATGTTCCGGCTAGAACGGCTGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCCTCGGGAAATCAAATGGATTAGGGCTTCTGGGTTCTTTTCTTAAGCGTTTGACTCTTCGTGGCCGTGCTCGGAAGCGAGAGATCGACGGCGATGGTCGGAGAAATGACCCCCGCGATGGCCCGCCATTGCCGGCGAAAATGGCGATTCAGGAGAACGAGAACGGGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATGTATGCGATAGCCCTTTTCGATTTGTGCTTCAATCGAGCCCTTCCCCCGGTCACCGGACGCCGGACCTCGCTTCGCCGGCGTCTTCTCCGGCTCGCCTAGACCATCAGGTTTTTCTTAGTTTTATTACTGTTTTTGGGTTTCATCTTTCAATGTCTTGGCGTAGAAACTGCATTTTTATGAACTGGGTGTCACCGGCAAAAACCAACTGTCGGCAGTGTTATGTAAATTCTCTGAAATTACCGCCGGAATGTTGCAGTTTCAAGCGTCAAAACACGTCAACCCCACCAAAAGGACTCTGAATTTTGACGAACAAACTGATAAATCTCTCATTTTCAATTCCACGCTTTCCTTTTTCCCGGAAAATTATGACCCTCCAAGAAAAAGACAAACCCCCTTTTGTATTTTCCTTTGTATTCTAACCCTCTTCTTAGAATTGAACATTTTTTCTTTTGGCCTTGGCTTCATTATAATAATTAGAAGAGGAAGAAAAAAAAAAAAAAAAAACACACACACACACAATAAAGTGAAGATGTGTAAATCAATAACTTTACCAAAACCAACTAAATTAAAGATCCCTTTTCACCCTCATTTTCTTTTGTTAGTTTGTGAACTTTTCTTTGCCTTGTTTTCACCCAAATTTCAAGGTCAATGATGTAGAGAGCTTGAAAAAATTGCCGGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCGGTGTTGGATCCTCCGTTTGAGGACGACGACGAAGGACATTACAAGGATGGTGAGGATGAGGATGATTACAATTTGGAACGCAGCTTTGCCATTGTACAAAGTACGTAAGCATGATTCAACCCAACTGCTAATCTCTTATAAGAAACATATATTAATTTTGTTTTTCCATATTGATTCGAGTTTGTTGTATGGTCGACATTTGCATTTAGGTACTCTCTGACTTGTTATTTGTTACTGATAGAAATGTATGTCTGTTACACTCTATTGAAATGGTTAATAGAGCTTCACAAACTCATAAGCTTGTTGCTGTGCTAAGATATGAACACTTGTATCTTTGGTGGGTTCCATTGGGGAGATTTGAATTATCAGAAGGGTAGAGGTGAAACCAAAACAAACTTTTGAATTTAGGTAAATCTGTACTAGAATAAAGGGAAAGAGGGCGAAAATGTTTCGGTTGTCAGATAAATGTTAGACGACGACATCATGAGAAATATCAGCACAAAGTTATTGTTGAACACTTGTTTTCGGCCTTGATGGCCTCTTCTGTTGGAGAAATGTTTTTGGAGGTCTGGCCCTTTTCTTTTTATTGTTCTTCAAAACTTCAAATCTGTCCCACATTTTGCTGTAAGAGAGGTGCTATAATTGTGGGAGTCCTCTTCAATTTATTAGGCTGTTCAATCATTGAACCAATTTATTCATTGTTTTCGAATGCCGTCATCCTTGTCTGTACTTCAGAACAAGGTCCCCCCCTCCCCCGCCCCGCTCGTCCGATTTTCTTTTTTTACCGTTTGAGCATTTAATAAAGGATTTTGAAGCTTATCAGTTACTACTCTGTGAATGCATGGTAATGAGTTGTTCTAGTCACTAGGCATATGCACTTCATCAGCAAGAGTTGTCTTTGTTGCATTTTCTACTAATTATCTTGTACTTTTTAGCTAGGAAATAGCTAAACTGTCTAGTTCATCCTGGCATATGATTGCATTTGAGCTAGAGATTCCAAATTATACTCTTTGCATTGCCACTTCTTCATTCTCTTTATGAATTTACTGATCCGCTCTTTGCTGAAAATGAAACACTTAGTCAAACTTCAAAGTTTGAAGACCAAAAATATGTCTGTCTTTGTCGATCTGCGATTAAAAGGAGTTTGATTATAATCGCAACCACATAAAGTTTTCTACCTTGTTAGTACAGTTCAGTAGCTTGAGCTTGGTACAAAATTTACCATGAATTCTCGTCATAGCTGTGGCCCCACTTGTTTTTTGTGCAGTCAAAACGGCATAAATGCGAATTTTCGACCAGGATTTCAAATTTTCGTTTCACCTTCTTCCACAATTTAAACTTCCACTGTACTTGGATTCTTGGTAAAAGTGAAACATGAAGTTGAAGTAAAAAAGGCATAGGTTCATGTCAAATGTAAAGTTAATTCTGATCTGTTTGTGTTCGTGTAATATTTGTCATGCTTTTTCAGAGGCGAGGCATCAACTACTGAAAAAACTTCGAAGATTCGAGAGACTAGCAGAACTAGACCCAGTAGAACTCGAGACATTTCTACTAAAAGACGAGGACGAAGACGAACTCGATGATGACGATGAAATCAATCATCTCAAGGAAGAAGAAGACTACGAAAAAGACATCAAACAAAACAACACAGAAGCCAATGACAATTCAAGGTTCCAAATTCCTCACCGACCCGCAAGAGATATGAAGACACTCCTCTGCAATCTCATTACTGAGGAAGAGAGAGACTTGGTTGGGATTGACAAGAGAGAAGAGACGATGAAGAGAGTATACATGAGATCGGATTTGTGGAAACGGGTGGACTCGAGCACAATCGACATGATGGTGGGGCAAGATTTGAAAGCAGAGATTGATGGGTGGAACATAAATAAGGAGCAGAGAGGAGAAATAGGCATAGAGATAGAGTTTGCAATCTTCAACTTGTTGGTGGAGGAAATGCAAACTGAACTTCATTGCTTAACTCATTAACTGCAAACAATATTTTAAATTTCCAGAAAATAATCTCTAGATTTTGAATTACTTTTAGGAATGTATAATCTAACTTTAAGAGCATAGAAGTAGAAAGATTAGGATGAAAGGGACATCATTATGATTTGTAAATTCATTATCCCTCCATATATATTATTATCATCTACCATTTTAATTTTTGAAATGCTTATTTCTCCTCTAAAAA

mRNA sequence

TTACTGAAAATCCATATCCTTAACCCCAAACCAACAATTCTCGATTTCATTCCCCCATTTTAGCTTTCGTCTTCTCTTTTGCAGAGCTACTGCCACTCCCATGGCTCAAAAGCACTTACACGAGCTTTTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCTGACAGACGCTCCCTTCTCAAACGCCCTTCTCTCAAATCCCATTTCCATCTTAACAATTCAAAACCCATCTCCCATTCCTCTGGTTTTCCAACTAAATTTTGCAGGAGCGCTTGTTTTTTCTCTTTCAATCACTCCCCTGATCTCGTAAACTCATCTCCGCTCTTTGGATTTCAGTCGCCGGTCAAAACCCCTTGTCGAAACCCCAATCCCATTTTCCTTCATGTTCCGGCTAGAACGGCTGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCCTCGGGAAATCAAATGGATTAGGGCTTCTGGGTTCTTTTCTTAAGCGTTTGACTCTTCGTGGCCGTGCTCGGAAGCGAGAGATCGACGGCGATGGTCGGAGAAATGACCCCCGCGATGGCCCGCCATTGCCGGCGAAAATGGCGATTCAGGAGAACGAGAACGGGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATGTATGCGATAGCCCTTTTCGATTTGTGCTTCAATCGAGCCCTTCCCCCGGTCACCGGACGCCGGACCTCGCTTCGCCGGCGTCTTCTCCGGCTCGCCTAGACCATCAGGTTTTTCTTAGTTTTATTACTGTTTTTGGGTTTCATCTTTCAATGTCTTGGCGTAGAAACTGCATTTTTATGAACTGGGTGTCACCGGCAAAAACCAACTGTCGGCAGTGTTATTTTGTGAACTTTTCTTTGCCTTGTTTTCACCCAAATTTCAAGGTCAATGATGTAGAGAGCTTGAAAAAATTGCCGGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCGGTGTTGGATCCTCCGTTTGAGGACGACGACGAAGGACATTACAAGGATGGTGAGGATGAGGATGATTACAATTTGGAACGCAGCTTTGCCATTGTACAAAAGGCGAGGCATCAACTACTGAAAAAACTTCGAAGATTCGAGAGACTAGCAGAACTAGACCCAGTAGAACTCGAGACATTTCTACTAAAAGACGAGGACGAAGACGAACTCGATGATGACGATGAAATCAATCATCTCAAGGAAGAAGAAGACTACGAAAAAGACATCAAACAAAACAACACAGAAGCCAATGACAATTCAAGGTTCCAAATTCCTCACCGACCCGCAAGAGATATGAAGACACTCCTCTGCAATCTCATTACTGAGGAAGAGAGAGACTTGGTTGGGATTGACAAGAGAGAAGAGACGATGAAGAGAGTATACATGAGATCGGATTTGTGGAAACGGGTGGACTCGAGCACAATCGACATGATGGTGGGGCAAGATTTGAAAGCAGAGATTGATGGGTGGAACATAAATAAGGAGCAGAGAGGAGAAATAGGCATAGAGATAGAGTTTGCAATCTTCAACTTGTTGGTGGAGGAAATGCAAACTGAACTTCATTGCTTAACTCATTAACTGCAAACAATATTTTAAATTTCCAGAAAATAATCTCTAGATTTTGAATTACTTTTAGGAATGTATAATCTAACTTTAAGAGCATAGAAGTAGAAAGATTAGGATGAAAGGGACATCATTATGATTTGTAAATTCATTATCCCTCCATATATATTATTATCATCTACCATTTTAATTTTTGAAATGCTTATTTCTCCTCTAAAAA

Coding sequence (CDS)

ATGGCTCAAAAGCACTTACACGAGCTTTTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCTGACAGACGCTCCCTTCTCAAACGCCCTTCTCTCAAATCCCATTTCCATCTTAACAATTCAAAACCCATCTCCCATTCCTCTGGTTTTCCAACTAAATTTTGCAGGAGCGCTTGTTTTTTCTCTTTCAATCACTCCCCTGATCTCGTAAACTCATCTCCGCTCTTTGGATTTCAGTCGCCGGTCAAAACCCCTTGTCGAAACCCCAATCCCATTTTCCTTCATGTTCCGGCTAGAACGGCTGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCCTCGGGAAATCAAATGGATTAGGGCTTCTGGGTTCTTTTCTTAAGCGTTTGACTCTTCGTGGCCGTGCTCGGAAGCGAGAGATCGACGGCGATGGTCGGAGAAATGACCCCCGCGATGGCCCGCCATTGCCGGCGAAAATGGCGATTCAGGAGAACGAGAACGGGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATGTATGCGATAGCCCTTTTCGATTTGTGCTTCAATCGAGCCCTTCCCCCGGTCACCGGACGCCGGACCTCGCTTCGCCGGCGTCTTCTCCGGCTCGCCTAGACCATCAGGTTTTTCTTAGTTTTATTACTGTTTTTGGGTTTCATCTTTCAATGTCTTGGCGTAGAAACTGCATTTTTATGAACTGGGTGTCACCGGCAAAAACCAACTGTCGGCAGTGTTATTTTGTGAACTTTTCTTTGCCTTGTTTTCACCCAAATTTCAAGGTCAATGATGTAGAGAGCTTGAAAAAATTGCCGGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCGGTGTTGGATCCTCCGTTTGAGGACGACGACGAAGGACATTACAAGGATGGTGAGGATGAGGATGATTACAATTTGGAACGCAGCTTTGCCATTGTACAAAAGGCGAGGCATCAACTACTGAAAAAACTTCGAAGATTCGAGAGACTAGCAGAACTAGACCCAGTAGAACTCGAGACATTTCTACTAAAAGACGAGGACGAAGACGAACTCGATGATGACGATGAAATCAATCATCTCAAGGAAGAAGAAGACTACGAAAAAGACATCAAACAAAACAACACAGAAGCCAATGACAATTCAAGGTTCCAAATTCCTCACCGACCCGCAAGAGATATGAAGACACTCCTCTGCAATCTCATTACTGAGGAAGAGAGAGACTTGGTTGGGATTGACAAGAGAGAAGAGACGATGAAGAGAGTATACATGAGATCGGATTTGTGGAAACGGGTGGACTCGAGCACAATCGACATGATGGTGGGGCAAGATTTGAAAGCAGAGATTGATGGGTGGAACATAAATAAGGAGCAGAGAGGAGAAATAGGCATAGAGATAGAGTTTGCAATCTTCAACTTGTTGGTGGAGGAAATGCAAACTGAACTTCATTGCTTAACTCATTAA

Protein sequence

MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENENGNDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQVFLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKDEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDNSRFQIPHRPARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH
Homology
BLAST of CmUC10G200320 vs. NCBI nr
Match: XP_038903007.1 (uncharacterized protein LOC120089713 [Benincasa hispida])

HSP 1 Score: 785.8 bits (2028), Expect = 2.3e-223
Identity = 410/514 (79.77%), Postives = 439/514 (85.41%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCRS 60
           MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNN KPISHSS FP KFCRS
Sbjct: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNPKPISHSSDFPAKFCRS 60

Query: 61  ACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQSTA 120
           ACFFSFNHSPDL+NSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST 
Sbjct: 61  ACFFSFNHSPDLINSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQSTV 120

Query: 121 ARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENENG 180
           ARSKSLGKSNGLG+LGSFLKRLT RGRARKREIDGDGR+NDPRDGPPLPAKMAI+ENEN 
Sbjct: 121 ARSKSLGKSNGLGVLGSFLKRLTHRGRARKREIDGDGRKNDPRDGPPLPAKMAIEENENE 180

Query: 181 NDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQVFLSF 240
           NDSV RLSNVTGFDFC+SN+CDSPFRFVLQSSPSPGH+TP+LASPASSPARLDHQ     
Sbjct: 181 NDSVSRLSNVTGFDFCDSNLCDSPFRFVLQSSPSPGHQTPELASPASSPARLDHQ----- 240

Query: 241 ITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKKLPVE 300
                                                           NDVE LKKLPVE
Sbjct: 241 -----------------------------------------------ANDVEGLKKLPVE 300

Query: 301 DEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKLRRFE 360
           DEEEEKEQSSPVSVLDPPFEDDDEGHY+DGEDEDDYNLERSFAIVQ+A+HQLLKKLRRFE
Sbjct: 301 DEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQQAKHQLLKKLRRFE 360

Query: 361 RLAELDPVELETFLLKDEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDNSRFQIPHR 420
           RLAELDPVELETFLLKDEDEDE +DDD+I+HLKEEEDY+KDIK+++ EAND+SRFQIPHR
Sbjct: 361 RLAELDPVELETFLLKDEDEDEDEDDDDIDHLKEEEDYKKDIKEHDIEANDSSRFQIPHR 420

Query: 421 PARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEI 480
           PARDM TL+CNL+TEEERDLV I+KREE MK +Y+RSDLWKRVDS+ I++MVGQDLK E+
Sbjct: 421 PARDMTTLVCNLVTEEERDLVVIEKREEMMKGMYVRSDLWKRVDSNAINVMVGQDLKEEV 462

Query: 481 DGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELH 515
           DGW  NKEQR EI IEIE AIF+LLVEEMQ ELH
Sbjct: 481 DGWKRNKEQRREIAIEIEVAIFSLLVEEMQPELH 462

BLAST of CmUC10G200320 vs. NCBI nr
Match: XP_011651995.1 (uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical protein Csa_002656 [Cucumis sativus])

HSP 1 Score: 750.4 bits (1936), Expect = 1.1e-212
Identity = 397/524 (75.76%), Postives = 429/524 (81.87%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCR 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL N KPI HSS F  KFCR
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPIPHSSDFSAKFCR 60

Query: 61  SACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
           S CFFSFNHSPDL NSSP FGFQSPVKTPCRNPNP+F HVPARTAGLLLEAALRIQKQST
Sbjct: 61  STCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQST 120

Query: 121 AARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENEN 180
           AARSKS GKSNGLGLLGSFLKRLT R RARKREI GDGR NDPRDGPPLPAKMAI+ENE 
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENET 180

Query: 181 GNDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQVFLS 240
            NDSVFRLSNVTGFDFCESN+CDSPFRFVLQSSPSPGHRTP+L+SPASSPARLDHQ    
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQ---- 240

Query: 241 FITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKKLPV 300
                                                            NDVESL+KLP 
Sbjct: 241 ------------------------------------------------ANDVESLQKLPA 300

Query: 301 EDEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKLRRF 360
           EDEEEEKEQSSPVSVLDPPFEDDDEGH++DGEDEDDYNLERSFAIVQKA+HQLLKKLRRF
Sbjct: 301 EDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKHQLLKKLRRF 360

Query: 361 ERLAELDPVELETFLLKDEDEDELD----DDDEINHLKEE-EDYEKDIKQNNTEANDNSR 420
           ERLAELDP+ELETFLL DED+DE +    D D+I+HLKEE E YEKDIKQ+N E ND+SR
Sbjct: 361 ERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVEQYEKDIKQHNKEGNDSSR 420

Query: 421 FQIPHRPARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQ 480
           FQIP+RP+RD KTL+CNLIT+EER+LV I+K EETMKRVYMR DLWKRVDS+ ID+MVG+
Sbjct: 421 FQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMRQDLWKRVDSNAIDLMVGK 472

Query: 481 DLKAEIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH 519
           DLK E+DGWNINKE RGEI +EIE AIF+LLVEEMQ+ELHCLTH
Sbjct: 481 DLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHCLTH 472

BLAST of CmUC10G200320 vs. NCBI nr
Match: KAA0043909.1 (histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa] >TYK25228.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 734.6 bits (1895), Expect = 6.0e-208
Identity = 393/522 (75.29%), Postives = 426/522 (81.61%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCR 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL N KPISHS  F  KFCR
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60

Query: 61  SACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
           S CFFSFNHSPDL NSSPLFGFQSPVKTPCR+PNP+F HVPARTAGLLLEAALRIQKQST
Sbjct: 61  STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120

Query: 121 AARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENEN 180
           AARSKS GKSNGLGLLGSFLKRLT R R+RKREI GDGR NDPRDGPPLPAKMAI+ENE 
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180

Query: 181 GNDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQVFLS 240
            NDSVFRLSNVTGFDFCESN+CDSPFRFVLQSS SPGHRTP+L+SP SSPARLDHQ    
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQ---- 240

Query: 241 FITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKKLPV 300
                                                            NDVESL+KLP 
Sbjct: 241 ------------------------------------------------ANDVESLQKLPA 300

Query: 301 EDEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKLRRF 360
           EDEEEEKEQSSPVSVLDPPFEDDDEG+++DGEDEDDYNLERSFAIVQKA+HQLLKKLRRF
Sbjct: 301 EDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQLLKKLRRF 360

Query: 361 ERLAELDPVELETFLLKDE--DEDELDDDDEINHLKEE-EDYEKDIKQNNTEANDNSRFQ 420
           ERLAELDP+ELETFLL DE  DEDEL D D+I+HLKEE E+YEKDIKQ+N E ND+SRFQ
Sbjct: 361 ERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQ 420

Query: 421 IPHRPARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDL 480
             +RP+RD K L+CNLITEEER++V I+KREETMKRVYMR DLWKRVDS+ ID+MVG+DL
Sbjct: 421 --NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDVMVGKDL 468

Query: 481 KAEIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH 519
           K E+DGWN NKE RGEIGIEIE AIF+LLVEEMQ+ELHCL H
Sbjct: 481 KEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 468

BLAST of CmUC10G200320 vs. NCBI nr
Match: XP_022144766.1 (uncharacterized protein LOC111014376 [Momordica charantia])

HSP 1 Score: 619.8 bits (1597), Expect = 2.2e-173
Identity = 351/522 (67.24%), Postives = 387/522 (74.14%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCRS 60
           M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPS KS+ HL   KPIS +  FP KFC+S
Sbjct: 2   MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNLHLKRRKPISETLDFPGKFCKS 61

Query: 61  ACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQSTA 120
           ACFFSF+ SPDL   SPLF FQSPV    RNPN IFLHVPARTAG+LLEAALRIQKQSTA
Sbjct: 62  ACFFSFHESPDL-RKSPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQSTA 121

Query: 121 ARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENE-- 180
           ARSK  GK+NGLGLLGSFLKRLT RGRARKREIDGDGRRND   G PLPAKMAI+ENE  
Sbjct: 122 ARSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENEDE 181

Query: 181 --NGNDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQV 240
             N N SV   +N+T F FCESN CDSPFRFVLQSSPS GHRTP+ +SPA+SP R DHQ 
Sbjct: 182 NVNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQ- 241

Query: 241 FLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKK 300
                                                               NDVESLKK
Sbjct: 242 ---------------------------------------------------DNDVESLKK 301

Query: 301 LPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKL 360
           LPVEDEEEEKEQSSPVS+LDPPFEDDDEGHY+DGEDED Y+LERS+ IVQKA+HQLLKKL
Sbjct: 302 LPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQKAKHQLLKKL 361

Query: 361 RRFERLAELDPVELETFLLKDEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDNSRFQ 420
           RRFE+LAELDPVELE+FLLK E EDELDDDD+I+HLKEEE    + +Q++ EAN +S FQ
Sbjct: 362 RRFEKLAELDPVELESFLLKGE-EDELDDDDDIDHLKEEEYESHNFEQHDVEANGSSSFQ 421

Query: 421 IPHRPARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDL 480
           IPHR       L+ N IT E+RD    D REE  K VY+RSDLWKRVDS+ ID  VGQDL
Sbjct: 422 IPHR-------LVRNRITGEQRDQAVTDNREEMTKGVYVRSDLWKRVDSNAIDATVGQDL 458

Query: 481 KAEIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH 519
           K E+DGWN N++QRGE+ IEIE AIF+LLV EMQTEL CLTH
Sbjct: 482 KTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLTH 458

BLAST of CmUC10G200320 vs. NCBI nr
Match: XP_023526007.1 (uncharacterized protein LOC111789613 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 568.2 bits (1463), Expect = 7.5e-158
Identity = 333/518 (64.29%), Postives = 367/518 (70.85%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCRS 60
           MAQKHLHELLKEDQ PFLL NFIADRRSLLKRPS KS F LN  KPIS SS F   FCRS
Sbjct: 1   MAQKHLHELLKEDQHPFLLANFIADRRSLLKRPSPKSLFQLNRPKPISDSSDFHRNFCRS 60

Query: 61  ACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQSTA 120
           ACFFSF HSPDL+ SSPLF FQSPVKTPCRN N IFLHVPA TAGLLLEAALRIQKQSTA
Sbjct: 61  ACFFSFTHSPDLITSSPLFEFQSPVKTPCRNHNGIFLHVPATTAGLLLEAALRIQKQSTA 120

Query: 121 ARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENENG 180
           A+S+SLGKSNGLG LGSFLKRLT RGR RKREI  DGR+N  R  PPLPA     ENEN 
Sbjct: 121 AKSRSLGKSNGLGFLGSFLKRLTHRGRIRKREICSDGRKNGDRGSPPLPA----NENENE 180

Query: 181 NDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQVFLSF 240
           NDSV R          +SN+C SPFRFVLQSSPSPGHRTP+ +SP SSPAR +HQ     
Sbjct: 181 NDSVSR----------QSNLCHSPFRFVLQSSPSPGHRTPEFSSPTSSPARRNHQ----- 240

Query: 241 ITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKKLPVE 300
                                                          V D ESLKK  VE
Sbjct: 241 -----------------------------------------------VKDAESLKKFAVE 300

Query: 301 DEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKLRRFE 360
           DEEEEKEQSSPVSVLDPPFE+ DEGHY     EDDYNL+RS+AIVQKA+HQLLKKLRRFE
Sbjct: 301 DEEEEKEQSSPVSVLDPPFEEYDEGHY-----EDDYNLDRSYAIVQKAKHQLLKKLRRFE 360

Query: 361 RLAELDPVELETFLLKDEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDNSRFQIPHR 420
           RLAELD VELETFLLKDEDEDEL+DD  I HL ++E +  DI ++N   N +SRFQIP  
Sbjct: 361 RLAELDVVELETFLLKDEDEDELNDDANIAHLDDDESH--DIIEHN---NGSSRFQIPR- 420

Query: 421 PARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEI 480
                K L+ NL+T++ERD+V I+      KRV +RS LWK VD++ ID++  QDLK E+
Sbjct: 421 -----KRLIYNLVTKDERDVVVIE------KRVLVRSKLWKGVDTNAIDVITRQDLKGEV 430

Query: 481 DGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH 519
           DGW+ N EQRGEI IEIE AIF+LLVEEMQTELHCL H
Sbjct: 481 DGWSRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLAH 430

BLAST of CmUC10G200320 vs. ExPASy TrEMBL
Match: A0A0A0LAR8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1)

HSP 1 Score: 750.4 bits (1936), Expect = 5.1e-213
Identity = 397/524 (75.76%), Postives = 429/524 (81.87%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCR 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL N KPI HSS F  KFCR
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPIPHSSDFSAKFCR 60

Query: 61  SACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
           S CFFSFNHSPDL NSSP FGFQSPVKTPCRNPNP+F HVPARTAGLLLEAALRIQKQST
Sbjct: 61  STCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQST 120

Query: 121 AARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENEN 180
           AARSKS GKSNGLGLLGSFLKRLT R RARKREI GDGR NDPRDGPPLPAKMAI+ENE 
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENET 180

Query: 181 GNDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQVFLS 240
            NDSVFRLSNVTGFDFCESN+CDSPFRFVLQSSPSPGHRTP+L+SPASSPARLDHQ    
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQ---- 240

Query: 241 FITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKKLPV 300
                                                            NDVESL+KLP 
Sbjct: 241 ------------------------------------------------ANDVESLQKLPA 300

Query: 301 EDEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKLRRF 360
           EDEEEEKEQSSPVSVLDPPFEDDDEGH++DGEDEDDYNLERSFAIVQKA+HQLLKKLRRF
Sbjct: 301 EDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKHQLLKKLRRF 360

Query: 361 ERLAELDPVELETFLLKDEDEDELD----DDDEINHLKEE-EDYEKDIKQNNTEANDNSR 420
           ERLAELDP+ELETFLL DED+DE +    D D+I+HLKEE E YEKDIKQ+N E ND+SR
Sbjct: 361 ERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVEQYEKDIKQHNKEGNDSSR 420

Query: 421 FQIPHRPARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQ 480
           FQIP+RP+RD KTL+CNLIT+EER+LV I+K EETMKRVYMR DLWKRVDS+ ID+MVG+
Sbjct: 421 FQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMRQDLWKRVDSNAIDLMVGK 472

Query: 481 DLKAEIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH 519
           DLK E+DGWNINKE RGEI +EIE AIF+LLVEEMQ+ELHCLTH
Sbjct: 481 DLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHCLTH 472

BLAST of CmUC10G200320 vs. ExPASy TrEMBL
Match: A0A5D3DNQ5 (Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003580 PE=4 SV=1)

HSP 1 Score: 734.6 bits (1895), Expect = 2.9e-208
Identity = 393/522 (75.29%), Postives = 426/522 (81.61%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCR 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL N KPISHS  F  KFCR
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60

Query: 61  SACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
           S CFFSFNHSPDL NSSPLFGFQSPVKTPCR+PNP+F HVPARTAGLLLEAALRIQKQST
Sbjct: 61  STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120

Query: 121 AARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENEN 180
           AARSKS GKSNGLGLLGSFLKRLT R R+RKREI GDGR NDPRDGPPLPAKMAI+ENE 
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180

Query: 181 GNDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQVFLS 240
            NDSVFRLSNVTGFDFCESN+CDSPFRFVLQSS SPGHRTP+L+SP SSPARLDHQ    
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQ---- 240

Query: 241 FITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKKLPV 300
                                                            NDVESL+KLP 
Sbjct: 241 ------------------------------------------------ANDVESLQKLPA 300

Query: 301 EDEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKLRRF 360
           EDEEEEKEQSSPVSVLDPPFEDDDEG+++DGEDEDDYNLERSFAIVQKA+HQLLKKLRRF
Sbjct: 301 EDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQLLKKLRRF 360

Query: 361 ERLAELDPVELETFLLKDE--DEDELDDDDEINHLKEE-EDYEKDIKQNNTEANDNSRFQ 420
           ERLAELDP+ELETFLL DE  DEDEL D D+I+HLKEE E+YEKDIKQ+N E ND+SRFQ
Sbjct: 361 ERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQ 420

Query: 421 IPHRPARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDL 480
             +RP+RD K L+CNLITEEER++V I+KREETMKRVYMR DLWKRVDS+ ID+MVG+DL
Sbjct: 421 --NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDVMVGKDL 468

Query: 481 KAEIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH 519
           K E+DGWN NKE RGEIGIEIE AIF+LLVEEMQ+ELHCL H
Sbjct: 481 KEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 468

BLAST of CmUC10G200320 vs. ExPASy TrEMBL
Match: A0A6J1CUE0 (uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014376 PE=4 SV=1)

HSP 1 Score: 619.8 bits (1597), Expect = 1.0e-173
Identity = 351/522 (67.24%), Postives = 387/522 (74.14%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCRS 60
           M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPS KS+ HL   KPIS +  FP KFC+S
Sbjct: 2   MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNLHLKRRKPISETLDFPGKFCKS 61

Query: 61  ACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQSTA 120
           ACFFSF+ SPDL   SPLF FQSPV    RNPN IFLHVPARTAG+LLEAALRIQKQSTA
Sbjct: 62  ACFFSFHESPDL-RKSPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQSTA 121

Query: 121 ARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENE-- 180
           ARSK  GK+NGLGLLGSFLKRLT RGRARKREIDGDGRRND   G PLPAKMAI+ENE  
Sbjct: 122 ARSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENEDE 181

Query: 181 --NGNDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQV 240
             N N SV   +N+T F FCESN CDSPFRFVLQSSPS GHRTP+ +SPA+SP R DHQ 
Sbjct: 182 NVNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQ- 241

Query: 241 FLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKK 300
                                                               NDVESLKK
Sbjct: 242 ---------------------------------------------------DNDVESLKK 301

Query: 301 LPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKL 360
           LPVEDEEEEKEQSSPVS+LDPPFEDDDEGHY+DGEDED Y+LERS+ IVQKA+HQLLKKL
Sbjct: 302 LPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQKAKHQLLKKL 361

Query: 361 RRFERLAELDPVELETFLLKDEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDNSRFQ 420
           RRFE+LAELDPVELE+FLLK E EDELDDDD+I+HLKEEE    + +Q++ EAN +S FQ
Sbjct: 362 RRFEKLAELDPVELESFLLKGE-EDELDDDDDIDHLKEEEYESHNFEQHDVEANGSSSFQ 421

Query: 421 IPHRPARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDL 480
           IPHR       L+ N IT E+RD    D REE  K VY+RSDLWKRVDS+ ID  VGQDL
Sbjct: 422 IPHR-------LVRNRITGEQRDQAVTDNREEMTKGVYVRSDLWKRVDSNAIDATVGQDL 458

Query: 481 KAEIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH 519
           K E+DGWN N++QRGE+ IEIE AIF+LLV EMQTEL CLTH
Sbjct: 482 KTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLTH 458

BLAST of CmUC10G200320 vs. ExPASy TrEMBL
Match: A0A6J1FAX4 (uncharacterized protein LOC111442411 OS=Cucurbita moschata OX=3662 GN=LOC111442411 PE=4 SV=1)

HSP 1 Score: 555.4 bits (1430), Expect = 2.4e-154
Identity = 332/520 (63.85%), Postives = 366/520 (70.38%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCRS 60
           MAQKHLHELLKEDQ PFLL NFIADRRSLLKRPS KS F LN SKPIS SS     FCRS
Sbjct: 1   MAQKHLHELLKEDQHPFLLANFIADRRSLLKRPSPKSLFQLNRSKPISDSS----DFCRS 60

Query: 61  ACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQSTA 120
           ACFFSF HSPDL  SSPLF F SPVKTPCRN N IFLHVPA TAGLLLEAALRIQKQSTA
Sbjct: 61  ACFFSFTHSPDLTTSSPLFEFHSPVKTPCRNHNGIFLHVPATTAGLLLEAALRIQKQSTA 120

Query: 121 ARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENENG 180
           A+SKSLGKSN LG LGSFLKRLT RGR RKREI  DGR+N  R  PPLP       NEN 
Sbjct: 121 AKSKSLGKSNALGFLGSFLKRLTHRGRIRKREICSDGRKNGYRGSPPLPT------NENE 180

Query: 181 NDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQVFLSF 240
           NDSV R          +SN+C+SPFRFVLQSSPSPGHRTP+ +SP SSPAR +HQ     
Sbjct: 181 NDSVSR----------QSNLCNSPFRFVLQSSPSPGHRTPEFSSPTSSPARRNHQ----- 240

Query: 241 ITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKKLPVE 300
                                                          V D ESLKKL VE
Sbjct: 241 -----------------------------------------------VKDAESLKKLAVE 300

Query: 301 DEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKLRRFE 360
           DEEEEKEQSSPVSVLDPPFE+ DEGHY     EDDYNL+RS+AIVQKA+HQLLKKLRRFE
Sbjct: 301 DEEEEKEQSSPVSVLDPPFEEYDEGHY-----EDDYNLDRSYAIVQKAKHQLLKKLRRFE 360

Query: 361 RLAELDPVELETFLLK--DEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDNSRFQIP 420
           RLAELD VELETFLLK  DEDEDELDDD +I HL ++E +  DI ++N   N +SRFQIP
Sbjct: 361 RLAELDVVELETFLLKDEDEDEDELDDDADIAHLDDDESH--DIIEHN---NGSSRFQIP 420

Query: 421 HRPARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKA 480
                  K L+ NL+T+EERD+V I+      KRV +RS+LWK VD++ IDM+  QDLK 
Sbjct: 421 ------PKRLIYNLVTKEERDVVVIE------KRVLVRSELWKGVDTNAIDMITRQDLKG 426

Query: 481 EIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH 519
           E+DGW+ N EQRGEI I++E AIF+LLVEEMQTELHCL H
Sbjct: 481 EVDGWSRNGEQRGEIAIDVELAIFSLLVEEMQTELHCLAH 426

BLAST of CmUC10G200320 vs. ExPASy TrEMBL
Match: A0A6J1J5Y5 (uncharacterized protein LOC111481647 OS=Cucurbita maxima OX=3661 GN=LOC111481647 PE=4 SV=1)

HSP 1 Score: 539.7 bits (1389), Expect = 1.4e-149
Identity = 323/520 (62.12%), Postives = 362/520 (69.62%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCRS 60
           MAQKHLHELLKEDQ PFLL NFIADRRSLLK P+ KS F LN SKPIS SS F   FCRS
Sbjct: 1   MAQKHLHELLKEDQHPFLLANFIADRRSLLKLPTPKSLFQLNRSKPISDSSDFRRNFCRS 60

Query: 61  ACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQSTA 120
           ACFFSF HSPDL+ SSPLF F SPVKTPC N N  FLHVPA TAGLLLEAALRIQKQSTA
Sbjct: 61  ACFFSFTHSPDLITSSPLFEFHSPVKTPCPNHNGTFLHVPATTAGLLLEAALRIQKQSTA 120

Query: 121 ARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPLPAKMAIQENENG 180
           A SKSLGKSNGLG LGSFLKRLT RGR RKREI  DGR+N  R  PPLPA        N 
Sbjct: 121 ANSKSLGKSNGLGFLGSFLKRLTHRGRIRKREICSDGRKNGYRGSPPLPA--------NE 180

Query: 181 NDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSPSPGHRTPDLASPASSPARLDHQVFLSF 240
           NDSV R          +SN+C+SPFRFVLQSSPS GHRTP+ +SP SSPAR +HQ     
Sbjct: 181 NDSVSR----------QSNLCNSPFRFVLQSSPSSGHRTPEFSSPTSSPARRNHQ----- 240

Query: 241 ITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKKLPVE 300
                                                          V D ESLKKL VE
Sbjct: 241 -----------------------------------------------VKDAESLKKLAVE 300

Query: 301 DEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLKKLRRFE 360
           DEEEEKEQSSPVSVLDPPFE+ +EGHY     EDDYNL+RS+AIVQKA+HQLLKKLRRFE
Sbjct: 301 DEEEEKEQSSPVSVLDPPFEEYEEGHY-----EDDYNLDRSYAIVQKAKHQLLKKLRRFE 360

Query: 361 RLAELDPVELETFLLK--DEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDNSRFQIP 420
           RLAELD VELETFLLK  DEDEDEL+DD +I HL ++E +  DI ++    N +SRFQIP
Sbjct: 361 RLAELDVVELETFLLKDEDEDEDELNDDADIAHLDDDESH--DIMEHK---NGSSRFQIP 420

Query: 421 HRPARDMKTLLCNLITEEERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKA 480
                  K L+ NL+T++ERD+V I+      KRV +RS+LWK VD++ ID+++ QDLK 
Sbjct: 421 ------PKRLISNLVTKDERDVVVIE------KRVLVRSELWKGVDTNAIDVIMKQDLKG 428

Query: 481 EIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH 519
           E+DGW+ N EQRGEI I+IE AIF+LLVEEMQTELH L H
Sbjct: 481 EVDGWSRNGEQRGEIAIDIELAIFSLLVEEMQTELHFLAH 428

BLAST of CmUC10G200320 vs. TAIR 10
Match: AT5G03670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G36420.1); Has 700 Blast hits to 624 proteins in 104 species: Archae - 0; Bacteria - 18; Metazoa - 333; Fungi - 60; Plants - 73; Viruses - 24; Other Eukaryotes - 192 (source: NCBI BLink). )

HSP 1 Score: 234.2 bits (596), Expect = 2.4e-61
Identity = 204/582 (35.05%), Postives = 289/582 (49.66%), Query Frame = 0

Query: 2   AQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKFCRSA 61
           +Q+HL +LL+EDQEPF L ++I+DRR  +   +  +H  +   +PIS ++G P++FCR+A
Sbjct: 3   SQRHLKDLLEEDQEPFQLQSYISDRRCQIN--AHVTHLQVKKRRPISQNAGLPSRFCRNA 62

Query: 62  CFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST-A 121
           CFFS   SPD    SPLF     +K+P R+ N IF+++PARTA +LLEAA+RIQKQS+  
Sbjct: 63  CFFSLRESPD-PKKSPLF----ELKSPNRSQNAIFVNIPARTASILLEAAVRIQKQSSEV 122

Query: 122 ARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDG------------------------- 181
           +++++    N  G+ GS LK+LT R   +KREI G                         
Sbjct: 123 SKTRTRNAGNAFGIFGSVLKKLTNR---KKREISGGKEAGRVSSSSVKDMLRWESPVVRK 182

Query: 182 ----DGRRNDPRDGPPLPAKMAIQ-----------------------------------E 241
                 +RN+  +      K+A +                                    
Sbjct: 183 IVTRKSKRNEEENASSQTHKIASETHFSRRSSSSGVWSESVTNGERSWDVDFETSISTSS 242

Query: 242 NENGNDSVFRLSNVTGFDFCE-SNVCDSPFRFVLQSSPS-PGHRTPDLASPASSPARLDH 301
             NG+D    + N  G D  E    C+SPF FVLQ+ PS  G RTP+ +SPA+SP     
Sbjct: 243 RSNGSDEFAMMMN--GQDLSEDKRFCESPFHFVLQTMPSNGGFRTPNFSSPAASP----- 302

Query: 302 QVFLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESL 361
                             R +C  M          ++ Y                +VE L
Sbjct: 303 ------------------RHDCHEME---------KESY----------------EVEKL 362

Query: 362 KKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLLK 421
           KKL +E+EEEEKEQSSPVSVLDPPF+DDDE  +      DD N+  SF  VQKA+H LL+
Sbjct: 363 KKLEMEEEEEEKEQSSPVSVLDPPFQDDDEDIHM-----DDNNIPSSFRSVQKAKHLLLQ 422

Query: 422 KLRRFERLAELDPVELETFLLKDEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDNSR 481
           KL RFE+LA LDP+ELE  +   E E+E ++++E   +K     E  I Q   +      
Sbjct: 423 KLCRFEQLAGLDPMELEKRMSDQETEEEEEEEEE--EMKSLYHCE-IITQRVLKTYFEEM 482

Query: 482 FQIPHRPARDMKTLLCNLITEE-ERDLVGIDKREETMKRVYMRSDLWKRVDSSTIDMMVG 514
            ++P      ++ L+ +L  EE   D+ G  +     KRV  R   W+ V+S+TIDMMV 
Sbjct: 483 VEVP----EGVEALISDLAAEELPSDIDGEAEAAIVAKRVCERLRSWRDVESNTIDMMVE 512

BLAST of CmUC10G200320 vs. TAIR 10
Match: AT2G36420.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G03670.1); Has 10588 Blast hits to 6606 proteins in 440 species: Archae - 8; Bacteria - 365; Metazoa - 4146; Fungi - 1198; Plants - 483; Viruses - 212; Other Eukaryotes - 4176 (source: NCBI BLink). )

HSP 1 Score: 181.4 bits (459), Expect = 1.8e-45
Identity = 167/524 (31.87%), Postives = 254/524 (48.47%), Query Frame = 0

Query: 3   QKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPTKF-CRSA 62
           +KHLHE L++DQEPF L ++I + RS +      S   +   K  + ++  P  F C ++
Sbjct: 7   KKHLHEFLEDDQEPFHLNHYIGNLRSQMG----CSDMRVKKRKSDNVATFPPGLFSCENS 66

Query: 63  CFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST-- 122
           CFF+ + SPD    SPLF  +SP K   R+   +FL +PARTA +LL+AA RIQKQ +  
Sbjct: 67  CFFAAHKSPD-PRKSPLFELRSPGKKKIRD-GRVFLQIPARTAAILLDAAARIQKQQSEK 126

Query: 123 AARSKSLGKSNGLGLLGSFLKRLTLR-GRARKREIDGDGRRNDPRDGPPLPAKMAIQENE 182
           A  +K+  + NG G+ GS LK LT R  + R    DG+               ++++   
Sbjct: 127 AKTNKARTRGNGFGMFGSVLKLLTYRITKPRLDNADGNA--------------VSLERGS 186

Query: 183 NGNDSVFRLSNVTGFDFCESNVCDSPFRFVLQSSP-SPGHRTPDLASPASSPARLDHQVF 242
               S  R   V   D C    C+SPF FVLQ++P S GH+TP   S A+SPAR   +  
Sbjct: 187 EPTSSSRRERIVEISDKC---FCESPFHFVLQTTPSSSGHQTPHFTSTATSPARRSTE-- 246

Query: 243 LSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNFSLPCFHPNFKVNDVESLKKL 302
                                                          +   ++ ESL+K+
Sbjct: 247 -----------------------------------------------DEDSDETESLEKV 306

Query: 303 PVED----EEEEKEQSSPVSVLDPPFEDDDEGHYKDGEDEDDYNLERSFAIVQKARHQLL 362
             ++    EEE+KEQ SPVSVLDP  E++++  +   E +   NL  SF IVQ+A+ +LL
Sbjct: 307 RGQEEEDKEEEDKEQCSPVSVLDPLEEEEEDEDHHQHEPDPPNNLSCSFEIVQRAKRRLL 366

Query: 363 KKLRRFERLAELDPVELETFLLKDEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDNS 422
           KKLRRFE+LA LDPVELE  + ++EDE             EEE+YE+      +E +DN 
Sbjct: 367 KKLRRFEKLAGLDPVELEGKMSEEEDE-------------EEEEYEE------SEEDDNI 426

Query: 423 RFQIPHRPARDMKTLLC--NLITEEERDLVGIDKREETMKRVYMRSDLWK--RVDSSTID 482
           R         D+   +   +   E+E+      K+ +  ++ +   + W+        +D
Sbjct: 427 RIYDSDEEYEDVDEAMARESRCAEDEK-----RKKNDERQKKWRMMNAWRVGLGAEEDVD 434

Query: 483 MMVGQDLKAEIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTEL 514
            +V +DL+ E   W  +  +  E   ++E +IF +L++E   EL
Sbjct: 487 AVVRKDLREEAGEWTRHGGEVEEAVSDLEHSIFFVLIDEFSREL 434

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038903007.12.3e-22379.77uncharacterized protein LOC120089713 [Benincasa hispida][more]
XP_011651995.11.1e-21275.76uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical ... [more]
KAA0043909.16.0e-20875.29histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. mak... [more]
XP_022144766.12.2e-17367.24uncharacterized protein LOC111014376 [Momordica charantia][more]
XP_023526007.17.5e-15864.29uncharacterized protein LOC111789613 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LAR85.1e-21375.76Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1[more]
A0A5D3DNQ52.9e-20875.29Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. m... [more]
A0A6J1CUE01.0e-17367.24uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
A0A6J1FAX42.4e-15463.85uncharacterized protein LOC111442411 OS=Cucurbita moschata OX=3662 GN=LOC1114424... [more]
A0A6J1J5Y51.4e-14962.12uncharacterized protein LOC111481647 OS=Cucurbita maxima OX=3661 GN=LOC111481647... [more]
Match NameE-valueIdentityDescription
AT5G03670.12.4e-6135.05unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G36420.11.8e-4531.87unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 298..333
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 149..171
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 149..165
NoneNo IPR availablePANTHERPTHR33623OS04G0572500 PROTEINcoord: 286..516
NoneNo IPR availablePANTHERPTHR33623:SF5HISTONE-LYSINE N-METHYLTRANSFERASE SETD1B-LIKE PROTEINcoord: 180..235
NoneNo IPR availablePANTHERPTHR33623OS04G0572500 PROTEINcoord: 1..161
NoneNo IPR availablePANTHERPTHR33623OS04G0572500 PROTEINcoord: 180..235
NoneNo IPR availablePANTHERPTHR33623:SF5HISTONE-LYSINE N-METHYLTRANSFERASE SETD1B-LIKE PROTEINcoord: 1..161
coord: 286..516

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC10G200320.1CmUC10G200320.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032259 methylation
molecular_function GO:0008168 methyltransferase activity