HG10019711 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10019711
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPHD-type domain-containing protein
LocationChr04: 24772360 .. 24774473 (+)
RNA-Seq ExpressionHG10019711
SyntenyHG10019711
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTCTGGAAACCAACGAAGAAAGAAGTCCTGAAGATCCATCGATCGACCTCGCTCCTCCTGCCGCTCAAGGTTTTTATTTTATTTTACTTTACTTTCTTCTTATTTTTAAGTATGATTGAATCATTTGAAACTGATTGGCATGGTTCAAATTCTATTGTATATAGCCTCAATTTCTTCGTATGTGATCTATTATATATGTTCAAACAATTTCTATTTAAATTAGAGTAGTCTCTAGAAGAACACAAATGTTATTGGCTAATAGTGAAGTTTTCTTCAAAGGAAGAAAAATGAATCAACCTAAAGAGACATAGTGTGCCTTTTACTTTATGTCTCAATCTTTGGAGTAGAGAGATTTCACTTACAAGTTTTCTTTCTGAGAGGGAAAACAAAAAAATCGTCCCACAAATTTGAAAATAAATGTTGAAGCTCGACCTTCAATCTTCATGTTCATGGCGGCTTGGCGAAAATCTAGGAGACGTGTTGGAAAGTCCTATTGCTGATTTCTAGGCTTCGTTATTTTTTGTTCTCTCCTTCGAATCTGCCTCTCTCTGGTATTGGTTTGAAATAGATCCGTATTGGTTATCAATTGGTGAGTTCCTTCCCCTTGGGCATCGTGGCAATGCCTGAGTGCTCAATATTTGGTTCATTGGATTATGTAATTGGATGAAGAAGAGGATCTTCGAGTTTTCCTTTACCATATAACAAGTTTATGTACGGGTAGGTAGATTATTAGTCTAGGGCGATGTTTTTGTTATCAATTTTAATTGTGATTTTTATCATGTGATTGTCGTCGGGATCAGTTTTGTCGTAATTGAAGCAGAGTGGTAATCGGTCTGGGTCTAGGCTTAAGAAGCACAAGAGGCTTGATGCCATATGCGAGAAAGAGTATAGTCGAAACCATGGCTATGTGAATGAGAATGTCAGTGGGTTGGGGACTGTGGAGGCTGATCTTGGGCTTAGGCAGAGCAACCATGTTCGTCGGCCCCAGTCCTGCTGGATGCTTGTTCTATGTCAAAGAAGAATCAGGAAAAGGAAACTTTTTGATGAGACGCATGGGAATTGGAGATCAAGGAACAGAAATTTGGGGACTAGAGTGGACAAAGGTACTCGGGAAAGCAGGAAAAGGAAACTTTTTGATGAAATTATTGTTGTGAAAGTAAGAAACAATGGAATGAGGATGGATTTGGCTGAGGAAAAAGGGAAAATGGAATACGTGGAATCTATGGTTGGAAGGTCTGTCTAACAGATCGAGGAGGAGGTTTGGGGTGACGAATGATCCAATTAAAATAGAAAAGGTGGTAAAGTCTCCTCAAATTAAGGATGATTGTTGCAGGGAAGACATGTTGGCAATCAATAATGAAGATGAGGAGGAGGAAGAAGTAGAAGAAGTAGAAGAAGAAGAAGGGGAAGAAGGAGGAGGAGGAGGTGGTGGTGGTGGAGGGCAAAGTAGTCATGACTGCGAAGAATGAGGAGGAGGTGGTGGTGGAGGGCTTGTAGTGGTGATCATAATGAAGAACCAACCAATGTGGTGGAGAATGCAAACGATGGTGAGATACAGTTGGTAGAATTAACACAGCTACATGAAAGTACAAATGAAATTCATGATGTAGAAGCTGCCTTAGTTTCAACAAATGAAGTGGTAGGTGGAAGGTCTTGCAGTGAGAAAGCTGTTGATTTGGGTAAGTTTGCTGAAAAGTCTAGGCAACATGGTGGCTATTTAAATTTAAAGAAGTTTACAGACAGTTCCATAGGTACTTTGGGTAAGGCTCGCATTCAAGAGGGCAGAAGGTGTGGATTGTGTGGAGGAGGAATTAATGGTAACCTCTCAAGAAGTTGGTTCAGGATTCGGGTGAGAGTGGAAATGAAGCTTGTAGTGGCTCTTCAGCTTCAGAGGAACAAAATTATGACAAGTGGGATGGTTTTAGGTCGAATTAATGATCGTTATGACATTGCTGAAATATGGATCCATCGACACTATGCAGTTTGGAGCTCCGAGGTTTATTTTGCTGGATTGGGATGCTTGAAAAATGTAAGGGCTGCTCTTTGCAGGGGAAGAGCATTGAAGTGCACTCGGTGTGGGAGACCTGGTGCAACCATCGGATCGTGTTGA

mRNA sequence

ATGCCTCTGGAAACCAACGAAGAAAGAAGTCCTGAAGATCCATCGATCGACCTCGCTCCTCCTGCCGCTCAAGGCTTCGTTATTTTTTGTTCTCTCCTTCGAATCTGCCTCTCTCTGAGTGGTAATCGGTCTGGGTCTAGGCTTAAGAAGCACAAGAGGCTTGATGCCATATGCGAGAAAGAGTATAGTCGAAACCATGGCTATGTGAATGAGAATGTCAGTGGGTTGGGGACTGTGGAGGCTGATCTTGGGCTTAGGCAGAGCAACCATGTTCGTCGGCCCCAGTCCTGCTGGATGCTTGTTCTATGTCAAAGAAGAATCAGGAAAAGGAAACTTTTTGATGAGACGCATGGGAATTGGAGATCAAGGAACAGAAATTTGGGGACTAGAGTGGACAAAGGTACTCGGGAAAGCAGGAAAAGGAAACTTTTTGATGAAATTATTGTTGTGAAAGTAAGAAACAATGGAATGAGGATGGATTTGGCTGAGGAAAAAGGGAAAATGGAATACGTGGAATCTATGGTTGGAAGGGAAGACATGTTGGCAATCAATAATGAAGATGAGGAGGAGGAAGAAGTAGAAGAAGTAGAAGAAGAAGAAGGGGAAGAAGGAGGAGGAGGAGGTGGTGGTGGTGGAGGGCAAAAAGCTGCCTTAGTTTCAACAAATGAAGTGGTAGGTGGAAGGTCTTGCAGTGAGAAAGCTGTTGATTTGGGTAAGTTTGCTGAAAAGTCTAGGCAACATGGTGGCTATTTAAATTTAAAGAAGTTTACAGACAGTTCCATAGGTACTTTGGGTAAGGCTCGCATTCAAGAGGGCAGAAGGTGTGGATTGTGTGGAGGAGGAATTAATGGTAACCTCTCAAGAAGTTGGTTCAGGATTCGGGTGAGAGTGGAAATGAAGCTTGTAGTGGCTCTTCAGCTTCAGAGGAACAAAATTATGACAAGTGGGATGGTTTTAGGTCGAATTAATGATCGTTATGACATTGCTGAAATATGGATCCATCGACACTATGCAGTTTGGAGCTCCGAGGTTTATTTTGCTGGATTGGGATGCTTGAAAAATGTAAGGGCTGCTCTTTGCAGGGGAAGAGCATTGAAGTGCACTCGGTGTGGGAGACCTGGTGCAACCATCGGATCGTGTTGA

Coding sequence (CDS)

ATGCCTCTGGAAACCAACGAAGAAAGAAGTCCTGAAGATCCATCGATCGACCTCGCTCCTCCTGCCGCTCAAGGCTTCGTTATTTTTTGTTCTCTCCTTCGAATCTGCCTCTCTCTGAGTGGTAATCGGTCTGGGTCTAGGCTTAAGAAGCACAAGAGGCTTGATGCCATATGCGAGAAAGAGTATAGTCGAAACCATGGCTATGTGAATGAGAATGTCAGTGGGTTGGGGACTGTGGAGGCTGATCTTGGGCTTAGGCAGAGCAACCATGTTCGTCGGCCCCAGTCCTGCTGGATGCTTGTTCTATGTCAAAGAAGAATCAGGAAAAGGAAACTTTTTGATGAGACGCATGGGAATTGGAGATCAAGGAACAGAAATTTGGGGACTAGAGTGGACAAAGGTACTCGGGAAAGCAGGAAAAGGAAACTTTTTGATGAAATTATTGTTGTGAAAGTAAGAAACAATGGAATGAGGATGGATTTGGCTGAGGAAAAAGGGAAAATGGAATACGTGGAATCTATGGTTGGAAGGGAAGACATGTTGGCAATCAATAATGAAGATGAGGAGGAGGAAGAAGTAGAAGAAGTAGAAGAAGAAGAAGGGGAAGAAGGAGGAGGAGGAGGTGGTGGTGGTGGAGGGCAAAAAGCTGCCTTAGTTTCAACAAATGAAGTGGTAGGTGGAAGGTCTTGCAGTGAGAAAGCTGTTGATTTGGGTAAGTTTGCTGAAAAGTCTAGGCAACATGGTGGCTATTTAAATTTAAAGAAGTTTACAGACAGTTCCATAGGTACTTTGGGTAAGGCTCGCATTCAAGAGGGCAGAAGGTGTGGATTGTGTGGAGGAGGAATTAATGGTAACCTCTCAAGAAGTTGGTTCAGGATTCGGGTGAGAGTGGAAATGAAGCTTGTAGTGGCTCTTCAGCTTCAGAGGAACAAAATTATGACAAGTGGGATGGTTTTAGGTCGAATTAATGATCGTTATGACATTGCTGAAATATGGATCCATCGACACTATGCAGTTTGGAGCTCCGAGGTTTATTTTGCTGGATTGGGATGCTTGAAAAATGTAAGGGCTGCTCTTTGCAGGGGAAGAGCATTGAAGTGCACTCGGTGTGGGAGACCTGGTGCAACCATCGGATCGTGTTGA

Protein sequence

MPLETNEERSPEDPSIDLAPPAAQGFVIFCSLLRICLSLSGNRSGSRLKKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRKLFDETHGNWRSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGREDMLAINNEDEEEEEVEEVEEEEGEEGGGGGGGGGGQKAALVSTNEVVGGRSCSEKAVDLGKFAEKSRQHGGYLNLKKFTDSSIGTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKLVVALQLQRNKIMTSGMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC
Homology
BLAST of HG10019711 vs. NCBI nr
Match: XP_038898386.1 (uncharacterized protein LOC120086038 [Benincasa hispida])

HSP 1 Score: 335.9 bits (860), Expect = 4.6e-88
Identity = 240/528 (45.45%), Postives = 261/528 (49.43%), Query Frame = 0

Query: 37  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQ 96
           L  SGNRSGSRL KKHKRLDAICEKEYSRNHG VNENVS L TVE DLGLR+S+ VRR  
Sbjct: 12  LKQSGNRSGSRLKKKHKRLDAICEKEYSRNHGDVNENVSRLATVEPDLGLRRSSRVRRAP 71

Query: 97  SCWMLVLCQRRIRKR---------------------KLFDETHGNW----RSRNRNLGTR 156
                    R+ R+                       L DET GNW    RSRNRNLG R
Sbjct: 72  VLLDASPMPRKKRRMVHGNGTLGVKTSASTLPQLRDDLNDETQGNWRSRLRSRNRNLGIR 131

Query: 157 VDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVG-------------- 216
           V+KGTR SRKRKLFDEII VKVR++GMRM L E KG+MEY ESMVG              
Sbjct: 132 VEKGTRTSRKRKLFDEIIDVKVRSSGMRMVLDEGKGRMEYGESMVGGSNRSGRRFGVTSD 191

Query: 217 -------------------REDMLAINNEDEEEEEVEEVEEEEGEEGGGGGGGGGGQ--- 276
                              REDML+INNEDEEEEE  E EEEE EE   G     G+   
Sbjct: 192 WIKIEKEVKSSPQHKDDCCREDMLSINNEDEEEEEEVEEEEEEDEEEEEGEEEEEGEEEE 251

Query: 277 ------------------------------------------------------------ 336
                                                                       
Sbjct: 252 EEEEEEEEEEEEGEEEKVVEGKEVMTAKNERGEGVLPLENEMDDENVKAVDDVIPQVVEK 311

Query: 337 -----------------------------------------------------KAALVST 379
                                                                +AA+ ST
Sbjct: 312 LDQETSSSLHVDEACSGDHNEEPANVIKNANNGEIQVEELTRLNEGVNEIHDVEAAIFST 371

BLAST of HG10019711 vs. NCBI nr
Match: XP_008456208.1 (PREDICTED: uncharacterized protein LOC103496212 [Cucumis melo])

HSP 1 Score: 333.2 bits (853), Expect = 3.0e-87
Identity = 246/570 (43.16%), Postives = 275/570 (48.25%), Query Frame = 0

Query: 16  IDLAPPAAQGFVIFCSLLRI-------CLSLSGNRSGSRL-KKHKRLDAICEKEYSRNHG 75
           I L P     F+    ++R+        L  SGNRSG RL KKHKRLDAICEKEYSRNHG
Sbjct: 28  ICLGPRLCSRFLFLIFVMRLSSGSVSSSLKQSGNRSGPRLKKKHKRLDAICEKEYSRNHG 87

Query: 76  YVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRK---------------- 135
            VNENV+ LGT+EAD GLR+S+ VRR     +L+      RK++                
Sbjct: 88  DVNENVTRLGTLEADPGLRRSSRVRRAP---VLLDASPMPRKKRRIVRGNGTLGVKTSAN 147

Query: 136 --------LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRM 195
                   L  ET GNW    RSRNRNLG RVDKG R SRKRKLFDEII VKVRN GMR+
Sbjct: 148 TLPLFSDDLKGETEGNWRSRLRSRNRNLGIRVDKGARASRKRKLFDEIIDVKVRNGGMRI 207

Query: 196 DLAEEKGKMEYVESMVGR--------------------------------EDMLAINNED 255
           DL EEK KME+ ESMVGR                                E+ML I+ +D
Sbjct: 208 DLDEEKRKMEFGESMVGRSNRTSRRFGVTNDPIKIEEEVKSPRIKDDYCKEEMLIIDIDD 267

Query: 256 EEE------EEVEEVEEEEGEEGGGGGGGGGGQ--------------------------- 315
           EEE      EE EE EEEE EE   GGGGGGG+                           
Sbjct: 268 EEEEGEGEGEEEEEEEEEEEEERERGGGGGGGEXEEEEEEEEEEEEEEEEEEEEEEEEEE 327

Query: 316 ------------------------------------------------------------ 375
                                                                       
Sbjct: 328 EEEEEAVEGKEVVTAKDEKGEDVLPLENEMDEENVKVVDDVTPQVVEKLDKETSSSLHVD 387

Query: 376 -----------------------------------KAALVSTNEVVGGRSCSEKAVDLGK 379
                                              +AA+VSTNEVVGGRSC+EKAVDLGK
Sbjct: 388 EACSGDHNEELANAGEIQLEESTQLNEGVNETQDVEAAVVSTNEVVGGRSCNEKAVDLGK 447

BLAST of HG10019711 vs. NCBI nr
Match: XP_031739139.1 (uncharacterized protein LOC101208571 [Cucumis sativus] >KAE8650542.1 hypothetical protein Csa_011617 [Cucumis sativus])

HSP 1 Score: 327.0 bits (837), Expect = 2.1e-85
Identity = 236/526 (44.87%), Postives = 264/526 (50.19%), Query Frame = 0

Query: 37  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQ 96
           L+ SGNRSG RL KKHKRLDAICEKEYSRNHG VNENVSGLGT+EAD GLR+S+ VRR  
Sbjct: 56  LNQSGNRSGPRLKKKHKRLDAICEKEYSRNHGDVNENVSGLGTLEADPGLRRSSRVRRAP 115

Query: 97  SCWMLVLCQRRIRK---------------------RKLFDETHGNWRSR----NRNLGTR 156
                    R+ R+                       L DET GNWRSR    +RNLG R
Sbjct: 116 VLLDASPIPRKKRRIVQGNGTLGVRTSANTLPLFSDDLKDETEGNWRSRLRSSSRNLGIR 175

Query: 157 VDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR------------- 216
           VDKG R SRKRKLFDEI+ VKVRN GMR+DL EEKG+ME+ ES+VGR             
Sbjct: 176 VDKGARASRKRKLFDEIVDVKVRNGGMRIDLDEEKGRMEFGESLVGRSNRTRRRFGVIND 235

Query: 217 -------------------EDMLAIN------------------------------NEDE 276
                              +DML I+                               E+E
Sbjct: 236 PIKIEEEVKSPRIKDDCCKKDMLVIDIDDEEEGEGEGEGEEEEEEEEEEEEEEEEEEEEE 295

Query: 277 EEEEVEEVEEEEGEEGGGG----------GGG---------------------------- 336
           EEEE EE EEEEGEE   G          G G                            
Sbjct: 296 EEEEEEEEEEEEGEEEVEGKEVVTAKDERGDGVLPLENEMDEENVKVVDDVTPQVVEKLD 355

Query: 337 ----------------------------GGGQ-------------------KAALVSTNE 379
                                         G+                    AA+VSTNE
Sbjct: 356 KETSSSLHVDEACRADHNEELANAVENANNGEIRLEESKQLNEGVNETQDVAAAVVSTNE 415

BLAST of HG10019711 vs. NCBI nr
Match: KAA0058834.1 (Tat-binding-7-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 324.7 bits (831), Expect = 1.1e-84
Identity = 237/521 (45.49%), Postives = 261/521 (50.10%), Query Frame = 0

Query: 37  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQ 96
           L  SGNRSG RL KKHKRLDAICEKEYSRNHG VNENV+ LGT+EAD GLR+S+ VRR  
Sbjct: 12  LKQSGNRSGPRLKKKHKRLDAICEKEYSRNHGDVNENVTRLGTLEADPGLRRSSRVRRAP 71

Query: 97  SCWMLVLCQRRIRKRK------------------------LFDETHGNW----RSRNRNL 156
              +L+      RK++                        L  ET GNW    RSRNRNL
Sbjct: 72  ---VLLDASPMPRKKRRIVRGNGTLGVKTSANTLPLFSDDLKGETEGNWRSRLRSRNRNL 131

Query: 157 GTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR---------- 216
           G RVDKG R SRKRKLFDEII VKVRN GMR+DL EEK KME+ ESMVGR          
Sbjct: 132 GIRVDKGARASRKRKLFDEIIDVKVRNGGMRIDLDEEKRKMEFGESMVGRSNRTSRRFGV 191

Query: 217 ----------------------EDMLAIN----------------------------NED 276
                                 E+ML I+                             E+
Sbjct: 192 TNDPIKIEEEVKSPRIKDDYCKEEMLIIDIDDEEEEGEGEGEEEEEEEEEEEEEEEEEEE 251

Query: 277 EEEEEVEEVEEEEGEEGGGG---------------------------------------- 336
           EEEEE EE EEEE EE   G                                        
Sbjct: 252 EEEEEEEEEEEEEEEEAVEGKEVVTAKDEKGEDVLPLENEMDEENVKVVDDVTPQVVEKL 311

Query: 337 ----------------------GGGGGGQ-----------------KAALVSTNEVVGGR 379
                                    G  Q                 +AA+VSTNEVVGGR
Sbjct: 312 DKETSSSLHVDEACSGDHNEELANAGEIQLEESTQLNEGVNETQDVEAAVVSTNEVVGGR 371

BLAST of HG10019711 vs. NCBI nr
Match: TYK11250.1 (Tat-binding-7-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 323.2 bits (827), Expect = 3.1e-84
Identity = 237/525 (45.14%), Postives = 261/525 (49.71%), Query Frame = 0

Query: 37  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQ 96
           L  SGNRSG RL KKHKRLDAICEKEYSRNHG VNENV+ LGT+EAD GLR+S+ VRR  
Sbjct: 12  LKQSGNRSGPRLKKKHKRLDAICEKEYSRNHGDVNENVTRLGTLEADPGLRRSSRVRRAP 71

Query: 97  SCWMLVLCQRRIRKRK------------------------LFDETHGNW----RSRNRNL 156
              +L+      RK++                        L  ET GNW    RSRNRNL
Sbjct: 72  ---VLLDASPMPRKKRRIVRGNGTLGVKTSANTLPLFSDDLKGETEGNWRSRLRSRNRNL 131

Query: 157 GTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR---------- 216
           G RVDKG R SRKRKLFDEII VKVRN GMR+DL EEK KME+ ESMVGR          
Sbjct: 132 GIRVDKGARASRKRKLFDEIIDVKVRNGGMRIDLDEEKRKMEFGESMVGRSNRTSRRFGV 191

Query: 217 ----------------------EDMLAIN------------------------------- 276
                                 E+ML I+                               
Sbjct: 192 TNDPIKIEEEVKSPRIKDDYCKEEMLIIDIDDEEEEGEGEGEEEEEEEEEEEEEEEEEEE 251

Query: 277 -NEDEEEEEVEEVEEEEGEEGGGG------------------------------------ 336
             E+EEEEE EE EEEE EE   G                                    
Sbjct: 252 EEEEEEEEEEEEEEEEEEEEAVEGKEVVTAKDEKGEDVLPLENEMDEENVKVVDDVTPQV 311

Query: 337 --------------------------GGGGGGQ-----------------KAALVSTNEV 379
                                        G  Q                 +AA+VSTNEV
Sbjct: 312 VEKLDKETSSSLHVDEACSGDHNEELANAGEIQLEESTQLNEGVNETQDVEAAVVSTNEV 371

BLAST of HG10019711 vs. ExPASy Swiss-Prot
Match: Q9UMN6 (Histone-lysine N-methyltransferase 2B OS=Homo sapiens OX=9606 GN=KMT2B PE=1 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 1.9e-07
Identity = 25/49 (51.02%), Postives = 34/49 (69.39%), Query Frame = 0

Query: 332  WIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC 381
            W H + A+WS+EV+    G LKNV AA+ RGR ++C  C +PGAT+G C
Sbjct: 1606 WTHVNCAIWSAEVFEENDGSLKNVHAAVARGRQMRCELCLKPGATVGCC 1654

BLAST of HG10019711 vs. ExPASy Swiss-Prot
Match: O08550 (Histone-lysine N-methyltransferase 2B OS=Mus musculus OX=10090 GN=Kmt2b PE=1 SV=3)

HSP 1 Score: 58.5 bits (140), Expect = 1.9e-07
Identity = 25/49 (51.02%), Postives = 34/49 (69.39%), Query Frame = 0

Query: 332  WIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC 381
            W H + A+WS+EV+    G LKNV AA+ RGR ++C  C +PGAT+G C
Sbjct: 1612 WTHVNCAIWSAEVFEENDGSLKNVHAAVARGRQMRCELCLKPGATVGCC 1660

BLAST of HG10019711 vs. ExPASy Swiss-Prot
Match: P20659 (Histone-lysine N-methyltransferase trithorax OS=Drosophila melanogaster OX=7227 GN=trx PE=1 SV=4)

HSP 1 Score: 56.6 bits (135), Expect = 7.0e-07
Identity = 24/49 (48.98%), Postives = 35/49 (71.43%), Query Frame = 0

Query: 330  EIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG 379
            + W+H + A+WS+EV+    G L+NV +A+ RGR +KCT CG  GAT+G
Sbjct: 1760 DCWVHTNCAMWSAEVFEEIDGSLQNVHSAVARGRMIKCTVCGNRGATVG 1808

BLAST of HG10019711 vs. ExPASy Swiss-Prot
Match: Q24742 (Histone-lysine N-methyltransferase trithorax OS=Drosophila virilis OX=7244 GN=trx PE=3 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 1.2e-06
Identity = 24/49 (48.98%), Postives = 35/49 (71.43%), Query Frame = 0

Query: 330  EIWIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIG 379
            + W+H + A+WS+EV+    G L+NV +A+ RGR +KCT CG  GAT+G
Sbjct: 1734 DCWVHINCAMWSAEVFEEIDGSLQNVHSAVARGRMIKCTVCGNRGATVG 1782

BLAST of HG10019711 vs. ExPASy Swiss-Prot
Match: Q03164 (Histone-lysine N-methyltransferase 2A OS=Homo sapiens OX=9606 GN=KMT2A PE=1 SV=5)

HSP 1 Score: 54.7 bits (130), Expect = 2.7e-06
Identity = 24/49 (48.98%), Postives = 33/49 (67.35%), Query Frame = 0

Query: 332  WIHRHYAVWSSEVYFAGLGCLKNVRAALCRGRALKCTRCGRPGATIGSC 381
            W H + A+WS+EV+    G LKNV  A+ RG+ L+C  C +PGAT+G C
Sbjct: 1898 WTHVNCALWSAEVFEDDDGSLKNVHMAVIRGKQLRCEFCQKPGATVGCC 1946

BLAST of HG10019711 vs. ExPASy TrEMBL
Match: A0A1S3C2T2 (uncharacterized protein LOC103496212 OS=Cucumis melo OX=3656 GN=LOC103496212 PE=4 SV=1)

HSP 1 Score: 333.2 bits (853), Expect = 1.4e-87
Identity = 246/570 (43.16%), Postives = 275/570 (48.25%), Query Frame = 0

Query: 16  IDLAPPAAQGFVIFCSLLRI-------CLSLSGNRSGSRL-KKHKRLDAICEKEYSRNHG 75
           I L P     F+    ++R+        L  SGNRSG RL KKHKRLDAICEKEYSRNHG
Sbjct: 28  ICLGPRLCSRFLFLIFVMRLSSGSVSSSLKQSGNRSGPRLKKKHKRLDAICEKEYSRNHG 87

Query: 76  YVNENVSGLGTVEADLGLRQSNHVRRPQSCWMLVLCQRRIRKRK---------------- 135
            VNENV+ LGT+EAD GLR+S+ VRR     +L+      RK++                
Sbjct: 88  DVNENVTRLGTLEADPGLRRSSRVRRAP---VLLDASPMPRKKRRIVRGNGTLGVKTSAN 147

Query: 136 --------LFDETHGNW----RSRNRNLGTRVDKGTRESRKRKLFDEIIVVKVRNNGMRM 195
                   L  ET GNW    RSRNRNLG RVDKG R SRKRKLFDEII VKVRN GMR+
Sbjct: 148 TLPLFSDDLKGETEGNWRSRLRSRNRNLGIRVDKGARASRKRKLFDEIIDVKVRNGGMRI 207

Query: 196 DLAEEKGKMEYVESMVGR--------------------------------EDMLAINNED 255
           DL EEK KME+ ESMVGR                                E+ML I+ +D
Sbjct: 208 DLDEEKRKMEFGESMVGRSNRTSRRFGVTNDPIKIEEEVKSPRIKDDYCKEEMLIIDIDD 267

Query: 256 EEE------EEVEEVEEEEGEEGGGGGGGGGGQ--------------------------- 315
           EEE      EE EE EEEE EE   GGGGGGG+                           
Sbjct: 268 EEEEGEGEGEEEEEEEEEEEEERERGGGGGGGEXEEEEEEEEEEEEEEEEEEEEEEEEEE 327

Query: 316 ------------------------------------------------------------ 375
                                                                       
Sbjct: 328 EEEEEAVEGKEVVTAKDEKGEDVLPLENEMDEENVKVVDDVTPQVVEKLDKETSSSLHVD 387

Query: 376 -----------------------------------KAALVSTNEVVGGRSCSEKAVDLGK 379
                                              +AA+VSTNEVVGGRSC+EKAVDLGK
Sbjct: 388 EACSGDHNEELANAGEIQLEESTQLNEGVNETQDVEAAVVSTNEVVGGRSCNEKAVDLGK 447

BLAST of HG10019711 vs. ExPASy TrEMBL
Match: A0A0A0L9H9 (PHD-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G236020 PE=4 SV=1)

HSP 1 Score: 326.6 bits (836), Expect = 1.3e-85
Identity = 236/527 (44.78%), Postives = 264/527 (50.09%), Query Frame = 0

Query: 37  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQ 96
           L+ SGNRSG RL KKHKRLDAICEKEYSRNHG VNENVSGLGT+EAD GLR+S+ VRR  
Sbjct: 56  LNQSGNRSGPRLKKKHKRLDAICEKEYSRNHGDVNENVSGLGTLEADPGLRRSSRVRRAP 115

Query: 97  SCWMLVLCQRRIRK---------------------RKLFDETHGNWRSR----NRNLGTR 156
                    R+ R+                       L DET GNWRSR    +RNLG R
Sbjct: 116 VLLDASPIPRKKRRIVQGNGTLGVRTSANTLPLFSDDLKDETEGNWRSRLRSSSRNLGIR 175

Query: 157 VDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR------------- 216
           VDKG R SRKRKLFDEI+ VKVRN GMR+DL EEKG+ME+ ES+VGR             
Sbjct: 176 VDKGARASRKRKLFDEIVDVKVRNGGMRIDLDEEKGRMEFGESLVGRSNRTRRRFGVIND 235

Query: 217 -------------------EDMLAIN-------------------------------NED 276
                              +DML I+                                E+
Sbjct: 236 PIKIEEEVKSPRIKDDCCKKDMLVIDIDDEEEGEGEGEGEEEEEEEEEEEEEEEEEEEEE 295

Query: 277 EEEEEVEEVEEEEGEEGGGG----------GGG--------------------------- 336
           EEEEE EE EEEEGEE   G          G G                           
Sbjct: 296 EEEEEEEEEEEEEGEEEVEGKEVVTAKDERGDGVLPLENEMDEENVKVVDDVTPQVVEKL 355

Query: 337 -----------------------------GGGQ-------------------KAALVSTN 379
                                          G+                    AA+VSTN
Sbjct: 356 DKETSSSLHVDEACRADHNEELANAVENANNGEIRLEESKQLNEGVNETQDVAAAVVSTN 415

BLAST of HG10019711 vs. ExPASy TrEMBL
Match: A0A5A7UUP2 (Tat-binding-7-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold803G00320 PE=4 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 5.1e-85
Identity = 237/521 (45.49%), Postives = 261/521 (50.10%), Query Frame = 0

Query: 37  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQ 96
           L  SGNRSG RL KKHKRLDAICEKEYSRNHG VNENV+ LGT+EAD GLR+S+ VRR  
Sbjct: 12  LKQSGNRSGPRLKKKHKRLDAICEKEYSRNHGDVNENVTRLGTLEADPGLRRSSRVRRAP 71

Query: 97  SCWMLVLCQRRIRKRK------------------------LFDETHGNW----RSRNRNL 156
              +L+      RK++                        L  ET GNW    RSRNRNL
Sbjct: 72  ---VLLDASPMPRKKRRIVRGNGTLGVKTSANTLPLFSDDLKGETEGNWRSRLRSRNRNL 131

Query: 157 GTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR---------- 216
           G RVDKG R SRKRKLFDEII VKVRN GMR+DL EEK KME+ ESMVGR          
Sbjct: 132 GIRVDKGARASRKRKLFDEIIDVKVRNGGMRIDLDEEKRKMEFGESMVGRSNRTSRRFGV 191

Query: 217 ----------------------EDMLAIN----------------------------NED 276
                                 E+ML I+                             E+
Sbjct: 192 TNDPIKIEEEVKSPRIKDDYCKEEMLIIDIDDEEEEGEGEGEEEEEEEEEEEEEEEEEEE 251

Query: 277 EEEEEVEEVEEEEGEEGGGG---------------------------------------- 336
           EEEEE EE EEEE EE   G                                        
Sbjct: 252 EEEEEEEEEEEEEEEEAVEGKEVVTAKDEKGEDVLPLENEMDEENVKVVDDVTPQVVEKL 311

Query: 337 ----------------------GGGGGGQ-----------------KAALVSTNEVVGGR 379
                                    G  Q                 +AA+VSTNEVVGGR
Sbjct: 312 DKETSSSLHVDEACSGDHNEELANAGEIQLEESTQLNEGVNETQDVEAAVVSTNEVVGGR 371

BLAST of HG10019711 vs. ExPASy TrEMBL
Match: A0A5D3CIS0 (Tat-binding-7-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold227G001060 PE=4 SV=1)

HSP 1 Score: 323.2 bits (827), Expect = 1.5e-84
Identity = 237/525 (45.14%), Postives = 261/525 (49.71%), Query Frame = 0

Query: 37  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQ 96
           L  SGNRSG RL KKHKRLDAICEKEYSRNHG VNENV+ LGT+EAD GLR+S+ VRR  
Sbjct: 12  LKQSGNRSGPRLKKKHKRLDAICEKEYSRNHGDVNENVTRLGTLEADPGLRRSSRVRRAP 71

Query: 97  SCWMLVLCQRRIRKRK------------------------LFDETHGNW----RSRNRNL 156
              +L+      RK++                        L  ET GNW    RSRNRNL
Sbjct: 72  ---VLLDASPMPRKKRRIVRGNGTLGVKTSANTLPLFSDDLKGETEGNWRSRLRSRNRNL 131

Query: 157 GTRVDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVGR---------- 216
           G RVDKG R SRKRKLFDEII VKVRN GMR+DL EEK KME+ ESMVGR          
Sbjct: 132 GIRVDKGARASRKRKLFDEIIDVKVRNGGMRIDLDEEKRKMEFGESMVGRSNRTSRRFGV 191

Query: 217 ----------------------EDMLAIN------------------------------- 276
                                 E+ML I+                               
Sbjct: 192 TNDPIKIEEEVKSPRIKDDYCKEEMLIIDIDDEEEEGEGEGEEEEEEEEEEEEEEEEEEE 251

Query: 277 -NEDEEEEEVEEVEEEEGEEGGGG------------------------------------ 336
             E+EEEEE EE EEEE EE   G                                    
Sbjct: 252 EEEEEEEEEEEEEEEEEEEEAVEGKEVVTAKDEKGEDVLPLENEMDEENVKVVDDVTPQV 311

Query: 337 --------------------------GGGGGGQ-----------------KAALVSTNEV 379
                                        G  Q                 +AA+VSTNEV
Sbjct: 312 VEKLDKETSSSLHVDEACSGDHNEELANAGEIQLEESTQLNEGVNETQDVEAAVVSTNEV 371

BLAST of HG10019711 vs. ExPASy TrEMBL
Match: A0A6J1CP50 (uncharacterized protein LOC111012888 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111012888 PE=4 SV=1)

HSP 1 Score: 315.5 bits (807), Expect = 3.1e-82
Identity = 234/527 (44.40%), Postives = 256/527 (48.58%), Query Frame = 0

Query: 37  LSLSGNRSGSRL-KKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQ 96
           L  SGNRSGSRL KKHKRLDAICEKEYSRNHG VNEN SGLGT E D GLR+SN VRR  
Sbjct: 12  LKQSGNRSGSRLKKKHKRLDAICEKEYSRNHGDVNENGSGLGTAEVDFGLRRSNRVRRAP 71

Query: 97  SCWMLVLCQRRIRKR---------------------KLFDETHGNW----RSRNRNLGTR 156
                    R+ R++                      L DE  GNW    R+RN NLG R
Sbjct: 72  VLLDASPSPRKKRRKIHGNGTLGIKKSAETLAQLSDDLNDEAQGNWGTRLRARNSNLGLR 131

Query: 157 VDKGTRESRKRKLFDEIIVVKVRNNGMRMDLAEEKGKMEYVESMVG-------------- 216
           VDKG R SRKRKLFD I  VKV+++GM+MDL E+KGK+E  ESMVG              
Sbjct: 132 VDKGARASRKRKLFDAISDVKVKDSGMKMDLDEKKGKLEDGESMVGRSNRSRRRRFGAMN 191

Query: 217 -------------------REDMLAINNED------EEEEEVEEVEEEEGEEGG-----G 276
                              RE  LAIN ED      EEEEEVEE EEEE EE G     G
Sbjct: 192 GPIRTEKEVKSPEIKDDYDREHKLAINIEDEEQEEEEEEEEVEEEEEEEEEEEGEEEKEG 251

Query: 277 GG-----GGGGGQ----------------------------------------------- 336
            G     G G G+                                               
Sbjct: 252 EGEEEEEGEGEGEEEEVLERKEVMIAKEERREDVLPLEDEVDDENVKAADNIFPQFIEKL 311

Query: 337 ----------------------------------------------------KAALVSTN 379
                                                               +AA VSTN
Sbjct: 312 EKETLSHLHIDEACSADHNKEPANAVDNSNNGEIQVEKLMFLHDGENEIHDVEAAGVSTN 371

BLAST of HG10019711 vs. TAIR 10
Match: AT3G15120.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 114.8 bits (286), Expect = 1.5e-25
Identity = 65/139 (46.76%), Postives = 85/139 (61.15%), Query Frame = 0

Query: 254 KKFTDS---SIGTLGKARIQEGRRCGLCGGGINGNLSRSWFRIRVRVEMKL---VVALQL 313
           KK  DS   S   LGK   ++ RRCGLCG G +G L +   +     +++      + + 
Sbjct: 467 KKAVDSVSTSSDRLGKPLFKQTRRCGLCGVGTDGKLPKKLMQDNGDSDVEAPSGSSSSEE 526

Query: 314 QRNKIMTS--------GMVLGRINDRYDIAEIWIHRHYAVWSSEVYFAGLGCLKNVRAAL 373
           Q+  I+          G +LG INDRY I+  W+H++ AVWS EVYFAG+GCLKN+RAAL
Sbjct: 527 QKYDILDGFGDDPGWLGRLLGPINDRYGISGTWVHQNCAVWSPEVYFAGVGCLKNIRAAL 586

Query: 374 CRGRALKCTRCGRPGATIG 379
            RGR+LKCTRC RPGAT G
Sbjct: 587 FRGRSLKCTRCDRPGATTG 605


HSP 2 Score: 37.0 bits (84), Expect = 4.1e-02
Identity = 50/183 (27.32%), Postives = 83/183 (45.36%), Query Frame = 0

Query: 40  SGNRSGSRLKKHKRLDAICEKEYSRNHGYVNENVSGLGTVEADLGLRQSNHVRRPQSCWM 99
           SG+ SG   KK K+L AICE+EY +NHG   +   G G   AD  LR+S+ VR+  S  +
Sbjct: 12  SGSPSG---KKSKKLAAICEEEYKKNHGESQDRDGGSGLACADSELRRSSRVRKIPS--I 71

Query: 100 LVLCQRRIRKRKLFDETHGNWRSRNRNLGTRVD-----KGTRESRKRKLFDEIIVVKVRN 159
           L       +KR+ F+++  +     RN     D     K    SR++K       V  + 
Sbjct: 72  LDASPPPPKKRQRFNKSSSSIEKGKRNEDGDSDAPDGWKSRLRSRRKK------NVGFQA 131

Query: 160 NGMRMDLAEEKGKMEYVESMVGREDMLAINNEDEEEEEVE-----------EVEEEEGEE 207
           +G +  + + K K+ +        +    ++ +EE+  ++           +V+E E  E
Sbjct: 132 SGRQRRVVKGKRKLVFRNRACELSEKAEASDREEEKGALKGGKLNKAKKPVDVKESESSE 183

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898386.14.6e-8845.45uncharacterized protein LOC120086038 [Benincasa hispida][more]
XP_008456208.13.0e-8743.16PREDICTED: uncharacterized protein LOC103496212 [Cucumis melo][more]
XP_031739139.12.1e-8544.87uncharacterized protein LOC101208571 [Cucumis sativus] >KAE8650542.1 hypothetica... [more]
KAA0058834.11.1e-8445.49Tat-binding-7-like protein [Cucumis melo var. makuwa][more]
TYK11250.13.1e-8445.14Tat-binding-7-like protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q9UMN61.9e-0751.02Histone-lysine N-methyltransferase 2B OS=Homo sapiens OX=9606 GN=KMT2B PE=1 SV=1[more]
O085501.9e-0751.02Histone-lysine N-methyltransferase 2B OS=Mus musculus OX=10090 GN=Kmt2b PE=1 SV=... [more]
P206597.0e-0748.98Histone-lysine N-methyltransferase trithorax OS=Drosophila melanogaster OX=7227 ... [more]
Q247421.2e-0648.98Histone-lysine N-methyltransferase trithorax OS=Drosophila virilis OX=7244 GN=tr... [more]
Q031642.7e-0648.98Histone-lysine N-methyltransferase 2A OS=Homo sapiens OX=9606 GN=KMT2A PE=1 SV=5[more]
Match NameE-valueIdentityDescription
A0A1S3C2T21.4e-8743.16uncharacterized protein LOC103496212 OS=Cucumis melo OX=3656 GN=LOC103496212 PE=... [more]
A0A0A0L9H91.3e-8544.78PHD-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G236020 PE... [more]
A0A5A7UUP25.1e-8545.49Tat-binding-7-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... [more]
A0A5D3CIS01.5e-8445.14Tat-binding-7-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... [more]
A0A6J1CP503.1e-8244.40uncharacterized protein LOC111012888 isoform X2 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT3G15120.11.5e-2546.76P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 179..199
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..216
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..204
NoneNo IPR availablePANTHERPTHR23069AAA DOMAIN-CONTAININGcoord: 83..378
NoneNo IPR availablePANTHERPTHR23069:SF7P-LOOP CONTAINING NUCLEOSIDE TRIPHOSPHATE HYDROLASES SUPERFAMILY PROTEINcoord: 83..378
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 308..380
e-value: 5.2E-8
score: 35.0
IPR034732Extended PHD (ePHD) domainPROSITEPS51805EPHDcoord: 302..380
score: 12.248079

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10019711.1HG10019711.1mRNA