HG10017614 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10017614
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCCHC-type domain-containing protein
LocationChr03: 16904165 .. 16905961 (-)
RNA-Seq ExpressionHG10017614
SyntenyHG10017614
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAGATAAGATCACTGCTGAATTGGAGTTACTGAATCTATCAGGGGAAGAAGAAAGAACTTCAATTCCTCTTGATCTTGGGATTGCTCAAGAGATTAATGGTCATCTGAACCACTGTTTAATTGGGAAGCTTCTTCCACCAAAGATTATCGCTGGAAATGCTTTAAGGAATGCGTTCTTTAACGCGTGGAGGATGGATCATGGGATTAATATCTCGGTGATTGGAAAGAATATTTTTTTTATTTGTTTTTTAGAAACTTTTTGACAAGGAATGGGTGATGAGTGCTGGCCCATGGTTATTTGATAAACATTTGCTGATTTTAGAGGAATTGGGGGTTGATAGGCAGTTATCTGATTTAAATTTTAATAAGGTTGCTTTTTGGATTCGTTTGCTGAATTTGCCATTCGGATTCCAGAACAGACTAATGACCAGGAGAATTGGCGAAGAAATTGGGGGTTTTGTTGAAGTTGAGTGCGATCAGGAAGGCCTTTGCTGGGGAGATTCAATGAGAATTAAGGTTCACCTAGATATCTTGAAACCTTTACGCAGGGGTATCAAAGTGTTGTCGTCAAACGGGTTGGAAGAAAAATGGGTGGCGATTCGTTACGAAAGATTGCCTGATTTTTGTTACTCATGCGGTCGTATAGGTCATGTAACTAAGGATTGTGTAAATATTCCTGTCAACCCAGATACATTAAGTGGTAGTTCTGGATAATTTGGTCCATGGCTAAGGTTCCAAGGTGTACCGAGATCCCCCAAGAAACCCTCCAATCCTCCTAAAGATTCTGACACTAGTAAAGAGGTTGAAGAGGAGGATATTTTGCAACCTTCTGTTCCAGAAAACAAGGCGGCTCGATCTGAATTTCAAGAGGAATAGAGGAATCATGGTTCTTTAAATCTTGACCTAAATCAAGCTCAAGAGTTGGCTGACCAAATGGAGACTGATTTCGAAAGAACTTATTTTGTGAATCATACGTCTCAACTACCTCTTCAACAGGTTTCTGAGCTTGTAACTTCTACAGGTCATCCGGACCCAACTTTTAAAGATCCAAATGTAAACCTTTTGGTTTTCAAAACTCCTTTGAAAGGAGGTAACATGGAGGAAGTAAAATCCAGAAAGGCTACTAGGACCTGGAAGAACCGTATGAGATCATTAACAGATTCTCCTCTATAGGTTAAGCGGATTTATCCGCCAAATAGGAAATTAGGTGATGGGGAGAGTTCAAAGATTGGAGAAAGCAAACGTCTTCGAGCTGCTTCTTTAGAAACTTTGCTGAATGAGGCGATGCAGCCCTGCCCAAAACCATGAAGATCATTTGCTGGAATGCTAGGGGCTTGGGCAATCCTCGAGCATTCCATTCTTTGTGTAACCTTGTTCACACGACTAGTCCTGATATTTTGTTCGTTGCAGAAACTAAAGGTGGTTCTGATCTTTGCAATAGAGTCAAGCTTCGATGTAAATTCACTGGTTGCTTACCAGTGAATAGTGTGAGAGCTAAGGGTGGTTTATGTATTTTCTGGAGTGATACCGTGGTTTCAGAGATCAAATCTTTCTCTTCTCATCATATAGATGCGGTGTTTACATGGAAGAATAGTCATTGGAGATTTACAGGAGTGTATGGGTATCCTGAATCAGCTAGAAAGCATTTGACTTGGGAGTTGCTGAATAGGTTGAATAATAGAGAGGATGTTCCATGGTTGTTAGGGGGTGACTTCAACAATATTCTTTATGATAATGAAAAGTCTTGGGATCCCTCCACCCCCCCCCCCCCCCCGTGCTGTTCAGGCTCTTAG

mRNA sequence

ATGACAGATAAGATCACTGCTGAATTGGAGTTACTGAATCTATCAGGGGAAGAAGAAAGAACTTCAATTCCTCTTGATCTTGGGATTGCTCAAGAGATTAATGGTCATCTGAACCACTGTTTAATTGGGAAGCTTCTTCCACCAAAGATTATCGCTGGAAATGCTTTAAGGAATGCGTTCTTTAACGCGTGGAGGATGGATCATGGGATTAATATCTCGAAACTTTTTGACAAGGAATGGGTGATGAGTGCTGGCCCATGGTTATTTGATAAACATTTGCTGATTTTAGAGGAATTGGGGGTTGATAGGCAGTTATCTGATTTAAATTTTAATAAGGTTGCTTTTTGGATTCGTTTGCTGAATTTGCCATTCGGATTCCAGAACAGACTAATGACCAGGAGAATTGGCGAAGAAATTGGGGGTTTTGTTGAAGTTGAGTGCGATCAGGAAGGCCTTTGCTGGGGAGATTCAATGAGAATTAAGGTTCACCTAGATATCTTGAAACCTTTACGCAGGGGTATCAAAGTGTTGTCGTCAAACGGGTTGGAAGAAAAATGGGTGGCGATTCGTTACGAAAGATTGCCTGATTTTTGTTACTCATGCGGTCGTATAGGTCATGTAACTAAGGATTGTGTAAATATTCCTGTCAACCCAGATACATTAAGTGGTCATCCGGACCCAACTTTTAAAGATCCAAATGTAAACCTTTTGGTTTTCAAAACTCCTTTGAAAGGAGGTAACATGGAGGAAGTAAAATCCAGAAAGGCTACTAGGACCTGGAAGAACCCCCTGCCCAAAACCATGAAGATCATTTGCTGGAATGCTAGGGGCTTGGGCAATCCTCGAGCATTCCATTCTTTGTGTAACCTTGTTCACACGACTAGTCCTGATATTTTGTTCGTTGCAGAAACTAAAGGTGGTTCTGATCTTTGCAATAGAGTCAAGCTTCGATGTAAATTCACTGGTTGCTTACCAGTGAATAGTGTGAGAGCTAAGGGTGGTTTATGTATTTTCTGGAGTGATACCGTGGTTTCAGAGATCAAATCTTTCTCTTCTCATCATATAGATGCGGTGTTTACATGGAAGAATAGTCATTGGAGATTTACAGGAGTGTATGGGTATCCTGAATCAGCTAGAAAGCATTTGACTTGGGAGTTGCTGAATAGGTTGAATAATAGAGAGGATGTTCCATGGTTGTTAGGGGGTGACTTCAACAATATTCTTTATGATAATGAAAAGTCTTGGGATCCCTCCACCCCCCCCCCCCCCCCGTGCTGTTCAGGCTCTTAG

Coding sequence (CDS)

ATGACAGATAAGATCACTGCTGAATTGGAGTTACTGAATCTATCAGGGGAAGAAGAAAGAACTTCAATTCCTCTTGATCTTGGGATTGCTCAAGAGATTAATGGTCATCTGAACCACTGTTTAATTGGGAAGCTTCTTCCACCAAAGATTATCGCTGGAAATGCTTTAAGGAATGCGTTCTTTAACGCGTGGAGGATGGATCATGGGATTAATATCTCGAAACTTTTTGACAAGGAATGGGTGATGAGTGCTGGCCCATGGTTATTTGATAAACATTTGCTGATTTTAGAGGAATTGGGGGTTGATAGGCAGTTATCTGATTTAAATTTTAATAAGGTTGCTTTTTGGATTCGTTTGCTGAATTTGCCATTCGGATTCCAGAACAGACTAATGACCAGGAGAATTGGCGAAGAAATTGGGGGTTTTGTTGAAGTTGAGTGCGATCAGGAAGGCCTTTGCTGGGGAGATTCAATGAGAATTAAGGTTCACCTAGATATCTTGAAACCTTTACGCAGGGGTATCAAAGTGTTGTCGTCAAACGGGTTGGAAGAAAAATGGGTGGCGATTCGTTACGAAAGATTGCCTGATTTTTGTTACTCATGCGGTCGTATAGGTCATGTAACTAAGGATTGTGTAAATATTCCTGTCAACCCAGATACATTAAGTGGTCATCCGGACCCAACTTTTAAAGATCCAAATGTAAACCTTTTGGTTTTCAAAACTCCTTTGAAAGGAGGTAACATGGAGGAAGTAAAATCCAGAAAGGCTACTAGGACCTGGAAGAACCCCCTGCCCAAAACCATGAAGATCATTTGCTGGAATGCTAGGGGCTTGGGCAATCCTCGAGCATTCCATTCTTTGTGTAACCTTGTTCACACGACTAGTCCTGATATTTTGTTCGTTGCAGAAACTAAAGGTGGTTCTGATCTTTGCAATAGAGTCAAGCTTCGATGTAAATTCACTGGTTGCTTACCAGTGAATAGTGTGAGAGCTAAGGGTGGTTTATGTATTTTCTGGAGTGATACCGTGGTTTCAGAGATCAAATCTTTCTCTTCTCATCATATAGATGCGGTGTTTACATGGAAGAATAGTCATTGGAGATTTACAGGAGTGTATGGGTATCCTGAATCAGCTAGAAAGCATTTGACTTGGGAGTTGCTGAATAGGTTGAATAATAGAGAGGATGTTCCATGGTTGTTAGGGGGTGACTTCAACAATATTCTTTATGATAATGAAAAGTCTTGGGATCCCTCCACCCCCCCCCCCCCCCCGTGCTGTTCAGGCTCTTAG

Protein sequence

MTDKITAELELLNLSGEEERTSIPLDLGIAQEINGHLNHCLIGKLLPPKIIAGNALRNAFFNAWRMDHGINISKLFDKEWVMSAGPWLFDKHLLILEELGVDRQLSDLNFNKVAFWIRLLNLPFGFQNRLMTRRIGEEIGGFVEVECDQEGLCWGDSMRIKVHLDILKPLRRGIKVLSSNGLEEKWVAIRYERLPDFCYSCGRIGHVTKDCVNIPVNPDTLSGHPDPTFKDPNVNLLVFKTPLKGGNMEEVKSRKATRTWKNPLPKTMKIICWNARGLGNPRAFHSLCNLVHTTSPDILFVAETKGGSDLCNRVKLRCKFTGCLPVNSVRAKGGLCIFWSDTVVSEIKSFSSHHIDAVFTWKNSHWRFTGVYGYPESARKHLTWELLNRLNNREDVPWLLGGDFNNILYDNEKSWDPSTPPPPPCCSGS
Homology
BLAST of HG10017614 vs. NCBI nr
Match: GAU14523.1 (hypothetical protein TSUD_250650 [Trifolium subterraneum])

HSP 1 Score: 235.0 bits (598), Expect = 1.2e-57
Identity = 137/444 (30.86%), Postives = 217/444 (48.87%), Query Frame = 0

Query: 17  EEERTSIPLDLGIAQEINGHLNHCLIGKLLPPKIIAGNALRNAFFNAWRMDHGINISKL- 76
           EEE  S   D G  ++++  L  CL+G+ L  K I  N+++      W    G+ I++  
Sbjct: 15  EEEGFSFDFDEGGDEQVD--LRWCLVGRFLCEKAIHFNSMKLRMAELWTPVKGVTINETP 74

Query: 77  -----------FDKEWVMSAGPWLFDKHLLILEELGVDRQLSDLNFNKVAFWIRLLNLPF 136
                       D E V+  GPW+FD + L+LE++ +  Q+ ++    V  W+++ +LP 
Sbjct: 75  SGKFLFHFAHPLDMEAVLKGGPWIFDNNTLLLEQVPLGMQVENIPLLHVNLWVQIHDLPT 134

Query: 137 GFQNRLMTRRIGEEIGGFVEVECDQEGLCWGDSMRIKVHLDILKPLRRGIKVLSSNGLEE 196
           G     +  ++   IG FVE + +     W   MRI+V +DI +PL++  KV +  G E 
Sbjct: 135 GLMKENVGIKLANYIGEFVEYDKNNNSSFWRQYMRIRVKVDIRQPLKKDTKVKNRVG-EW 194

Query: 197 KWVAIRYERLPDFCYSCGRIGHVTKDC-VNIPVNPDT-LSGHPDPTFKDPNVN----LLV 256
             V  +YE+L  FC+ CG +GH    C V   +  D  + G       DP       +  
Sbjct: 195 CMVKFKYEKLGIFCFVCGIMGHAENKCEVRYSMEQDDGIRGWSAEIRADPRRQGGRPVSR 254

Query: 257 FKTPLKGGNMEEVKSRKAT-------------RTWKNPLPKTMKIICWNARGLGNPRAFH 316
           +    KGG +E+    +A              ++W+  LP  MKI+ WN RGL  P A  
Sbjct: 255 WLREEKGGRVEKHGGDQAAQPTFTGGVRVWVPQSWQPGLPGPMKILSWNCRGLSTPSAIP 314

Query: 317 SLCNLVHTTSPDILFVAETKGGSDLCNRVKLRCKFTGCLPVNSVRAKGGLCIFWSDTVVS 376
           +L N+  +  PDILF++ET   +    RV++  KF  CL V+     GGL + W DT+  
Sbjct: 315 NLRNIAQSHKPDILFLSETLSKAQAMERVRVDLKFNSCLSVDVEGRSGGLSVMWRDTINC 374

Query: 377 EIKSFSSHHIDAVFTWK-NSHWRFTGVYGYPESARKHLTWELLNRLNNREDVPWLLGGDF 429
            + ++S + I+ +   K    WR T  YGYPE  R+   W+LL +L +  D+PW + GDF
Sbjct: 375 RVMNYSRNFINLIVEEKEQEEWRLTCYYGYPERGRRKQAWDLLRQLRDMSDLPWCIVGDF 434

BLAST of HG10017614 vs. NCBI nr
Match: MCH82853.1 (hypothetical protein [Trifolium medium])

HSP 1 Score: 227.6 bits (579), Expect = 2.0e-55
Identity = 126/447 (28.19%), Postives = 205/447 (45.86%), Query Frame = 0

Query: 11  LLNLSGEEERTSIPLDLGIAQEINGHLNHCLIGKLLPPKIIAGNALRNAFFNAWRMDHGI 70
           L  LS  EE      D    ++    L  CL+G+ +  + I  N++     + W+   G+
Sbjct: 6   LEGLSLHEEEEGFRFDFEADEDEQVDLRWCLVGRFICERSIHFNSMSIRMADLWKPVRGV 65

Query: 71  NISKL------------FDKEWVMSAGPWLFDKHLLILEELGVDRQLSDLNFNKVAFWIR 130
            I +              D E V++  PW+FD ++LILE++ +  Q+  +    V  W++
Sbjct: 66  TIKEASAGKILFHFAHPLDMEAVLNGSPWIFDNNMLILEQVQLGMQIEHIPLFHVNMWVQ 125

Query: 131 LLNLPFGFQNRLMTRRIGEEIGGFVEVECDQEGLCWGDSMRIKVHLDILKPLRRGIKVLS 190
           + +LP G     +   +   IG FVE +       W + MRI+V +D+  PL++  KV++
Sbjct: 126 VHDLPMGLMKEKVGIPLANYIGSFVEYDKKNNSTFWREFMRIRVKIDVRLPLKKDTKVMN 185

Query: 191 SNGLEEKW--VAIRYERLPDFCYSCGRIGHVTKDC----------------VNIPVNPDT 250
             G   KW  V  +YE+L  FC+ CG +GH    C                  +  +P T
Sbjct: 186 KEG---KWCTVKFKYEKLGTFCFVCGIMGHSENKCEVRFSMEQDDGTREWSAELRADPRT 245

Query: 251 LSGHPDPTF--KDPNVNLLVFKTPLKGGNMEEVKSR-----------KATRTWKNPLPKT 310
             G P   +  +D    +      + G +     S             A ++W+  LP  
Sbjct: 246 RGGRPVSRWLREDRGGPMRHHGGDVAGQSNPPANSNSVDPTVAELALNAQQSWQPGLPGP 305

Query: 311 MKIICWNARGLGNPRAFHSLCNLVHTTSPDILFVAETKGGSDLCNRVKLRCKFTGCLPVN 370
           MKI+ WN RGL    A  +L N+     PDILF+AET   +    R+++  KF  CL V+
Sbjct: 306 MKILSWNCRGLSTSSAIPNLSNIAQGYQPDILFLAETLSKNHTMERIRVNLKFQSCLSVD 365

Query: 371 SVRAKGGLCIFWSDTVVSEIKSFSSHHIDAVFTWK-NSHWRFTGVYGYPESARKHLTWEL 414
           +    GGL + W D++   + ++S + I+ +   K    WR T  YGYPE  R+   W+L
Sbjct: 366 AEGRSGGLSVMWRDSISCRVMNYSRNFINLIVREKEEEEWRLTCYYGYPERGRRRQAWDL 425

BLAST of HG10017614 vs. NCBI nr
Match: PNY16372.1 (ribonuclease H, partial [Trifolium pratense])

HSP 1 Score: 227.3 bits (578), Expect = 2.6e-55
Identity = 139/463 (30.02%), Postives = 218/463 (47.08%), Query Frame = 0

Query: 6   TAELELLNLSGEEERTSIPLDLGIAQEINGHLNHCLIGKLLPPKIIAGNALRNAFFNAWR 65
           T  L+ L+L  EEE      +    ++    L  CLIG+ L  K I  N+++    + W+
Sbjct: 3   TPNLDGLSLH-EEEEDGFSFEFEEEEDAQVDLRWCLIGRFLCDKAIHVNSMKVRMADLWK 62

Query: 66  MDHGINISKL------------FDKEWVMSAGPWLFDKHLLILEELGVDRQLSDLNFNKV 125
              G+ I +              D E V++  PW+FD ++L+LE + +  Q+  +  N V
Sbjct: 63  PVMGVTIKETKRGIFLFHFNHQIDMEEVLNGSPWIFDYNMLVLERVQLGMQIEQIPLNHV 122

Query: 126 AFWIRLLNLPFGFQNRLMTRRIGEEIGGFVEVECDQEGLCWGDSMRIKVHLDILKPLRRG 185
           + W+++ NLP G     +   +   IG F+E + +     W + MRI+V +D+  PL++ 
Sbjct: 123 SLWVQVHNLPTGLMKERVGTTLANYIGEFMEYDKNNNTSFWREFMRIRVKIDVRLPLKKD 182

Query: 186 IKVLSSNGLEEKW--VAIRYERLPDFCYSCGRIGHVTKDC-VNIPV-NPDTLSG-----H 245
            KV +  G   KW  V I+YE+L  FC+ CG +GH    C V   + N D   G      
Sbjct: 183 AKVKNREG---KWCTVNIKYEKLGVFCFVCGIMGHAENKCQVRFAMENDDGRRGWSADLR 242

Query: 246 PDPTFKDPNVNLLVFKTPLKGGNMEEVKS------------------RKATRTWKNPLPK 305
            +P  +   V     K   +GG+ E                        A  +W+  LP 
Sbjct: 243 AEPRRRGGRVTSRWLKE--EGGSGETAMGGHTAVPPNFQPEQSSGGPAYADVSWQPGLPG 302

Query: 306 TMKIICWNARGLGNPRAFHSLCNLVHTTSPDILFVAETKGGSDLCNRVKLRCKFTGCLPV 365
            MKI+ WN RGL  P A  +L N+     PDILF++ET   +    RV++  +++ CL V
Sbjct: 303 PMKILSWNCRGLSTPSAIPNLRNVAQGHQPDILFLSETLSKAQSMERVRVMLQYSSCLSV 362

Query: 366 NSVRAKGGLCIFWSDTVVSEIKSFSSHHIDAVFTWK-NSHWRFTGVYGYPESARKHLTWE 425
           +     GGL + W DT    I ++S + I+ +        WR T  YGYPE +R+   W+
Sbjct: 363 DVEGRSGGLSVMWRDTTNCRIMNYSRNFINLIVDDPVKGEWRLTCYYGYPERSRRRQAWD 422

Query: 426 LLNRLNNREDVPWLLGGDFNNILYDNEKSWDPSTPPPPPCCSG 429
           LL  L +  D+PW + GDFN++L   +K    + P P   C+G
Sbjct: 423 LLRELRDMSDLPWCIVGDFNDLLSQEDKK--GTLPHPNWLCNG 457

BLAST of HG10017614 vs. NCBI nr
Match: PNY17656.1 (ribonuclease H, partial [Trifolium pratense])

HSP 1 Score: 226.9 bits (577), Expect = 3.4e-55
Identity = 135/465 (29.03%), Postives = 219/465 (47.10%), Query Frame = 0

Query: 9   LELLNLSGEEERTSIPLDL---GIAQEINGHLNHCLIGKLLPPKIIAGNALRNAFFNAWR 68
           L+ L+L  EEE   +  D    G  +++   L  CL+G+ +  + I  N++     + W+
Sbjct: 6   LDGLSLHEEEEEEGLCFDFEEEGDDEQV--ELRWCLVGRFICERNIHFNSMSVRMADLWK 65

Query: 69  MDHGINISKL------------FDKEWVMSAGPWLFDKHLLILEELGVDRQLSDLNFNKV 128
              G+ I +             +D E V++ GPW+FD ++L+LE++ +  Q+  +     
Sbjct: 66  PVRGVTIKEAKPGLFLFHFAHPYDMEAVLNGGPWIFDYNMLLLEQVQLGMQVDHIPLFHA 125

Query: 129 AFWIRLLNLPFGFQNRLMTRRIGEEIGGFVEVECDQEGLCWGDSMRIKVHLDILKPLRRG 188
             W+++ +LP G     +   +   IG FVE + +     W   MR++V +D+ +PL++ 
Sbjct: 126 VMWVQIHDLPMGLMKEKVGIGLANYIGSFVEYDKNNNTSFWRQFMRVRVKIDVRQPLKKD 185

Query: 189 IKVLSSNGLEEKW--VAIRYERLPDFCYSCGRIGHVTKDC----------------VNIP 248
            KV +  G   KW  V  +YE+L  FC+ CG +GH    C                 ++ 
Sbjct: 186 YKVKNKEG---KWCTVNFKYEKLGVFCFVCGIMGHAENKCEVRYSMEQDDGRREWSADLR 245

Query: 249 VNPDTLSGHPDPTF--------KDPNVNLLVFKTPLKGG--NMEEVKSRKATRTWKNP-L 308
             P    G     +        +D      V +   + G  N   + +    +  K P L
Sbjct: 246 AEPKRQGGRQSSRWLKEDKGGREDQGRRDTVVQPNNQPGSSNTGPIGAELDPKIPKEPGL 305

Query: 309 PKTMKIICWNARGLGNPRAFHSLCNLVHTTSPDILFVAETKGGSDLCNRVKLRCKFTGCL 368
           P  MKI+ WN RGL NP A  +L N+ H   PDILF++ET   +    RV++  KF  CL
Sbjct: 306 PGPMKILSWNCRGLSNPSAIPNLHNIAHGHKPDILFLSETLSKAQSMERVRVNLKFQSCL 365

Query: 369 PVNSVRAKGGLCIFWSDTVVSEIKSFSSHHIDAVFTWK-NSHWRFTGVYGYPESARKHLT 428
            V+     GGL + W DT+   + ++S + I+ +   K    WR T  YGYPE  R+   
Sbjct: 366 SVDVEGRSGGLSVMWRDTIKCRVLNYSRNFINLIVEEKEGEEWRLTCYYGYPERGRRRQA 425

BLAST of HG10017614 vs. NCBI nr
Match: MCH80000.1 (hypothetical protein [Trifolium medium])

HSP 1 Score: 221.1 bits (562), Expect = 1.9e-53
Identity = 139/474 (29.32%), Postives = 220/474 (46.41%), Query Frame = 0

Query: 9   LELLNLSGEEERTSIPLDLGIAQEINGHLNHCLIGKLLPPKIIAGNALRNAFFNAWRMDH 68
           L+ L+L  EEE  S   +    ++++  L  CLIG+ L  + I  N+++    + W+   
Sbjct: 6   LDGLSLHEEEEGFSFDFEEEGDEQVD--LRWCLIGRFLCDRAIHSNSMKIRMADLWKPVR 65

Query: 69  GINISKL------------FDKEWVMSAGPWLFDKHLLILEELGVDRQLSDLNFNKVAFW 128
           G+ I +              D E V++ GPW FD ++LILE++ +  Q+ D+    +  W
Sbjct: 66  GVIIKEARPGTFLFHFDHPLDMEAVLNGGPWTFDNNMLILEQVQLGMQIEDIPLFHINLW 125

Query: 129 IRLLNLPFGFQNRLMTRRIGEEIGGFVEVECDQEGLCWGDSMRIKVHLDILKPLRRGIKV 188
           +++ NLP G     +   +   IG FVE + +     W   MR++V +D+ +PL++  KV
Sbjct: 126 VQIHNLPTGLMKESVGVPLANYIGSFVEYDKNNNTSFWRQFMRVRVRVDVRQPLKKDTKV 185

Query: 189 LSSNGLEEKW--VAIRYERLPDFCYSCGRIGHVTKDC-VNIPVNPDTLSGHPDPTFKDPN 248
            +  G   +W  V  +YE+L  FC+ CG +GH    C V   +  D    +     +  +
Sbjct: 186 KNKKG---EWCVVNFKYEKLGIFCFVCGIMGHTENKCAVRYAMEQDDGRRYWSADIRAES 245

Query: 249 VNLLVFKTPL-----KGGNME------EVK-SRKATRTWKNP------------------ 308
                 +T       KGG  E      EV+ S +A+ +   P                  
Sbjct: 246 SRQGGRQTSRWLREEKGGRKEHEGLEREVRSSTQASSSHAGPSADDVAATVQDARPAGIP 305

Query: 309 --------LPKTMKIICWNARGLGNPRAFHSLCNLVHTTSPDILFVAETKGGSDLCNRVK 368
                   LP  MKI+ WN RGL  P A  +L N+ H   PDILF +ET   + +  RV+
Sbjct: 306 QENITDQGLPGPMKILSWNCRGLSTPSAIPNLRNIAHGHQPDILFFSETLSKAQVMERVR 365

Query: 369 LRCKFTGCLPVNSVRAKGGLCIFWSDTVVSEIKSFSSHHIDAVFTWK-NSHWRFTGVYGY 428
           +   F  CL V+     GGL + W DT+   + ++S + I+ V   +    WR T  YGY
Sbjct: 366 VNLNFNSCLSVDVEGRSGGLSVMWKDTIKCRVLNYSRNFINLVVEEREEGEWRLTCYYGY 425

BLAST of HG10017614 vs. ExPASy TrEMBL
Match: A0A2N9GPY1 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29425 PE=4 SV=1)

HSP 1 Score: 238.0 bits (606), Expect = 7.1e-59
Identity = 132/389 (33.93%), Postives = 189/389 (48.59%), Query Frame = 0

Query: 41  LIGKLLPPKIIAGNALRNAFFNAWRMDHG-----INISKLF-------DKEWVMSAGPWL 100
           L  + L  +I+   ++   F   WR DHG     +N ++L        D+E VM   PW 
Sbjct: 34  LAARFLTRRILNVESVARTFKPLWRTDHGFIVRDMNENRLVFVFEDEADRERVMMGEPWA 93

Query: 101 FDKHLLILEELGVDRQLSDLNFNKVAFWIRLLNLPFGFQNRLMTRRIGEEIGGFVEVECD 160
           +DKHL++L+ +  D  + D+ F K +FW+++  LP    N      IG  +G  V+V   
Sbjct: 94  YDKHLVVLQRIEEDEAIDDVLFCKTSFWVQMHGLPVRRMNHETAVTIGSSLGSIVQVAEG 153

Query: 161 QEGLCWGDSMRIKVHLDILKPLRRGIKVLSSNGLEEKWVAIRYERLPDFCYSCGRIGHVT 220
           +  +  G +MR++V+LDI KPL RG KV       E W+  RYERLP+FCY CG + H  
Sbjct: 154 EANVEGGTAMRLRVNLDITKPLCRGRKVRFEKD-RETWITFRYERLPNFCYWCGHVTHSD 213

Query: 221 KDCVNIPVNPDTLSGHPD---PTFKDPNVNLLVFKTPLKGGNMEEVKSRKATRTWKNPLP 280
           KDC +   N D+L        P  + PN                E   RK     + P  
Sbjct: 214 KDCPHWLRNKDSLRLEEQQFGPWLRAPN----------------ERPWRKLEIKVEVP-Q 273

Query: 281 KTMKIICWNARGLGNPRAFHSLCNLVHTTSPDILFVAETKGGSDLCNRVKLRCKFTGCLP 340
           +TM  + WN RGLGNPR    L  LV    P ++F+ ET        R++ + +F     
Sbjct: 274 RTMNALAWNCRGLGNPRTVQELARLVRAQDPAVVFLIETWQDDGPLERLRCQLQFKNKFV 333

Query: 341 VNSVRAKGGLCIFWSDTVVSEIKSFSSHHIDAVFTWKNSH-WRFTGVYGYPESARKHLTW 400
             S    GGLC+FW   +   + SFS  HIDA+        WR TG YG PE+  +  +W
Sbjct: 334 AKSRNKGGGLCLFWKKEIKLRVHSFSPSHIDAIINENQQDIWRLTGFYGAPETRNREESW 393

Query: 401 ELLNRLNNREDVPWLLGGDFNNILYDNEK 414
            LL RL+++  +PW   GDFN ++   EK
Sbjct: 394 ALLRRLSSQYSIPWCCLGDFNELVRIEEK 404

BLAST of HG10017614 vs. ExPASy TrEMBL
Match: A0A2N9GJ35 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27322 PE=3 SV=1)

HSP 1 Score: 231.9 bits (590), Expect = 5.1e-57
Identity = 150/480 (31.25%), Postives = 215/480 (44.79%), Query Frame = 0

Query: 1   MTDKITAELELLNLSGEEERTSIPLDLGIAQEINGHLNHCLIGKLLPPKIIAGNALRNAF 60
           MT+++      + LS E+E   I L      +      H ++ KLL  K     A + + 
Sbjct: 1   MTEELEELCRRMKLS-EKEMLRISLRKDPILKSKKEAQHSILFKLLTTKPFHSEAFKGSI 60

Query: 61  FNAWRMDHGINI----SKLF--------DKEWVMSAGPWLFDKHLLILEELGVDRQLSDL 120
              W    G+ I      LF        D E +    PW FDK L+ +     D Q +++
Sbjct: 61  RALWSGLGGVTIRSIEGNLFMAVFTRRDDMERIFVRSPWTFDKKLIPIVRFEGDLQPTEV 120

Query: 121 NFNKVAFWIRLLNLPFGFQNRLMTRRIGEEIGGFVEVECDQEGLCWGDSMRIKVHLDILK 180
            F+  AFWIR+ NLP     R +   IG+EIG  +EV+  + G  WG+ +RI+V +DI +
Sbjct: 121 RFSHTAFWIRVFNLPIKSMIREVGEDIGQEIGRLLEVDVPENGFGWGEYLRIRVEIDIAQ 180

Query: 181 PLRRGIKVLS--SNGLEEKWVAIRYERLPDFCYSCGRIGHVTKDCV-------------- 240
           PL RG  + S  S+G    WV  +YE LP FCY CGR+GH + +CV              
Sbjct: 181 PLLRGCILQSDESDGGGLFWVDFKYEHLPIFCYRCGRLGHGSHECVVGRGGRISEGVSGE 240

Query: 241 ----------NIPVNP---------DTLSGHPDPTFKDPNVNLLVFKTPLKGGN------ 300
                       P  P             G  +  F            P+ GG       
Sbjct: 241 KWGAWLRALAARPAQPRRSREGVFQPDEEGESNMPFDREAATENDPSPPVSGGGCKLWDG 300

Query: 301 ------MEEVK------SRKATRTWKNPLPKTMKIICWNARGLGNPRAFHSLCNLVHTTS 360
                 +EE+        R A  + K+  P TM+ +  N RGLGNP+  + L NLV    
Sbjct: 301 HWLHELLEEIMQLEMHVERPACSSGKDAPPVTMRALSLNCRGLGNPQTVNELHNLVKKEG 360

Query: 361 PDILFVAETKGGSDLCNRVKLRCKFTGCLPVNSVRAKGGLCIFWSDTVVSEIKSFSSHHI 415
           P+I+F+ ET+        +++R    GCL V      GGL + W  +V+  I+S+S HHI
Sbjct: 361 PNIVFLMETRLNVRNLEWLRVRLGMKGCLGVERHGQGGGLALLWDSSVMINIQSYSEHHI 420

BLAST of HG10017614 vs. ExPASy TrEMBL
Match: A0A392M6V1 (Uncharacterized protein (Fragment) OS=Trifolium medium OX=97028 GN=A2U01_0003665 PE=4 SV=1)

HSP 1 Score: 227.6 bits (579), Expect = 9.6e-56
Identity = 126/447 (28.19%), Postives = 205/447 (45.86%), Query Frame = 0

Query: 11  LLNLSGEEERTSIPLDLGIAQEINGHLNHCLIGKLLPPKIIAGNALRNAFFNAWRMDHGI 70
           L  LS  EE      D    ++    L  CL+G+ +  + I  N++     + W+   G+
Sbjct: 6   LEGLSLHEEEEGFRFDFEADEDEQVDLRWCLVGRFICERSIHFNSMSIRMADLWKPVRGV 65

Query: 71  NISKL------------FDKEWVMSAGPWLFDKHLLILEELGVDRQLSDLNFNKVAFWIR 130
            I +              D E V++  PW+FD ++LILE++ +  Q+  +    V  W++
Sbjct: 66  TIKEASAGKILFHFAHPLDMEAVLNGSPWIFDNNMLILEQVQLGMQIEHIPLFHVNMWVQ 125

Query: 131 LLNLPFGFQNRLMTRRIGEEIGGFVEVECDQEGLCWGDSMRIKVHLDILKPLRRGIKVLS 190
           + +LP G     +   +   IG FVE +       W + MRI+V +D+  PL++  KV++
Sbjct: 126 VHDLPMGLMKEKVGIPLANYIGSFVEYDKKNNSTFWREFMRIRVKIDVRLPLKKDTKVMN 185

Query: 191 SNGLEEKW--VAIRYERLPDFCYSCGRIGHVTKDC----------------VNIPVNPDT 250
             G   KW  V  +YE+L  FC+ CG +GH    C                  +  +P T
Sbjct: 186 KEG---KWCTVKFKYEKLGTFCFVCGIMGHSENKCEVRFSMEQDDGTREWSAELRADPRT 245

Query: 251 LSGHPDPTF--KDPNVNLLVFKTPLKGGNMEEVKSR-----------KATRTWKNPLPKT 310
             G P   +  +D    +      + G +     S             A ++W+  LP  
Sbjct: 246 RGGRPVSRWLREDRGGPMRHHGGDVAGQSNPPANSNSVDPTVAELALNAQQSWQPGLPGP 305

Query: 311 MKIICWNARGLGNPRAFHSLCNLVHTTSPDILFVAETKGGSDLCNRVKLRCKFTGCLPVN 370
           MKI+ WN RGL    A  +L N+     PDILF+AET   +    R+++  KF  CL V+
Sbjct: 306 MKILSWNCRGLSTSSAIPNLSNIAQGYQPDILFLAETLSKNHTMERIRVNLKFQSCLSVD 365

Query: 371 SVRAKGGLCIFWSDTVVSEIKSFSSHHIDAVFTWK-NSHWRFTGVYGYPESARKHLTWEL 414
           +    GGL + W D++   + ++S + I+ +   K    WR T  YGYPE  R+   W+L
Sbjct: 366 AEGRSGGLSVMWRDSISCRVMNYSRNFINLIVREKEEEEWRLTCYYGYPERGRRRQAWDL 425

BLAST of HG10017614 vs. ExPASy TrEMBL
Match: A0A2K3PM58 (Ribonuclease H (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g013091 PE=4 SV=1)

HSP 1 Score: 227.3 bits (578), Expect = 1.3e-55
Identity = 139/463 (30.02%), Postives = 218/463 (47.08%), Query Frame = 0

Query: 6   TAELELLNLSGEEERTSIPLDLGIAQEINGHLNHCLIGKLLPPKIIAGNALRNAFFNAWR 65
           T  L+ L+L  EEE      +    ++    L  CLIG+ L  K I  N+++    + W+
Sbjct: 3   TPNLDGLSLH-EEEEDGFSFEFEEEEDAQVDLRWCLIGRFLCDKAIHVNSMKVRMADLWK 62

Query: 66  MDHGINISKL------------FDKEWVMSAGPWLFDKHLLILEELGVDRQLSDLNFNKV 125
              G+ I +              D E V++  PW+FD ++L+LE + +  Q+  +  N V
Sbjct: 63  PVMGVTIKETKRGIFLFHFNHQIDMEEVLNGSPWIFDYNMLVLERVQLGMQIEQIPLNHV 122

Query: 126 AFWIRLLNLPFGFQNRLMTRRIGEEIGGFVEVECDQEGLCWGDSMRIKVHLDILKPLRRG 185
           + W+++ NLP G     +   +   IG F+E + +     W + MRI+V +D+  PL++ 
Sbjct: 123 SLWVQVHNLPTGLMKERVGTTLANYIGEFMEYDKNNNTSFWREFMRIRVKIDVRLPLKKD 182

Query: 186 IKVLSSNGLEEKW--VAIRYERLPDFCYSCGRIGHVTKDC-VNIPV-NPDTLSG-----H 245
            KV +  G   KW  V I+YE+L  FC+ CG +GH    C V   + N D   G      
Sbjct: 183 AKVKNREG---KWCTVNIKYEKLGVFCFVCGIMGHAENKCQVRFAMENDDGRRGWSADLR 242

Query: 246 PDPTFKDPNVNLLVFKTPLKGGNMEEVKS------------------RKATRTWKNPLPK 305
            +P  +   V     K   +GG+ E                        A  +W+  LP 
Sbjct: 243 AEPRRRGGRVTSRWLKE--EGGSGETAMGGHTAVPPNFQPEQSSGGPAYADVSWQPGLPG 302

Query: 306 TMKIICWNARGLGNPRAFHSLCNLVHTTSPDILFVAETKGGSDLCNRVKLRCKFTGCLPV 365
            MKI+ WN RGL  P A  +L N+     PDILF++ET   +    RV++  +++ CL V
Sbjct: 303 PMKILSWNCRGLSTPSAIPNLRNVAQGHQPDILFLSETLSKAQSMERVRVMLQYSSCLSV 362

Query: 366 NSVRAKGGLCIFWSDTVVSEIKSFSSHHIDAVFTWK-NSHWRFTGVYGYPESARKHLTWE 425
           +     GGL + W DT    I ++S + I+ +        WR T  YGYPE +R+   W+
Sbjct: 363 DVEGRSGGLSVMWRDTTNCRIMNYSRNFINLIVDDPVKGEWRLTCYYGYPERSRRRQAWD 422

Query: 426 LLNRLNNREDVPWLLGGDFNNILYDNEKSWDPSTPPPPPCCSG 429
           LL  L +  D+PW + GDFN++L   +K    + P P   C+G
Sbjct: 423 LLRELRDMSDLPWCIVGDFNDLLSQEDKK--GTLPHPNWLCNG 457

BLAST of HG10017614 vs. ExPASy TrEMBL
Match: A0A2K3PQU3 (Ribonuclease H (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g014404 PE=4 SV=1)

HSP 1 Score: 226.9 bits (577), Expect = 1.6e-55
Identity = 135/465 (29.03%), Postives = 219/465 (47.10%), Query Frame = 0

Query: 9   LELLNLSGEEERTSIPLDL---GIAQEINGHLNHCLIGKLLPPKIIAGNALRNAFFNAWR 68
           L+ L+L  EEE   +  D    G  +++   L  CL+G+ +  + I  N++     + W+
Sbjct: 6   LDGLSLHEEEEEEGLCFDFEEEGDDEQV--ELRWCLVGRFICERNIHFNSMSVRMADLWK 65

Query: 69  MDHGINISKL------------FDKEWVMSAGPWLFDKHLLILEELGVDRQLSDLNFNKV 128
              G+ I +             +D E V++ GPW+FD ++L+LE++ +  Q+  +     
Sbjct: 66  PVRGVTIKEAKPGLFLFHFAHPYDMEAVLNGGPWIFDYNMLLLEQVQLGMQVDHIPLFHA 125

Query: 129 AFWIRLLNLPFGFQNRLMTRRIGEEIGGFVEVECDQEGLCWGDSMRIKVHLDILKPLRRG 188
             W+++ +LP G     +   +   IG FVE + +     W   MR++V +D+ +PL++ 
Sbjct: 126 VMWVQIHDLPMGLMKEKVGIGLANYIGSFVEYDKNNNTSFWRQFMRVRVKIDVRQPLKKD 185

Query: 189 IKVLSSNGLEEKW--VAIRYERLPDFCYSCGRIGHVTKDC----------------VNIP 248
            KV +  G   KW  V  +YE+L  FC+ CG +GH    C                 ++ 
Sbjct: 186 YKVKNKEG---KWCTVNFKYEKLGVFCFVCGIMGHAENKCEVRYSMEQDDGRREWSADLR 245

Query: 249 VNPDTLSGHPDPTF--------KDPNVNLLVFKTPLKGG--NMEEVKSRKATRTWKNP-L 308
             P    G     +        +D      V +   + G  N   + +    +  K P L
Sbjct: 246 AEPKRQGGRQSSRWLKEDKGGREDQGRRDTVVQPNNQPGSSNTGPIGAELDPKIPKEPGL 305

Query: 309 PKTMKIICWNARGLGNPRAFHSLCNLVHTTSPDILFVAETKGGSDLCNRVKLRCKFTGCL 368
           P  MKI+ WN RGL NP A  +L N+ H   PDILF++ET   +    RV++  KF  CL
Sbjct: 306 PGPMKILSWNCRGLSNPSAIPNLHNIAHGHKPDILFLSETLSKAQSMERVRVNLKFQSCL 365

Query: 369 PVNSVRAKGGLCIFWSDTVVSEIKSFSSHHIDAVFTWK-NSHWRFTGVYGYPESARKHLT 428
            V+     GGL + W DT+   + ++S + I+ +   K    WR T  YGYPE  R+   
Sbjct: 366 SVDVEGRSGGLSVMWRDTIKCRVLNYSRNFINLIVEEKEGEEWRLTCYYGYPERGRRRQA 425

BLAST of HG10017614 vs. TAIR 10
Match: AT3G42140.1 (zinc ion binding;nucleic acid binding )

HSP 1 Score: 56.2 bits (134), Expect = 7.4e-08
Identity = 31/131 (23.66%), Postives = 54/131 (41.22%), Query Frame = 0

Query: 81  VMSAGPWLFDKHLLILEELGVDRQLSDLNFNKVAFWIRLLNLPFGFQNRLMTRRIGEEIG 140
           ++  GPW F+  + +++     +  SD  F ++ FWI++  +P  F    +   IGE +G
Sbjct: 75  ILRRGPWSFNDWMCVIQRW--TKLHSDAEFKRIPFWIQIRGIPLRFLTARIITSIGERMG 134

Query: 141 GFVEVECDQEGLCWGDSMRIKVHLDILKPLRRGIKVLSSNGLEEKWVAIRYERLPDFCYS 200
            F+E                         L R + VL            +YE+L +FC +
Sbjct: 135 LFLETN-----------------------LGRDVSVLK----------FQYEKLKNFCTT 170

Query: 201 CGRIGHVTKDC 212
           CG + H   +C
Sbjct: 195 CGMLSHDASEC 170

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GAU14523.11.2e-5730.86hypothetical protein TSUD_250650 [Trifolium subterraneum][more]
MCH82853.12.0e-5528.19hypothetical protein [Trifolium medium][more]
PNY16372.12.6e-5530.02ribonuclease H, partial [Trifolium pratense][more]
PNY17656.13.4e-5529.03ribonuclease H, partial [Trifolium pratense][more]
MCH80000.11.9e-5329.32hypothetical protein [Trifolium medium][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A2N9GPY17.1e-5933.93Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9GJ355.1e-5731.25Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27322 PE=3 SV=1[more]
A0A392M6V19.6e-5628.19Uncharacterized protein (Fragment) OS=Trifolium medium OX=97028 GN=A2U01_0003665... [more]
A0A2K3PM581.3e-5530.02Ribonuclease H (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g013091 PE=4 SV... [more]
A0A2K3PQU31.6e-5529.03Ribonuclease H (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g014404 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT3G42140.17.4e-0823.66zinc ion binding;nucleic acid binding [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025836Zinc knuckle CX2CX4HX4CPFAMPF14392zf-CCHC_4coord: 164..211
e-value: 1.9E-15
score: 56.3
IPR025558Domain of unknown function DUF4283PFAMPF14111DUF4283coord: 31..162
e-value: 8.4E-17
score: 61.1
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 265..422
e-value: 5.1E-22
score: 80.8
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 266..414
IPR040256Uncharacterized protein At4g02000-likePANTHERPTHR31286GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.8-LIKEcoord: 14..212
NoneNo IPR availablePANTHERPTHR31286:SF84SUBFAMILY NOT NAMEDcoord: 14..212
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 198..211
score: 10.279065

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10017614.1HG10017614.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding