Tan0010202 (gene) Snake gourd v1

Overview
NameTan0010202
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonuclease H domain
LocationLG07: 4279324 .. 4283000 (-)
RNA-Seq ExpressionTan0010202
SyntenyTan0010202
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCCTCGCGCCTGCAACCCGATTCTGTAATCACTTCTTTAATTGATTAATTAACTAATTAATTTGTTAGTAAAATCACGCTTTAAATATCATTTAATTGTGGTCGCTTTCTATTCTATACCTCTCTACGCTGACGTGGACCTTGAAATGGACGTCTCCGTTTACCTTTGTTTTATTTTTCATTAAAAAAAAGGTTTAACCGGTGAAGTATTTAATTGATATTATATATATATATATATATATATATATATAATTTATCTGATACTTTTATATATAACAAGTTTAGTTTTTGAAGCTTCACCCTGGTTTCAAATTTTTTAAAAATATTTAATAGATTTCAAATTTAAAAGTATGAGATAAATTCTTAAATTTTTAATTTGTTTCTAATGAATTCTTCAGTTTTTAATTCTGTAATTGATGTCGAGTAGTTCTTTAGTACATTTAATATTTTAAATAAATTTACGAGTCTATTTAGATATAAAATTATTTTTTTTTTGTCTAATTCTAATAGCATTTATAAACTCTAAGGCTTTGGGGTGTATGTTATAATATCTTGTGTAAAGACATCTTGAATATGTTATATGGCAGTTCGAATGACCAAGTACGGAGGTCTCGTTGTTATATTTCGTATTTTTGGCCTATTGGATCGACTCTATATTAGGGTTTAGGAGAGGTGTCGTATATAAACTCCCAACTCCTATATGAAATAAGCTGCTTTTGTGTAATTTTATAATCTCGTTTTTCCTTAGTTAATAAAATTACTCTCTTTCTCTGCCTGTGGATGCTTTAGTGAACCACGTAAATTCTGTGTGTTGATTTTTTTTTTTTTTTTGTGACTCTTCGGTTATCTACTGTTTTCGTTTTAACCTCTAGATCTTGTGTTATATAGTCTATGATAAACTTTGCACCTCAAATACAGACTATTATAACCTACATTAAATAACTCTGCATTGTAAATACAAATTATATAACTTCAAACTATTATAATCCGATAAGTGTCCCAAATAGTCCCTAAATGTTTGTACTAAAACAAAAATAAACTCTTAATTTTATGTCTAATATTCATCAATTTTTTTAAAAAAATGTTGGATATTTTAAAGTTTAGAATTTATTAGACACAACTATAAAATTTAGAGACTAAACATGTAATTTAATCCTCCATTTTTCCTTATTTTAATCCTCCATTTTTCCTTTTGCTTTTTAGGGTTTTCTGTAATAGTGGGAAAGATTTTGAGATATATGCATAAATTTGATCGATAATATCAAAATCTAACAAATCCATTTAAGATTTTTGTCTTACTAATGTGATGCTCCAATTCGTCTTCATAGGAACTAGTATCGAGTTTCTTGAAAAGCATTATTTATTAGAAACATCTGATTCTATGCTATTGAAGGATTTAGTCACACACTTTTAAGCCTAAATCGGTCATTGAACAAATCATTTTTTTAAAAAAAAATTACTAAATCATCATGGTTGTTGTATAAATCTAAGATTTTAATGGATTTATGGAGTCTTTACCGGAGAATTCTTATCAACAACAATGAAAAAAAGAGTAATTACGCATAAGTAAAATTACATTACATTCACACACCATATATTCATCGATTAAAGAGAGTTTCACTAATTTATTTTTTGCTGATGACAATATTTTTTTTCTTTAATGATTCTGTGCAGGAATGCCAAAACATCAAGATGGTTCTCCAAGACTACGAGCGAGCTTCAGGCCCGTCCATAAACAAAGAGAAGTCAAAATTCATGATCAGCAGCAATGCGAACCAAGAAACCATTACCAGAATTCAAGTCTATCTCAGAGTGAATCATGAGGGCAATTTTGGCTCTTATCTTAGCCTCCTTTCAAAAAATACACGAAACAAGTCCCAGATTTTTCAAAAGGTGAAAAATCGCATTTGAAAGGCAATCCAAGGCTGGAAAGACAGGTTTTTCTCAGTAGGAGGAAAATAAGTCATTATCAAAGCTGTGGCTCAGGCTATCCCCGTGTATACCATGAGTTGCTTCCAACTCTCGGATAGGTTATGCAAGGATCTTAGTTCTATCTGCAACAAGTTTTGGTGGGGCTCGTCCGGGAAAAAAAGAAAATTCATTGGAGAAATTGGGATCGTTTCTATTGTAGCAAAAGAGATGGGGGGCTAGCTTTAGGGACTTCAAAATCTTTAATCAAGTTATGCTTGCCAAGCATAATTGGAGGTTAGTCAGGAATCCCGATAGGCTCCTTGCTAAGATTCTTAAATGCATGTACTTTAAAGACAATGATTTTCTCCATGCTACTCTTGGTCCTAACCTTTCCTTCACTTGGCGAGGCTATCGCTAGAAGATTGGCAACAGTTGTCATGTTTTTATCAACGAAGACCTCTAGATTCCAAATTTAAGCAATCTCAAACCTGTGTGGACCCATGAGAACTTCAAAGGTAAAAGAGTAGTCGAGCTTATTACTCAAGATGGCGGTTGGGATGAAAGCTGTGTCTGAGAAGCTTTTCTTCATCACGATGCAACTGATATCTTGAACATTCCCCTAGGAGGTCTGCGGAGGAAAGATGAAATCATTTGGAATGGTGTTAAAAAAAAAGAAGTTTTCGGTAAAAAATGCTTATCTTTTGGGTTTCTCTCTTTTTATTTCCTCTCGCCTGTCGCATTCTGATGCGATCGCCCTTAATTTGTGCTTGGAATGACTTTTGGAATATTGATGTGAGACCAAATATCAAAATTGTGTGCTGAAAAATTCTTAATAATATTATTCCCACTGTTTTTTTAATCTTTCTAAAAAAGGTGTACAAGTTGATGATACTTGCTCTTTTTGCAAGAAACACATGGAGACAACTACCCATCTGCTATGGGAGTGTAAGGTTGCTAAACGTGTTTGGAACTTTTTCCTTCCTACTTATGCATCTGTTTTTTCATCTGTCAGGAACCTCTGGGAACCGTGGGATTACTGGACTTGCTCCAAGGCTGAGGAAATCCAAGCAGATCGTCATGTTATTCATTTGGTGTTGTGGAACTTGTGGAGTCAAAGGAACCAAATCCATTTTAATGCAGGAATGTCGAGTCCTGATGATCTTGTCCTGAAAGTTGTTTCTGCCCTTCGCATACACCAAAGAGCTTCTGAAGCTTCTACCTCTCCAATTTGCCCCCTTACCCAAAAAAGGTCCTGGCAACCTTCGAAGGCGAATGTTTGGAAACTCAACACAGATGTCGCTTGGTCTTGTAAATTGAATCGTGGTGGTTTGGGGTGGATTGTTCGGGATGCTGCAGGAAAATTGATCTTGGAGGGATGCAAAATCCTCTCTTCTAGATGGCCTGTTAAACTGCTTAAAGCGCTTGCTATTCTGGAAGGTCTGAAGGAAATCCTAGCTTGGAAAATGAGTCAACTCCCGTCTTTGATCGTTGAAACTGACTCGCTGGAGGTAATCTTTCTTTTGAATGGAGTCACTGCCGACTTTTCTAAAATTTCTTTTGTTATTGGCGATATTCTTTCGCTTGTTAAGGATTTTGGGTCCATTATTTTTTTGTAAAGTTCCTAGAGAGGAGAACTCTCGGGTCTGGAGTCACTCACGCTCTTGCAGCTTTGACATCCTCTTTAGGAGACTCTCGGGTCTGGAATGAGGGCTTTCTAGAGGACATTATCTCTCTCATTTTTGAGATGGGTGTGGATGTTTGA

mRNA sequence

ATGGCGCCTCGCGCCTGCAACCCGATTCTGAACCTCTGGGAACCGTGGGATTACTGGACTTGCTCCAAGGCTGAGGAAATCCAAGCAGATCGTCATGTTATTCATTTGGTGTTGTGGAACTTGTGGAGTCAAAGGAACCAAATCCATTTTAATGCAGGAATGTCGAGTCCTGATGATCTTGTCCTGAAAGTTGTTTCTGCCCTTCGCATACACCAAAGAGCTTCTGAAGCTTCTACCTCTCCAATTTGCCCCCTTACCCAAAAAAGGTCCTGGCAACCTTCGAAGGCGAATGTTTGGAAACTCAACACAGATGTCGCTTGGTCTTGTAAATTGAATCGTGGTGGTTTGGGGTGGATTGTTCGGGATGCTGCAGGAAAATTGATCTTGGAGGGATGCAAAATCCTCTCTTCTAGATGGCCTGTTAAACTGCTTAAAGCGCTTGCTATTCTGGAAGGTCTGAAGGAAATCCTAGCTTGGAAAATGAGTCAACTCCCGTCTTTGATCGTTGAAACTGACTCGCTGGAGTTCCTAGAGAGGAGAACTCTCGGGTCTGGAGTCACTCACGCTCTTGCAGCTTTGACATCCTCTTTAGGAGACTCTCGGGTCTGGAATGAGGGCTTTCTAGAGGACATTATCTCTCTCATTTTTGAGATGGGTGTGGATGTTTGA

Coding sequence (CDS)

ATGGCGCCTCGCGCCTGCAACCCGATTCTGAACCTCTGGGAACCGTGGGATTACTGGACTTGCTCCAAGGCTGAGGAAATCCAAGCAGATCGTCATGTTATTCATTTGGTGTTGTGGAACTTGTGGAGTCAAAGGAACCAAATCCATTTTAATGCAGGAATGTCGAGTCCTGATGATCTTGTCCTGAAAGTTGTTTCTGCCCTTCGCATACACCAAAGAGCTTCTGAAGCTTCTACCTCTCCAATTTGCCCCCTTACCCAAAAAAGGTCCTGGCAACCTTCGAAGGCGAATGTTTGGAAACTCAACACAGATGTCGCTTGGTCTTGTAAATTGAATCGTGGTGGTTTGGGGTGGATTGTTCGGGATGCTGCAGGAAAATTGATCTTGGAGGGATGCAAAATCCTCTCTTCTAGATGGCCTGTTAAACTGCTTAAAGCGCTTGCTATTCTGGAAGGTCTGAAGGAAATCCTAGCTTGGAAAATGAGTCAACTCCCGTCTTTGATCGTTGAAACTGACTCGCTGGAGTTCCTAGAGAGGAGAACTCTCGGGTCTGGAGTCACTCACGCTCTTGCAGCTTTGACATCCTCTTTAGGAGACTCTCGGGTCTGGAATGAGGGCTTTCTAGAGGACATTATCTCTCTCATTTTTGAGATGGGTGTGGATGTTTGA

Protein sequence

MAPRACNPILNLWEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEASTSPICPLTQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLEFLERRTLGSGVTHALAALTSSLGDSRVWNEGFLEDIISLIFEMGVDV
Homology
BLAST of Tan0010202 vs. NCBI nr
Match: XP_022155262.1 (uncharacterized protein LOC111022403 [Momordica charantia])

HSP 1 Score: 90.9 bits (224), Expect = 1.5e-14
Identity = 51/147 (34.69%), Postives = 79/147 (53.74%), Query Frame = 0

Query: 30  DRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEASTSPI-CPLTQK 89
           D  V+ +  W +W+ RN + F    SS   ++ ++   +      SE S S +   L  K
Sbjct: 9   DLDVLLIGSWVIWNHRNYVIFRGEHSSFSTMIQQLTKFVTESSYQSETSLSMLHKTLNNK 68

Query: 90  RSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALA 149
             W+P   ++W LN D +WS   +RGG+GWI+R   G ++L G + + +   VKLL+A A
Sbjct: 69  LKWEPPPMHIWTLNADASWSDSTHRGGIGWIIRSWDGDIVLAGNRFVEACNNVKLLEASA 128

Query: 150 ILEGLKEILAWKMSQLPSLIVETDSLE 176
           ILEGL+ +    +  L  L +ETDS E
Sbjct: 129 ILEGLRNLT--NLGVLRPLHIETDSAE 153

BLAST of Tan0010202 vs. NCBI nr
Match: XP_021722108.1 (uncharacterized protein LOC110689646 [Chenopodium quinoa])

HSP 1 Score: 87.0 bits (214), Expect = 2.2e-13
Identity = 57/189 (30.16%), Postives = 94/189 (49.74%), Query Frame = 0

Query: 21  CSKAEEIQADRH---VIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEA 80
           C     +  D H   ++  +LW +W +RN   +        D++ K VS +  +++A +A
Sbjct: 67  CGSIRVLHKDPHWWNILFAILWGIWLRRNVWSYENRKKDLMDVIQKAVSVVGDYEQAQQA 126

Query: 81  -STSPICPLTQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEGCKILS 140
            S+S       +  W+P    + K+N+D A     +  GLG ++RDA G++++  C  L 
Sbjct: 127 LSSSEGRHQLLESKWKPPTQGMIKINSDAA-IFDNSAVGLGGVMRDALGEVVVATCLCLR 186

Query: 141 SRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLEFLERRTLGSGVTHALAALTSS 200
           S++ V + +ALA+   L+  L    S+   L     S  F   +  G+G  HALA L+SS
Sbjct: 187 SKYEVDVAEALALRHSLRIALE---SEFRKLARGCQSCSFSFVKRSGNGAAHALAKLSSS 246

Query: 201 LGDSRVWNE 206
            GD RVW E
Sbjct: 247 YGDLRVWME 251

BLAST of Tan0010202 vs. NCBI nr
Match: XP_022154990.1 (uncharacterized protein LOC111022134 isoform X1 [Momordica charantia])

HSP 1 Score: 84.7 bits (208), Expect = 1.1e-12
Identity = 48/171 (28.07%), Postives = 80/171 (46.78%), Query Frame = 0

Query: 13  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VS 72
           W   +YW     +  + +R    ++   +W  RN+  F    S   D+ L +       +
Sbjct: 169 WTTKEYWEWLMDKAGEEERRRSMIIACQIWEMRNKSIFKGVHSETRDIQLAIDRYIINSA 228

Query: 73  ALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAA 132
               + +       PI  +    +  W+P  +N WKLNTD AW    N  G+GWI+RD  
Sbjct: 229 GQDTNLKRKSKDFHPIRRIGDNTRARWKPPTSNSWKLNTDAAWRADTNTDGIGWILRDEK 288

Query: 133 GKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE 176
           G++I  GC+I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Sbjct: 289 GEVIKTGCRIIRAERNITYLEVMAICEGLRAI---RQEHCRPIHLESDSLE 336

BLAST of Tan0010202 vs. NCBI nr
Match: XP_022154991.1 (uncharacterized protein LOC111022134 isoform X2 [Momordica charantia])

HSP 1 Score: 84.7 bits (208), Expect = 1.1e-12
Identity = 48/171 (28.07%), Postives = 80/171 (46.78%), Query Frame = 0

Query: 13  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VS 72
           W   +YW     +  + +R    ++   +W  RN+  F    S   D+ L +       +
Sbjct: 134 WTTKEYWEWLMDKAGEEERRRSMIIACQIWEMRNKSIFKGVHSETRDIQLAIDRYIINSA 193

Query: 73  ALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAA 132
               + +       PI  +    +  W+P  +N WKLNTD AW    N  G+GWI+RD  
Sbjct: 194 GQDTNLKRKSKDFHPIRRIGDNTRARWKPPTSNSWKLNTDAAWRADTNTDGIGWILRDEK 253

Query: 133 GKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE 176
           G++I  GC+I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Sbjct: 254 GEVIKTGCRIIRAERNITYLEVMAICEGLRAI---RQEHCRPIHLESDSLE 301

BLAST of Tan0010202 vs. NCBI nr
Match: XP_022143535.1 (uncharacterized protein LOC111013412 [Momordica charantia])

HSP 1 Score: 82.8 bits (203), Expect = 4.1e-12
Identity = 49/163 (30.06%), Postives = 81/163 (49.69%), Query Frame = 0

Query: 23  KAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLK----VVSALRIHQRASEAS 82
           KA E +  R +I  + W +W  RN+  F        D+ L     ++++   +      S
Sbjct: 3   KAGEEERRRSMI--IAWQIWEMRNKSIFKGVHPETRDIQLAIDRYIINSAGRNTNLKGKS 62

Query: 83  TSPICPLTQK------RSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEGC 142
           T+    L ++        W+P  +N WKLNT+ AW    N GG+GWI+RD  G++I   C
Sbjct: 63  TNKDLHLIRRIEDNTGAQWKPPTSNSWKLNTNAAWRADTNTGGIGWILRDEKGEVIKASC 122

Query: 143 KILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE 176
           +I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Sbjct: 123 RIIRAERNITYLEVMAICEGLRAI---RQEHCRPIHLESDSLE 160

BLAST of Tan0010202 vs. ExPASy TrEMBL
Match: A0A6J1DNV9 (uncharacterized protein LOC111022403 OS=Momordica charantia OX=3673 GN=LOC111022403 PE=4 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 7.3e-15
Identity = 51/147 (34.69%), Postives = 79/147 (53.74%), Query Frame = 0

Query: 30  DRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRASEASTSPI-CPLTQK 89
           D  V+ +  W +W+ RN + F    SS   ++ ++   +      SE S S +   L  K
Sbjct: 9   DLDVLLIGSWVIWNHRNYVIFRGEHSSFSTMIQQLTKFVTESSYQSETSLSMLHKTLNNK 68

Query: 90  RSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALA 149
             W+P   ++W LN D +WS   +RGG+GWI+R   G ++L G + + +   VKLL+A A
Sbjct: 69  LKWEPPPMHIWTLNADASWSDSTHRGGIGWIIRSWDGDIVLAGNRFVEACNNVKLLEASA 128

Query: 150 ILEGLKEILAWKMSQLPSLIVETDSLE 176
           ILEGL+ +    +  L  L +ETDS E
Sbjct: 129 ILEGLRNLT--NLGVLRPLHIETDSAE 153

BLAST of Tan0010202 vs. ExPASy TrEMBL
Match: A0A6J1DL64 (uncharacterized protein LOC111022134 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022134 PE=4 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 5.2e-13
Identity = 48/171 (28.07%), Postives = 80/171 (46.78%), Query Frame = 0

Query: 13  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VS 72
           W   +YW     +  + +R    ++   +W  RN+  F    S   D+ L +       +
Sbjct: 169 WTTKEYWEWLMDKAGEEERRRSMIIACQIWEMRNKSIFKGVHSETRDIQLAIDRYIINSA 228

Query: 73  ALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAA 132
               + +       PI  +    +  W+P  +N WKLNTD AW    N  G+GWI+RD  
Sbjct: 229 GQDTNLKRKSKDFHPIRRIGDNTRARWKPPTSNSWKLNTDAAWRADTNTDGIGWILRDEK 288

Query: 133 GKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE 176
           G++I  GC+I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Sbjct: 289 GEVIKTGCRIIRAERNITYLEVMAICEGLRAI---RQEHCRPIHLESDSLE 336

BLAST of Tan0010202 vs. ExPASy TrEMBL
Match: A0A6J1DQC9 (uncharacterized protein LOC111022134 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022134 PE=4 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 5.2e-13
Identity = 48/171 (28.07%), Postives = 80/171 (46.78%), Query Frame = 0

Query: 13  WEPWDYWTCSKAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLKV------VS 72
           W   +YW     +  + +R    ++   +W  RN+  F    S   D+ L +       +
Sbjct: 134 WTTKEYWEWLMDKAGEEERRRSMIIACQIWEMRNKSIFKGVHSETRDIQLAIDRYIINSA 193

Query: 73  ALRIHQRASEASTSPICPL--TQKRSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAA 132
               + +       PI  +    +  W+P  +N WKLNTD AW    N  G+GWI+RD  
Sbjct: 194 GQDTNLKRKSKDFHPIRRIGDNTRARWKPPTSNSWKLNTDAAWRADTNTDGIGWILRDEK 253

Query: 133 GKLILEGCKILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE 176
           G++I  GC+I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Sbjct: 254 GEVIKTGCRIIRAERNITYLEVMAICEGLRAI---RQEHCRPIHLESDSLE 301

BLAST of Tan0010202 vs. ExPASy TrEMBL
Match: A0A6J1CP26 (uncharacterized protein LOC111013412 OS=Momordica charantia OX=3673 GN=LOC111013412 PE=4 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 2.0e-12
Identity = 49/163 (30.06%), Postives = 81/163 (49.69%), Query Frame = 0

Query: 23  KAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVLK----VVSALRIHQRASEAS 82
           KA E +  R +I  + W +W  RN+  F        D+ L     ++++   +      S
Sbjct: 3   KAGEEERRRSMI--IAWQIWEMRNKSIFKGVHPETRDIQLAIDRYIINSAGRNTNLKGKS 62

Query: 83  TSPICPLTQK------RSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEGC 142
           T+    L ++        W+P  +N WKLNT+ AW    N GG+GWI+RD  G++I   C
Sbjct: 63  TNKDLHLIRRIEDNTGAQWKPPTSNSWKLNTNAAWRADTNTGGIGWILRDEKGEVIKASC 122

Query: 143 KILSSRWPVKLLKALAILEGLKEILAWKMSQLPSLIVETDSLE 176
           +I+ +   +  L+ +AI EGL+ I   +      + +E+DSLE
Sbjct: 123 RIIRAERNITYLEVMAICEGLRAI---RQEHCRPIHLESDSLE 160

BLAST of Tan0010202 vs. ExPASy TrEMBL
Match: A0A6J1DSV1 (uncharacterized protein LOC111023608 OS=Momordica charantia OX=3673 GN=LOC111023608 PE=4 SV=1)

HSP 1 Score: 79.3 bits (194), Expect = 2.2e-11
Identity = 45/144 (31.25%), Postives = 71/144 (49.31%), Query Frame = 0

Query: 23  KAEEIQADRHVIHLVLWNLWSQRNQIHFNAGMSSPDDLVL----KVVSALRIHQRASEAS 82
           KA E +  R +I  + W +W  RN+  F    S   D+ L     ++++          S
Sbjct: 3   KAGEEERRRSMI--IAWQIWEMRNKSIFKGVHSETRDIQLVIDRYIINSAGRDTNLKGKS 62

Query: 83  TSPICPLTQK------RSWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEGC 142
            +    L ++        W+P  +N WKLNTD AW    N GG+GWI+RD  G++I   C
Sbjct: 63  ANKDLHLIRRIGDNTGARWKPPTSNSWKLNTDAAWRADTNTGGIGWILRDEKGEVIKADC 122

Query: 143 KILSSRWPVKLLKALAILEGLKEI 157
           +I+ +   +  L+ +AI EGL+ I
Sbjct: 123 RIIRTERNITYLEVMAICEGLRAI 144

BLAST of Tan0010202 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 58.2 bits (139), Expect = 1.0e-08
Identity = 39/150 (26.00%), Postives = 64/150 (42.67%), Query Frame = 0

Query: 37  VLWNLWSQRNQIHFNAGMSSPDDLVLKVVSAL---RIHQRASEASTSPICPLTQKRSWQP 96
           +LW LW  RN++ F     +  +++ +    L   RI   A    T P    +    W+P
Sbjct: 363 LLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVNRSSCGRWRP 422

Query: 97  SKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEGCKILSSRWPVKLLKALAILEGL 156
                 K NTD  W+    R G+GW++R+  G++   G + L         K  ++LE  
Sbjct: 423 PPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALP--------KLKSVLEAE 482

Query: 157 KEILAWKMSQLPS-----LIVETDSLEFLE 179
            E + W +  L       +I E+DS   +E
Sbjct: 483 LEAMRWAVLSLSRFQYNYVIFESDSQVLIE 504

BLAST of Tan0010202 vs. TAIR 10
Match: AT2G34320.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 51.6 bits (122), Expect = 9.4e-07
Identity = 35/148 (23.65%), Postives = 58/148 (39.19%), Query Frame = 0

Query: 18  YWTCSKAEEIQADRHVIHLV---LWNLWSQRNQIHFNAGMSSPDDLVLKVVSALRIHQRA 77
           YW  +   EI     + +LV   LW LW  RN++ F        +++ + +         
Sbjct: 58  YWVLNLEVEIPKLGKIGNLVPWLLWRLWKSRNELMFKGKEYDAPEVLRRAMEDFEEWSTR 117

Query: 78  SEASTSPICPLTQKR---SWQPSKANVWKLNTDVAWSCKLNRGGLGWIVRDAAGKLILEG 137
            E       P  ++     W+       K NTD  W  +  R G+GWI+R+ +G ++  G
Sbjct: 118 RELEGKASGPQVERNLSVQWKAPPYQWVKCNTDATWQLENPRCGIGWILRNESGGVLWMG 177

Query: 138 CKILSSRWPVKLLKALAILEGLKEILAW 160
            + L         +   +LE   E L W
Sbjct: 178 ARALP--------RTKNVLEAELEALRW 197

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022155262.11.5e-1434.69uncharacterized protein LOC111022403 [Momordica charantia][more]
XP_021722108.12.2e-1330.16uncharacterized protein LOC110689646 [Chenopodium quinoa][more]
XP_022154990.11.1e-1228.07uncharacterized protein LOC111022134 isoform X1 [Momordica charantia][more]
XP_022154991.11.1e-1228.07uncharacterized protein LOC111022134 isoform X2 [Momordica charantia][more]
XP_022143535.14.1e-1230.06uncharacterized protein LOC111013412 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1DNV97.3e-1534.69uncharacterized protein LOC111022403 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A6J1DL645.2e-1328.07uncharacterized protein LOC111022134 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DQC95.2e-1328.07uncharacterized protein LOC111022134 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1CP262.0e-1230.06uncharacterized protein LOC111013412 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A6J1DSV12.2e-1131.25uncharacterized protein LOC111023608 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
Match NameE-valueIdentityDescription
AT4G29090.11.0e-0826.00Ribonuclease H-like superfamily protein [more]
AT2G34320.19.4e-0723.65Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 102..175
e-value: 9.3E-11
score: 41.6
NoneNo IPR availablePANTHERPTHR47074BNAC02G40300D PROTEINcoord: 88..175

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0010202.1Tan0010202.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity