Tan0019989 (gene) Snake gourd v1

Overview
NameTan0019989
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDDE Tnp4 domain-containing protein
LocationLG09: 63489201 .. 63491727 (-)
RNA-Seq ExpressionTan0019989
SyntenyTan0019989
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAACGCGCCTAGCGGTCCGCGTGAACGTGTTTCCTGTGTAACCAAAAATCAAAAACCGGAACCTTCGGCTCCATCTTCCTCCACTCGGCTAAACGTTTTCCCCCCAGTCCTTCACCGCGTCCAATTCTCCTTTTAATTAATCAATTTTATCCAAAAAATAATAATTATTCACTTTAAAAAACAAACAAAAAAAAAAAGAAATGGGTCCAATTCCTAGTCACCGTCAAACAATCCTCTACGCGTTCCCACTTACCCACACGTTTCCCCAATTTCAATTTCGTTTCTCTGTTCGTATAAATATCATCCCTTTCCCCATTTCCCATTCTCACTCCCATCTTCTCTTTTCACATTTGGAAGATCAAGATCGACTAATAAATCCTCTTTACTTTCATACCCACTTCCTTAATTTTCACCTTTCATGGAAATTAGCTCGTTCCCATTTCTTAATCAGGACGATTTATTACCCATCTTCAATCTCTTCTCTGATATGGATAACAGTTTCAGTGTGAATCAGAGCCCCCGAAAGCGGCGACGACAAGACGACGACCAGAGTCAATTCAACGATGGCGCCAACGACTTACTAAAGCTCCCCTTCTGGTTCGATGACACCCACGATCAGAAACAACACCACTGGATTATGGATTCCGACGCCGATCACAAGCGCGAATTCCACCTTTCCGATGACAATTTCCCCCAATTCACCGCTAAAAAATCCCGCCGTACAACGGCAGAAAATGCCAGTTCGTCTCCGGCGAAGGGCGGTGCCGGCGCCGGCACCGGAGCACAGCAGCGACGGCTGTGGGTGAAGGACCGATCCAAGGACTGGTGGGACCAATGCAACCACCCGGATTTCCCCGACGAGGAATTCCGGCGGGCTTTCAGAATGAGTAAAGCGACATTCGATATGATCTGTAAGGAATTGGACTCGACGGTGATGAAAAAGGACACGATGCTTCGTGTCGCGATTCCGGTCCGGCAGCGGGTCGCCGTCTGTATATGGCGGTTGGCCACCGGCGAGCCGCTCCGGCTCGTGTCGAAACGGTTCGGGCTCGGAATCTCGACCTGTCACAAATTAGTTCTGGAAGTCTGCTCTGCAATTCGGAAAGTTCTAATGCCGAAATTCCTGCAATGGCCGGAGGATTCAAAATTAGCGAAAATCAAGCAAGAATTCGAGTCGATTTCAGGAATTCCGAGAGTGGGTGGCTCGATTTACACGACCCACATCCCAATAATCGCACCTAAAAACAACGTCGCCGCTTATTTCAACAAACGCCACACAGAACGCAACCAAAAAACTTCGTATTCCATCACCGTGCAAGGCGTCGTCGATCCGGCCGGCGTCTTCACCGACGTCTGCATCGGGTGGCCGGGATCCATGCCGGACGATCAAGTCCTGGAAAAATCAGCCCTTTTCGAAAGGGCAAACATGGGTCTTTTGAACGACGTCTTCATCGTCGGAAATTCAGGGTACCCATTAATGGACTGGCTTCTGGTGCCGTATTCAGTGCAGAATTTGACATGGACGCAACACGCTTTCAACGAGAAGGTTTCGGAGATTCAGTCGGCGGCGAAGGCGGCGTTCGGGCGGCTGAAGGGGCGGTGGTCGTGCCTGCAGAAAAGGACGGAAGTGAAGCTGCAGGAGCTGCCGGTGGTGCTCGGAGCTTGCTGTGTTCTTCATAATATATGCGAGATGAGGAAAGAAAGCTTCGATCCAGAGCTGAAATTTGAGCTTTTTGATGATGAAATGGTGCCTGAAAATAATGGGCTTAGATCTGTGAGTGCCATTCAAGCTAGGGATCATATTGCTCATAATCTTCTTCACCATGGGCTTGCTGGAACTGGATTTCTTTAGGGAGCTTCAATTTATTTTGGTGAAATGAAGTTTATATATATCTATATATGTGTTAAAAAAAAAAACATTGTTTTTGGCTATAGATTGGTATAGTCATGGTTAAAATAAGTGACTTTTAAACTATACTTGTAATTTGGTGAGGATTTTCCCATTGGAAAAAGAAAAAAGAAAAAATTTGACAGTTTTTTAGATGATTTTAGTGGTTAGTGATAGTGTTAGACCTTGGAAGGGGAAGGGGAAGGGATGTAATCAAACAATCATCTTTTGTATCTTTTGAAGGGAAGATTATTACTTTTTTAATTCGTACTTTTTTTTTAGGACTCGAATCTGGTCGTCTATTATGTACATGATTGTTAAAATAGTTGCGAGTTGCAACATAAACTTGATACTTGGTTACATGGGAAGAAAATGTTGTGTGGCTTCCATTAGATAAACTTTCATTTTCAACTAAAGATCTAGTAGTTTGTGGGCAACTACATTTAGAATTCTTAAAACGTATGAAAAAGAGATCAATTGGAAATTTCTTGTTATCGAGGGGAACGGTAGTTGTCTTTGTCGTAGTTGAAAAATACGTACTTGCTATCAAAGAGAGTTGAAACTCCGACGTGACCAATCAAAGACATTTGAGAATGACAATGTTGGTTGGTATGTTTGCTAACTCTATGGATTAGGAAC

mRNA sequence

CAACGCGCCTAGCGGTCCGCGTGAACGTGTTTCCTGTGTAACCAAAAATCAAAAACCGGAACCTTCGGCTCCATCTTCCTCCACTCGGCTAAACGTTTTCCCCCCAGTCCTTCACCGCGTCCAATTCTCCTTTTAATTAATCAATTTTATCCAAAAAATAATAATTATTCACTTTAAAAAACAAACAAAAAAAAAAAGAAATGGGTCCAATTCCTAGTCACCGTCAAACAATCCTCTACGCGTTCCCACTTACCCACACGTTTCCCCAATTTCAATTTCGTTTCTCTGTTCGTATAAATATCATCCCTTTCCCCATTTCCCATTCTCACTCCCATCTTCTCTTTTCACATTTGGAAGATCAAGATCGACTAATAAATCCTCTTTACTTTCATACCCACTTCCTTAATTTTCACCTTTCATGGAAATTAGCTCGTTCCCATTTCTTAATCAGGACGATTTATTACCCATCTTCAATCTCTTCTCTGATATGGATAACAGTTTCAGTGTGAATCAGAGCCCCCGAAAGCGGCGACGACAAGACGACGACCAGAGTCAATTCAACGATGGCGCCAACGACTTACTAAAGCTCCCCTTCTGGTTCGATGACACCCACGATCAGAAACAACACCACTGGATTATGGATTCCGACGCCGATCACAAGCGCGAATTCCACCTTTCCGATGACAATTTCCCCCAATTCACCGCTAAAAAATCCCGCCGTACAACGGCAGAAAATGCCAGTTCGTCTCCGGCGAAGGGCGGTGCCGGCGCCGGCACCGGAGCACAGCAGCGACGGCTGTGGGTGAAGGACCGATCCAAGGACTGGTGGGACCAATGCAACCACCCGGATTTCCCCGACGAGGAATTCCGGCGGGCTTTCAGAATGAGTAAAGCGACATTCGATATGATCTGTAAGGAATTGGACTCGACGGTGATGAAAAAGGACACGATGCTTCGTGTCGCGATTCCGGTCCGGCAGCGGGTCGCCGTCTGTATATGGCGGTTGGCCACCGGCGAGCCGCTCCGGCTCGTGTCGAAACGGTTCGGGCTCGGAATCTCGACCTGTCACAAATTAGTTCTGGAAGTCTGCTCTGCAATTCGGAAAGTTCTAATGCCGAAATTCCTGCAATGGCCGGAGGATTCAAAATTAGCGAAAATCAAGCAAGAATTCGAGTCGATTTCAGGAATTCCGAGAGTGGGTGGCTCGATTTACACGACCCACATCCCAATAATCGCACCTAAAAACAACGTCGCCGCTTATTTCAACAAACGCCACACAGAACGCAACCAAAAAACTTCGTATTCCATCACCGTGCAAGGCGTCGTCGATCCGGCCGGCGTCTTCACCGACGTCTGCATCGGGTGGCCGGGATCCATGCCGGACGATCAAGTCCTGGAAAAATCAGCCCTTTTCGAAAGGGCAAACATGGGTCTTTTGAACGACGTCTTCATCGTCGGAAATTCAGGGTACCCATTAATGGACTGGCTTCTGGTGCCGTATTCAGTGCAGAATTTGACATGGACGCAACACGCTTTCAACGAGAAGGTTTCGGAGATTCAGTCGGCGGCGAAGGCGGCGTTCGGGCGGCTGAAGGGGCGGTGGTCGTGCCTGCAGAAAAGGACGGAAGTGAAGCTGCAGGAGCTGCCGGTGGTGCTCGGAGCTTGCTGTGTTCTTCATAATATATGCGAGATGAGGAAAGAAAGCTTCGATCCAGAGCTGAAATTTGAGCTTTTTGATGATGAAATGGTGCCTGAAAATAATGGGCTTAGATCTGTGAGTGCCATTCAAGCTAGGGATCATATTGCTCATAATCTTCTTCACCATGGGCTTGCTGGAACTGGATTTCTTTAGGGAGCTTCAATTTATTTTGGTGAAATGAAGTTTATATATATCTATATATGTGTTAAAAAAAAAAACATTGTTTTTGGCTATAGATTGGTATAGTCATGGTTAAAATAAGTGACTTTTAAACTATACTTGTAATTTGGTGAGGATTTTCCCATTGGAAAAAGAAAAAAGAAAAAATTTGACAGTTTTTTAGATGATTTTAGTGGTTAGTGATAGTGTTAGACCTTGGAAGGGGAAGGGGAAGGGATGTAATCAAACAATCATCTTTTGTATCTTTTGAAGGGAAGATTATTACTTTTTTAATTCGTACTTTTTTTTTAGGACTCGAATCTGGTCGTCTATTATGTACATGATTGTTAAAATAGTTGCGAGTTGCAACATAAACTTGATACTTGGTTACATGGGAAGAAAATGTTGTGTGGCTTCCATTAGATAAACTTTCATTTTCAACTAAAGATCTAGTAGTTTGTGGGCAACTACATTTAGAATTCTTAAAACGTATGAAAAAGAGATCAATTGGAAATTTCTTGTTATCGAGGGGAACGGTAGTTGTCTTTGTCGTAGTTGAAAAATACGTACTTGCTATCAAAGAGAGTTGAAACTCCGACGTGACCAATCAAAGACATTTGAGAATGACAATGTTGGTTGGTATGTTTGCTAACTCTATGGATTAGGAAC

Coding sequence (CDS)

ATGGAAATTAGCTCGTTCCCATTTCTTAATCAGGACGATTTATTACCCATCTTCAATCTCTTCTCTGATATGGATAACAGTTTCAGTGTGAATCAGAGCCCCCGAAAGCGGCGACGACAAGACGACGACCAGAGTCAATTCAACGATGGCGCCAACGACTTACTAAAGCTCCCCTTCTGGTTCGATGACACCCACGATCAGAAACAACACCACTGGATTATGGATTCCGACGCCGATCACAAGCGCGAATTCCACCTTTCCGATGACAATTTCCCCCAATTCACCGCTAAAAAATCCCGCCGTACAACGGCAGAAAATGCCAGTTCGTCTCCGGCGAAGGGCGGTGCCGGCGCCGGCACCGGAGCACAGCAGCGACGGCTGTGGGTGAAGGACCGATCCAAGGACTGGTGGGACCAATGCAACCACCCGGATTTCCCCGACGAGGAATTCCGGCGGGCTTTCAGAATGAGTAAAGCGACATTCGATATGATCTGTAAGGAATTGGACTCGACGGTGATGAAAAAGGACACGATGCTTCGTGTCGCGATTCCGGTCCGGCAGCGGGTCGCCGTCTGTATATGGCGGTTGGCCACCGGCGAGCCGCTCCGGCTCGTGTCGAAACGGTTCGGGCTCGGAATCTCGACCTGTCACAAATTAGTTCTGGAAGTCTGCTCTGCAATTCGGAAAGTTCTAATGCCGAAATTCCTGCAATGGCCGGAGGATTCAAAATTAGCGAAAATCAAGCAAGAATTCGAGTCGATTTCAGGAATTCCGAGAGTGGGTGGCTCGATTTACACGACCCACATCCCAATAATCGCACCTAAAAACAACGTCGCCGCTTATTTCAACAAACGCCACACAGAACGCAACCAAAAAACTTCGTATTCCATCACCGTGCAAGGCGTCGTCGATCCGGCCGGCGTCTTCACCGACGTCTGCATCGGGTGGCCGGGATCCATGCCGGACGATCAAGTCCTGGAAAAATCAGCCCTTTTCGAAAGGGCAAACATGGGTCTTTTGAACGACGTCTTCATCGTCGGAAATTCAGGGTACCCATTAATGGACTGGCTTCTGGTGCCGTATTCAGTGCAGAATTTGACATGGACGCAACACGCTTTCAACGAGAAGGTTTCGGAGATTCAGTCGGCGGCGAAGGCGGCGTTCGGGCGGCTGAAGGGGCGGTGGTCGTGCCTGCAGAAAAGGACGGAAGTGAAGCTGCAGGAGCTGCCGGTGGTGCTCGGAGCTTGCTGTGTTCTTCATAATATATGCGAGATGAGGAAAGAAAGCTTCGATCCAGAGCTGAAATTTGAGCTTTTTGATGATGAAATGGTGCCTGAAAATAATGGGCTTAGATCTGTGAGTGCCATTCAAGCTAGGGATCATATTGCTCATAATCTTCTTCACCATGGGCTTGCTGGAACTGGATTTCTTTAG

Protein sequence

MEISSFPFLNQDDLLPIFNLFSDMDNSFSVNQSPRKRRRQDDDQSQFNDGANDLLKLPFWFDDTHDQKQHHWIMDSDADHKREFHLSDDNFPQFTAKKSRRTTAENASSSPAKGGAGAGTGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPLMDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVLGACCVLHNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNLLHHGLAGTGFL
Homology
BLAST of Tan0019989 vs. ExPASy Swiss-Prot
Match: Q9M2U3 (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 2.8e-37
Identity = 118/397 (29.72%), Postives = 185/397 (46.60%), Query Frame = 0

Query: 97  KKSRRTTAENASSSPAKGGAGAGTGAQQRRLWVKDRSKDWWDQCNHPDF----PDEEFRR 156
           KK  R     A+++     A A             +S DWWD  +   +      + F  
Sbjct: 15  KKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSRRIYGGSTDPKTFES 74

Query: 157 AFRMSKATFDMICKELDSTVMKKDTMLRVA----IPVRQRVAVCIWRLATGEPLRLVSKR 216
            F++S+ TFD IC  + +    K      +    + +  RVAV + RL +GE L ++ + 
Sbjct: 75  VFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAVALRRLGSGESLSVIGET 134

Query: 217 FGLGISTCHKLVLEVCSAIRKVLMPKFLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTH 276
           FG+  ST  ++      ++ +  +   L WP  SKL +IK +FE ISG+P   G+I  TH
Sbjct: 135 FGMNQSTVSQITWRFVESMEERAI-HHLSWP--SKLDEIKSKFEKISGLPNCCGAIDITH 194

Query: 277 I----PIIAPKNNVAAYFNKRHTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQ 336
           I    P + P N V           + + ++S+T+Q VVDP   F DV  GWPGS+ DD 
Sbjct: 195 IVMNLPAVEPSNKVWL---------DGEKNFSMTLQAVVDPDMRFLDVIAGWPGSLNDDV 254

Query: 337 VLEKSALFERANMG-LLND------------VFIVGNSGYPLMDWLLVPYSVQNLTWTQH 396
           VL+ S  ++    G  LN              +IVG+SG+PL+ WLL PY  +  +  Q 
Sbjct: 255 VLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQGKPTSLPQT 314

Query: 397 AFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEV-KLQELPVVLGACCVLHN-ICEMRKES 456
            FN++ SE   AA+ A  +LK RW  +     +     LP ++  CC+LHN I +M  ++
Sbjct: 315 EFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNIIIDMEDQT 374

Query: 457 FDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNL 467
            D +   +  D      +  L   ++   RD ++  L
Sbjct: 375 LDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQL 399

BLAST of Tan0019989 vs. ExPASy Swiss-Prot
Match: Q94K49 (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 2.9e-34
Identity = 94/314 (29.94%), Postives = 156/314 (49.68%), Query Frame = 0

Query: 135 DWWD----QCNHPDFPDEE---FRRAFRMSKATFDMICKELDSTVMKKDTMLRVAI---- 194
           DWWD    + + P  P +E   F+  FR SK TF  IC  +   ++ +     + I    
Sbjct: 43  DWWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRL 102

Query: 195 -PVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLQWPEDS 254
             V ++VA+ + RLA+G+    V   FG+G ST  ++      A+ +      L+WP+  
Sbjct: 103 LSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEE-RAKHHLRWPDSD 162

Query: 255 KLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTSYSITVQGV 314
           ++ +IK +FE + G+P   G+I TTHI +  P    +  +       +Q+ +YS+ +QGV
Sbjct: 163 RIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDW------CDQEKNYSMFLQGV 222

Query: 315 VDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLND-------------VFIVGNS 374
            D    F ++  GWPG M   ++L+ S  F+      + D              ++VG  
Sbjct: 223 FDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGI 282

Query: 375 GYPLMDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQK-RTEVKLQE 423
            YPL+ WL+ P+   + + +  AFNE+  +++S A  AF +LKG W  L K       ++
Sbjct: 283 SYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRK 342

BLAST of Tan0019989 vs. ExPASy Swiss-Prot
Match: Q96MB7 (Putative nuclease HARBI1 OS=Homo sapiens OX=9606 GN=HARBI1 PE=1 SV=1)

HSP 1 Score: 92.8 bits (229), Expect = 1.1e-17
Identity = 70/261 (26.82%), Postives = 120/261 (45.98%), Query Frame = 0

Query: 167 ELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSA 226
           EL    + + T    AI    +V   +    +G     +    G+  ++  + V  V  A
Sbjct: 51  ELLGANLSRPTQRSRAISPETQVLAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEA 110

Query: 227 IRKVLMPKFLQWPED-SKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKR 286
           + +    +F+++P D + +  +K EF  ++G+P V G +   H+ I AP     +Y N+ 
Sbjct: 111 LVE-RASQFIRFPADEASIQALKDEFYGLAGMPGVMGVVDCIHVAIKAPNAEDLSYVNR- 170

Query: 287 HTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFI 346
                 K  +S+    V D  G    V   WPGS+ D  VL++S+L  +   G+  D ++
Sbjct: 171 ------KGLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQQSSLSSQFEAGMHKDSWL 230

Query: 347 VGNSGYPLMDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVK 406
           +G+S + L  WL+ P  +   T  ++ +N   S   S  +  F  L  R+ CL   ++  
Sbjct: 231 LGDSSFFLRTWLMTPLHIPE-TPAEYRYNMAHSATHSVIEKTFRTLCSRFRCLD-GSKGA 290

Query: 407 LQELPV----VLGACCVLHNI 423
           LQ  P     ++ ACCVLHNI
Sbjct: 291 LQYSPEKSSHIILACCVLHNI 301

BLAST of Tan0019989 vs. ExPASy Swiss-Prot
Match: Q17QR8 (Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 4.2e-17
Identity = 69/261 (26.44%), Postives = 119/261 (45.59%), Query Frame = 0

Query: 167 ELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSA 226
           EL    + + T    AI    ++   +    +G     +    G+  ++  + V  V  A
Sbjct: 51  ELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEA 110

Query: 227 IRKVLMPKFLQWPED-SKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKR 286
           + +    +F+ +P D + +  +K EF  ++GIP V G +   H+ I AP     +Y N+ 
Sbjct: 111 LVE-RASQFIHFPADEASVQALKDEFYGLAGIPGVIGVVDCMHVAIKAPNAEDLSYVNR- 170

Query: 287 HTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFI 346
                 K  +S+    V D  G    V   WPGS+ D  VL++S+L  +   G+  + ++
Sbjct: 171 ------KGLHSLNCLMVCDIRGALMTVETSWPGSLQDCVVLQQSSLSSQFEAGMHKESWL 230

Query: 347 VGNSGYPLMDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVK 406
           +G+S + L  WL+ P  +   T  ++ +N   S   S  +  F  L  R+ CL   ++  
Sbjct: 231 LGDSSFFLRTWLMTPLHIPE-TPAEYRYNMAHSATHSVIEKTFRTLCSRFRCLD-GSKGA 290

Query: 407 LQELPV----VLGACCVLHNI 423
           LQ  P     ++ ACCVLHNI
Sbjct: 291 LQYSPEKSSHIILACCVLHNI 301

BLAST of Tan0019989 vs. ExPASy Swiss-Prot
Match: B0BN95 (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 4.2e-17
Identity = 69/261 (26.44%), Postives = 119/261 (45.59%), Query Frame = 0

Query: 167 ELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSA 226
           EL    + + T    AI    ++   +    +G     +    G+  ++  + V  V  A
Sbjct: 51  ELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEA 110

Query: 227 IRKVLMPKFLQWPED-SKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKR 286
           + +    +F+ +P D + +  +K EF  ++G+P V G++   H+ I AP     +Y N+ 
Sbjct: 111 LVE-RASQFIHFPADEAAIQSLKDEFYGLAGMPGVIGAVDCIHVAIKAPNAEDLSYVNR- 170

Query: 287 HTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFI 346
                 K  +S+    V D  G    V   WPGS+ D  VL++S+L  +   G+  D ++
Sbjct: 171 ------KGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQSSLSSQFETGMPKDSWL 230

Query: 347 VGNSGYPLMDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVK 406
           +G+S + L  WLL P  +   T  ++ +N   S   S  +     L  R+ CL   ++  
Sbjct: 231 LGDSSFFLHTWLLTPLHIPE-TPAEYRYNRAHSATHSVIEKTLRTLCCRFRCLD-GSKGA 290

Query: 407 LQELPV----VLGACCVLHNI 423
           LQ  P     ++ ACCVLHNI
Sbjct: 291 LQYSPEKSSHIILACCVLHNI 301

BLAST of Tan0019989 vs. NCBI nr
Match: XP_023529395.1 (protein ALP1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 846.7 bits (2186), Expect = 1.0e-241
Identity = 424/484 (87.60%), Postives = 438/484 (90.50%), Query Frame = 0

Query: 1   MEISSFPFLNQDDLLPIFNLFSDMDNSFSVNQSPRKRRRQ----DDDQSQFNDGA---ND 60
           MEISSFPFLNQDDLLPIFNLFS+MD++FSVN SP+KRRRQ    DDDQ+QFN  +   ++
Sbjct: 1   MEISSFPFLNQDDLLPIFNLFSEMDDNFSVNHSPKKRRRQDNDDDDDQTQFNKTSFDDDE 60

Query: 61  LLKLPFWFDDTHDQKQHHWIMDSDADHKREFHLSDDNFPQFTAKKSRRTTAENASSSPAK 120
           LLKLPFWFD   D KQ HWIM    + K EF +SD+N  QF  KK RR T EN  SSP K
Sbjct: 61  LLKLPFWFD---DDKQQHWIM----EQKPEFQVSDENLTQFVPKKPRRATPENTHSSPVK 120

Query: 121 GGAGAGTGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVM 180
           GGA    G Q RRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVM
Sbjct: 121 GGA----GTQHRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVM 180

Query: 181 KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP 240
           KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP
Sbjct: 181 KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP 240

Query: 241 KFLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT 300
           KFLQWPEDSKLAKIKQEFESISGIP+VGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT
Sbjct: 241 KFLQWPEDSKLAKIKQEFESISGIPKVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT 300

Query: 301 SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPL 360
           SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLL DV IVGNSGYPL
Sbjct: 301 SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLKDVSIVGNSGYPL 360

Query: 361 MDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVL 420
            DWLLVPYS QNLTWTQHAFNEKVSEIQ AAKAAFGRLKGRW+CLQKRTEVKLQELPVVL
Sbjct: 361 TDWLLVPYSAQNLTWTQHAFNEKVSEIQGAAKAAFGRLKGRWTCLQKRTEVKLQELPVVL 420

Query: 421 GACCVLHNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNLLHHGLAG 478
           GACCVLHNICEMRKE FDPELKFE FDDEMVPENNG+RS SAIQARDHIAHNLLHHGLAG
Sbjct: 421 GACCVLHNICEMRKERFDPELKFEFFDDEMVPENNGVRSASAIQARDHIAHNLLHHGLAG 473

BLAST of Tan0019989 vs. NCBI nr
Match: XP_022933888.1 (protein ALP1-like [Cucurbita moschata] >KAG7021950.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 845.1 bits (2182), Expect = 2.9e-241
Identity = 423/484 (87.40%), Postives = 440/484 (90.91%), Query Frame = 0

Query: 1   MEISSFPFLNQDDLLPIFNLFSDMDNSFSVNQSPRKRRRQDDD----QSQFNDGA---ND 60
           MEISSFPFLNQDDLLPIFNLFS+MD++FSVNQSP+KRRRQ+DD    Q+QFN  +   ++
Sbjct: 1   MEISSFPFLNQDDLLPIFNLFSEMDDNFSVNQSPKKRRRQNDDDDDHQTQFNKTSFDDDE 60

Query: 61  LLKLPFWFDDTHDQKQHHWIMDSDADHKREFHLSDDNFPQFTAKKSRRTTAENASSSPAK 120
           LLKLPFWFD   D KQ HWIM    + K EF +SD+N  QF  KK RR T EN  SSPAK
Sbjct: 61  LLKLPFWFD---DDKQQHWIM----EQKPEFQVSDENLTQFVPKKPRRATPENTHSSPAK 120

Query: 121 GGAGAGTGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVM 180
           GGA    G Q RRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMIC+ELDSTVM
Sbjct: 121 GGA----GTQHRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICQELDSTVM 180

Query: 181 KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP 240
           KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP
Sbjct: 181 KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP 240

Query: 241 KFLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT 300
           KFLQWPE+SKLAKIKQEFESISGIP+VGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT
Sbjct: 241 KFLQWPEESKLAKIKQEFESISGIPKVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT 300

Query: 301 SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPL 360
           SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLL DV IVGNSGYPL
Sbjct: 301 SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLKDVSIVGNSGYPL 360

Query: 361 MDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVL 420
            DWLLVPYS QNLTWTQHAFNEKVSEIQ AAKAAFGRLKGRW+CLQKRTEVKLQELPVVL
Sbjct: 361 TDWLLVPYSAQNLTWTQHAFNEKVSEIQGAAKAAFGRLKGRWTCLQKRTEVKLQELPVVL 420

Query: 421 GACCVLHNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNLLHHGLAG 478
           GACCVLHNICEMRKE FDPELKFE FDDEMVPENNG+RS SAIQARDHIAHNLLHHGLAG
Sbjct: 421 GACCVLHNICEMRKERFDPELKFEFFDDEMVPENNGVRSASAIQARDHIAHNLLHHGLAG 473

BLAST of Tan0019989 vs. NCBI nr
Match: KAG6588056.1 (Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 844.0 bits (2179), Expect = 6.5e-241
Identity = 422/484 (87.19%), Postives = 440/484 (90.91%), Query Frame = 0

Query: 1   MEISSFPFLNQDDLLPIFNLFSDMDNSFSVNQSPRKRRRQDDD----QSQFNDGA---ND 60
           MEISSFPFLNQDDL+PIFNLFS+MD++FSVNQSP+KRRRQ+DD    Q+QFN  +   ++
Sbjct: 1   MEISSFPFLNQDDLVPIFNLFSEMDDNFSVNQSPKKRRRQNDDDDDHQTQFNKTSFDDDE 60

Query: 61  LLKLPFWFDDTHDQKQHHWIMDSDADHKREFHLSDDNFPQFTAKKSRRTTAENASSSPAK 120
           LLKLPFWFD   D KQ HWIM    + K EF +SD+N  QF  KK RR T EN  SSPAK
Sbjct: 61  LLKLPFWFD---DDKQQHWIM----EQKPEFQVSDENLTQFVPKKPRRATPENTHSSPAK 120

Query: 121 GGAGAGTGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVM 180
           GGA    G Q RRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMIC+ELDSTVM
Sbjct: 121 GGA----GTQHRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICQELDSTVM 180

Query: 181 KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP 240
           KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP
Sbjct: 181 KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP 240

Query: 241 KFLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT 300
           KFLQWPE+SKLAKIKQEFESISGIP+VGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT
Sbjct: 241 KFLQWPEESKLAKIKQEFESISGIPKVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT 300

Query: 301 SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPL 360
           SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLL DV IVGNSGYPL
Sbjct: 301 SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLKDVSIVGNSGYPL 360

Query: 361 MDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVL 420
            DWLLVPYS QNLTWTQHAFNEKVSEIQ AAKAAFGRLKGRW+CLQKRTEVKLQELPVVL
Sbjct: 361 TDWLLVPYSAQNLTWTQHAFNEKVSEIQGAAKAAFGRLKGRWTCLQKRTEVKLQELPVVL 420

Query: 421 GACCVLHNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNLLHHGLAG 478
           GACCVLHNICEMRKE FDPELKFE FDDEMVPENNG+RS SAIQARDHIAHNLLHHGLAG
Sbjct: 421 GACCVLHNICEMRKERFDPELKFEFFDDEMVPENNGVRSASAIQARDHIAHNLLHHGLAG 473

BLAST of Tan0019989 vs. NCBI nr
Match: XP_022967341.1 (protein ALP1-like [Cucurbita maxima])

HSP 1 Score: 842.4 bits (2175), Expect = 1.9e-240
Identity = 422/483 (87.37%), Postives = 438/483 (90.68%), Query Frame = 0

Query: 1   MEISSFPFLNQDDLLPIFNLFSDMDNSFSVNQSPRKRRRQ---DDDQSQFNDGA---NDL 60
           MEISSFPFLNQDDLLPIFNLFS+MD++FSVN SP+KRRRQ   DDDQ+QFN  +   ++L
Sbjct: 1   MEISSFPFLNQDDLLPIFNLFSEMDDNFSVNYSPKKRRRQDDDDDDQTQFNKTSFDDDEL 60

Query: 61  LKLPFWFDDTHDQKQHHWIMDSDADHKREFHLSDDNFPQFTAKKSRRTTAENASSSPAKG 120
           LKLPFWFD   D KQ HWIM    + K EF + D+N  QF AKK RR T EN  SSPAKG
Sbjct: 61  LKLPFWFD---DDKQQHWIM----EQKPEFQVFDENLTQFVAKKPRRATPENTHSSPAKG 120

Query: 121 GAGAGTGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVMK 180
           GA    G Q RRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMIC+ELDSTVMK
Sbjct: 121 GA----GTQHRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICQELDSTVMK 180

Query: 181 KDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPK 240
           KDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPK
Sbjct: 181 KDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPK 240

Query: 241 FLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTS 300
           FLQWPEDSKLAKIKQEFESISGIP+VGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTS
Sbjct: 241 FLQWPEDSKLAKIKQEFESISGIPKVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTS 300

Query: 301 YSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPLM 360
           YSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLL DV IVGNSGYPL 
Sbjct: 301 YSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLKDVSIVGNSGYPLT 360

Query: 361 DWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVLG 420
           DWLLVPYS QNLTWTQHAFNEKVSEIQ AAKAAFGRLKGRW+CLQKRTEVKLQELPVVLG
Sbjct: 361 DWLLVPYSAQNLTWTQHAFNEKVSEIQGAAKAAFGRLKGRWTCLQKRTEVKLQELPVVLG 420

Query: 421 ACCVLHNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNLLHHGLAGT 478
           ACCVLHNICEMRKE FDPELKFE FDDEMVPENNG+RS SAI ARDHI+HNLLHHGLAGT
Sbjct: 421 ACCVLHNICEMRKERFDPELKFEFFDDEMVPENNGVRSASAILARDHISHNLLHHGLAGT 472

BLAST of Tan0019989 vs. NCBI nr
Match: XP_038880089.1 (protein ALP1-like [Benincasa hispida])

HSP 1 Score: 824.3 bits (2128), Expect = 5.3e-235
Identity = 422/493 (85.60%), Postives = 440/493 (89.25%), Query Frame = 0

Query: 1   MEISSFPFLNQDDLLPIFNLFSDMDN-------SFSVNQSPRKRRRQD---DDQSQFND- 60
           MEISSFPFLNQ++LLPIFNLFSDMDN       +FSVNQSP+KRRR D   DD SQFN+ 
Sbjct: 1   MEISSFPFLNQEELLPIFNLFSDMDNNHNNTNATFSVNQSPKKRRRSDENGDDHSQFNNI 60

Query: 61  --GAND--LLKLPFWFDDTHDQKQHHWIMDSDADHKR-EFHLSDDNFPQFTAKKSRRTTA 120
               ND  L KLP WF     + Q +WIMDS+    R EFHLSD N  QF +KK RRTTA
Sbjct: 61  SFTENDEALQKLPCWF-----ESQENWIMDSEEPKPRNEFHLSDQNPTQF-SKKPRRTTA 120

Query: 121 ENASSSPAKGGAGAGTGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMI 180
           EN   SPAK   G   GAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSK+TFDMI
Sbjct: 121 EN--GSPAKNPTGG--GAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKSTFDMI 180

Query: 181 CKELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVC 240
           CKELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVC
Sbjct: 181 CKELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVC 240

Query: 241 SAIRKVLMPKFLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNK 300
           SAIRKVLMPKFLQWP++SKL KIKQEFESISGIP+VGGSIYTTHIPIIAPKNNVAAYFNK
Sbjct: 241 SAIRKVLMPKFLQWPDESKLTKIKQEFESISGIPKVGGSIYTTHIPIIAPKNNVAAYFNK 300

Query: 301 RHTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVF 360
           RHTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKS LFERANMGLLNDVF
Sbjct: 301 RHTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSVLFERANMGLLNDVF 360

Query: 361 IVGNSGYPLMDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEV 420
           IVGNSGYPLMDWLLVPY+VQNLTWTQH FNEKV EIQ+AAK AFGRLKGRWSCLQKRTEV
Sbjct: 361 IVGNSGYPLMDWLLVPYTVQNLTWTQHGFNEKVGEIQAAAKTAFGRLKGRWSCLQKRTEV 420

Query: 421 KLQELPVVLGACCVLHNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAH 478
           KLQELPVVLGACCVLHNICE+RKE FDP+LKFEL+DDEMVPENNGLRSVSAIQARDHIAH
Sbjct: 421 KLQELPVVLGACCVLHNICEIRKEKFDPDLKFELYDDEMVPENNGLRSVSAIQARDHIAH 480

BLAST of Tan0019989 vs. ExPASy TrEMBL
Match: A0A6J1F642 (protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111441167 PE=3 SV=1)

HSP 1 Score: 845.1 bits (2182), Expect = 1.4e-241
Identity = 423/484 (87.40%), Postives = 440/484 (90.91%), Query Frame = 0

Query: 1   MEISSFPFLNQDDLLPIFNLFSDMDNSFSVNQSPRKRRRQDDD----QSQFNDGA---ND 60
           MEISSFPFLNQDDLLPIFNLFS+MD++FSVNQSP+KRRRQ+DD    Q+QFN  +   ++
Sbjct: 1   MEISSFPFLNQDDLLPIFNLFSEMDDNFSVNQSPKKRRRQNDDDDDHQTQFNKTSFDDDE 60

Query: 61  LLKLPFWFDDTHDQKQHHWIMDSDADHKREFHLSDDNFPQFTAKKSRRTTAENASSSPAK 120
           LLKLPFWFD   D KQ HWIM    + K EF +SD+N  QF  KK RR T EN  SSPAK
Sbjct: 61  LLKLPFWFD---DDKQQHWIM----EQKPEFQVSDENLTQFVPKKPRRATPENTHSSPAK 120

Query: 121 GGAGAGTGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVM 180
           GGA    G Q RRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMIC+ELDSTVM
Sbjct: 121 GGA----GTQHRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICQELDSTVM 180

Query: 181 KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP 240
           KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP
Sbjct: 181 KKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMP 240

Query: 241 KFLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT 300
           KFLQWPE+SKLAKIKQEFESISGIP+VGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT
Sbjct: 241 KFLQWPEESKLAKIKQEFESISGIPKVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKT 300

Query: 301 SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPL 360
           SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLL DV IVGNSGYPL
Sbjct: 301 SYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLKDVSIVGNSGYPL 360

Query: 361 MDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVL 420
            DWLLVPYS QNLTWTQHAFNEKVSEIQ AAKAAFGRLKGRW+CLQKRTEVKLQELPVVL
Sbjct: 361 TDWLLVPYSAQNLTWTQHAFNEKVSEIQGAAKAAFGRLKGRWTCLQKRTEVKLQELPVVL 420

Query: 421 GACCVLHNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNLLHHGLAG 478
           GACCVLHNICEMRKE FDPELKFE FDDEMVPENNG+RS SAIQARDHIAHNLLHHGLAG
Sbjct: 421 GACCVLHNICEMRKERFDPELKFEFFDDEMVPENNGVRSASAIQARDHIAHNLLHHGLAG 473

BLAST of Tan0019989 vs. ExPASy TrEMBL
Match: A0A6J1HRQ5 (protein ALP1-like OS=Cucurbita maxima OX=3661 GN=LOC111466889 PE=3 SV=1)

HSP 1 Score: 842.4 bits (2175), Expect = 9.2e-241
Identity = 422/483 (87.37%), Postives = 438/483 (90.68%), Query Frame = 0

Query: 1   MEISSFPFLNQDDLLPIFNLFSDMDNSFSVNQSPRKRRRQ---DDDQSQFNDGA---NDL 60
           MEISSFPFLNQDDLLPIFNLFS+MD++FSVN SP+KRRRQ   DDDQ+QFN  +   ++L
Sbjct: 1   MEISSFPFLNQDDLLPIFNLFSEMDDNFSVNYSPKKRRRQDDDDDDQTQFNKTSFDDDEL 60

Query: 61  LKLPFWFDDTHDQKQHHWIMDSDADHKREFHLSDDNFPQFTAKKSRRTTAENASSSPAKG 120
           LKLPFWFD   D KQ HWIM    + K EF + D+N  QF AKK RR T EN  SSPAKG
Sbjct: 61  LKLPFWFD---DDKQQHWIM----EQKPEFQVFDENLTQFVAKKPRRATPENTHSSPAKG 120

Query: 121 GAGAGTGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVMK 180
           GA    G Q RRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMIC+ELDSTVMK
Sbjct: 121 GA----GTQHRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICQELDSTVMK 180

Query: 181 KDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPK 240
           KDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPK
Sbjct: 181 KDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPK 240

Query: 241 FLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTS 300
           FLQWPEDSKLAKIKQEFESISGIP+VGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTS
Sbjct: 241 FLQWPEDSKLAKIKQEFESISGIPKVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTS 300

Query: 301 YSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPLM 360
           YSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLL DV IVGNSGYPL 
Sbjct: 301 YSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLKDVSIVGNSGYPLT 360

Query: 361 DWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVLG 420
           DWLLVPYS QNLTWTQHAFNEKVSEIQ AAKAAFGRLKGRW+CLQKRTEVKLQELPVVLG
Sbjct: 361 DWLLVPYSAQNLTWTQHAFNEKVSEIQGAAKAAFGRLKGRWTCLQKRTEVKLQELPVVLG 420

Query: 421 ACCVLHNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNLLHHGLAGT 478
           ACCVLHNICEMRKE FDPELKFE FDDEMVPENNG+RS SAI ARDHI+HNLLHHGLAGT
Sbjct: 421 ACCVLHNICEMRKERFDPELKFEFFDDEMVPENNGVRSASAILARDHISHNLLHHGLAGT 472

BLAST of Tan0019989 vs. ExPASy TrEMBL
Match: A0A6J1HAH8 (protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111461585 PE=3 SV=1)

HSP 1 Score: 801.2 bits (2068), Expect = 2.3e-228
Identity = 401/478 (83.89%), Postives = 429/478 (89.75%), Query Frame = 0

Query: 1   MEISSFPFLNQDDLLPIFNLFSDMDNSFSVNQSPRKRRR-QDDDQSQFNDGANDLLKLPF 60
           MEI+SFPFLNQD+ LPI N FS+MDN FSVNQSP+KRRR  D+      +G NDLL  PF
Sbjct: 1   MEITSFPFLNQDEFLPISNFFSEMDN-FSVNQSPKKRRRPTDNGDDGSGNGGNDLLNHPF 60

Query: 61  WFDDTHDQKQHHWIMDSDADHKREFHLSDDNFPQFTAKKSRRTTAENASSSPAKGGAGAG 120
           W ++  D+++ HW+MDSD     EF        Q  +KK RR T +N +SSPAKGGA   
Sbjct: 61  WVEN-RDEQREHWVMDSD-----EF--------QVLSKKPRRGTPQNDNSSPAKGGA--- 120

Query: 121 TGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVMKKDTML 180
            GA QRRLWVK+RSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMIC+ELDSTVMKKDTML
Sbjct: 121 -GAHQRRLWVKNRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICRELDSTVMKKDTML 180

Query: 181 RVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLQWP 240
           RVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLQWP
Sbjct: 181 RVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLQWP 240

Query: 241 EDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTSYSITV 300
           E+SKL KIKQEFESISGIP+VGG+IYTTHIPIIAPKN+VAAYFNKRHTERNQKTSYSITV
Sbjct: 241 EESKLTKIKQEFESISGIPKVGGAIYTTHIPIIAPKNSVAAYFNKRHTERNQKTSYSITV 300

Query: 301 QGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPLMDWLLV 360
           QGVVDPAGVFTDVCIGWPGSMPDD VLEKSALFERANMGLLNDVFIVGNSGYPLMDWLLV
Sbjct: 301 QGVVDPAGVFTDVCIGWPGSMPDDHVLEKSALFERANMGLLNDVFIVGNSGYPLMDWLLV 360

Query: 361 PYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVLGACCVL 420
           PY++QNLTWTQHAFNEKV EIQ+AAKAAFGRLKGRW+CLQKRTEVKLQELPVVLGACCVL
Sbjct: 361 PYTLQNLTWTQHAFNEKVGEIQAAAKAAFGRLKGRWTCLQKRTEVKLQELPVVLGACCVL 420

Query: 421 HNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNLLHHGLAGTGFL 478
           HNICEMRKE FD ELKFE+FDDEMVPE+NG+RSVSAIQ RDHIAHNLLHHGLAGTGFL
Sbjct: 421 HNICEMRKERFDSELKFEVFDDEMVPESNGVRSVSAIQGRDHIAHNLLHHGLAGTGFL 459

BLAST of Tan0019989 vs. ExPASy TrEMBL
Match: A0A6J1JHN7 (protein ALP1-like OS=Cucurbita maxima OX=3661 GN=LOC111485185 PE=3 SV=1)

HSP 1 Score: 799.3 bits (2063), Expect = 8.9e-228
Identity = 401/478 (83.89%), Postives = 429/478 (89.75%), Query Frame = 0

Query: 1   MEISSFPFLNQDDLLPIFNLFSDMDNSFSVNQSPRKRRR-QDDDQSQFNDGANDLLKLPF 60
           MEI+S PFLNQD+ LPI N FS++DN FSVNQS +KRRR  D+      +G NDLL LPF
Sbjct: 1   MEITSVPFLNQDEFLPISNFFSEIDN-FSVNQSLKKRRRPTDNGDDGSGNGGNDLLNLPF 60

Query: 61  WFDDTHDQKQHHWIMDSDADHKREFHLSDDNFPQFTAKKSRRTTAENASSSPAKGGAGAG 120
           W ++  D++Q HW+MDSD     EF        Q  +KK RR T +N +SSPAKGGA   
Sbjct: 61  WVEN-RDEQQEHWVMDSD-----EF--------QVLSKKPRRGTPQNDNSSPAKGGA--- 120

Query: 121 TGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVMKKDTML 180
            GA QRRLWVK+RSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMIC+ELDSTVMKKDTML
Sbjct: 121 -GAHQRRLWVKNRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICRELDSTVMKKDTML 180

Query: 181 RVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLQWP 240
           RVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLQWP
Sbjct: 181 RVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLQWP 240

Query: 241 EDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTSYSITV 300
           E+SKL KIKQEFESISGIP+VGG+IYTTHIPIIAPKN+VAAYFNKRHTERNQKTSYSITV
Sbjct: 241 EESKLTKIKQEFESISGIPKVGGAIYTTHIPIIAPKNSVAAYFNKRHTERNQKTSYSITV 300

Query: 301 QGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPLMDWLLV 360
           QGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPLMDWLLV
Sbjct: 301 QGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFIVGNSGYPLMDWLLV 360

Query: 361 PYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVLGACCVL 420
           PY++QNLTWTQHAFNEKV EIQ+AAKAAFGRLKGRW+CLQKRTEVKLQELPVVLGACCVL
Sbjct: 361 PYTLQNLTWTQHAFNEKVGEIQAAAKAAFGRLKGRWTCLQKRTEVKLQELPVVLGACCVL 420

Query: 421 HNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNLLHHGLAGTGFL 478
           HNICEMRKE FD ELKFE+FDDEMVPE+NG+RSVSAIQ RDHIAHNLLHHGLAGTGFL
Sbjct: 421 HNICEMRKERFDSELKFEVFDDEMVPESNGVRSVSAIQGRDHIAHNLLHHGLAGTGFL 459

BLAST of Tan0019989 vs. ExPASy TrEMBL
Match: A0A1S3BPN5 (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103492343 PE=3 SV=1)

HSP 1 Score: 794.7 bits (2051), Expect = 2.2e-226
Identity = 399/492 (81.10%), Postives = 427/492 (86.79%), Query Frame = 0

Query: 1   MEISSFPFLNQDDLLPIFNLFSDMDNS----FSVNQSPRKRRRQDDDQSQFND----GAN 60
           MEISSFPFLNQ++ LPIFNLFSDMDN+    F+VN +P+KRRR D +   FN+      N
Sbjct: 1   MEISSFPFLNQEEFLPIFNLFSDMDNNPTTPFNVNPTPKKRRRSDPNSDDFNNFSFTDEN 60

Query: 61  D------LLKLPFWFDDTHDQKQHHWIMDSDADH-KREFHLSDDNFPQFTAKKSRRTTAE 120
           D      LLKLP WFD   +  Q  W+MDS       +FHLSD        KK RR + E
Sbjct: 61  DEPTDDPLLKLPCWFDPQPESPQ-SWLMDSQKPKPTNDFHLSDQ-----IPKKPRRASPE 120

Query: 121 NASSSPAKGGAGAGTGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMIC 180
           N   SP K     G G QQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSK+TFDMIC
Sbjct: 121 N--PSPVKNNPAGGGGTQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKSTFDMIC 180

Query: 181 KELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCS 240
           KELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCS
Sbjct: 181 KELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCS 240

Query: 241 AIRKVLMPKFLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKR 300
           AIRKVLMPKFLQWP++SKL KIKQEFESISGIP+VGGSIYTTHIPIIAP+NNVAAYFNKR
Sbjct: 241 AIRKVLMPKFLQWPDESKLTKIKQEFESISGIPKVGGSIYTTHIPIIAPRNNVAAYFNKR 300

Query: 301 HTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLNDVFI 360
           HTERNQKTSYSITVQGVVDP+GVFTDVCIGWPGSMPDDQVLEKS L+ERA+MGLLNDVF+
Sbjct: 301 HTERNQKTSYSITVQGVVDPSGVFTDVCIGWPGSMPDDQVLEKSLLYERASMGLLNDVFV 360

Query: 361 VGNSGYPLMDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVK 420
           VGNSGYPLMDWLLVPY+VQNLTWTQH FNEKV EIQ+AAKAAFGRLKGRW+CLQKRTEVK
Sbjct: 361 VGNSGYPLMDWLLVPYTVQNLTWTQHGFNEKVGEIQAAAKAAFGRLKGRWTCLQKRTEVK 420

Query: 421 LQELPVVLGACCVLHNICEMRKESFDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHN 478
           LQELPVVLGACCVLHNICEMRKE FDPELKFE++DDEM+PENNGLRSVSAIQARDHIAHN
Sbjct: 421 LQELPVVLGACCVLHNICEMRKEKFDPELKFEVYDDEMLPENNGLRSVSAIQARDHIAHN 480

BLAST of Tan0019989 vs. TAIR 10
Match: AT5G12010.1 (unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, plasma membrane, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G29780.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 567.8 bits (1462), Expect = 8.4e-162
Identity = 283/445 (63.60%), Postives = 349/445 (78.43%), Query Frame = 0

Query: 37  RRRQDDDQSQFNDGANDLLKLPFWFDDTHDQKQHHWIMDSDADHKREFHLSDDNFPQFTA 96
           RR   D QS +   A  +     ++ D +D         +DA+   + +L      +  A
Sbjct: 69  RREMSDFQSNYRKRARTMSD---YYSDLNDYY-------ADAEESGDINLKKSRVSRAVA 128

Query: 97  KKSRRTTAENASSSPAKGGAGA--GTGA-QQRRLWVKDRSKDWWDQCNHPDFPDEEFRRA 156
             +    +E  + S    G+G+  GTG+ QQRRLWVKDRS+ WW++C+  D+P+E+F++A
Sbjct: 129 SVAVAAASEIEAESSEITGSGSVRGTGSGQQRRLWVKDRSRAWWEECSRLDYPEEDFKKA 188

Query: 157 FRMSKATFDMICKELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGI 216
           FRMSK+TF++IC EL+S V K+DT LR AIPVRQRVAVCIWRLATGEPLRLVSK+FGLGI
Sbjct: 189 FRMSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAVCIWRLATGEPLRLVSKKFGLGI 248

Query: 217 STCHKLVLEVCSAIRKVLMPKFLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIA 276
           STCHKLVLEVC AI+ VLMPK+LQWP+D  L  I++ FES+SGIP V GS+YTTHIPIIA
Sbjct: 249 STCHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERFESVSGIPNVVGSMYTTHIPIIA 308

Query: 277 PKNNVAAYFNKRHTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALFE 336
           PK +VA+YFNKRHTERNQKTSYSIT+Q VV+P GVFTD+CIGWPGSMPDD+VLEKS L++
Sbjct: 309 PKISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTDLCIGWPGSMPDDKVLEKSLLYQ 368

Query: 337 RANM-GLLNDVFIVGNSGYPLMDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLK 396
           RAN  GLL  +++ G  G+PL+DW+LVPY+ QNLTWTQHAFNEK+SE+Q  AK AFGRLK
Sbjct: 369 RANNGGLLKGMWVAGGPGHPLLDWVLVPYTQQNLTWTQHAFNEKMSEVQGVAKEAFGRLK 428

Query: 397 GRWSCLQKRTEVKLQELPVVLGACCVLHNICEMRKESFDPELKFELFDDEMVPENNGLRS 456
           GRW+CLQKRTEVKLQ+LP VLGACCVLHNICEMR+E  +PEL  E+ DDE++PE N LRS
Sbjct: 429 GRWACLQKRTEVKLQDLPTVLGACCVLHNICEMREEKMEPELMVEVIDDEVLPE-NVLRS 488

Query: 457 VSAIQARDHIAHNLLHHGLAGTGFL 478
           V+A++ARD I+HNLLHHGLAGT FL
Sbjct: 489 VNAMKARDTISHNLLHHGLAGTSFL 502

BLAST of Tan0019989 vs. TAIR 10
Match: AT4G29780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G12010.1); Has 945 Blast hits to 944 proteins in 87 species: Archae - 0; Bacteria - 0; Metazoa - 519; Fungi - 43; Plants - 365; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )

HSP 1 Score: 557.0 bits (1434), Expect = 1.5e-158
Identity = 301/538 (55.95%), Postives = 364/538 (67.66%), Query Frame = 0

Query: 1   MEISSFPF--LNQDDLLPIFNLFSDMDNSFSV-----------NQSPRKRRRQDDDQSQF 60
           MEISSFPF  L  D+      LF DMD+S S            N + +KR R+DD+    
Sbjct: 1   MEISSFPFPYLQDDECSHFLGLFQDMDSSPSTFGLEGFNSNDNNTNQKKRPRKDDEGGGG 60

Query: 61  NDGA---------------NDLLKLPFWFDDTHDQKQHHWIMD-------SDADHKREFH 120
             G                 D+L      D+   Q+Q  W  +        +A+HK++  
Sbjct: 61  GGGGTEVLGAVNGNNKAAFGDILATLLLLDEEAKQQQEQWDFEFIKEKSLLEANHKKKVK 120

Query: 121 LSDDNFPQF-------------TAKKSRRTTAENASSSPAKG---------------GAG 180
             D  + Q               +K++R+T      S+ A G                +G
Sbjct: 121 TMDGYYNQMQDHYSAAGETDGSRSKRARKTAVAAVVSAVASGADTTGLAAPVPTADIASG 180

Query: 181 AGTGAQQRRLWVKDRSKDWWDQCNHPDFPDEEFRRAFRMSKATFDMICKELDSTVMKKDT 240
           +G+G   RRLWVK+R+ DWWD+ + PDFP++EFRR FRMSK+TF++IC+ELD+TV KK+T
Sbjct: 181 SGSGPSHRRLWVKERTTDWWDRVSRPDFPEDEFRREFRMSKSTFNLICEELDTTVTKKNT 240

Query: 241 MLRVAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLQ 300
           MLR AIP  +RV VC+WRLATG PLR VS+RFGLGISTCHKLV+EVC AI  VLMPK+L 
Sbjct: 241 MLRDAIPAPKRVGVCVWRLATGAPLRHVSERFGLGISTCHKLVIEVCRAIYDVLMPKYLL 300

Query: 301 WPEDSKLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTSYSI 360
           WP DS++   K +FES+  IP V GSIYTTHIPIIAPK +VAAYFNKRHTERNQKTSYSI
Sbjct: 301 WPSDSEINSTKAKFESVHKIPNVVGSIYTTHIPIIAPKVHVAAYFNKRHTERNQKTSYSI 360

Query: 361 TVQGVVDPAGVFTDVCIGWPGSMPDDQVLEKSALF-ERANMGLLNDVFIVGNSGYPLMDW 420
           TVQGVV+  G+FTDVCIG PGS+ DDQ+LEKS+L  +RA  G+L D +IVGNSG+PL D+
Sbjct: 361 TVQGVVNADGIFTDVCIGNPGSLTDDQILEKSSLSRQRAARGMLRDSWIVGNSGFPLTDY 420

Query: 421 LLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVLGAC 475
           LLVPY+ QNLTWTQHAFNE + EIQ  A AAF RLKGRW+CLQKRTEVKLQ+LP VLGAC
Sbjct: 421 LLVPYTRQNLTWTQHAFNESIGEIQGIATAAFERLKGRWACLQKRTEVKLQDLPYVLGAC 480

BLAST of Tan0019989 vs. TAIR 10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 157.9 bits (398), Expect = 2.0e-38
Identity = 118/397 (29.72%), Postives = 185/397 (46.60%), Query Frame = 0

Query: 97  KKSRRTTAENASSSPAKGGAGAGTGAQQRRLWVKDRSKDWWDQCNHPDF----PDEEFRR 156
           KK  R     A+++     A A             +S DWWD  +   +      + F  
Sbjct: 15  KKVDRNVLLAATAAATSASAAAALNNNDDDDDSSSQSLDWWDGFSRRIYGGSTDPKTFES 74

Query: 157 AFRMSKATFDMICKELDSTVMKKDTMLRVA----IPVRQRVAVCIWRLATGEPLRLVSKR 216
            F++S+ TFD IC  + +    K      +    + +  RVAV + RL +GE L ++ + 
Sbjct: 75  VFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLNDRVAVALRRLGSGESLSVIGET 134

Query: 217 FGLGISTCHKLVLEVCSAIRKVLMPKFLQWPEDSKLAKIKQEFESISGIPRVGGSIYTTH 276
           FG+  ST  ++      ++ +  +   L WP  SKL +IK +FE ISG+P   G+I  TH
Sbjct: 135 FGMNQSTVSQITWRFVESMEERAI-HHLSWP--SKLDEIKSKFEKISGLPNCCGAIDITH 194

Query: 277 I----PIIAPKNNVAAYFNKRHTERNQKTSYSITVQGVVDPAGVFTDVCIGWPGSMPDDQ 336
           I    P + P N V           + + ++S+T+Q VVDP   F DV  GWPGS+ DD 
Sbjct: 195 IVMNLPAVEPSNKVWL---------DGEKNFSMTLQAVVDPDMRFLDVIAGWPGSLNDDV 254

Query: 337 VLEKSALFERANMG-LLND------------VFIVGNSGYPLMDWLLVPYSVQNLTWTQH 396
           VL+ S  ++    G  LN              +IVG+SG+PL+ WLL PY  +  +  Q 
Sbjct: 255 VLKNSGFYKLVEKGKRLNGEKLPLSERTELREYIVGDSGFPLLPWLLTPYQGKPTSLPQT 314

Query: 397 AFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEV-KLQELPVVLGACCVLHN-ICEMRKES 456
            FN++ SE   AA+ A  +LK RW  +     +     LP ++  CC+LHN I +M  ++
Sbjct: 315 EFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFVCCLLHNIIIDMEDQT 374

Query: 457 FDPELKFELFDDEMVPENNGLRSVSAIQARDHIAHNL 467
            D +   +  D      +  L   ++   RD ++  L
Sbjct: 375 LDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQL 399

BLAST of Tan0019989 vs. TAIR 10
Match: AT3G63270.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 147.9 bits (372), Expect = 2.1e-35
Identity = 94/314 (29.94%), Postives = 156/314 (49.68%), Query Frame = 0

Query: 135 DWWD----QCNHPDFPDEE---FRRAFRMSKATFDMICKELDSTVMKKDTMLRVAI---- 194
           DWWD    + + P  P +E   F+  FR SK TF  IC  +   ++ +     + I    
Sbjct: 43  DWWDTFWLRNSSPSVPSDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRL 102

Query: 195 -PVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRKVLMPKFLQWPEDS 254
             V ++VA+ + RLA+G+    V   FG+G ST  ++      A+ +      L+WP+  
Sbjct: 103 LSVEKQVAIALRRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEE-RAKHHLRWPDSD 162

Query: 255 KLAKIKQEFESISGIPRVGGSIYTTHIPIIAPKNNVAAYFNKRHTERNQKTSYSITVQGV 314
           ++ +IK +FE + G+P   G+I TTHI +  P    +  +       +Q+ +YS+ +QGV
Sbjct: 163 RIEEIKSKFEEMYGLPNCCGAIDTTHIIMTLPAVQASDDW------CDQEKNYSMFLQGV 222

Query: 315 VDPAGVFTDVCIGWPGSMPDDQVLEKSALFERANMGLLND-------------VFIVGNS 374
            D    F ++  GWPG M   ++L+ S  F+      + D              ++VG  
Sbjct: 223 FDHEMRFLNMVTGWPGGMTVSKLLKFSGFFKLCENAQILDGNPKTLSQGAQIREYVVGGI 282

Query: 375 GYPLMDWLLVPYSVQNLTWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQK-RTEVKLQE 423
            YPL+ WL+ P+   + + +  AFNE+  +++S A  AF +LKG W  L K       ++
Sbjct: 283 SYPLLPWLITPHDSDHPSDSMVAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRK 342

BLAST of Tan0019989 vs. TAIR 10
Match: AT3G19120.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 107.1 bits (266), Expect = 4.1e-23
Identity = 81/309 (26.21%), Postives = 143/309 (46.28%), Query Frame = 0

Query: 147 DEEFRRAFRMSKATFDMICKELDSTVMKKDTMLRVAIPVRQRVAVCIWRLATGEPLRLVS 206
           D  +R  + +S   F  +  +L   +    T   +++P    VA+ + RLA G   + ++
Sbjct: 114 DARWRSLYGLSYPVFITVVDKLKPFI----TASNLSLPADYAVAMVLSRLAHGCSAKTLA 173

Query: 207 KRFGLGISTCHKLVLEVCSAIRKVLMPKFLQWPEDS-KLAKIKQEFESISGIPRVGGSIY 266
            R+ L      K+   V   +   L P+F++ P    +L +  Q FE ++ +P + G+I 
Sbjct: 174 SRYSLDPYLISKITNMVTRLLATKLYPEFIKIPVGKRRLIETTQGFEELTSLPNICGAID 233

Query: 267 TTHIPIIAPKNNVAAYFNKRHTERNQKTSY-------SITVQGVVDPAGVFTDVCIGWPG 326
           +T + +            +R T+ N +  Y       ++ +Q V D   +F DVC+  PG
Sbjct: 234 STPVKL------------RRRTKLNPRNIYGCKYGYDAVLLQVVADHKKIFWDVCVKAPG 293

Query: 327 SMPDDQVLEKSALFERANMG------LLN------DVFIVGNSGYPLMDWLLVPYSVQNL 386
              D      S L++R   G      ++N        +IVG+  YPL+ +L+ P+S    
Sbjct: 294 GEDDSSHFRDSLLYKRLTSGDIVWEKVINIRGHHVRPYIVGDWCYPLLSFLMTPFSPNGS 353

Query: 387 -TWTQHAFNEKVSEIQSAAKAAFGRLKGRWSCLQKRTEVKLQELPVVLGACCVLHNICEM 435
            T  ++ F+  + + +S    A G LK RW  LQ    V +   P  + ACCVLHN+C++
Sbjct: 354 GTPPENLFDGMLMKGRSVVVEAIGLLKARWKILQS-LNVGVNHAPQTIVACCVLHNLCQI 404

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M2U32.8e-3729.72Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
Q94K492.9e-3429.94Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
Q96MB71.1e-1726.82Putative nuclease HARBI1 OS=Homo sapiens OX=9606 GN=HARBI1 PE=1 SV=1[more]
Q17QR84.2e-1726.44Putative nuclease HARBI1 OS=Bos taurus OX=9913 GN=HARBI1 PE=2 SV=1[more]
B0BN954.2e-1726.44Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_023529395.11.0e-24187.60protein ALP1-like [Cucurbita pepo subsp. pepo][more]
XP_022933888.12.9e-24187.40protein ALP1-like [Cucurbita moschata] >KAG7021950.1 Protein ALP1-like protein, ... [more]
KAG6588056.16.5e-24187.19Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022967341.11.9e-24087.37protein ALP1-like [Cucurbita maxima][more]
XP_038880089.15.3e-23585.60protein ALP1-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1F6421.4e-24187.40protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111441167 PE=3 SV=1[more]
A0A6J1HRQ59.2e-24187.37protein ALP1-like OS=Cucurbita maxima OX=3661 GN=LOC111466889 PE=3 SV=1[more]
A0A6J1HAH82.3e-22883.89protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111461585 PE=3 SV=1[more]
A0A6J1JHN78.9e-22883.89protein ALP1-like OS=Cucurbita maxima OX=3661 GN=LOC111485185 PE=3 SV=1[more]
A0A1S3BPN52.2e-22681.10putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103492343 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G12010.18.4e-16263.60unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, ... [more]
AT4G29780.11.5e-15855.95unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G55350.12.0e-3829.72PIF / Ping-Pong family of plant transposases [more]
AT3G63270.12.1e-3529.94CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT3G19120.14.1e-2326.21PIF / Ping-Pong family of plant transposases [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 264..421
e-value: 2.7E-36
score: 124.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 95..110
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 32..46
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 95..124
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 29..49
NoneNo IPR availablePANTHERPTHR22930:SF199NUCLEASEcoord: 61..476
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 61..476

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0019989.1Tan0019989.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding