Tan0007856 (gene) Snake gourd v1

Overview
NameTan0007856
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDDE Tnp4 domain-containing protein
LocationLG08: 2752323 .. 2754476 (+)
RNA-Seq ExpressionTan0007856
SyntenyTan0007856
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGCCGGAGGACTCGGCGGCGATAAGAGAACCACCAGAAGCTCCGCCATGAACGCTACCGCCGTCACTACCAGAAGTCAGGCCAAGAAGTCCGACCGGCAGAGCCATCTCAAACACCAACTGGTAACTCTTATCGAAACCACCATTTCTTCCGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTTCTTCCCTCTCAAACCCTCGCCCTCGAATCCCTCATCTGTTCCACTGCATCCTCTCTTCATGGTCTCTCTCCTCGTCTCCCCAAACTTGCCCTACATCCACCGCCGCCGCCGCGGCAATGCTGGTTCCAACGCTTCCTTTCTGCGACGGCGGAGGTCGATTGCGATCCGAGATGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTCCGTCTCCTTTCCCCGATTCAGAGCTCCTCATCCTCTTCCGTTCCGCCCGATTGCGCTTTAGCCGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGTTACAAGGCGGTTGGGAGGCGGTTCGGAATCGATTCTGCTGATGCTTGCCGCTCGTTTTATGCGGTTTGTAAGGCCATCAATGAGAAATTGGGGCATTTGCTTGAGCTTCGGTCTGATATTGATCGGATTGTGGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGGCTTAGAAGATTTGGGGTTGAAGGCGATCTTCTTGGTAAAAGCGAATCGCTTCTGGTTCAGGCATTGGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGATGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAGCTGTACGCAGAAATTGAGAAATCTGGTGAATTACTCAAAGGCCCTGTTTATAATCTGGATGATGAAAAGCCCATTTCCCAATACTTGATTGGTGATTCTTGCTTCCCTCTTTTGCCATGGCTTTTGACACCATACATGAAACTGAACGAGGAAGATAGCTCTGGTTTTCCTGAAAAAGCGTTCAATTCCACACATAACCGTGCAATGGGGTTGGTTAACACAGCATATTGCAGACTCCGAGCTCGGTGGAAGCTTCTGTCAAAACCATGGAAGGAAGGATGTAGAGATTTCTTCCCATTTATTGTATTGACTGGGTGTCTGCTGCATAATTTCCTCATTAAATGCAGTGAGAAACTTGATGAAGAACAAGGTGAAGACGAAGGAGCAAGTTGTTCAAGTGAGGAACAGAAGTTTCCTCTTTATGATGGTGAGATAGGAGATGATAGAGGAAAGGATATCAGAGATGCGCTTGCCTTGCACTTGAGTAGGCTGAGCTTCAGAAGATGATTGCTTTCCACATCTTGGTAATTTTCATCTTTCTTCACCTGTATATATTCCTTTCCAATCTTTTAGAACATTTTGTCAAAACTGTAGGTTGGAAGTCCTAGTTCTCTGGAAGAAGATCTATTTTCTTTTGTCAACCCACTAGTTTCGTAATTATTTGATACTTCTTATAGTTAGAGAAGAGATTTTTACACTTATCTAACATGAAAGTGACTTTGAAATCATTCATTCTATACATAAAAATTTCAAAGAATCTACAGTTCATGATTTAGGTCCTTTAGAATGCAATGATTTTGACTTGCAATTATTCTATTAAGTGAATAGGAATCTCCTTTTGAATCAAATCATCTTAGTAAATTAATTTGAACCCCATTTGGGATTGATCTAGTAAAACTTGATATTGAATTTTAGTTGAAGTGAAATAACTATGGTTGGTCTAATAATAATTCTTATACAAAAAGGACAAAGAAGTAATGAGCTTTCCTTGGAAGTTAGGACTTGCCGACTTGGTCCACTCCTACAGGTAATGCAGGGGTTTAAGCATCACAAAACCCCCCGAGGATGAGTCAGTGAGTCACTACAATCTTTTACTTGGTGAGTTTTTACCTTTTTGGTATAGCTTCTCGAGAGTGCTTTTAATAAGCACTTTTAGAGGGGAAAAGCTCATTAAGAGTAAATTAGGCATGGATTTTAGCAAAGTTAAAAGTGTCTTTTAACTTATTACTAACTTTGACATTTGCCTACCTAACCAAAAGTGCTATTGAAAATTCCAAAAATACATTTTAAGTACTTTAAAATATTATTTGATGAAAG

mRNA sequence

ATGGCCGCCGGAGGACTCGGCGGCGATAAGAGAACCACCAGAAGCTCCGCCATGAACGCTACCGCCGTCACTACCAGAAGTCAGGCCAAGAAGTCCGACCGGCAGAGCCATCTCAAACACCAACTGGTAACTCTTATCGAAACCACCATTTCTTCCGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTTCTTCCCTCTCAAACCCTCGCCCTCGAATCCCTCATCTGTTCCACTGCATCCTCTCTTCATGGTCTCTCTCCTCGTCTCCCCAAACTTGCCCTACATCCACCGCCGCCGCCGCGGCAATGCTGGTTCCAACGCTTCCTTTCTGCGACGGCGGAGGTCGATTGCGATCCGAGATGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTCCGTCTCCTTTCCCCGATTCAGAGCTCCTCATCCTCTTCCGTTCCGCCCGATTGCGCTTTAGCCGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGTTACAAGGCGGTTGGGAGGCGGTTCGGAATCGATTCTGCTGATGCTTGCCGCTCGTTTTATGCGGTTTGTAAGGCCATCAATGAGAAATTGGGGCATTTGCTTGAGCTTCGGTCTGATATTGATCGGATTGTGGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGGCTTAGAAGATTTGGGGTTGAAGGCGATCTTCTTGGTAAAAGCGAATCGCTTCTGGTTCAGGCATTGGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGATGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAGCTGTACGCAGAAATTGAGAAATCTGGTGAATTACTCAAAGGCCCTGTTTATAATCTGGATGATGAAAAGCCCATTTCCCAATACTTGATTGGTGATTCTTGCTTCCCTCTTTTGCCATGGCTTTTGACACCATACATGAAACTGAACGAGGAAGATAGCTCTGGTTTTCCTGAAAAAGCGTTCAATTCCACACATAACCGTGCAATGGGGTTGGTTAACACAGCATATTGCAGACTCCGAGCTCGGTGGAAGCTTCTGTCAAAACCATGGAAGGAAGGATGTAGAGATTTCTTCCCATTTATTGTATTGACTGGGTGTCTGCTGCATAATTTCCTCATTAAATGCAGTGAGAAACTTGATGAAGAACAAGGTGAAGACGAAGGAGCAAGTTGTTCAAGTGAGGAACAGAAGTTTCCTCTTTATGATGGTGAGATAGGAGATGATAGAGGAAAGGATATCAGAGATGCGCTTGCCTTGCACTTGAGTAGGCTGAGCTTCAGAAGATGATTGCTTTCCACATCTTGGACAAAGAAGTAATGAGCTTTCCTTGGAAGTTAGGACTTGCCGACTTGGTCCACTCCTACAGGTAATGCAGGGGTTTAAGCATCACAAAACCCCCCGAGGATGAGTCAGTGAGTCACTACAATCTTTTACTTGGTGAGTTTTTACCTTTTTGGTATAGCTTCTCGAGAGTGCTTTTAATAAGCACTTTTAGAGGGGAAAAGCTCATTAAGAGTAAATTAGGCATGGATTTTAGCAAAGTTAAAAGTGTCTTTTAACTTATTACTAACTTTGACATTTGCCTACCTAACCAAAAGTGCTATTGAAAATTCCAAAAATACATTTTAAGTACTTTAAAATATTATTTGATGAAAG

Coding sequence (CDS)

ATGGCCGCCGGAGGACTCGGCGGCGATAAGAGAACCACCAGAAGCTCCGCCATGAACGCTACCGCCGTCACTACCAGAAGTCAGGCCAAGAAGTCCGACCGGCAGAGCCATCTCAAACACCAACTGGTAACTCTTATCGAAACCACCATTTCTTCCGCTCACTCCTTTCTCTCTCTCAACGATCTCCACCTTCTTCCCTCTCAAACCCTCGCCCTCGAATCCCTCATCTGTTCCACTGCATCCTCTCTTCATGGTCTCTCTCCTCGTCTCCCCAAACTTGCCCTACATCCACCGCCGCCGCCGCGGCAATGCTGGTTCCAACGCTTCCTTTCTGCGACGGCGGAGGTCGATTGCGATCCGAGATGGAATCTCTCCTTCCGTATGTCGAAATCGTCCTTCTCCCTCCTCCTCCGTCTCCTTTCCCCGATTCAGAGCTCCTCATCCTCTTCCGTTCCGCCCGATTGCGCTTTAGCCGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGTTACAAGGCGGTTGGGAGGCGGTTCGGAATCGATTCTGCTGATGCTTGCCGCTCGTTTTATGCGGTTTGTAAGGCCATCAATGAGAAATTGGGGCATTTGCTTGAGCTTCGGTCTGATATTGATCGGATTGTGGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGGCTTAGAAGATTTGGGGTTGAAGGCGATCTTCTTGGTAAAAGCGAATCGCTTCTGGTTCAGGCATTGGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGATGGCCGAGCTCCATGAAACCTGAAACAATCTTGCGGCAGAGCAAGCTGTACGCAGAAATTGAGAAATCTGGTGAATTACTCAAAGGCCCTGTTTATAATCTGGATGATGAAAAGCCCATTTCCCAATACTTGATTGGTGATTCTTGCTTCCCTCTTTTGCCATGGCTTTTGACACCATACATGAAACTGAACGAGGAAGATAGCTCTGGTTTTCCTGAAAAAGCGTTCAATTCCACACATAACCGTGCAATGGGGTTGGTTAACACAGCATATTGCAGACTCCGAGCTCGGTGGAAGCTTCTGTCAAAACCATGGAAGGAAGGATGTAGAGATTTCTTCCCATTTATTGTATTGACTGGGTGTCTGCTGCATAATTTCCTCATTAAATGCAGTGAGAAACTTGATGAAGAACAAGGTGAAGACGAAGGAGCAAGTTGTTCAAGTGAGGAACAGAAGTTTCCTCTTTATGATGGTGAGATAGGAGATGATAGAGGAAAGGATATCAGAGATGCGCTTGCCTTGCACTTGAGTAGGCTGAGCTTCAGAAGATGA

Protein sequence

MAAGGLGGDKRTTRSSAMNATAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLICSTASSLHGLSPRLPKLALHPPPPPRQCWFQRFLSATAEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLLGKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSRLSFRR
Homology
BLAST of Tan0007856 vs. ExPASy Swiss-Prot
Match: Q9M2U3 (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 156.4 bits (394), Expect = 7.7e-37
Identity = 103/356 (28.93%), Postives = 169/356 (47.47%), Query Frame = 0

Query: 105 WFQRFLSATAEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC-- 164
           W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P     
Sbjct: 54  WWDGFSRRIYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLND 113

Query: 165 ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG 224
            +A AL RL  G S   +G  FG++ +   +  +   +++ E+  H L   S +D I   
Sbjct: 114 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPSKLDEIKSK 173

Query: 225 FGWIS-LPNCCGVLGL-------------RRFGVEGDLLGKSESLLVQALVDAEGRFLDV 284
           F  IS LPNCCG + +              +  ++G+   K+ S+ +QA+VD + RFLDV
Sbjct: 174 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGE---KNFSMTLQAVVDPDMRFLDV 233

Query: 285 SAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPISQYLIGDSCFPLLPWLL 344
            AGWP S+  + +L+ S  Y  +EK G+ L G    L +   + +Y++GDS FPLLPWLL
Sbjct: 234 IAGWPGSLNDDVVLKNSGFYKLVEK-GKRLNGEKLPLSERTELREYIVGDSGFPLLPWLL 293

Query: 345 TPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRARWKLLSKPWKEGCRDFFPFIV 404
           TPY    +   +  P+  FN  H+ A      A  +L+ RW++++       R+  P I+
Sbjct: 294 TPY----QGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRII 353

Query: 405 LTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPLYDGEIGDDRGKDIRDALA 438
              CLLHN +I       E+Q  D+       +  +     ++ D+    +RD L+
Sbjct: 354 FVCCLLHNIIIDM-----EDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELS 396

BLAST of Tan0007856 vs. ExPASy Swiss-Prot
Match: Q94K49 (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 133.7 bits (335), Expect = 5.3e-30
Identity = 103/357 (28.85%), Postives = 162/357 (45.38%), Query Frame = 0

Query: 111 SATAEVDCDPRWNLSFRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAAL 170
           S +   D D  +   FR SK++FS +  L+         S + +     +  +  +A AL
Sbjct: 54  SPSVPSDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAL 113

Query: 171 FRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GW 230
            RLA G S  +VG  FG+  +   +  +   +A+ E+  H L       I+ I   F   
Sbjct: 114 RRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDRIEEIKSKFEEM 173

Query: 231 ISLPNCCG-------VLGLRRFGVEGDLLG--KSESLLVQALVDAEGRFLDVSAGWPSSM 290
             LPNCCG       ++ L       D     K+ S+ +Q + D E RFL++  GWP  M
Sbjct: 174 YGLPNCCGAIDTTHIIMTLPAVQASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGM 233

Query: 291 KPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPISQYLIGDSCFPLLPWLLTPYMKLNE 350
               +L+ S  + ++ ++ ++L G    L     I +Y++G   +PLLPWL+TP+   + 
Sbjct: 234 TVSKLLKFSGFF-KLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHP 293

Query: 351 EDSSGFPEKAFNSTHNRAMGLVNTAYCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHN 410
            DS      AFN  H +   +  TA+ +L+  W++LSK      R   P I+L  CLLHN
Sbjct: 294 SDSM----VAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHN 353

Query: 411 FLIKCSEKLDEE----QGEDEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSR 443
            +I C + L E+       D G +    +Q  PL         G ++R  L  HL R
Sbjct: 354 IIIDCGDYLQEDVPLSGHHDSGYADRYCKQTEPL---------GSELRGCLTEHLLR 396

BLAST of Tan0007856 vs. ExPASy Swiss-Prot
Match: B0BN95 (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 7.2e-11
Identity = 63/262 (24.05%), Postives = 104/262 (39.69%), Query Frame = 0

Query: 148 SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRS 207
           S ++ P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +   +
Sbjct: 64  SRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHFPA 123

Query: 208 DIDRIV----VGFGWISLPNCCGVLGLRRFGVEG----DLLGKS----ESLLVQALVDAE 267
           D   I       +G   +P   G +      ++     DL   +     SL    + D  
Sbjct: 124 DEAAIQSLKDEFYGLAGMPGVIGAVDCIHVAIKAPNAEDLSYVNRKGLHSLNCLVVCDIR 183

Query: 268 GRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPISQYLIGDSCFP 327
           G  + V   WP S++   +L+QS L ++ E                 P   +L+GDS F 
Sbjct: 184 GALMTVETSWPGSLQDCAVLQQSSLSSQFETG--------------MPKDSWLLGDSSFF 243

Query: 328 LLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLR----ARWKLLSKPWKE 387
           L  WLLTP + + E  +     +A ++TH+     + T  CR R    ++  L   P K 
Sbjct: 244 LHTWLLTP-LHIPETPAEYRYNRAHSATHSVIEKTLRTLCCRFRCLDGSKGALQYSPEKS 303

Query: 388 GCRDFFPFIVLTGCLLHNFLIK 394
                   I+L  C+LHN  ++
Sbjct: 304 S------HIILACCVLHNISLE 304

BLAST of Tan0007856 vs. ExPASy Swiss-Prot
Match: Q8BR93 (Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 4.7e-10
Identity = 62/260 (23.85%), Postives = 101/260 (38.85%), Query Frame = 0

Query: 148 SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRS 207
           S ++ P+  + AAL     G+    +G   GI  A   R    V +A+ E+    +    
Sbjct: 64  SRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVTEALVERASQFIHF-- 123

Query: 208 DIDRIVVG------FGWISLPNCCGVLGLRRFGVEG----DLLGKS----ESLLVQALVD 267
            +D   V       +G   +P   GV       ++     DL   +     SL    + D
Sbjct: 124 PVDEAAVQSLKDEFYGLAGMPGVIGVADCIHVAIKAPNAEDLSYVNRKGLHSLNCLVVCD 183

Query: 268 AEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPISQYLIGDSC 327
             G  + V   WP S++   +L++S L ++ E                 P   +L+GDS 
Sbjct: 184 IRGALMTVETSWPGSLQDCAVLQRSSLTSQFETG--------------MPKDSWLLGDSS 243

Query: 328 FPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRARWKLLSKPWKEGC 387
           F L  WLLTP + + E  +     +A ++TH+     + T  CR R           +G 
Sbjct: 244 FFLRSWLLTP-LPIPETAAEYRYNRAHSATHSVIERTLQTLCCRFRC------LDGSKGA 300

Query: 388 RDFFP----FIVLTGCLLHN 390
             + P     I+L  C+LHN
Sbjct: 304 LQYSPEKCSHIILACCVLHN 300

BLAST of Tan0007856 vs. NCBI nr
Match: KAG6586365.1 (Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 833.2 bits (2151), Expect = 1.1e-237
Identity = 416/447 (93.06%), Postives = 430/447 (96.20%), Query Frame = 0

Query: 1   MAAGGLGGDKRTTRSSAMNATAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSLN 60
           MAAGG  GDKRTTRSSA+NA AVTTRS+AKKSDR++HLKHQLVTLIETTISSAHSFLSLN
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSLN 60

Query: 61  DLHLLPSQTLALESLICSTASSLHGLSPRLPKLALHPPPPPRQCWFQRFLSATAEVDCDP 120
           DLHLLPSQTLALES I ST+SSL  LSP LPKL+LH  PPPRQCWFQRFLSATAEVDCDP
Sbjct: 61  DLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLH--PPPRQCWFQRFLSATAEVDCDP 120

Query: 121 RWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGID 180
           RWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGID
Sbjct: 121 RWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFGID 180

Query: 181 SADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240
           SADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL
Sbjct: 181 SADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240

Query: 241 GKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300
           GK  SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD
Sbjct: 241 GKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300

Query: 301 EKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRA 360
            KPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPE+AFNSTHNRAMGLVNTA+C++RA
Sbjct: 301 GKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKVRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPLY 420
           RWKLLSKPWKE CRDFFPFIVLTGCLLHNFLIKCSEKL+EEQ ED+GASCSSEEQKFPLY
Sbjct: 361 RWKLLSKPWKEECRDFFPFIVLTGCLLHNFLIKCSEKLEEEQDEDDGASCSSEEQKFPLY 420

Query: 421 DGEIGDDRGKDIRDALALHLSRLSFRR 448
           DGE GDDRGKDIRDALALHLSRLSFRR
Sbjct: 421 DGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of Tan0007856 vs. NCBI nr
Match: XP_022938170.1 (protein ALP1-like [Cucurbita moschata] >XP_022938171.1 protein ALP1-like [Cucurbita moschata] >KAG7021213.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 830.1 bits (2143), Expect = 9.1e-237
Identity = 415/447 (92.84%), Postives = 429/447 (95.97%), Query Frame = 0

Query: 1   MAAGGLGGDKRTTRSSAMNATAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSLN 60
           MAAGG  GDKRTTRSSA+NA AVTTRS+AKKSDR++HLKHQLVTLIETTISSAHSFLSLN
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSLN 60

Query: 61  DLHLLPSQTLALESLICSTASSLHGLSPRLPKLALHPPPPPRQCWFQRFLSATAEVDCDP 120
           DLHLLPSQTLALES I ST+SSL  LSP LPKL+LH  PPPRQCWFQRFLSATAEVDCDP
Sbjct: 61  DLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLH--PPPRQCWFQRFLSATAEVDCDP 120

Query: 121 RWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGID 180
           RWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGID
Sbjct: 121 RWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFGID 180

Query: 181 SADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240
           SADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL
Sbjct: 181 SADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240

Query: 241 GKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300
           GK  SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD
Sbjct: 241 GKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300

Query: 301 EKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRA 360
            KPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPE+AFNSTHNRAMGLVNTA+C++RA
Sbjct: 301 GKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKVRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPLY 420
           RWKLLSKPWKE CRDFFPFIVLTGCLLHNFLIKCSEKL+EEQ ED+GASCSSEEQKF LY
Sbjct: 361 RWKLLSKPWKEECRDFFPFIVLTGCLLHNFLIKCSEKLEEEQDEDDGASCSSEEQKFALY 420

Query: 421 DGEIGDDRGKDIRDALALHLSRLSFRR 448
           DGE GDDRGKDIRDALALHLSRLSFRR
Sbjct: 421 DGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of Tan0007856 vs. NCBI nr
Match: XP_023536803.1 (protein ALP1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 829.3 bits (2141), Expect = 1.6e-236
Identity = 415/447 (92.84%), Postives = 428/447 (95.75%), Query Frame = 0

Query: 1   MAAGGLGGDKRTTRSSAMNATAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSLN 60
           MAAGG  GDKRTTRSSA+NA AVTTRS+AKKSDR++HLKHQLVTLIETTISSAHSFLSLN
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSLN 60

Query: 61  DLHLLPSQTLALESLICSTASSLHGLSPRLPKLALHPPPPPRQCWFQRFLSATAEVDCDP 120
           DLHLLPSQTLALES I ST+SSL  LSP LPKL+LH  PPPRQCWFQRFLSATAEVDCDP
Sbjct: 61  DLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLH--PPPRQCWFQRFLSATAEVDCDP 120

Query: 121 RWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGID 180
           RWNL FRMSKSSFSLLLRLLSPIQS SS+SVPPDCALAAALFRLAHGASYKAVGRRFGID
Sbjct: 121 RWNLFFRMSKSSFSLLLRLLSPIQSCSSTSVPPDCALAAALFRLAHGASYKAVGRRFGID 180

Query: 181 SADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240
           SADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL
Sbjct: 181 SADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240

Query: 241 GKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300
           GK  SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD
Sbjct: 241 GKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300

Query: 301 EKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRA 360
            KPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPE+AFNSTHNRAMGLVNTA+C++RA
Sbjct: 301 GKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKVRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPLY 420
           RWKLLSKPWKE CRDFFPFIVLTGCLLHNFLIKCSEKL EEQ ED+GASCSSEEQKFPLY
Sbjct: 361 RWKLLSKPWKEECRDFFPFIVLTGCLLHNFLIKCSEKLVEEQDEDDGASCSSEEQKFPLY 420

Query: 421 DGEIGDDRGKDIRDALALHLSRLSFRR 448
           DGE GDDRGKDIRDALALHLSRLSFRR
Sbjct: 421 DGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of Tan0007856 vs. NCBI nr
Match: XP_022965738.1 (protein ALP1-like [Cucurbita maxima] >XP_022965739.1 protein ALP1-like [Cucurbita maxima])

HSP 1 Score: 827.4 bits (2136), Expect = 5.9e-236
Identity = 413/447 (92.39%), Postives = 429/447 (95.97%), Query Frame = 0

Query: 1   MAAGGLGGDKRTTRSSAMNATAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSLN 60
           MAAGG  GDKRTTRSSA+NA AVTTRS+AKKSDR++HLKHQLVTLIETTISSAHSFLSLN
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSLN 60

Query: 61  DLHLLPSQTLALESLICSTASSLHGLSPRLPKLALHPPPPPRQCWFQRFLSATAEVDCDP 120
           DLHLLPSQTLALES I ST+SSL  LSP LPKL+LH  PPPRQCWFQRFLSATAEVDCDP
Sbjct: 61  DLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLH--PPPRQCWFQRFLSATAEVDCDP 120

Query: 121 RWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGID 180
           RWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGID
Sbjct: 121 RWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFGID 180

Query: 181 SADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240
           SADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL
Sbjct: 181 SADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240

Query: 241 GKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300
           GK  SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD
Sbjct: 241 GKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300

Query: 301 EKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRA 360
            KPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPE+AFNSTHNRAMGLVNTA+C++RA
Sbjct: 301 GKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKVRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPLY 420
           RWKLLSKPWKE CRDFFPF+VLTGCLLHNFLIKCSEKL+EEQ E++GAS SSEEQKFPLY
Sbjct: 361 RWKLLSKPWKEECRDFFPFVVLTGCLLHNFLIKCSEKLEEEQDEEDGASSSSEEQKFPLY 420

Query: 421 DGEIGDDRGKDIRDALALHLSRLSFRR 448
           DGE GDDRGKDIRDALALHLSRLSFRR
Sbjct: 421 DGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of Tan0007856 vs. NCBI nr
Match: XP_038890100.1 (protein ALP1-like [Benincasa hispida])

HSP 1 Score: 806.2 bits (2081), Expect = 1.4e-229
Identity = 405/456 (88.82%), Postives = 426/456 (93.42%), Query Frame = 0

Query: 1   MAAGGLGGDKRTTRSSAMN----ATAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSF 60
           MA  G+GGDKRTTRSS++N    AT  TTRS+AKK DR+SHL HQLVTLI+TTISSAHSF
Sbjct: 1   MATRGIGGDKRTTRSSSINAVAAATVATTRSKAKKLDRESHLYHQLVTLIQTTISSAHSF 60

Query: 61  LSLNDLHLLPSQTLALESLICSTASSLHGLSPRLPKLAL-----HPPPPPRQCWFQRFLS 120
           LSLNDLHLLPSQTLALESL+ ST+SSL+ LSPRLPKL L      PPPPPRQCWFQRFLS
Sbjct: 61  LSLNDLHLLPSQTLALESLLGSTSSSLYALSPRLPKLTLPPPPPPPPPPPRQCWFQRFLS 120

Query: 121 ATAEVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYK 180
           AT++VDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYK
Sbjct: 121 ATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYK 180

Query: 181 AVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLR 240
           AVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLR
Sbjct: 181 AVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLR 240

Query: 241 RFGVEGDLLGKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELL 300
           RFGVE +LLGK+ SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLY EIEKS ELL
Sbjct: 241 RFGVESELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYEEIEKSNELL 300

Query: 301 KGPVYNLDDEKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLV 360
           KGPVYNLDD+KPI QYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPE+AFNSTHNRAM LV
Sbjct: 301 KGPVYNLDDDKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALV 360

Query: 361 NTAYCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCS 420
           NTA+ RLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQ ++E A CS
Sbjct: 361 NTAFGRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQDQEEEAICS 420

Query: 421 SEEQKFPLYDGEIGDDRGKDIRDALALHLSRLSFRR 448
           SE+QKFPLYDG+IGDDRGKDIRDALALHLS LS+RR
Sbjct: 421 SEDQKFPLYDGKIGDDRGKDIRDALALHLSSLSYRR 456

BLAST of Tan0007856 vs. ExPASy TrEMBL
Match: A0A6J1FIY2 (protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111444336 PE=3 SV=1)

HSP 1 Score: 830.1 bits (2143), Expect = 4.4e-237
Identity = 415/447 (92.84%), Postives = 429/447 (95.97%), Query Frame = 0

Query: 1   MAAGGLGGDKRTTRSSAMNATAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSLN 60
           MAAGG  GDKRTTRSSA+NA AVTTRS+AKKSDR++HLKHQLVTLIETTISSAHSFLSLN
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSLN 60

Query: 61  DLHLLPSQTLALESLICSTASSLHGLSPRLPKLALHPPPPPRQCWFQRFLSATAEVDCDP 120
           DLHLLPSQTLALES I ST+SSL  LSP LPKL+LH  PPPRQCWFQRFLSATAEVDCDP
Sbjct: 61  DLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLH--PPPRQCWFQRFLSATAEVDCDP 120

Query: 121 RWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGID 180
           RWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGID
Sbjct: 121 RWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFGID 180

Query: 181 SADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240
           SADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL
Sbjct: 181 SADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240

Query: 241 GKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300
           GK  SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD
Sbjct: 241 GKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300

Query: 301 EKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRA 360
            KPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPE+AFNSTHNRAMGLVNTA+C++RA
Sbjct: 301 GKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKVRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPLY 420
           RWKLLSKPWKE CRDFFPFIVLTGCLLHNFLIKCSEKL+EEQ ED+GASCSSEEQKF LY
Sbjct: 361 RWKLLSKPWKEECRDFFPFIVLTGCLLHNFLIKCSEKLEEEQDEDDGASCSSEEQKFALY 420

Query: 421 DGEIGDDRGKDIRDALALHLSRLSFRR 448
           DGE GDDRGKDIRDALALHLSRLSFRR
Sbjct: 421 DGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of Tan0007856 vs. ExPASy TrEMBL
Match: A0A6J1HRT9 (protein ALP1-like OS=Cucurbita maxima OX=3661 GN=LOC111465544 PE=3 SV=1)

HSP 1 Score: 827.4 bits (2136), Expect = 2.9e-236
Identity = 413/447 (92.39%), Postives = 429/447 (95.97%), Query Frame = 0

Query: 1   MAAGGLGGDKRTTRSSAMNATAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSLN 60
           MAAGG  GDKRTTRSSA+NA AVTTRS+AKKSDR++HLKHQLVTLIETTISSAHSFLSLN
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVTTRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSLN 60

Query: 61  DLHLLPSQTLALESLICSTASSLHGLSPRLPKLALHPPPPPRQCWFQRFLSATAEVDCDP 120
           DLHLLPSQTLALES I ST+SSL  LSP LPKL+LH  PPPRQCWFQRFLSATAEVDCDP
Sbjct: 61  DLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLH--PPPRQCWFQRFLSATAEVDCDP 120

Query: 121 RWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGID 180
           RWNL FRMSKSSFSLLLRLLSPIQSSSS+SVPPDCALAAALFRLAHGASYKAVGRRFGID
Sbjct: 121 RWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFGID 180

Query: 181 SADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240
           SADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL
Sbjct: 181 SADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLL 240

Query: 241 GKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300
           GK  SLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD
Sbjct: 241 GKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDD 300

Query: 301 EKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRA 360
            KPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPE+AFNSTHNRAMGLVNTA+C++RA
Sbjct: 301 GKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKVRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPLY 420
           RWKLLSKPWKE CRDFFPF+VLTGCLLHNFLIKCSEKL+EEQ E++GAS SSEEQKFPLY
Sbjct: 361 RWKLLSKPWKEECRDFFPFVVLTGCLLHNFLIKCSEKLEEEQDEEDGASSSSEEQKFPLY 420

Query: 421 DGEIGDDRGKDIRDALALHLSRLSFRR 448
           DGE GDDRGKDIRDALALHLSRLSFRR
Sbjct: 421 DGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of Tan0007856 vs. ExPASy TrEMBL
Match: A0A0A0LFB5 (DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G008700 PE=3 SV=1)

HSP 1 Score: 795.8 bits (2054), Expect = 9.2e-227
Identity = 399/449 (88.86%), Postives = 421/449 (93.76%), Query Frame = 0

Query: 1   MAAGGLGGDKRTTRSSAMNATAVT-TRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSL 60
           MA  GL GDKRTTRSSAMNA A   TRS+AKK D+++HL HQL+TLIETTISSAHSFLSL
Sbjct: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLICSTASSLHGLSPRLPKLALHPP-PPPRQCWFQRFLSATAEVDC 120
           NDLHLLPSQTLALESL+CST+SSLH LSPRLPKL+L PP PPPRQCWFQRFLSAT++VDC
Sbjct: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNLSFRMSKSSFSLLLRLLSPIQSS SSSVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGD 240
           IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+
Sbjct: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240

Query: 241 LLGKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNL 300
           L  K+ SLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Sbjct: 241 L--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL 300

Query: 301 DDEKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRL 360
           D+EKPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  +AFNSTH RAM LVNTA+CRL
Sbjct: 301 DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRL 360

Query: 361 RARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFP 420
           RARWKLLSKPWKEGCRDFFPFI+LTGCLL NFLIKCSEKLDEEQ ++EGASCSSEEQKFP
Sbjct: 361 RARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP 420

Query: 421 LYDGEIGDDRGKDIRDALALHLSRLSFRR 448
           L+DGEIGD RGKDIRDALALHLS L++RR
Sbjct: 421 LFDGEIGDGRGKDIRDALALHLSSLNYRR 447

BLAST of Tan0007856 vs. ExPASy TrEMBL
Match: A0A5D3BH79 (Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold285G00450 PE=3 SV=1)

HSP 1 Score: 792.3 bits (2045), Expect = 1.0e-225
Identity = 396/448 (88.39%), Postives = 417/448 (93.08%), Query Frame = 0

Query: 1   MAAGGLGGDKRTTRSSAMNATAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSLN 60
           MA  GL GDKRTTRSSAMNA A  TRS+AKK D+++HL HQL+TLIETTISSA SFLSLN
Sbjct: 1   MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLN 60

Query: 61  DLHLLPSQTLALESLICSTASSLHGLSPRLPKLALHPP-PPPRQCWFQRFLSATAEVDCD 120
           DLHLLPSQTLALESL+CST+SSLH LSPRLPKL+L  P PPPRQCWFQRFLSAT++VDCD
Sbjct: 61  DLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDCD 120

Query: 121 PRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGI 180
           PRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGI
Sbjct: 121 PRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGI 180

Query: 181 DSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDL 240
           DSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L
Sbjct: 181 DSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL 240

Query: 241 LGKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLD 300
             K+ SLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKS ELLKGPVYNLD
Sbjct: 241 --KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLD 300

Query: 301 DEKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLR 360
           DEKPI QYLIGDSCFPL PWLLTPY++LNEEDSSGF E+AFNSTH RAM LVNTA+CRLR
Sbjct: 301 DEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLR 360

Query: 361 ARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPL 420
           ARWKLLSKPWKEGCRDFFPFI+LTGCLL NFLIKCSEKLDEEQ ++EGASCSSEEQKFP 
Sbjct: 361 ARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPP 420

Query: 421 YDGEIGDDRGKDIRDALALHLSRLSFRR 448
           +DGEIGD RGKDIRDALALHLS LS+RR
Sbjct: 421 FDGEIGDGRGKDIRDALALHLSSLSYRR 446

BLAST of Tan0007856 vs. ExPASy TrEMBL
Match: A0A1S3C5W6 (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103497032 PE=3 SV=1)

HSP 1 Score: 792.3 bits (2045), Expect = 1.0e-225
Identity = 396/448 (88.39%), Postives = 417/448 (93.08%), Query Frame = 0

Query: 1   MAAGGLGGDKRTTRSSAMNATAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSLN 60
           MA  GL GDKRTTRSSAMNA A  TRS+AKK D+++HL HQL+TLIETTISSA SFLSLN
Sbjct: 1   MATRGLAGDKRTTRSSAMNAAAAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSLN 60

Query: 61  DLHLLPSQTLALESLICSTASSLHGLSPRLPKLALHPP-PPPRQCWFQRFLSATAEVDCD 120
           DLHLLPSQTLALESL+CST+SSLH LSPRLPKL+L  P PPPRQCWFQRFLSAT++VDCD
Sbjct: 61  DLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDCD 120

Query: 121 PRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGI 180
           PRWNLSFRMSKSSFSLLLRLLSPIQS SSSSVPPDCALAAALFRLAHGASYKAVGRRFGI
Sbjct: 121 PRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFGI 180

Query: 181 DSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDL 240
           DSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+L
Sbjct: 181 DSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGEL 240

Query: 241 LGKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLD 300
             K+ SLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKS ELLKGPVYNLD
Sbjct: 241 --KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLD 300

Query: 301 DEKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLR 360
           DEKPI QYLIGDSCFPL PWLLTPY++LNEEDSSGF E+AFNSTH RAM LVNTA+CRLR
Sbjct: 301 DEKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLR 360

Query: 361 ARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPL 420
           ARWKLLSKPWKEGCRDFFPFI+LTGCLL NFLIKCSEKLDEEQ ++EGASCSSEEQKFP 
Sbjct: 361 ARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPP 420

Query: 421 YDGEIGDDRGKDIRDALALHLSRLSFRR 448
           +DGEIGD RGKDIRDALALHLS LS+RR
Sbjct: 421 FDGEIGDGRGKDIRDALALHLSSLSYRR 446

BLAST of Tan0007856 vs. TAIR 10
Match: AT1G72270.2 (LOCATED IN: mitochondrion; EXPRESSED IN: shoot apex, embryo, flower, seed; EXPRESSED DURING: petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1). )

HSP 1 Score: 294.3 bits (752), Expect = 1.7e-79
Identity = 187/435 (42.99%), Postives = 253/435 (58.16%), Query Frame = 0

Query: 30  KKSDR-------QSHLKHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLICSTASS 89
           +KSDR       +  LK  L+  + +  +  +SFL  NDL L PSQTL LESLI S    
Sbjct: 6   RKSDRNPTNVSDKLGLKDPLLRRLSSAAAVTNSFLQANDLFLSPSQTLRLESLISSL--- 65

Query: 90  LHGLSPRLPKLALHPPPPPRQCWFQRFLSATAEVDCDPRWNLSFRMSKSSFSLLLRLLSP 149
                P  P  +          WF RFL++  E + DPRW L FRMSKS+F  L  +L  
Sbjct: 66  -----PISPSPSSSSSAITTTTWFNRFLTSATEDEDDPRWCLYFRMSKSTFFSLYSIL-- 125

Query: 150 IQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGH 209
               S SS+P   + AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL  
Sbjct: 126 ----SHSSLP---SFAATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEKLS- 185

Query: 210 LLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLLGKSESLLVQALVDAEGRFLDV 269
                  +D     F    LPNC GV+G  RF V+G LLG   S+LVQALVD+ GRF+D+
Sbjct: 186 -----QQLDDPKPDFSPNLLPNCYGVVGFGRFEVKGKLLGAKGSILVQALVDSNGRFVDI 245

Query: 270 SAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPISQYLIGDSCFPLLPWLL 329
           SAGWPS+MKPE I RQ+KL++  E   E+L G    L +   + +Y++GDSC PLLPWL+
Sbjct: 246 SAGWPSTMKPEAIFRQTKLFSIAE---EVLSGAPTKLGNGVLVPRYILGDSCLPLLPWLV 305

Query: 330 TPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRARWKLLSKPWKEGCRDFFPFIV 389
           TPY   ++E+S  F E+ FN+  +  +  V  A+ ++RARW++L K WK    +F PF++
Sbjct: 306 TPYDLTSDEES--FREE-FNNVVHTGLHSVEIAFAKVRARWRILDKKWKPETIEFMPFVI 365

Query: 390 LTGCLLHNFLIKCSEKLDE--------EQGED-EGASCSSEEQKFPLYDGEIGDDRGKDI 448
            TGCLLHNFL+   +  D         E G++ E      +E++   ++GE   +  K I
Sbjct: 366 TTGCLLHNFLVNSGDDDDSVEECVNGCEAGDNGEMRKDDDKEEETRSFEGEAYRE-SKRI 410

BLAST of Tan0007856 vs. TAIR 10
Match: AT1G72270.1 (CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G27010.1); Has 772 Blast hits to 657 proteins in 120 species: Archae - 0; Bacteria - 0; Metazoa - 344; Fungi - 94; Plants - 322; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink). )

HSP 1 Score: 292.0 bits (746), Expect = 8.3e-79
Identity = 185/430 (43.02%), Postives = 250/430 (58.14%), Query Frame = 0

Query: 30  KKSDR-------QSHLKHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLICSTASS 89
           +KSDR       +  LK  L+  + +  +  +SFL  NDL L PSQTL LESLI S    
Sbjct: 6   RKSDRNPTNVSDKLGLKDPLLRRLSSAAAVTNSFLQANDLFLSPSQTLRLESLISSL--- 65

Query: 90  LHGLSPRLPKLALHPPPPPRQCWFQRFLSATAEVDCDPRWNLSFRMSKSSFSLLLRLLSP 149
                P  P  +          WF RFL++  E + DPRW L FRMSKS+F  L  +L  
Sbjct: 66  -----PISPSPSSSSSAITTTTWFNRFLTSATEDEDDPRWCLYFRMSKSTFFSLYSIL-- 125

Query: 150 IQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGH 209
               S SS+P   + AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL  
Sbjct: 126 ----SHSSLP---SFAATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEKLS- 185

Query: 210 LLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGDLLGKSESLLVQALVDAEGRFLDV 269
                  +D     F    LPNC GV+G  RF V+G LLG   S+LVQALVD+ GRF+D+
Sbjct: 186 -----QQLDDPKPDFSPNLLPNCYGVVGFGRFEVKGKLLGAKGSILVQALVDSNGRFVDI 245

Query: 270 SAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPISQYLIGDSCFPLLPWLL 329
           SAGWPS+MKPE I RQ+KL++  E   E+L G    L +   + +Y++GDSC PLLPWL+
Sbjct: 246 SAGWPSTMKPEAIFRQTKLFSIAE---EVLSGAPTKLGNGVLVPRYILGDSCLPLLPWLV 305

Query: 330 TPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRARWKLLSKPWKEGCRDFFPFIV 389
           TPY   ++E+S  F E+ FN+  +  +  V  A+ ++RARW++L K WK    +F PF++
Sbjct: 306 TPYDLTSDEES--FREE-FNNVVHTGLHSVEIAFAKVRARWRILDKKWKPETIEFMPFVI 365

Query: 390 LTGCLLHNFLIKCSEKLDE--------EQGED-EGASCSSEEQKFPLYDGEIGDDRGKDI 443
            TGCLLHNFL+   +  D         E G++ E      +E++   ++GE   +  K I
Sbjct: 366 TTGCLLHNFLVNSGDDDDSVEECVNGCEAGDNGEMRKDDDKEEETRSFEGEAYRE-SKRI 405

BLAST of Tan0007856 vs. TAIR 10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 156.4 bits (394), Expect = 5.5e-38
Identity = 103/356 (28.93%), Postives = 169/356 (47.47%), Query Frame = 0

Query: 105 WFQRFLSATAEVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSSSSSVPPDC-- 164
           W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P     
Sbjct: 54  WWDGFSRRIYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLND 113

Query: 165 ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG 224
            +A AL RL  G S   +G  FG++ +   +  +   +++ E+  H L   S +D I   
Sbjct: 114 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPSKLDEIKSK 173

Query: 225 FGWIS-LPNCCGVLGL-------------RRFGVEGDLLGKSESLLVQALVDAEGRFLDV 284
           F  IS LPNCCG + +              +  ++G+   K+ S+ +QA+VD + RFLDV
Sbjct: 174 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGE---KNFSMTLQAVVDPDMRFLDV 233

Query: 285 SAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPISQYLIGDSCFPLLPWLL 344
            AGWP S+  + +L+ S  Y  +EK G+ L G    L +   + +Y++GDS FPLLPWLL
Sbjct: 234 IAGWPGSLNDDVVLKNSGFYKLVEK-GKRLNGEKLPLSERTELREYIVGDSGFPLLPWLL 293

Query: 345 TPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAYCRLRARWKLLSKPWKEGCRDFFPFIV 404
           TPY    +   +  P+  FN  H+ A      A  +L+ RW++++       R+  P I+
Sbjct: 294 TPY----QGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRII 353

Query: 405 LTGCLLHNFLIKCSEKLDEEQGEDEGASCSSEEQKFPLYDGEIGDDRGKDIRDALA 438
              CLLHN +I       E+Q  D+       +  +     ++ D+    +RD L+
Sbjct: 354 FVCCLLHNIIIDM-----EDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELS 396

BLAST of Tan0007856 vs. TAIR 10
Match: AT3G63270.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 133.7 bits (335), Expect = 3.8e-31
Identity = 103/357 (28.85%), Postives = 162/357 (45.38%), Query Frame = 0

Query: 111 SATAEVDCDPRWNLSFRMSKSSFSLLLRLL---------SPIQSSSSSSVPPDCALAAAL 170
           S +   D D  +   FR SK++FS +  L+         S + +     +  +  +A AL
Sbjct: 54  SPSVPSDEDYAFKHFFRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIAL 113

Query: 171 FRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GW 230
            RLA G S  +VG  FG+  +   +  +   +A+ E+  H L       I+ I   F   
Sbjct: 114 RRLASGDSQVSVGAAFGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDRIEEIKSKFEEM 173

Query: 231 ISLPNCCG-------VLGLRRFGVEGDLLG--KSESLLVQALVDAEGRFLDVSAGWPSSM 290
             LPNCCG       ++ L       D     K+ S+ +Q + D E RFL++  GWP  M
Sbjct: 174 YGLPNCCGAIDTTHIIMTLPAVQASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGM 233

Query: 291 KPETILRQSKLYAEIEKSGELLKGPVYNLDDEKPISQYLIGDSCFPLLPWLLTPYMKLNE 350
               +L+ S  + ++ ++ ++L G    L     I +Y++G   +PLLPWL+TP+   + 
Sbjct: 234 TVSKLLKFSGFF-KLCENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHP 293

Query: 351 EDSSGFPEKAFNSTHNRAMGLVNTAYCRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHN 410
            DS      AFN  H +   +  TA+ +L+  W++LSK      R   P I+L  CLLHN
Sbjct: 294 SDSM----VAFNERHEKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHN 353

Query: 411 FLIKCSEKLDEE----QGEDEGASCSSEEQKFPLYDGEIGDDRGKDIRDALALHLSR 443
            +I C + L E+       D G +    +Q  PL         G ++R  L  HL R
Sbjct: 354 IIIDCGDYLQEDVPLSGHHDSGYADRYCKQTEPL---------GSELRGCLTEHLLR 396

BLAST of Tan0007856 vs. TAIR 10
Match: AT3G19120.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 97.8 bits (242), Expect = 2.3e-20
Identity = 123/457 (26.91%), Postives = 200/457 (43.76%), Query Frame = 0

Query: 21  TAVTTRSQAKKSDRQSHLKHQLVTLIETTISSAHSFLSLNDLHLLPSQTLALESLICSTA 80
           ++ +T SQ+  +            L+  T++S  SFL++N                 ST 
Sbjct: 27  SSASTSSQSSTTPSSLLSTSSAAPLLFFTLASLLSFLAVNR---------------SSTE 86

Query: 81  SSLHGLSPRLPKLALHPPPPPRQCWFQ--RFLSATAE----VDC---DPRWNLSFRMSKS 140
           SS    SP     +  PPPP     +    F + T +    +D    D RW   + +S  
Sbjct: 87  SSSSSESP-----SPSPPPPLADGDYSVAAFRALTTDHIWSLDAPLRDARWRSLYGLSYP 146

Query: 141 SFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAV 200
            F  ++  L P  ++S+ S+P D A+A  L RLAHG S K +  R+ +D     +    V
Sbjct: 147 VFITVVDKLKPFITASNLSLPADYAVAMVLSRLAHGCSAKTLASRYSLDPYLISKITNMV 206

Query: 201 CKAINEKL-GHLLELRSDIDRIV---VGFGWI-SLPNCCGVLG-----LRR------FGV 260
            + +  KL    +++     R++    GF  + SLPN CG +      LRR        +
Sbjct: 207 TRLLATKLYPEFIKIPVGKRRLIETTQGFEELTSLPNICGAIDSTPVKLRRRTKLNPRNI 266

Query: 261 EGDLLGKSESLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPV 320
            G   G  +++L+Q + D +  F DV    P      +  R S LY  +  SG+++   V
Sbjct: 267 YGCKYG-YDAVLLQVVADHKKIFWDVCVKAPGGEDDSSHFRDSLLYKRL-TSGDIVWEKV 326

Query: 321 YNLDDEKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPEKAFNSTHNRAMGLVNTAY 380
            N+     +  Y++GD C+PLL +L+TP+   +   S   PE  F+    +   +V  A 
Sbjct: 327 INIRGHH-VRPYIVGDWCYPLLSFLMTPF---SPNGSGTPPENLFDGMLMKGRSVVVEAI 386

Query: 381 CRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQGEDEG-----ASC 440
             L+ARWK+L +    G  +  P  ++  C+LHN L + + + + E  +D       A  
Sbjct: 387 GLLKARWKIL-QSLNVGV-NHAPQTIVACCVLHN-LCQIAREPEPEIWKDPDEAGTPARV 446

Query: 441 SSEEQKFPLYDGEIGDDRGKDIRDALALHL-SRLSFR 447
              E++F  Y        G+ +R ALA  L  RLS R
Sbjct: 447 LESERQFYYY--------GESLRQALAEDLHQRLSSR 446

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M2U37.7e-3728.93Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
Q94K495.3e-3028.85Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
B0BN957.2e-1124.05Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1[more]
Q8BR934.7e-1023.85Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
KAG6586365.11.1e-23793.06Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022938170.19.1e-23792.84protein ALP1-like [Cucurbita moschata] >XP_022938171.1 protein ALP1-like [Cucurb... [more]
XP_023536803.11.6e-23692.84protein ALP1-like [Cucurbita pepo subsp. pepo][more]
XP_022965738.15.9e-23692.39protein ALP1-like [Cucurbita maxima] >XP_022965739.1 protein ALP1-like [Cucurbit... [more]
XP_038890100.11.4e-22988.82protein ALP1-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1FIY24.4e-23792.84protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111444336 PE=3 SV=1[more]
A0A6J1HRT92.9e-23692.39protein ALP1-like OS=Cucurbita maxima OX=3661 GN=LOC111465544 PE=3 SV=1[more]
A0A0A0LFB59.2e-22788.86DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G008700 PE... [more]
A0A5D3BH791.0e-22588.39Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A1S3C5W61.0e-22588.39putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103497032 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G72270.21.7e-7942.99LOCATED IN: mitochondrion; EXPRESSED IN: shoot apex, embryo, flower, seed; EXPRE... [more]
AT1G72270.18.3e-7943.02CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR0217... [more]
AT3G55350.15.5e-3828.93PIF / Ping-Pong family of plant transposases [more]
AT3G63270.13.8e-3128.85CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT3G19120.12.3e-2026.91PIF / Ping-Pong family of plant transposases [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 243..389
e-value: 5.8E-16
score: 58.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..35
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 404..427
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 12..29
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 10..444
NoneNo IPR availablePANTHERPTHR22930:SF186LOW PROTEIN: NUCLEASE-LIKE PROTEINcoord: 10..444

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007856.1Tan0007856.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0035098 ESC/E(Z) complex
cellular_component GO:0035102 PRC1 complex
molecular_function GO:0003682 chromatin binding
molecular_function GO:0046872 metal ion binding