CsGy2G002290 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy2G002290
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionDDE Tnp4 domain-containing protein
LocationGy14Chr2: 1506280 .. 1508749 (+)
RNA-Seq ExpressionCsGy2G002290
SyntenyCsGy2G002290
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGTCTAATATCTAATGTATGCTGACATGGTGAAAAATTCAAAATAATTGATAAATGGAAAATTTTAATGTTCTAAAAAAATCTCTTTAGTTTATTTTTGTTAAGAATGAAGGAGAAAAGAAATCTATTTTATGAGTTTTAAAAGTCCTTAGTTGTGTAAATTTCCTATAACACTTCCTACAAAGGGACAACACCTCAACAAAGTAAGTTGAACAAGTTCAATTTATTATGGTATCGTTTGAATAATCCTCCCGAGAGTTTCTTGGCTCCTTCTCAATTTGACCATTGCTATCCTACCAAAACCTTCTAAGTCCAATTTACTACCAAACTCATCTTTATTATTTATATTTTCTACTAAAATTGATAGCTATATATATATATATTTTATTAATTTTCGTAAATATATCGAAATATTTAAGATCAAAGTCTACTAAACTATGGTTTCTAAATATAAAGTAGGGTTATCAAATCTAAAACTATAGTTTTTCAAATAAAAACATAAATCCAAGAATTGGAGTTTTTTCCATTTTTGAAATGGTTGGGGTAAATATTTTGGTTGTATTTTTTAAATAAAGAGAATTGTTTTTTCTAATAATTTAGAAAAAAAAAAAAAAACTTATTTTTGTTCACAAAGACAATGATGGCATTCACAGTGTGTGCTCCCCTACCCTGTTCGCCATGGCCACCAGAGGACTCGCCGGGGACAAGAGAACCACCAGAAGTTCCGCCATGAACGCTGCCGCCGCCGCCATTACCAGAAGCAAGGCCAAGAAACTCGATCAAGAGAACCATCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTTCACCTTCTTCCCTCTCAAACCCTCGCCCTTGAATCCCTACTTTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCAAAACTTTCCCTACCTCCGCCACTACCTCCACCGCGCCAATGCTGGTTCCAGCGCTTCCTATCCGCGACATCGGACGTCGATTGCGATCCGAGATGGAATCTCTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGTCTCCTTTCTCCGATTCAAAGCTCCCCATCCTCTTCAGTTCCTCCCGATTGTGCTTTAGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTACAAGGCGGTTGGGAGACGGTTTGGGATCGATTCTGCTGATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATTGAGAAATCTAGTGAATTACTGAAGGGTCCTGTTTATAATCTCGACAATGAAAAACCCATTCCCCAATACTTGATCGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTGACACCATATATGGAACTGAATGAAGAAGATAGTTCTGGCTTTTGTGGGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAAACCATGGAAGGAAGGATGTAGAGATTTTTTCCCATTTATTATATTGACTGGATGTCTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATCAAGAAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAAAAGTTTCCTCTTTTCGATGGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTGAGTAGCCTGAACTACAGAAGATGATTGCTTTGAACACCTTGGTAATTTTAATCTTTCATCAGCTGTATATATTCCTCTCTGATCCTTTAGAACATTTTGTTAAAACTGCAGATTGTAAGCCCTAATTCTCTAGAAGTAGATCTTTTTTTTTTTTTTGTCAGCCATTAGTTTCCTAGTGATTTGATTCTTATTGTTAGAGAAGGGATGTTTACACTTATCGAACATGAAAGTGACTTGAAATCATTCACTCACTATGTAAATTTAAAAGATCTAGTTCATGATTTGGGTCCCTCTGAATGCAATGATTTTGACATATAAATATTCTATCAATTGAGTAGGAACTGCCTTTTGAATCAAGTCATCTTAATAAATTAATCTGAATCCCATTTTTGGAATTGTTCTAGTATAACTCGAACATGAATTTCAGTTGAAGTGAAATAACTATGGATGGTCATGCTTATAGAAAAAGCTC

mRNA sequence

TCGTCTAATATCTAATGTATGCTGACATGGTGAAAAATTCAAAATAATTGATAAATGGAAAATTTTAATGTTCTAAAAAAATCTCTTTAGTTTATTTTTGTTAAGAATGAAGGAGAAAAGAAATCTATTTTATGAGTTTTAAAAGTCCTTAGTTGTGTAAATTTCCTATAACACTTCCTACAAAGGGACAACACCTCAACAAAGTAAGTTGAACAAGTTCAATTTATTATGGTATCGTTTGAATAATCCTCCCGAGAGTTTCTTGGCTCCTTCTCAATTTGACCATTGCTATCCTACCAAAACCTTCTAAGTCCAATTTACTACCAAACTCATCTTTATTATTTATATTTTCTACTAAAATTGATAGCTATATATATATATATTTTATTAATTTTCGTAAATATATCGAAATATTTAAGATCAAAGTCTACTAAACTATGGTTTCTAAATATAAAGTAGGGTTATCAAATCTAAAACTATAGTTTTTCAAATAAAAACATAAATCCAAGAATTGGAGTTTTTTCCATTTTTGAAATGGTTGGGGTAAATATTTTGGTTGTATTTTTTAAATAAAGAGAATTGTTTTTTCTAATAATTTAGAAAAAAAAAAAAAAACTTATTTTTGTTCACAAAGACAATGATGGCATTCACAGTGTGTGCTCCCCTACCCTGTTCGCCATGGCCACCAGAGGACTCGCCGGGGACAAGAGAACCACCAGAAGTTCCGCCATGAACGCTGCCGCCGCCGCCATTACCAGAAGCAAGGCCAAGAAACTCGATCAAGAGAACCATCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTTCACCTTCTTCCCTCTCAAACCCTCGCCCTTGAATCCCTACTTTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCAAAACTTTCCCTACCTCCGCCACTACCTCCACCGCGCCAATGCTGGTTCCAGCGCTTCCTATCCGCGACATCGGACGTCGATTGCGATCCGAGATGGAATCTCTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGTCTCCTTTCTCCGATTCAAAGCTCCCCATCCTCTTCAGTTCCTCCCGATTGTGCTTTAGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTACAAGGCGGTTGGGAGACGGTTTGGGATCGATTCTGCTGATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATTGAGAAATCTAGTGAATTACTGAAGGGTCCTGTTTATAATCTCGACAATGAAAAACCCATTCCCCAATACTTGATCGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTGACACCATATATGGAACTGAATGAAGAAGATAGTTCTGGCTTTTGTGGGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAAACCATGGAAGGAAGGATGTAGAGATTTTTTCCCATTTATTATATTGACTGGATGTCTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATCAAGAAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAAAAGTTTCCTCTTTTCGATGGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTGAGTAGCCTGAACTACAGAAGATGATTGCTTTGAACACCTTGGTAATTTTAATCTTTCATCAGCTGTATATATTCCTCTCTGATCCTTTAGAACATTTTGTTAAAACTGCAGATTGTAAGCCCTAATTCTCTAGAAGTAGATCTTTTTTTTTTTTTTGTCAGCCATTAGTTTCCTAGTGATTTGATTCTTATTGTTAGAGAAGGGATGTTTACACTTATCGAACATGAAAGTGACTTGAAATCATTCACTCACTATGTAAATTTAAAAGATCTAGTTCATGATTTGGGTCCCTCTGAATGCAATGATTTTGACATATAAATATTCTATCAATTGAGTAGGAACTGCCTTTTGAATCAAGTCATCTTAATAAATTAATCTGAATCCCATTTTTGGAATTGTTCTAGTATAACTCGAACATGAATTTCAGTTGAAGTGAAATAACTATGGATGGTCATGCTTATAGAAAAAGCTC

Coding sequence (CDS)

ATGGCCACCAGAGGACTCGCCGGGGACAAGAGAACCACCAGAAGTTCCGCCATGAACGCTGCCGCCGCCGCCATTACCAGAAGCAAGGCCAAGAAACTCGATCAAGAGAACCATCTTAACCATCAACTGATAACCCTCATCGAAACCACCATTTCTTCTGCTCACTCCTTTCTCTCTCTCAACGATCTTCACCTTCTTCCCTCTCAAACCCTCGCCCTTGAATCCCTACTTTGTTCCACTTCATCTTCTCTTCACGCTCTTTCTCCTCGTCTCCCAAAACTTTCCCTACCTCCGCCACTACCTCCACCGCGCCAATGCTGGTTCCAGCGCTTCCTATCCGCGACATCGGACGTCGATTGCGATCCGAGATGGAATCTCTCTTTCCGTATGTCGAAATCCTCTTTCTCCCTCCTCCTTCGTCTCCTTTCTCCGATTCAAAGCTCCCCATCCTCTTCAGTTCCTCCCGATTGTGCTTTAGCTGCTGCGCTTTTCCGATTGGCGCATGGCGCGAGCTACAAGGCGGTTGGGAGACGGTTTGGGATCGATTCTGCTGATGCTTGTCGGTCGTTTTATGCTGTTTGTAAAGCTATTAATGAGAAATTGGGGCATTTGCTTGAGTTACGGTCTGACATTGATCGGATTGTTGTGGGATTTGGGTGGATTTCGCTTCCGAATTGTTGTGGGGTTTTAGGTCTAAGAAGATTTGGGTTTGAAGGTGAGCTGAAAAATGGATCGCTTCTGGTTCAAGCATTAGTCGATGCTGAAGGGAGGTTTCTGGATGTCTCTGCTGGTTGGCCGAGCTCCATGAAACCTGCAACAATCTTGCGGCAGAGCAAACTATATGCAGAAATTGAGAAATCTAGTGAATTACTGAAGGGTCCTGTTTATAATCTCGACAATGAAAAACCCATTCCCCAATACTTGATCGGTGATTCTTGCTTCCCCCTTTTGCCATGGCTTTTGACACCATATATGGAACTGAATGAAGAAGATAGTTCTGGCTTTTGTGGGAGAGCATTCAATTCCACACATGGCCGTGCAATGGCGTTGGTTAACACAGCATTTTGCAGACTCCGAGCTCGGTGGAAGCTCTTGTCAAAACCATGGAAGGAAGGATGTAGAGATTTTTTCCCATTTATTATATTGACTGGATGTCTGCTGCAGAATTTCCTGATTAAATGCAGTGAGAAACTAGATGAAGAGCAAGATCAAGAAGAAGGAGCAAGTTGCTCAAGTGAGGAGCAAAAGTTTCCTCTTTTCGATGGTGAGATAGGAGATGGTAGAGGAAAGGATATCAGAGATGCCCTTGCCTTGCACTTGAGTAGCCTGAACTACAGAAGATGA

Protein sequence

MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR*
Homology
BLAST of CsGy2G002290 vs. ExPASy Swiss-Prot
Match: Q9M2U3 (Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 1.0e-33
Identity = 104/361 (28.81%), Postives = 161/361 (44.60%), Query Frame = 0

Query: 107 WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSPSSSVPPDC-- 166
           W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P     
Sbjct: 54  WWDGFSRRIYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLND 113

Query: 167 ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG 226
            +A AL RL  G S   +G  FG++ +   +  +   +++ E+  H L   S +D I   
Sbjct: 114 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPSKLDEIKSK 173

Query: 227 FGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSA 286
           F  IS LPNCCG + +              +   +GE KN S+ +QA+VD + RFLDV A
Sbjct: 174 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGE-KNFSMTLQAVVDPDMRFLDVIA 233

Query: 287 GWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTP 346
           GWP S+    +L+ S  Y  +EK    L G    L     + +Y++GDS FPLLPWLLTP
Sbjct: 234 GWPGSLNDDVVLKNSGFYKLVEKGKR-LNGEKLPLSERTELREYIVGDSGFPLLPWLLTP 293

Query: 347 YMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT 406
           Y    +   +      FN  H  A      A  +L+ RW++++       R+  P II  
Sbjct: 294 Y----QGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFV 353

Query: 407 GCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSL 445
            CLL N +I       E+Q  ++       +  +     ++ D     +RD L+  L   
Sbjct: 354 CCLLHNIIIDM-----EDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 403

BLAST of CsGy2G002290 vs. ExPASy Swiss-Prot
Match: Q94K49 (Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=3702 GN=ALP1 PE=1 SV=1)

HSP 1 Score: 131.3 bits (329), Expect = 2.7e-29
Identity = 92/297 (30.98%), Postives = 143/297 (48.15%), Query Frame = 0

Query: 128 FRMSKSSFSLLLRLLSP--IQSSPSSSVPPDCAL-------AAALFRLAHGASYKAVGRR 187
           FR SK++FS +  L+    I   PS  +  +  L       A AL RLA G S  +VG  
Sbjct: 69  FRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAA 128

Query: 188 FGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRF 247
           FG+  +   +  +   +A+ E+  H L       I+ I   F     LPNCCG +     
Sbjct: 129 FGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDRIEEIKSKFEEMYGLPNCCGAIDTTHI 188

Query: 248 -----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEI 307
                       +  + KN S+ +Q + D E RFL++  GWP  M  + +L+ S  + ++
Sbjct: 189 IMTLPAVQASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFF-KL 248

Query: 308 EKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTH 367
            +++++L G    L     I +Y++G   +PLLPWL+TP+   +  DS      AFN  H
Sbjct: 249 CENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSM----VAFNERH 308

Query: 368 GRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE 402
            +  ++  TAF +L+  W++LSK      R   P IIL  CLL N +I C + L E+
Sbjct: 309 EKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQED 360

BLAST of CsGy2G002290 vs. ExPASy Swiss-Prot
Match: B0BN95 (Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 1.2e-10
Identity = 67/277 (24.19%), Postives = 104/277 (37.55%), Query Frame = 0

Query: 138 LLRLLSPIQSSP---SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVC 197
           L+ LL    S P   S ++ P+  + AAL     G+    +G   GI  A   R    V 
Sbjct: 49  LVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVT 108

Query: 198 KAINEKLGHLLELRSDIDRIV----VGFGWISLPNCCGVL----------GLRRFGFEGE 257
           +A+ E+    +   +D   I       +G   +P   G +                +   
Sbjct: 109 EALVERASQFIHFPADEAAIQSLKDEFYGLAGMPGVIGAVDCIHVAIKAPNAEDLSYVNR 168

Query: 258 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 317
               SL    + D  G  + V   WP S++   +L+QS L ++ E               
Sbjct: 169 KGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQQSSLSSQFETG------------- 228

Query: 318 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLR- 377
             P   +L+GDS F L  WLLTP + + E  +     RA ++TH      + T  CR R 
Sbjct: 229 -MPKDSWLLGDSSFFLHTWLLTP-LHIPETPAEYRYNRAHSATHSVIEKTLRTLCCRFRC 288

Query: 378 ---ARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIK 394
              ++  L   P K         IIL  C+L N  ++
Sbjct: 289 LDGSKGALQYSPEKSS------HIILACCVLHNISLE 304

BLAST of CsGy2G002290 vs. ExPASy Swiss-Prot
Match: Q8BR93 (Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 1.0e-09
Identity = 66/275 (24.00%), Postives = 101/275 (36.73%), Query Frame = 0

Query: 138 LLRLLSPIQSSP---SSSVPPDCALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVC 197
           L+ LL    S P   S ++ P+  + AAL     G+    +G   GI  A   R    V 
Sbjct: 49  LVELLGASLSRPTQRSRAISPETQILAALGFYTSGSFQTRMGDAIGISQASMSRCVANVT 108

Query: 198 KAINEKLGHLLELRSDIDRIVVG------FGWISLPNCCGVL----------GLRRFGFE 257
           +A+ E+    +     +D   V       +G   +P   GV                 + 
Sbjct: 109 EALVERASQFIHF--PVDEAAVQSLKDEFYGLAGMPGVIGVADCIHVAIKAPNAEDLSYV 168

Query: 258 GELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL 317
                 SL    + D  G  + V   WP S++   +L++S L ++ E             
Sbjct: 169 NRKGLHSLNCLVVCDIRGALMTVETSWPGSLQDCAVLQRSSLTSQFETG----------- 228

Query: 318 DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRL 377
               P   +L+GDS F L  WLLTP + + E  +     RA ++TH      + T  CR 
Sbjct: 229 ---MPKDSWLLGDSSFFLRSWLLTP-LPIPETAAEYRYNRAHSATHSVIERTLQTLCCRF 288

Query: 378 RARWKLLSKPWKEGCRDFFP----FIILTGCLLQN 390
           R           +G   + P     IIL  C+L N
Sbjct: 289 RC------LDGSKGALQYSPEKCSHIILACCVLHN 300

BLAST of CsGy2G002290 vs. NCBI nr
Match: XP_004139403.1 (protein ALP1-like [Cucumis sativus] >KGN60730.1 hypothetical protein Csa_019280 [Cucumis sativus])

HSP 1 Score: 895 bits (2314), Expect = 0.0
Identity = 447/447 (100.00%), Postives = 447/447 (100.00%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL
Sbjct: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC
Sbjct: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE
Sbjct: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240

Query: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300
           LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
Sbjct: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300

Query: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360
           EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA
Sbjct: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420
           RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF
Sbjct: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420

Query: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 447
           DGEIGDGRGKDIRDALALHLSSLNYRR
Sbjct: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 447

BLAST of CsGy2G002290 vs. NCBI nr
Match: XP_008457314.1 (PREDICTED: putative nuclease HARBI1 [Cucumis melo] >KAA0063508.1 putative nuclease HARBI1 [Cucumis melo var. makuwa] >TYJ97835.1 putative nuclease HARBI1 [Cucumis melo var. makuwa])

HSP 1 Score: 859 bits (2220), Expect = 5.51e-314
Identity = 434/447 (97.09%), Postives = 437/447 (97.76%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MATRGLAGDKRTTRSSAMNAA AAITRSKAKKLDQENHLNHQLITLIETTISSA SFLSL
Sbjct: 1   MATRGLAGDKRTTRSSAMNAA-AAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PLPPPRQCWFQRFLSATSDVDC
Sbjct: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNLSFRMSKSSFSLLLRLLSPIQS  SSSVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE
Sbjct: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240

Query: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300
           LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLD+
Sbjct: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDD 300

Query: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360
           EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGF  RAFNSTHGRAMALVNTAFCRLRA
Sbjct: 301 EKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420
           RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP F
Sbjct: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPPF 420

Query: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 447
           DGEIGDGRGKDIRDALALHLSSL+YRR
Sbjct: 421 DGEIGDGRGKDIRDALALHLSSLSYRR 446

BLAST of CsGy2G002290 vs. NCBI nr
Match: XP_038890100.1 (protein ALP1-like [Benincasa hispida])

HSP 1 Score: 803 bits (2075), Expect = 9.87e-292
Identity = 410/456 (89.91%), Postives = 428/456 (93.86%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAI---TRSKAKKLDQENHLNHQLITLIETTISSAHSF 60
           MATRG+ GDKRTTRSS++NA AAA    TRSKAKKLD+E+HL HQL+TLI+TTISSAHSF
Sbjct: 1   MATRGIGGDKRTTRSSSINAVAAATVATTRSKAKKLDRESHLYHQLVTLIQTTISSAHSF 60

Query: 61  LSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPP----RQCWFQRFLS 120
           LSLNDLHLLPSQTLALESLL STSSSL+ALSPRLPKL+LPPP PPP    RQCWFQRFLS
Sbjct: 61  LSLNDLHLLPSQTLALESLLGSTSSSLYALSPRLPKLTLPPPPPPPPPPPRQCWFQRFLS 120

Query: 121 ATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYK 180
           ATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSS SSSVPPDCALAAALFRLAHGASYK
Sbjct: 121 ATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSSSSSVPPDCALAAALFRLAHGASYK 180

Query: 181 AVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLR 240
           AVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLR
Sbjct: 181 AVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLR 240

Query: 241 RFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELL 300
           RFG E EL  KNGSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLY EIEKS+ELL
Sbjct: 241 RFGVESELLGKNGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYEEIEKSNELL 300

Query: 301 KGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALV 360
           KGPVYNLD++KPIPQYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAMALV
Sbjct: 301 KGPVYNLDDDKPIPQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMALV 360

Query: 361 NTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCS 420
           NTAF RLRARWKLLSKPWKEGCRDFFPFI+LTGCLL NFLIKCSEKLDEEQDQEE A CS
Sbjct: 361 NTAFGRLRARWKLLSKPWKEGCRDFFPFIVLTGCLLHNFLIKCSEKLDEEQDQEEEAICS 420

Query: 421 SEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYRR 447
           SE+QKFPL+DG+IGD RGKDIRDALALHLSSL+YRR
Sbjct: 421 SEDQKFPLYDGKIGDDRGKDIRDALALHLSSLSYRR 456

BLAST of CsGy2G002290 vs. NCBI nr
Match: KAG6586365.1 (Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 771 bits (1990), Expect = 5.79e-279
Identity = 390/449 (86.86%), Postives = 414/449 (92.20%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSL
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVT-TRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALES + STSSSL ALSP LPKLSL PP   PRQCWFQRFLSAT++VDC
Sbjct: 61  NDLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLHPP---PRQCWFQRFLSATAEVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNL FRMSKSSFSLLLRLLSPIQSS S+SVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+
Sbjct: 181 IDSADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGD 240

Query: 241 L--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL 300
           L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Sbjct: 241 LLGKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNL 300

Query: 301 DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRL 360
           D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++
Sbjct: 301 DDGKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKV 360

Query: 361 RARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP 420
           RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL+EEQD+++GASCSSEEQKFP
Sbjct: 361 RARWKLLSKPWKEECRDFFPFIVLTGCLLHNFLIKCSEKLEEEQDEDDGASCSSEEQKFP 420

Query: 421 LFDGEIGDGRGKDIRDALALHLSSLNYRR 447
           L+DGE GD RGKDIRDALALHLS L++RR
Sbjct: 421 LYDGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of CsGy2G002290 vs. NCBI nr
Match: XP_022938170.1 (protein ALP1-like [Cucurbita moschata] >XP_022938171.1 protein ALP1-like [Cucurbita moschata] >KAG7021213.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 768 bits (1982), Expect = 9.58e-278
Identity = 389/449 (86.64%), Postives = 413/449 (91.98%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSL
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVT-TRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALES + STSSSL ALSP LPKLSL PP   PRQCWFQRFLSAT++VDC
Sbjct: 61  NDLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLHPP---PRQCWFQRFLSATAEVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNL FRMSKSSFSLLLRLLSPIQSS S+SVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+
Sbjct: 181 IDSADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGD 240

Query: 241 L--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL 300
           L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Sbjct: 241 LLGKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNL 300

Query: 301 DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRL 360
           D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++
Sbjct: 301 DDGKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKV 360

Query: 361 RARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP 420
           RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL+EEQD+++GASCSSEEQKF 
Sbjct: 361 RARWKLLSKPWKEECRDFFPFIVLTGCLLHNFLIKCSEKLEEEQDEDDGASCSSEEQKFA 420

Query: 421 LFDGEIGDGRGKDIRDALALHLSSLNYRR 447
           L+DGE GD RGKDIRDALALHLS L++RR
Sbjct: 421 LYDGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of CsGy2G002290 vs. ExPASy TrEMBL
Match: A0A0A0LFB5 (DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G008700 PE=3 SV=1)

HSP 1 Score: 895 bits (2314), Expect = 0.0
Identity = 447/447 (100.00%), Postives = 447/447 (100.00%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL
Sbjct: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC
Sbjct: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE
Sbjct: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240

Query: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300
           LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN
Sbjct: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300

Query: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360
           EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA
Sbjct: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420
           RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF
Sbjct: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420

Query: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 447
           DGEIGDGRGKDIRDALALHLSSLNYRR
Sbjct: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 447

BLAST of CsGy2G002290 vs. ExPASy TrEMBL
Match: A0A5D3BH79 (Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold285G00450 PE=3 SV=1)

HSP 1 Score: 859 bits (2220), Expect = 2.67e-314
Identity = 434/447 (97.09%), Postives = 437/447 (97.76%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MATRGLAGDKRTTRSSAMNAA AAITRSKAKKLDQENHLNHQLITLIETTISSA SFLSL
Sbjct: 1   MATRGLAGDKRTTRSSAMNAA-AAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PLPPPRQCWFQRFLSATSDVDC
Sbjct: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNLSFRMSKSSFSLLLRLLSPIQS  SSSVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE
Sbjct: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240

Query: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300
           LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLD+
Sbjct: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDD 300

Query: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360
           EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGF  RAFNSTHGRAMALVNTAFCRLRA
Sbjct: 301 EKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420
           RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP F
Sbjct: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPPF 420

Query: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 447
           DGEIGDGRGKDIRDALALHLSSL+YRR
Sbjct: 421 DGEIGDGRGKDIRDALALHLSSLSYRR 446

BLAST of CsGy2G002290 vs. ExPASy TrEMBL
Match: A0A1S3C5W6 (putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103497032 PE=3 SV=1)

HSP 1 Score: 859 bits (2220), Expect = 2.67e-314
Identity = 434/447 (97.09%), Postives = 437/447 (97.76%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MATRGLAGDKRTTRSSAMNAA AAITRSKAKKLDQENHLNHQLITLIETTISSA SFLSL
Sbjct: 1   MATRGLAGDKRTTRSSAMNAA-AAITRSKAKKLDQENHLNHQLITLIETTISSARSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLP PLPPPRQCWFQRFLSATSDVDC
Sbjct: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPSPLPPPRQCWFQRFLSATSDVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNLSFRMSKSSFSLLLRLLSPIQS  SSSVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSPSSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE
Sbjct: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240

Query: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDN 300
           LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY EIEKSSELLKGPVYNLD+
Sbjct: 241 LKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYEEIEKSSELLKGPVYNLDD 300

Query: 301 EKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRA 360
           EKPIPQYLIGDSCFPL PWLLTPY+ELNEEDSSGF  RAFNSTHGRAMALVNTAFCRLRA
Sbjct: 301 EKPIPQYLIGDSCFPLFPWLLTPYIELNEEDSSGFRERAFNSTHGRAMALVNTAFCRLRA 360

Query: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLF 420
           RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP F
Sbjct: 361 RWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPPF 420

Query: 421 DGEIGDGRGKDIRDALALHLSSLNYRR 447
           DGEIGDGRGKDIRDALALHLSSL+YRR
Sbjct: 421 DGEIGDGRGKDIRDALALHLSSLSYRR 446

BLAST of CsGy2G002290 vs. ExPASy TrEMBL
Match: A0A6J1HRT9 (protein ALP1-like OS=Cucurbita maxima OX=3661 GN=LOC111465544 PE=3 SV=1)

HSP 1 Score: 768 bits (1982), Expect = 4.64e-278
Identity = 389/449 (86.64%), Postives = 413/449 (91.98%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSL
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVT-TRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALES + STSSSL ALSP LPKLSL PP   PRQCWFQRFLSAT++VDC
Sbjct: 61  NDLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLHPP---PRQCWFQRFLSATAEVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNL FRMSKSSFSLLLRLLSPIQSS S+SVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+
Sbjct: 181 IDSADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGD 240

Query: 241 L--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL 300
           L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Sbjct: 241 LLGKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNL 300

Query: 301 DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRL 360
           D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++
Sbjct: 301 DDGKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKV 360

Query: 361 RARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP 420
           RARWKLLSKPWKE CRDFFPF++LTGCLL NFLIKCSEKL+EEQD+E+GAS SSEEQKFP
Sbjct: 361 RARWKLLSKPWKEECRDFFPFVVLTGCLLHNFLIKCSEKLEEEQDEEDGASSSSEEQKFP 420

Query: 421 LFDGEIGDGRGKDIRDALALHLSSLNYRR 447
           L+DGE GD RGKDIRDALALHLS L++RR
Sbjct: 421 LYDGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of CsGy2G002290 vs. ExPASy TrEMBL
Match: A0A6J1FIY2 (protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111444336 PE=3 SV=1)

HSP 1 Score: 768 bits (1982), Expect = 4.64e-278
Identity = 389/449 (86.64%), Postives = 413/449 (91.98%), Query Frame = 0

Query: 1   MATRGLAGDKRTTRSSAMNAAAAAITRSKAKKLDQENHLNHQLITLIETTISSAHSFLSL 60
           MA  G +GDKRTTRSSA+NA A   TRSKAKK D++NHL HQL+TLIETTISSAHSFLSL
Sbjct: 1   MAAGGFSGDKRTTRSSAINAGAVT-TRSKAKKSDRKNHLKHQLVTLIETTISSAHSFLSL 60

Query: 61  NDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPPPLPPPRQCWFQRFLSATSDVDC 120
           NDLHLLPSQTLALES + STSSSL ALSP LPKLSL PP   PRQCWFQRFLSAT++VDC
Sbjct: 61  NDLHLLPSQTLALESFIYSTSSSLQALSPCLPKLSLHPP---PRQCWFQRFLSATAEVDC 120

Query: 121 DPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFG 180
           DPRWNL FRMSKSSFSLLLRLLSPIQSS S+SVPPDCALAAALFRLAHGASYKAVGRRFG
Sbjct: 121 DPRWNLFFRMSKSSFSLLLRLLSPIQSSSSTSVPPDCALAAALFRLAHGASYKAVGRRFG 180

Query: 181 IDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGFEGE 240
           IDSADACRSFYAVCKAIN+KLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFG EG+
Sbjct: 181 IDSADACRSFYAVCKAINDKLGHLLELRSDIDRIVVGFGWISLPNCCGVLGLRRFGVEGD 240

Query: 241 L--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNL 300
           L  K+GSLLVQALVDAEGRFLDVSAGWPSSMKP TILRQSKLYAEIEKS ELLKGPVYNL
Sbjct: 241 LLGKDGSLLVQALVDAEGRFLDVSAGWPSSMKPETILRQSKLYAEIEKSGELLKGPVYNL 300

Query: 301 DNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRL 360
           D+ KPI QYLIGDSCFPLLPWLLTPYM+LNEEDSSGF  RAFNSTH RAM LVNTAFC++
Sbjct: 301 DDGKPISQYLIGDSCFPLLPWLLTPYMKLNEEDSSGFPERAFNSTHNRAMGLVNTAFCKV 360

Query: 361 RARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFP 420
           RARWKLLSKPWKE CRDFFPFI+LTGCLL NFLIKCSEKL+EEQD+++GASCSSEEQKF 
Sbjct: 361 RARWKLLSKPWKEECRDFFPFIVLTGCLLHNFLIKCSEKLEEEQDEDDGASCSSEEQKFA 420

Query: 421 LFDGEIGDGRGKDIRDALALHLSSLNYRR 447
           L+DGE GD RGKDIRDALALHLS L++RR
Sbjct: 421 LYDGETGDDRGKDIRDALALHLSRLSFRR 445

BLAST of CsGy2G002290 vs. TAIR 10
Match: AT1G72270.2 (LOCATED IN: mitochondrion; EXPRESSED IN: shoot apex, embryo, flower, seed; EXPRESSED DURING: petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1). )

HSP 1 Score: 280.0 bits (715), Expect = 3.3e-75
Identity = 178/421 (42.28%), Postives = 239/421 (56.77%), Query Frame = 0

Query: 39  LNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPP 98
           L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S   S    SP     ++  
Sbjct: 21  LKDPLLRRLSSAAAVTNSFLQANDLFLSPSQTLRLESLISSLPIS---PSPSSSSSAI-- 80

Query: 99  PLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCA 158
                   WF RFL++ ++ + DPRW L FRMSKS+F  L  +LS       SS+P   +
Sbjct: 81  ----TTTTWFNRFLTSATEDEDDPRWCLYFRMSKSTFFSLYSILS------HSSLP---S 140

Query: 159 LAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG 218
            AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     
Sbjct: 141 FAATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEKLS------QQLDDPKPD 200

Query: 219 FGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATIL 278
           F    LPNC GV+G  RF  +G+L    GS+LVQALVD+ GRF+D+SAGWPS+MKP  I 
Sbjct: 201 FSPNLLPNCYGVVGFGRFEVKGKLLGAKGSILVQALVDSNGRFVDISAGWPSTMKPEAIF 260

Query: 279 RQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF 338
           RQ+KL++  E   E+L G    L N   +P+Y++GDSC PLLPWL+TPY   ++E+S   
Sbjct: 261 RQTKLFSIAE---EVLSGAPTKLGNGVLVPRYILGDSCLPLLPWLVTPYDLTSDEES--- 320

Query: 339 CGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCS 398
               FN+     +  V  AF ++RARW++L K WK    +F PF+I TGCLL NFL+   
Sbjct: 321 FREEFNNVVHTGLHSVEIAFAKVRARWRILDKKWKPETIEFMPFVITTGCLLHNFLVNSG 380

Query: 399 EKLD---------EEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSLNYR 448
           +  D         E  D  E      +E++   F+GE      K IRDA+A +LS ++  
Sbjct: 381 DDDDSVEECVNGCEAGDNGEMRKDDDKEEETRSFEGE-AYRESKRIRDAIAENLSRVSSL 410

BLAST of CsGy2G002290 vs. TAIR 10
Match: AT1G72270.1 (CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G27010.1); Has 772 Blast hits to 657 proteins in 120 species: Archae - 0; Bacteria - 0; Metazoa - 344; Fungi - 94; Plants - 322; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink). )

HSP 1 Score: 278.9 bits (712), Expect = 7.3e-75
Identity = 177/415 (42.65%), Postives = 236/415 (56.87%), Query Frame = 0

Query: 39  LNHQLITLIETTISSAHSFLSLNDLHLLPSQTLALESLLCSTSSSLHALSPRLPKLSLPP 98
           L   L+  + +  +  +SFL  NDL L PSQTL LESL+ S   S    SP     ++  
Sbjct: 21  LKDPLLRRLSSAAAVTNSFLQANDLFLSPSQTLRLESLISSLPIS---PSPSSSSSAI-- 80

Query: 99  PLPPPRQCWFQRFLSATSDVDCDPRWNLSFRMSKSSFSLLLRLLSPIQSSPSSSVPPDCA 158
                   WF RFL++ ++ + DPRW L FRMSKS+F  L  +LS       SS+P   +
Sbjct: 81  ----TTTTWFNRFLTSATEDEDDPRWCLYFRMSKSTFFSLYSILS------HSSLP---S 140

Query: 159 LAAALFRLAHGASYKAVGRRFGIDS-ADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG 218
            AA +FRLAHGASY+ +  RFG DS + A RSF+ VCK INEKL         +D     
Sbjct: 141 FAATIFRLAHGASYECLVHRFGFDSTSQASRSFFTVCKLINEKLS------QQLDDPKPD 200

Query: 219 FGWISLPNCCGVLGLRRFGFEGEL--KNGSLLVQALVDAEGRFLDVSAGWPSSMKPATIL 278
           F    LPNC GV+G  RF  +G+L    GS+LVQALVD+ GRF+D+SAGWPS+MKP  I 
Sbjct: 201 FSPNLLPNCYGVVGFGRFEVKGKLLGAKGSILVQALVDSNGRFVDISAGWPSTMKPEAIF 260

Query: 279 RQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGF 338
           RQ+KL++  E   E+L G    L N   +P+Y++GDSC PLLPWL+TPY   ++E+S   
Sbjct: 261 RQTKLFSIAE---EVLSGAPTKLGNGVLVPRYILGDSCLPLLPWLVTPYDLTSDEES--- 320

Query: 339 CGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCS 398
               FN+     +  V  AF ++RARW++L K WK    +F PF+I TGCLL NFL+   
Sbjct: 321 FREEFNNVVHTGLHSVEIAFAKVRARWRILDKKWKPETIEFMPFVITTGCLLHNFLVNSG 380

Query: 399 EKLD---------EEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLS 442
           +  D         E  D  E      +E++   F+GE      K IRDA+A +LS
Sbjct: 381 DDDDSVEECVNGCEAGDNGEMRKDDDKEEETRSFEGE-AYRESKRIRDAIAENLS 404

BLAST of CsGy2G002290 vs. TAIR 10
Match: AT3G55350.1 (PIF / Ping-Pong family of plant transposases )

HSP 1 Score: 146.0 bits (367), Expect = 7.4e-35
Identity = 104/361 (28.81%), Postives = 161/361 (44.60%), Query Frame = 0

Query: 107 WFQRFLSATSDVDCDPR-WNLSFRMSKSSFSLLLRLL------SPIQSSPSSSVPPDC-- 166
           W+  F         DP+ +   F++S+ +F  +  L+       P   S S+  P     
Sbjct: 54  WWDGFSRRIYGGSTDPKTFESVFKISRKTFDYICSLVKADFTAKPANFSDSNGNPLSLND 113

Query: 167 ALAAALFRLAHGASYKAVGRRFGIDSADACRSFYAVCKAINEKLGHLLELRSDIDRIVVG 226
            +A AL RL  G S   +G  FG++ +   +  +   +++ E+  H L   S +D I   
Sbjct: 114 RVAVALRRLGSGESLSVIGETFGMNQSTVSQITWRFVESMEERAIHHLSWPSKLDEIKSK 173

Query: 227 FGWIS-LPNCCGVLGL-------------RRFGFEGELKNGSLLVQALVDAEGRFLDVSA 286
           F  IS LPNCCG + +              +   +GE KN S+ +QA+VD + RFLDV A
Sbjct: 174 FEKISGLPNCCGAIDITHIVMNLPAVEPSNKVWLDGE-KNFSMTLQAVVDPDMRFLDVIA 233

Query: 287 GWPSSMKPATILRQSKLYAEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTP 346
           GWP S+    +L+ S  Y  +EK    L G    L     + +Y++GDS FPLLPWLLTP
Sbjct: 234 GWPGSLNDDVVLKNSGFYKLVEKGKR-LNGEKLPLSERTELREYIVGDSGFPLLPWLLTP 293

Query: 347 YMELNEEDSSGFCGRAFNSTHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILT 406
           Y    +   +      FN  H  A      A  +L+ RW++++       R+  P II  
Sbjct: 294 Y----QGKPTSLPQTEFNKRHSEATKAAQMALSKLKDRWRIINGVMWMPDRNRLPRIIFV 353

Query: 407 GCLLQNFLIKCSEKLDEEQDQEEGASCSSEEQKFPLFDGEIGDGRGKDIRDALALHLSSL 445
            CLL N +I       E+Q  ++       +  +     ++ D     +RD L+  L   
Sbjct: 354 CCLLHNIIIDM-----EDQTLDDQPLSQQHDMNYRQRSCKLADEASSVLRDELSDQLCGK 403

BLAST of CsGy2G002290 vs. TAIR 10
Match: AT3G63270.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: PIF / Ping-Pong family of plant transposases (TAIR:AT3G55350.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 131.3 bits (329), Expect = 1.9e-30
Identity = 92/297 (30.98%), Postives = 143/297 (48.15%), Query Frame = 0

Query: 128 FRMSKSSFSLLLRLLSP--IQSSPSSSVPPDCAL-------AAALFRLAHGASYKAVGRR 187
           FR SK++FS +  L+    I   PS  +  +  L       A AL RLA G S  +VG  
Sbjct: 69  FRASKTTFSYICSLVREDLISRPPSGLINIEGRLLSVEKQVAIALRRLASGDSQVSVGAA 128

Query: 188 FGIDSADACRSFYAVCKAINEKLGHLLELRSD--IDRIVVGF-GWISLPNCCGVLGLRRF 247
           FG+  +   +  +   +A+ E+  H L       I+ I   F     LPNCCG +     
Sbjct: 129 FGVGQSTVSQVTWRFIEALEERAKHHLRWPDSDRIEEIKSKFEEMYGLPNCCGAIDTTHI 188

Query: 248 -----------GFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLYAEI 307
                       +  + KN S+ +Q + D E RFL++  GWP  M  + +L+ S  + ++
Sbjct: 189 IMTLPAVQASDDWCDQEKNYSMFLQGVFDHEMRFLNMVTGWPGGMTVSKLLKFSGFF-KL 248

Query: 308 EKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFNSTH 367
            +++++L G    L     I +Y++G   +PLLPWL+TP+   +  DS      AFN  H
Sbjct: 249 CENAQILDGNPKTLSQGAQIREYVVGGISYPLLPWLITPHDSDHPSDSM----VAFNERH 308

Query: 368 GRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE 402
            +  ++  TAF +L+  W++LSK      R   P IIL  CLL N +I C + L E+
Sbjct: 309 EKVRSVAATAFQQLKGSWRILSKVMWRPDRRKLPSIILVCCLLHNIIIDCGDYLQED 360

BLAST of CsGy2G002290 vs. TAIR 10
Match: AT5G12010.1 (unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, plasma membrane, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G29780.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 94.7 bits (234), Expect = 2.0e-19
Identity = 77/300 (25.67%), Postives = 126/300 (42.00%), Query Frame = 0

Query: 127 SFRMSKSSFSLLLRLLSPI----QSSPSSSVPPDCALAAALFRLAHGASYKAVGRRFGID 186
           +FRMSKS+F L+   L+       ++  +++P    +A  ++RLA G   + V ++FG+ 
Sbjct: 178 AFRMSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAVCIWRLATGEPLRLVSKKFGLG 237

Query: 187 SADACRSFYAVCKAINE----------------KLGHLLELRSDIDRIVVGFGWISLPNC 246
            +   +    VCKAI +                 +    E  S I  +V       +P  
Sbjct: 238 ISTCHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERFESVSGIPNVVGSMYTTHIPII 297

Query: 247 CGVLGL-----RRFGFEGELKNGSLLVQALVDAEGRFLDVSAGWPSSMKPATILRQSKLY 306
              + +     +R     +  + S+ +QA+V+ +G F D+  GWP SM    +L +S LY
Sbjct: 298 APKISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTDLCIGWPGSMPDDKVLEKSLLY 357

Query: 307 AEIEKSSELLKGPVYNLDNEKPIPQYLIGDSCFPLLPWLLTPYMELNEEDSSGFCGRAFN 366
                   LLKG             ++ G    PLL W+L PY + N      +   AFN
Sbjct: 358 QRANNGG-LLKG------------MWVAGGPGHPLLDWVLVPYTQQN----LTWTQHAFN 417

Query: 367 STHGRAMALVNTAFCRLRARWKLLSKPWKEGCRDFFPFIILTGCLLQNFLIKCSEKLDEE 402
                   +   AF RL+ RW  L K  +   +D  P ++   C+L N      EK++ E
Sbjct: 418 EKMSEVQGVAKEAFGRLKGRWACLQKRTEVKLQD-LPTVLGACCVLHNICEMREEKMEPE 459

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M2U31.0e-3328.81Protein ALP1-like OS=Arabidopsis thaliana OX=3702 GN=At3g55350 PE=2 SV=1[more]
Q94K492.7e-2930.98Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 OS=Arabidopsis thaliana OX=... [more]
B0BN951.2e-1024.19Putative nuclease HARBI1 OS=Rattus norvegicus OX=10116 GN=Harbi1 PE=2 SV=1[more]
Q8BR931.0e-0924.00Putative nuclease HARBI1 OS=Mus musculus OX=10090 GN=Harbi1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_004139403.10.0100.00protein ALP1-like [Cucumis sativus] >KGN60730.1 hypothetical protein Csa_019280 ... [more]
XP_008457314.15.51e-31497.09PREDICTED: putative nuclease HARBI1 [Cucumis melo] >KAA0063508.1 putative nuclea... [more]
XP_038890100.19.87e-29289.91protein ALP1-like [Benincasa hispida][more]
KAG6586365.15.79e-27986.86Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022938170.19.58e-27886.64protein ALP1-like [Cucurbita moschata] >XP_022938171.1 protein ALP1-like [Cucurb... [more]
Match NameE-valueIdentityDescription
A0A0A0LFB50.0100.00DDE Tnp4 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G008700 PE... [more]
A0A5D3BH792.67e-31497.09Putative nuclease HARBI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A1S3C5W62.67e-31497.09putative nuclease HARBI1 OS=Cucumis melo OX=3656 GN=LOC103497032 PE=3 SV=1[more]
A0A6J1HRT94.64e-27886.64protein ALP1-like OS=Cucurbita maxima OX=3661 GN=LOC111465544 PE=3 SV=1[more]
A0A6J1FIY24.64e-27886.64protein ALP1-like OS=Cucurbita moschata OX=3662 GN=LOC111444336 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G72270.23.3e-7542.28LOCATED IN: mitochondrion; EXPRESSED IN: shoot apex, embryo, flower, seed; EXPRE... [more]
AT1G72270.17.3e-7542.65CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR0217... [more]
AT3G55350.17.4e-3528.81PIF / Ping-Pong family of plant transposases [more]
AT3G63270.11.9e-3030.98CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G12010.12.0e-1925.67unknown protein; INVOLVED IN: response to salt stress; LOCATED IN: chloroplast, ... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 240..389
e-value: 2.6E-15
score: 56.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 404..427
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 9..443
NoneNo IPR availablePANTHERPTHR22930:SF186LOW PROTEIN: NUCLEASE-LIKE PROTEINcoord: 9..443

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy2G002290.1CsGy2G002290.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0035098 ESC/E(Z) complex
cellular_component GO:0035102 PRC1 complex
molecular_function GO:0003682 chromatin binding
molecular_function GO:0046872 metal ion binding