Chrysanthenum transcriptome database

GenBank blast output of UN67737


BLASTX 7.6.2

Query= UN67737 /QuerySize=975
        (974 letters)

Database: GenBank nr;
          20,571,509 sequences; 7,061,663,739 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|255551795|ref|XP_002516943.1| conserved hypothetical protein ...    152   1e-034
gi|356519956|ref|XP_003528634.1| PREDICTED: uncharacterized prot...     93   7e-017
gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT...     92   2e-016
gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]               88   3e-015
gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsi...     87   4e-015
gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabid...     78   3e-012
gi|357478805|ref|XP_003609688.1| hypothetical protein MTR_4g1200...     73   9e-011
gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis...     71   4e-010
gi|356564599|ref|XP_003550539.1| PREDICTED: uncharacterized prot...     64   6e-008

>gi|255551795|ref|XP_002516943.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 450

 Score =  152 bits (383), Expect = 1e-034
 Identities = 85/171 (49%), Positives = 105/171 (61%), Gaps = 15/171 (8%)
 Frame = -2

Query: 553 GSQRSKENEMIIVEENDSP------PQPSFDRKVXXXXXXXXXXXXXXXDFFERISTGFG 392
           G+  SK+++++ VE++DSP         SF+RKV               DFFERISTGFG
Sbjct: 256 GTSLSKKSDIVAVEDDDSPNSQATASASSFERKVSRSRSVGCGSRSFSGDFFERISTGFG 315

Query: 391 DCTLRRVESNREGK-KGPNLRAVGHDGACMKERVRCGGLFSGFMITXXXXXXXXXXYWVA 215
           DCTLRRVES REGK KGP   A  H    MKERV+CGG+F GFMIT          YWV+
Sbjct: 316 DCTLRRVESQREGKPKGPG--AASH----MKERVKCGGIFGGFMITSSSSSSSSSSYWVS 369

Query: 214 SNPND-VIGGNSMNGKMMTSTGVGMTSAHGRSKSWGWALASPMRAFSKTST 65
           S+  +  + G S +G +  + G G   AHGRS+SWGWA ASPMRAFSK S+
Sbjct: 370 SSAEEHHMNGKSTHG-VAAAAGAGGPLAHGRSRSWGWAFASPMRAFSKPSS 419

>gi|356519956|ref|XP_003528634.1| PREDICTED: uncharacterized protein
        LOC100802982 [Glycine max]

          Length = 420

 Score =  93 bits (230), Expect = 7e-017
 Identities = 43/89 (48%), Positives = 55/89 (61%), Gaps = 5/89 (5%)
 Frame = -2

Query: 322 HDGACMKERVRCGGLFSGFMITXXXXXXXXXXYWVASNPNDVIGGNSMNGKMMTSTGVGM 143
           H   CMKERVRCGGLFSGFM+T          YWV+S+ +D     ++NGK  T     +
Sbjct: 307 HHHHCMKERVRCGGLFSGFMMTSSSSSSSSSSYWVSSSADDAAAAAAVNGKSAT-----V 361

Query: 142 TSAHGRSKSWGWALASPMRAFSKTSTTKK 56
             +H R +SWGWA ASPMRAFS   ++K+
Sbjct: 362 ALSHNRGRSWGWAFASPMRAFSGKPSSKE 390


 Score =  63 bits (152), Expect = 7e-008
 Identities = 33/73 (45%), Positives = 42/73 (57%), Gaps = 6/73 (8%)
 Frame = -2

Query: 550 SQRSKENEMIIVEENDSPPQ------PSFDRKVXXXXXXXXXXXXXXXDFFERISTGFGD 389
           S  S + ++++ ++N + P        SF+RKV               DFFERISTGFGD
Sbjct: 216 SGSSLKTDIVVEQDNTNSPNTASASASSFERKVSRSRSVGCGSRSFSGDFFERISTGFGD 275

Query: 388 CTLRRVESNREGK 350
           CTLRRVES REGK
Sbjct: 276 CTLRRVESQREGK 288

>gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876
        [Arabidopsis lyrata subsp. lyrata]

          Length = 384

 Score =  92 bits (226), Expect = 2e-016
 Identities = 50/127 (39%), Positives = 66/127 (51%), Gaps = 4/127 (3%)
 Frame = -2

Query: 580 SGISSNMLSGSQRSKENEMIIVEENDSP---PQPSFDRKVXXXXXXXXXXXXXXXDFFER 410
           S +S  ++ GS  ++    +IVEE+ SP     PS +RKV               DFFER
Sbjct: 191 SSMSKRVVGGSNSNRNGIDVIVEEDGSPNIEVTPS-ERKVSRSRSVGCGSRSFSGDFFER 249

Query: 409 ISTGFGDCTLRRVESNREGKKGPNLRAVGHDGACMKERVRCGGLFSGFMITXXXXXXXXX 230
           I+ GFGDCTLRRVES REG      +   +    ++E VRCGG+F GFMI          
Sbjct: 250 ITNGFGDCTLRRVESQREGNNNKGNKVSSNPSNGVREMVRCGGIFGGFMIMTSSSSSSSS 309

Query: 229 XYWVASN 209
             WV+S+
Sbjct: 310 SSWVSSS 316

>gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]

          Length = 388

 Score =  88 bits (216), Expect = 3e-015
 Identities = 48/118 (40%), Positives = 61/118 (51%), Gaps = 4/118 (3%)
 Frame = -2

Query: 553 GSQRSKENEMIIVEENDSP---PQPSFDRKVXXXXXXXXXXXXXXXDFFERISTGFGDCT 383
           GS  ++    +IVEE+ SP     PS +RKV               DFFERI+ GFGDCT
Sbjct: 201 GSSSNRNGIDVIVEEDGSPNIEVTPS-ERKVSRSRSVGCGSRSFSGDFFERITNGFGDCT 259

Query: 382 LRRVESNREGKKGPNLRAVGHDGACMKERVRCGGLFSGFMITXXXXXXXXXXYWVASN 209
           LRRVES REG      +   +    ++E VRCGG+F GFMI            WV+S+
Sbjct: 260 LRRVESQREGNNNKGNKVSSNSSNGVREMVRCGGIFGGFMIMTSSSSSSSSSSWVSSS 317

>gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 396

 Score =  87 bits (215), Expect = 4e-015
 Identities = 48/118 (40%), Positives = 61/118 (51%), Gaps = 4/118 (3%)
 Frame = -2

Query: 553 GSQRSKENEMIIVEENDSP---PQPSFDRKVXXXXXXXXXXXXXXXDFFERISTGFGDCT 383
           GS  ++    +IVEE+ SP     PS +RKV               DFFERI+ GFGDCT
Sbjct: 209 GSSSNRNGIDVIVEEDGSPNIEVTPS-ERKVSRSRSVGCGSRSFSGDFFERITNGFGDCT 267

Query: 382 LRRVESNREGKKGPNLRAVGHDGACMKERVRCGGLFSGFMITXXXXXXXXXXYWVASN 209
           LRRVES REG      +   +    ++E VRCGG+F GFMI            WV+S+
Sbjct: 268 LRRVESQREGNNNKGNKVSSNPSNGVREMVRCGGIFGGFMIMTSSSSSSSSSSWVSSS 325

>gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabidopsis
        thaliana]

          Length = 214

 Score =  78 bits (190), Expect = 3e-012
 Identities = 41/77 (53%), Positives = 47/77 (61%)
 Frame = +3

Query: 258 VIINPLNKPPHLTLSFMHAPSCPTARKLGPFFPSRLDSTRRSVQSPNPVEIRSKKSPEKL 437
           +IINP   PPHLT+S + + + P    L     S  DS     QSPNP+EI SKKSPEKL
Sbjct:   1 MIINPPKMPPHLTISLIASAASPPPPLLMTLVASLCDSILLKAQSPNPLEILSKKSPEKL 60

Query: 438 RLPHPTDLDRETFRSND 488
            LP PTDLD  TF  ND
Sbjct:  61 LLPQPTDLDLNTFLPND 77

>gi|357478805|ref|XP_003609688.1| hypothetical protein MTR_4g120050 [Medicago
        truncatula]

          Length = 445

 Score =  73 bits (177), Expect = 9e-011
 Identities = 38/75 (50%), Positives = 45/75 (60%), Gaps = 6/75 (8%)
 Frame = -2

Query: 556 SGSQRSKENEMIIVEENDSPPQP------SFDRKVXXXXXXXXXXXXXXXDFFERISTGF 395
           SGS   ++N+++IVEE +    P      SF+RKV               DFFERISTGF
Sbjct: 228 SGSSLGRKNDIVIVEEEEEDNSPNSNNTGSFERKVSRSRSVGCGSRSFSGDFFERISTGF 287

Query: 394 GDCTLRRVESNREGK 350
           GDCTLRRVES REGK
Sbjct: 288 GDCTLRRVESQREGK 302

>gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis thaliana]

          Length = 207

 Score =  71 bits (172), Expect = 4e-010
 Identities = 37/69 (53%), Positives = 42/69 (60%)
 Frame = +3

Query: 282 PPHLTLSFMHAPSCPTARKLGPFFPSRLDSTRRSVQSPNPVEIRSKKSPEKLRLPHPTDL 461
           PPHLT+S + + + P    L     S  DS     QSPNP+EI SKKSPEKL LP PTDL
Sbjct:   2 PPHLTISLIASAASPPPPLLMTLVASLCDSILLKAQSPNPLEILSKKSPEKLLLPQPTDL 61

Query: 462 DRETFRSND 488
           D  TF  ND
Sbjct:  62 DLNTFLPND 70

>gi|356564599|ref|XP_003550539.1| PREDICTED: uncharacterized protein
        LOC100798085 [Glycine max]

          Length = 444

 Score =  64 bits (153), Expect = 6e-008
 Identities = 33/73 (45%), Positives = 43/73 (58%), Gaps = 6/73 (8%)
 Frame = -2

Query: 550 SQRSKENEMIIVEENDSPPQP------SFDRKVXXXXXXXXXXXXXXXDFFERISTGFGD 389
           S  S + ++++ ++N++   P      SF+RKV               DFFERISTGFGD
Sbjct: 243 SGSSLKTDIVVEQDNNNSNSPNTASASSFERKVSRSKSVGCGSRSFSGDFFERISTGFGD 302

Query: 388 CTLRRVESNREGK 350
           CTLRRVES REGK
Sbjct: 303 CTLRRVESQREGK 315

  Database: GenBank nr
    Posted date:  Thu Sep 27 19:07:00 2012
  Number of letters in database: 7,061,663,739
  Number of sequences in database:  20,571,509

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,959,113,869,582
Number of Sequences: 20571509
Number of Extensions: 3959113869582
Number of Successful Extensions: 1102425621
Number of sequences better than 0.0: 0