Skip to content
Snippets Groups Projects
  1. May 16, 2018
  2. Apr 04, 2018
  3. Aug 05, 2016
  4. May 23, 2016
  5. May 20, 2016
  6. Apr 05, 2016
    • Steinar H. Gunderson's avatar
      Make heuristic match skipping more aggressive. · d53de187
      Steinar H. Gunderson authored
      This causes compression to be much faster on incompressible inputs
      (such as the jpeg and pdf tests), and is neutral or even positive on the other
      tests. The test set shows only microscopic density regressions; I attempted to
      construct a worst-case test set containing ~1500 different cases of mixed
      plaintext + /dev/urandom, and even those seemed to be only 0.38 percentage
      points less dense on average (the single worst case was 87.8% -> 89.0%), which
      we can live with given that this is already an edge case.
      
      The original idea is by Klaus Post; I only tweaked the implementation.
      Ironically, the new implementation is almost more in line with the
      comment that was there, so I've left that largely alone, albeit
      with a small modification.
      
      Microbenchmark results (opt mode, 64-bit, static linking):
      
      Ivy Bridge:
      
      Benchmark                 Base (ns)  New (ns)                                Improvement
      ----------------------------------------------------------------------------------------
      BM_ZFlat/0                   120284    115480  847.0MB/s  html (22.31 %)        +4.2%
      BM_ZFlat/1                  1527911   1522242  440.7MB/s  urls (47.78 %)        +0.4%
      BM_ZFlat/2                    17591     10582  10.9GB/s  jpg (99.95 %)         +66.2%
      BM_ZFlat/3                      323       322  593.3MB/s  jpg_200 (73.00 %)     +0.3%
      BM_ZFlat/4                    53691     14063  6.8GB/s  pdf (83.30 %)         +281.8%
      BM_ZFlat/5                   495442    492347  794.8MB/s  html4 (22.52 %)       +0.6%
      BM_ZFlat/6                   473523    473622  306.7MB/s  txt1 (57.88 %)        -0.0%
      BM_ZFlat/7                   421406    420120  284.5MB/s  txt2 (61.91 %)        +0.3%
      BM_ZFlat/8                  1265632   1270538  320.8MB/s  txt3 (54.99 %)        -0.4%
      BM_ZFlat/9                  1742688   1737894  264.8MB/s  txt4 (66.26 %)        +0.3%
      BM_ZFlat/10                  107950    103404  1095.1MB/s  pb (19.68 %)         +4.4%
      BM_ZFlat/11                  372660    371818  473.5MB/s  gaviota (37.72 %)     +0.2%
      BM_ZFlat/12                   53239     49528  474.4MB/s  cp (48.12 %)          +7.5%
      BM_ZFlat/13                   18940     17349  613.9MB/s  c (42.47 %)           +9.2%
      BM_ZFlat/14                    5155      5075  700.3MB/s  lsp (48.37 %)         +1.6%
      BM_ZFlat/15                 1474757   1474471  667.2MB/s  xls (41.23 %)         +0.0%
      BM_ZFlat/16                     363       362  528.0MB/s  xls_200 (78.00 %)     +0.3%
      BM_ZFlat/17                  453849    456931  1073.2MB/s  bin (18.11 %)        -0.7%
      BM_ZFlat/18                      90        87  2.1GB/s  bin_200 (7.50 %)        +3.4%
      BM_ZFlat/19                   82163     80498  453.7MB/s  sum (48.96 %)         +2.1%
      BM_ZFlat/20                    7174      7124  566.7MB/s  man (59.21 %)         +0.7%
      Sum of all benchmarks       8694831   8623857                                   +0.8%
      
      Sandy Bridge:
      
      Benchmark                 Base (ns)  New (ns)                                Improvement
      ----------------------------------------------------------------------------------------
      BM_ZFlat/0                   117426    112649  868.2MB/s  html (22.31 %)        +4.2%
      BM_ZFlat/1                  1517095   1498522  447.5MB/s  urls (47.78 %)        +1.2%
      BM_ZFlat/2                    18601     10649  10.8GB/s  jpg (99.95 %)         +74.7%
      BM_ZFlat/3                      359       356  536.0MB/s  jpg_200 (73.00 %)     +0.8%
      BM_ZFlat/4                    60249     13832  6.9GB/s  pdf (83.30 %)         +335.6%
      BM_ZFlat/5                   481246    475571  822.7MB/s  html4 (22.52 %)       +1.2%
      BM_ZFlat/6                   460541    455693  318.8MB/s  txt1 (57.88 %)        +1.1%
      BM_ZFlat/7                   407751    404147  295.8MB/s  txt2 (61.91 %)        +0.9%
      BM_ZFlat/8                  1228255   1222519  333.4MB/s  txt3 (54.99 %)        +0.5%
      BM_ZFlat/9                  1678299   1666379  276.2MB/s  txt4 (66.26 %)        +0.7%
      BM_ZFlat/10                  106499    101715  1113.4MB/s  pb (19.68 %)         +4.7%
      BM_ZFlat/11                  361913    360222  488.7MB/s  gaviota (37.72 %)     +0.5%
      BM_ZFlat/12                   53137     49618  473.6MB/s  cp (48.12 %)          +7.1%
      BM_ZFlat/13                   18801     17812  597.8MB/s  c (42.47 %)           +5.6%
      BM_ZFlat/14                    5394      5383  660.2MB/s  lsp (48.37 %)         +0.2%
      BM_ZFlat/15                 1435411   1432870  686.4MB/s  xls (41.23 %)         +0.2%
      BM_ZFlat/16                     389       395  483.3MB/s  xls_200 (78.00 %)     -1.5%
      BM_ZFlat/17                  447255    445510  1100.4MB/s  bin (18.11 %)        +0.4%
      BM_ZFlat/18                      86        86  2.2GB/s  bin_200 (7.50 %)        +0.0%
      BM_ZFlat/19                   82555     79512  459.3MB/s  sum (48.96 %)         +3.8%
      BM_ZFlat/20                    7527      7553  534.5MB/s  man (59.21 %)         -0.3%
      Sum of all benchmarks       8488789   8360993                                   +1.5%
      
      Haswell:
      
      Benchmark                 Base (ns)  New (ns)                                Improvement
      ----------------------------------------------------------------------------------------
      BM_ZFlat/0                   107512    105621  925.6MB/s  html (22.31 %)        +1.8%
      BM_ZFlat/1                  1344306   1332479  503.1MB/s  urls (47.78 %)        +0.9%
      BM_ZFlat/2                    14752      9471  12.1GB/s  jpg (99.95 %)         +55.8%
      BM_ZFlat/3                      287       275  694.0MB/s  jpg_200 (73.00 %)     +4.4%
      BM_ZFlat/4                    48810     12263  7.8GB/s  pdf (83.30 %)         +298.0%
      BM_ZFlat/5                   443013    442064  884.6MB/s  html4 (22.52 %)       +0.2%
      BM_ZFlat/6                   429239    432124  336.0MB/s  txt1 (57.88 %)        -0.7%
      BM_ZFlat/7                   381765    383681  311.5MB/s  txt2 (61.91 %)        -0.5%
      BM_ZFlat/8                  1136667   1154304  353.0MB/s  txt3 (54.99 %)        -1.5%
      BM_ZFlat/9                  1579925   1592431  288.9MB/s  txt4 (66.26 %)        -0.8%
      BM_ZFlat/10                   98345     92411  1.2GB/s  pb (19.68 %)            +6.4%
      BM_ZFlat/11                  340397    340466  516.8MB/s  gaviota (37.72 %)     -0.0%
      BM_ZFlat/12                   47076     43536  539.5MB/s  cp (48.12 %)          +8.1%
      BM_ZFlat/13                   16680     15637  680.8MB/s  c (42.47 %)           +6.7%
      BM_ZFlat/14                    4616      4539  782.6MB/s  lsp (48.37 %)         +1.7%
      BM_ZFlat/15                 1331231   1334094  736.9MB/s  xls (41.23 %)         -0.2%
      BM_ZFlat/16                     326       322  593.5MB/s  xls_200 (78.00 %)     +1.2%
      BM_ZFlat/17                  404383    400326  1.2GB/s  bin (18.11 %)           +1.0%
      BM_ZFlat/18                      69        69  2.7GB/s  bin_200 (7.50 %)        +0.0%
      BM_ZFlat/19                   74771     71348  511.7MB/s  sum (48.96 %)         +4.8%
      BM_ZFlat/20                    6461      6383  632.2MB/s  man (59.21 %)         +1.2%
      Sum of all benchmarks       7810631   7773844                                   +0.5%
      
      I've done a quick test that there are no performance regressions on external
      GCC (4.9.2, Debian, Haswell, 64-bit), too.
      d53de187
  7. Mar 10, 2016
  8. Jan 08, 2016
  9. Jan 04, 2016
  10. Oct 28, 2015
  11. Aug 26, 2015
  12. Aug 19, 2015
  13. Aug 03, 2015
  14. Jul 31, 2015
  15. Jul 07, 2015
    • Steinar H. Gunderson's avatar
      Release Snappy 1.1.3; getting the new Uncompress variant in a release is nice, · efb39e81
      Steinar H. Gunderson authored
      and it's also good to finally get an official release out after the migration
      to GitHub.
      
      The GitHub releases are basically done by tagging a commit and then uploading
      the .tar.gz file generated by make dist as a binary asset; GitHub will add
      all files on the tagged commit on top of the tarball and recompress, but since
      we don't have any nodist_* files in configure.ac, this works fine for us.
      (As far as I can see, this behavior of GitHub--uncompressing the .tar.gz,
      and the behavior of silently ignoring files in it that are also in the git
      repository--is undocumented, but also seems to be used in some official
      screenshots, so I guess we can rely on it.)
      
      A=sesse
      R=jeff
      1.1.3
      efb39e81
  16. Jul 06, 2015
    • Steinar H. Gunderson's avatar
      Initialized members of SnappyArrayWriter and SnappyDecompressionValidator. · eb66d817
      Steinar H. Gunderson authored
      These members were almost surely initialized before use by other member
      functions, but Coverity was warning about this. Eliminating these warnings
      minimizes clutter in that report and the likelihood of overlooking a real bug.
      
      A=cmumford
      R=jeff
      eb66d817
    • Steinar H. Gunderson's avatar
      Add support for Uncompress(source, sink). Various changes to allow · b2312c4c
      Steinar H. Gunderson authored
      Uncompress(source, sink) to get the same performance as the different
      variants of Uncompress to Cord/DataBuffer/String/FlatBuffer.
      
      Changes to efficiently support Uncompress(source, sink)
      --------
      
      a) For strings - we add support to StringByteSink to do GetAppendBuffer so we
         can write to it without copying.
      b) For flat array buffers, we do GetAppendBuffer and see if we can get a full buffer.
      
      With the above changes we get performance with ByteSource/ByteSink
      that is	very close to directly using flat arrays and strings.
      
      We add various benchmark cases to demonstrate that.
      
      Orthogonal change
      ------------------
      
      Add support for TryFastAppend() for SnappyScatteredWriter.
      
      Benchmark results are below
      
      CPU: Intel Core2 dL1:32KB dL2:4096KB
      Benchmark              Time(ns)    CPU(ns) Iterations
      -----------------------------------------------------
      BM_UFlat/0               109065     108996       6410 896.0MB/s  html
      BM_UFlat/1              1012175    1012343        691 661.4MB/s  urls
      BM_UFlat/2                26775      26771      26149 4.4GB/s  jpg
      BM_UFlat/3                48947      48940      14363 1.8GB/s  pdf
      BM_UFlat/4               441029     440835       1589 886.1MB/s  html4
      BM_UFlat/5                39861      39880      17823 588.3MB/s  cp
      BM_UFlat/6                18315      18300      38126 581.1MB/s  c
      BM_UFlat/7                 5254       5254     100000 675.4MB/s  lsp
      BM_UFlat/8              1568060    1567376        447 626.6MB/s  xls
      BM_UFlat/9               337512     337734       2073 429.5MB/s  txt1
      BM_UFlat/10              287269     287054       2434 415.9MB/s  txt2
      BM_UFlat/11              890098     890219        787 457.2MB/s  txt3
      BM_UFlat/12             1186593    1186863        590 387.2MB/s  txt4
      BM_UFlat/13              573927     573318       1000 853.7MB/s  bin
      BM_UFlat/14               64250      64294      10000 567.2MB/s  sum
      BM_UFlat/15                7301       7300      96153 552.2MB/s  man
      BM_UFlat/16              109617     109636       6375 1031.5MB/s  pb
      BM_UFlat/17              364438     364497       1921 482.3MB/s  gaviota
      BM_UFlatSink/0           108518     108465       6450 900.4MB/s  html
      BM_UFlatSink/1           991952     991997        705 675.0MB/s  urls
      BM_UFlatSink/2            26815      26798      26065 4.4GB/s  jpg
      BM_UFlatSink/3            49127      49122      14255 1.8GB/s  pdf
      BM_UFlatSink/4           436674     436731       1604 894.4MB/s  html4
      BM_UFlatSink/5            39738      39733      17345 590.5MB/s  cp
      BM_UFlatSink/6            18413      18416      37962 577.4MB/s  c
      BM_UFlatSink/7             5677       5676     100000 625.2MB/s  lsp
      BM_UFlatSink/8          1552175    1551026        451 633.2MB/s  xls
      BM_UFlatSink/9           338526     338489       2065 428.5MB/s  txt1
      BM_UFlatSink/10          289387     289307       2420 412.6MB/s  txt2
      BM_UFlatSink/11          893803     893706        783 455.4MB/s  txt3
      BM_UFlatSink/12         1195919    1195459        586 384.4MB/s  txt4
      BM_UFlatSink/13          559637     559779       1000 874.3MB/s  bin
      BM_UFlatSink/14           65073      65094      10000 560.2MB/s  sum
      BM_UFlatSink/15            7618       7614      92823 529.5MB/s  man
      BM_UFlatSink/16          110085     110121       6352 1027.0MB/s  pb
      BM_UFlatSink/17          369196     368915       1896 476.5MB/s  gaviota
      BM_UValidate/0            46954      46957      14899 2.0GB/s  html
      BM_UValidate/1           500621     500868       1000 1.3GB/s  urls
      BM_UValidate/2              283        283    2481447 417.2GB/s  jpg
      BM_UValidate/3            16230      16228      43137 5.4GB/s  pdf
      BM_UValidate/4           189129     189193       3701 2.0GB/s  html4
      
      A=uday
      R=sanjay
      b2312c4c
  17. Jun 22, 2015
    • Steinar H. Gunderson's avatar
      Changes to eliminate compiler warnings on MSVC · b2ad9600
      Steinar H. Gunderson authored
      This code was not compiling under Visual Studio 2013 with warnings being treated
      as errors. Specifically:
      
      1. Changed int -> size_t to eliminate signed/unsigned mismatch warning.
      2. Added some missing return values to functions.
      3. Inserting character instead of integer literals into strings to avoid type
         conversions.
      
      A=cmumford
      R=jeff
      b2ad9600
    • Steinar H. Gunderson's avatar
      Fixed unit tests to compile under MSVC. · e7a897e1
      Steinar H. Gunderson authored
      1. Including config.h in test.
      2. Including windows.h before zippy-test.h.
      3. Removed definition of WIN32_LEAN_AND_MEAN. This caused problems in
         build environments that define WIN32_LEAN_AND_MEAN as our
         definition didn't check for prior existence. This constant is old
         and no longer needed anyhow.
      4. Disable MSVC warning 4722 since ~LogMessageCrash() never returns.
      
      A=cmumford
      R=jeff
      e7a897e1
    • Steinar H. Gunderson's avatar
      Change a few branch annotations that profiling found to be wrong. · 86eb8b15
      Steinar H. Gunderson authored
      Overall performance is neutral or slightly positive.
      
      Westmere (64-bit, opt):
      
      Benchmark               Base (ns)  New (ns)                                Improvement
      --------------------------------------------------------------------------------------
      BM_UFlat/0                  73798     71464  1.3GB/s  html                    +3.3%
      BM_UFlat/1                 715223    704318  953.5MB/s  urls                  +1.5%
      BM_UFlat/2                   8137      8871  13.0GB/s  jpg                    -8.3%
      BM_UFlat/3                    200       204  935.5MB/s  jpg_200               -2.0%
      BM_UFlat/4                  21627     21281  4.5GB/s  pdf                     +1.6%
      BM_UFlat/5                 302806    290350  1.3GB/s  html4                   +4.3%
      BM_UFlat/6                 218920    219017  664.1MB/s  txt1                  -0.0%
      BM_UFlat/7                 190437    191212  626.1MB/s  txt2                  -0.4%
      BM_UFlat/8                 584192    580484  703.4MB/s  txt3                  +0.6%
      BM_UFlat/9                 776537    779055  591.6MB/s  txt4                  -0.3%
      BM_UFlat/10                 76056     72606  1.5GB/s  pb                      +4.8%
      BM_UFlat/11                235962    239043  737.4MB/s  gaviota               -1.3%
      BM_UFlat/12                 28049     28000  840.1MB/s  cp                    +0.2%
      BM_UFlat/13                 12225     12021  886.9MB/s  c                     +1.7%
      BM_UFlat/14                  3362      3544  1004.0MB/s  lsp                  -5.1%
      BM_UFlat/15                937015    939206  1048.9MB/s  xls                  -0.2%
      BM_UFlat/16                   236       233  823.1MB/s  xls_200               +1.3%
      BM_UFlat/17                373170    361947  1.3GB/s  bin                     +3.1%
      BM_UFlat/18                   264       264  725.5MB/s  bin_200               +0.0%
      BM_UFlat/19                 42834     43577  839.2MB/s  sum                   -1.7%
      BM_UFlat/20                  4770      4736  853.6MB/s  man                   +0.7%
      BM_UValidate/0              39671     39944  2.4GB/s  html                    -0.7%
      BM_UValidate/1             443391    443391  1.5GB/s  urls                    +0.0%
      BM_UValidate/2                163       163  703.3GB/s  jpg                   +0.0%
      BM_UValidate/3                113       112  1.7GB/s  jpg_200                 +0.9%
      BM_UValidate/4               7555      7608  12.6GB/s  pdf                    -0.7%
      BM_ZFlat/0                 157616    157568  621.5MB/s  html (22.31 %)        +0.0%
      BM_ZFlat/1                1997290   2014486  333.4MB/s  urls (47.77 %)        -0.9%
      BM_ZFlat/2                  23035     22237  5.2GB/s  jpg (99.95 %)           +3.6%
      BM_ZFlat/3                    539       540  354.5MB/s  jpg_200 (73.00 %)     -0.2%
      BM_ZFlat/4                  80709     81369  1.2GB/s  pdf (81.85 %)           -0.8%
      BM_ZFlat/5                 639059    639220  613.0MB/s  html4 (22.51 %)       -0.0%
      BM_ZFlat/6                 577203    583370  249.3MB/s  txt1 (57.87 %)        -1.1%
      BM_ZFlat/7                 510887    516094  232.0MB/s  txt2 (61.93 %)        -1.0%
      BM_ZFlat/8                1535843   1556973  262.2MB/s  txt3 (54.92 %)        -1.4%
      BM_ZFlat/9                2070068   2102380  219.3MB/s  txt4 (66.22 %)        -1.5%
      BM_ZFlat/10                152396    152148  745.5MB/s  pb (19.64 %)          +0.2%
      BM_ZFlat/11                447367    445859  395.4MB/s  gaviota (37.72 %)     +0.3%
      BM_ZFlat/12                 76375     76797  306.3MB/s  cp (48.12 %)          -0.5%
      BM_ZFlat/13                 31518     31987  333.3MB/s  c (42.40 %)           -1.5%
      BM_ZFlat/14                 10598     10827  328.6MB/s  lsp (48.37 %)         -2.1%
      BM_ZFlat/15               1782243   1802728  546.5MB/s  xls (41.23 %)         -1.1%
      BM_ZFlat/16                   526       539  355.0MB/s  xls_200 (78.00 %)     -2.4%
      BM_ZFlat/17                598141    597311  822.1MB/s  bin (18.11 %)         +0.1%
      BM_ZFlat/18                   121       120  1.6GB/s  bin_200 (7.50 %)        +0.8%
      BM_ZFlat/19                109981    112173  326.0MB/s  sum (48.96 %)         -2.0%
      BM_ZFlat/20                 14355     14575  277.4MB/s  man (59.36 %)         -1.5%
      Sum of all benchmarks    33882722  33879325                                   +0.0%
      
      Sandy Bridge (64-bit, opt):
      
      Benchmark               Base (ns)  New (ns)                                Improvement
      --------------------------------------------------------------------------------------
      BM_UFlat/0                  43764     41600  2.3GB/s  html                    +5.2%
      BM_UFlat/1                 517990    507058  1.3GB/s  urls                    +2.2%
      BM_UFlat/2                   6625      5529  20.8GB/s  jpg                   +19.8%
      BM_UFlat/3                    154       155  1.2GB/s  jpg_200                 -0.6%
      BM_UFlat/4                  12795     11747  8.1GB/s  pdf                     +8.9%
      BM_UFlat/5                 200335    193413  2.0GB/s  html4                   +3.6%
      BM_UFlat/6                 156574    156426  929.2MB/s  txt1                  +0.1%
      BM_UFlat/7                 137574    137464  870.4MB/s  txt2                  +0.1%
      BM_UFlat/8                 422551    421603  967.4MB/s  txt3                  +0.2%
      BM_UFlat/9                 577749    578985  795.6MB/s  txt4                  -0.2%
      BM_UFlat/10                 42329     39362  2.8GB/s  pb                      +7.5%
      BM_UFlat/11                170615    169751  1037.9MB/s  gaviota              +0.5%
      BM_UFlat/12                 12800     12719  1.8GB/s  cp                      +0.6%
      BM_UFlat/13                  6585      6579  1.6GB/s  c                       +0.1%
      BM_UFlat/14                  2066      2044  1.7GB/s  lsp                     +1.1%
      BM_UFlat/15                750861    746911  1.3GB/s  xls                     +0.5%
      BM_UFlat/16                   188       192  996.0MB/s  xls_200               -2.1%
      BM_UFlat/17                271622    264333  1.8GB/s  bin                     +2.8%
      BM_UFlat/18                   208       207  923.6MB/s  bin_200               +0.5%
      BM_UFlat/19                 24667     24845  1.4GB/s  sum                     -0.7%
      BM_UFlat/20                  2663      2662  1.5GB/s  man                     +0.0%
      BM_ZFlat/0                 115173    115624  846.5MB/s  html (22.31 %)        -0.4%
      BM_ZFlat/1                1530331   1537769  436.5MB/s  urls (47.77 %)        -0.5%
      BM_ZFlat/2                  17503     17013  6.8GB/s  jpg (99.95 %)           +2.9%
      BM_ZFlat/3                    385       385  496.3MB/s  jpg_200 (73.00 %)     +0.0%
      BM_ZFlat/4                  61753     61540  1.6GB/s  pdf (81.85 %)           +0.3%
      BM_ZFlat/5                 484806    483356  810.1MB/s  html4 (22.51 %)       +0.3%
      BM_ZFlat/6                 464143    467609  310.9MB/s  txt1 (57.87 %)        -0.7%
      BM_ZFlat/7                 410315    413319  289.5MB/s  txt2 (61.93 %)        -0.7%
      BM_ZFlat/8                1244082   1249381  326.5MB/s  txt3 (54.92 %)        -0.4%
      BM_ZFlat/9                1696914   1709685  269.4MB/s  txt4 (66.22 %)        -0.7%
      BM_ZFlat/10                104148    103372  1096.7MB/s  pb (19.64 %)         +0.8%
      BM_ZFlat/11                363522    359722  489.8MB/s  gaviota (37.72 %)     +1.1%
      BM_ZFlat/12                 47021     50095  469.3MB/s  cp (48.12 %)          -6.1%
      BM_ZFlat/13                 16888     16985  627.4MB/s  c (42.40 %)           -0.6%
      BM_ZFlat/14                  5496      5469  650.3MB/s  lsp (48.37 %)         +0.5%
      BM_ZFlat/15               1460713   1448760  679.5MB/s  xls (41.23 %)         +0.8%
      BM_ZFlat/16                   387       393  486.8MB/s  xls_200 (78.00 %)     -1.5%
      BM_ZFlat/17                457654    451462  1086.6MB/s  bin (18.11 %)        +1.4%
      BM_ZFlat/18                    97        87  2.1GB/s  bin_200 (7.50 %)       +11.5%
      BM_ZFlat/19                 77904     80924  451.7MB/s  sum (48.96 %)         -3.7%
      BM_ZFlat/20                  7648      7663  527.1MB/s  man (59.36 %)         -0.2%
      Sum of all benchmarks    25493635  25482069                                   +0.0%
      
      A=dehao
      R=sesse
      86eb8b15
    • Steinar H. Gunderson's avatar
      Sync with various Google-internal changes. · 11ccdfb8
      Steinar H. Gunderson authored
      Should not mean much for the open-source version.
      11ccdfb8
    • Steinar H. Gunderson's avatar
      Change some internal path names. · 22acaf43
      Steinar H. Gunderson authored
      This is mostly to sync up with some changes from Google's internal
      repositories; it does not affect the open-source distribution in itself.
      22acaf43
  18. Feb 28, 2014
  19. Feb 19, 2014
  20. Oct 25, 2013
  21. Oct 15, 2013
  22. Aug 13, 2013
  23. Jul 29, 2013
    • snappy.mirrorbot@gmail.com's avatar
      When we compare the number of bytes produced with the offset for a · 7c3c01df
      snappy.mirrorbot@gmail.com authored
      backreference, make the signedness of the bytes produced clear,
      by sticking it into a size_t. This avoids a signed/unsigned compare
      warning from MSVC (public issue 71), and also is slightly clearer.
      
      Since the line is now so long the explanatory comment about the -1u
      trick has to go somewhere else anyway, I used the opportunity to
      explain it in slightly more detail.
      
      This is a purely stylistic change; the emitted assembler from GCC
      is identical.
      
      R=jeff
      
      
      git-svn-id: https://snappy.googlecode.com/svn/trunk@79 03e5f5b5-db94-4691-08a0-1a8bf15f6143
      7c3c01df
  24. Jun 30, 2013
    • snappy.mirrorbot@gmail.com's avatar
      In the fast path for decompressing literals, instead of checking · 2f0aaf86
      snappy.mirrorbot@gmail.com authored
      whether there's 16 bytes free and then checking right afterwards
      (when having subtracted the literal size) that there are now 
      5 bytes free, just check once for 21 bytes. This skips a compare
      and a branch; although it is easily predictable, it is still
      a few cycles on a fast path that we would like to get rid of.
      
      Benchmarking this yields very confusing results. On open-source
      GCC 4.8.1 on Haswell, we get exactly the expected results; the
      benchmarks where we hit the fast path for literals (in particular
      the two HTML benchmarks and the protobuf benchmark) give very nice
      speedups, and the others are not really affected.
      
      However, benchmarks with Google's GCC branch on other hardware
      is much less clear. It seems that we have a weak loss in some cases
      (and the win for the “typical” win cases are not nearly as clear),
      but that it depends on microarchitecture and plain luck in how we run
      the benchmark. Looking at the generated assembler, it seems that
      the removal of the if causes other large-scale changes in how the
      function is laid out, which makes it likely that this is just bad luck.
      
      Thus, we should keep this change, even though its exact current impact is
      unclear; it's a sensible change per se, and dropping it on the basis of
      microoptimization for a given compiler (or even branch of a compiler)
      would seem like a bad strategy in the long run.
      
      Microbenchmark results (all in 64-bit, opt mode):
      
        Nehalem, Google GCC:
      
        Benchmark                Base (ns)  New (ns)                       Improvement
        ------------------------------------------------------------------------------
        BM_UFlat/0                   76747     75591  1.3GB/s  html           +1.5%
        BM_UFlat/1                  765756    757040  886.3MB/s  urls         +1.2%
        BM_UFlat/2                   10867     10893  10.9GB/s  jpg           -0.2%
        BM_UFlat/3                     124       131  1.4GB/s  jpg_200        -5.3%
        BM_UFlat/4                   31663     31596  2.8GB/s  pdf            +0.2%
        BM_UFlat/5                  314162    308176  1.2GB/s  html4          +1.9%
        BM_UFlat/6                   29668     29746  790.6MB/s  cp           -0.3%
        BM_UFlat/7                   12958     13386  796.4MB/s  c            -3.2%
        BM_UFlat/8                    3596      3682  966.0MB/s  lsp          -2.3%
        BM_UFlat/9                 1019193   1033493  953.3MB/s  xls          -1.4%
        BM_UFlat/10                    239       247  775.3MB/s  xls_200      -3.2%
        BM_UFlat/11                 236411    240271  606.9MB/s  txt1         -1.6%
        BM_UFlat/12                 206639    209768  571.2MB/s  txt2         -1.5%
        BM_UFlat/13                 627803    635722  641.4MB/s  txt3         -1.2%
        BM_UFlat/14                 845932    857816  538.2MB/s  txt4         -1.4%
        BM_UFlat/15                 402107    391670  1.2GB/s  bin            +2.7%
        BM_UFlat/16                    283       279  683.6MB/s  bin_200      +1.4%
        BM_UFlat/17                  46070     46815  781.5MB/s  sum          -1.6%
        BM_UFlat/18                   5053      5163  782.0MB/s  man          -2.1%
        BM_UFlat/19                  79721     76581  1.4GB/s  pb             +4.1%
        BM_UFlat/20                 251158    252330  697.5MB/s  gaviota      -0.5%
        Sum of all benchmarks      4966150   4980396                          -0.3%
      
      
        Sandy Bridge, Google GCC:
        
        Benchmark                Base (ns)  New (ns)                       Improvement
        ------------------------------------------------------------------------------
        BM_UFlat/0                   42850     42182  2.3GB/s  html           +1.6%
        BM_UFlat/1                  525660    515816  1.3GB/s  urls           +1.9%
        BM_UFlat/2                    7173      7283  16.3GB/s  jpg           -1.5%
        BM_UFlat/3                      92        91  2.1GB/s  jpg_200        +1.1%
        BM_UFlat/4                   15147     14872  5.9GB/s  pdf            +1.8%
        BM_UFlat/5                  199936    192116  2.0GB/s  html4          +4.1%
        BM_UFlat/6                   12796     12443  1.8GB/s  cp             +2.8%
        BM_UFlat/7                    6588      6400  1.6GB/s  c              +2.9%
        BM_UFlat/8                    2010      1951  1.8GB/s  lsp            +3.0%
        BM_UFlat/9                  761124    763049  1.3GB/s  xls            -0.3%
        BM_UFlat/10                    186       189  1016.1MB/s  xls_200     -1.6%
        BM_UFlat/11                 159354    158460  918.6MB/s  txt1         +0.6%
        BM_UFlat/12                 139732    139950  856.1MB/s  txt2         -0.2%
        BM_UFlat/13                 429917    425027  961.7MB/s  txt3         +1.2%
        BM_UFlat/14                 585255    587324  785.8MB/s  txt4         -0.4%
        BM_UFlat/15                 276186    266173  1.8GB/s  bin            +3.8%
        BM_UFlat/16                    205       207  925.5MB/s  bin_200      -1.0%
        BM_UFlat/17                  24925     24935  1.4GB/s  sum            -0.0%
        BM_UFlat/18                   2632      2576  1.5GB/s  man            +2.2%
        BM_UFlat/19                  40546     39108  2.8GB/s  pb             +3.7%
        BM_UFlat/20                 175803    168209  1048.9MB/s  gaviota     +4.5%
        Sum of all benchmarks      3408117   3368361                          +1.2%
      
      
        Haswell, upstream GCC 4.8.1:
      
        Benchmark                Base (ns)  New (ns)                       Improvement
        ------------------------------------------------------------------------------
        BM_UFlat/0                   46308     40641  2.3GB/s  html          +13.9%
        BM_UFlat/1                  513385    514706  1.3GB/s  urls           -0.3%
        BM_UFlat/2                    6197      6151  19.2GB/s  jpg           +0.7%
        BM_UFlat/3                      61        61  3.0GB/s  jpg_200        +0.0%
        BM_UFlat/4                   13551     13429  6.5GB/s  pdf            +0.9%
        BM_UFlat/5                  198317    190243  2.0GB/s  html4          +4.2%
        BM_UFlat/6                   14768     12560  1.8GB/s  cp            +17.6%
        BM_UFlat/7                    6453      6447  1.6GB/s  c              +0.1%
        BM_UFlat/8                    1991      1980  1.8GB/s  lsp            +0.6%
        BM_UFlat/9                  766947    770424  1.2GB/s  xls            -0.5%
        BM_UFlat/10                    170       169  1.1GB/s  xls_200        +0.6%
        BM_UFlat/11                 164350    163554  888.7MB/s  txt1         +0.5%
        BM_UFlat/12                 145444    143830  832.1MB/s  txt2         +1.1%
        BM_UFlat/13                 437849    438413  929.2MB/s  txt3         -0.1%
        BM_UFlat/14                 603587    605309  759.8MB/s  txt4         -0.3%
        BM_UFlat/15                 249799    248067  1.9GB/s  bin            +0.7%
        BM_UFlat/16                    191       188  1011.4MB/s  bin_200     +1.6%
        BM_UFlat/17                  26064     24778  1.4GB/s  sum            +5.2%
        BM_UFlat/18                   2620      2601  1.5GB/s  man            +0.7%
        BM_UFlat/19                  44551     37373  3.0GB/s  pb            +19.2%
        BM_UFlat/20                 165408    164584  1.0GB/s  gaviota        +0.5%
        Sum of all benchmarks      3408011   3385508                          +0.7%
      
      
      git-svn-id: https://snappy.googlecode.com/svn/trunk@78 03e5f5b5-db94-4691-08a0-1a8bf15f6143
      2f0aaf86
  25. Jun 14, 2013
    • snappy.mirrorbot@gmail.com's avatar
      Make the two IncrementalCopy* functions take in an ssize_t instead of a len, · 062bf544
      snappy.mirrorbot@gmail.com authored
      in order to avoid having to do 32-to-64-bit signed conversions on a hot path
      during decompression. (Also fixes some MSVC warnings, mentioned in public
      issue 75, but more of those remain.) They cannot be size_t because we expect
      them to go negative and test for that.
      
      This saves a few movzwl instructions, yielding ~2% speedup in decompression.
      
      
      Sandy Bridge:
      
      Benchmark                          Base (ns)  New (ns)                                Improvement
      -------------------------------------------------------------------------------------------------
      BM_UFlat/0                             48009     41283  2.3GB/s  html                   +16.3%
      BM_UFlat/1                            531274    513419  1.3GB/s  urls                    +3.5%
      BM_UFlat/2                              7378      7062  16.8GB/s  jpg                    +4.5%
      BM_UFlat/3                                92        92  2.0GB/s  jpg_200                 +0.0%
      BM_UFlat/4                             15057     14974  5.9GB/s  pdf                     +0.6%
      BM_UFlat/5                            204323    193140  2.0GB/s  html4                   +5.8%
      BM_UFlat/6                             13282     12611  1.8GB/s  cp                      +5.3%
      BM_UFlat/7                              6511      6504  1.6GB/s  c                       +0.1%
      BM_UFlat/8                              2014      2030  1.7GB/s  lsp                     -0.8%
      BM_UFlat/9                            775909    768336  1.3GB/s  xls                     +1.0%
      BM_UFlat/10                              182       184  1043.2MB/s  xls_200              -1.1%
      BM_UFlat/11                           167352    161630  901.2MB/s  txt1                  +3.5%
      BM_UFlat/12                           147393    142246  842.8MB/s  txt2                  +3.6%
      BM_UFlat/13                           449960    432853  944.4MB/s  txt3                  +4.0%
      BM_UFlat/14                           620497    594845  775.9MB/s  txt4                  +4.3%
      BM_UFlat/15                           265610    267356  1.8GB/s  bin                     -0.7%
      BM_UFlat/16                              206       205  932.7MB/s  bin_200               +0.5%
      BM_UFlat/17                            25561     24730  1.4GB/s  sum                     +3.4%
      BM_UFlat/18                             2620      2644  1.5GB/s  man                     -0.9%
      BM_UFlat/19                            45766     38589  2.9GB/s  pb                     +18.6%
      BM_UFlat/20                           171107    169832  1039.5MB/s  gaviota              +0.8%
      Sum of all benchmarks                3500103   3394565                                   +3.1%
      
      
      Westmere:
      
      Benchmark                          Base (ns)  New (ns)                                Improvement
      -------------------------------------------------------------------------------------------------
      BM_UFlat/0                             72624     71526  1.3GB/s  html                    +1.5%
      BM_UFlat/1                            735821    722917  930.8MB/s  urls                  +1.8%
      BM_UFlat/2                             10450     10172  11.7GB/s  jpg                    +2.7%
      BM_UFlat/3                               117       117  1.6GB/s  jpg_200                 +0.0%
      BM_UFlat/4                             29817     29648  3.0GB/s  pdf                     +0.6%
      BM_UFlat/5                            297126    293073  1.3GB/s  html4                   +1.4%
      BM_UFlat/6                             28252     27994  842.0MB/s  cp                    +0.9%
      BM_UFlat/7                             12672     12391  862.1MB/s  c                     +2.3%
      BM_UFlat/8                              3507      3425  1040.9MB/s  lsp                  +2.4%
      BM_UFlat/9                           1004268    969395  1018.0MB/s  xls                  +3.6%
      BM_UFlat/10                              233       227  844.8MB/s  xls_200               +2.6%
      BM_UFlat/11                           230054    224981  647.8MB/s  txt1                  +2.3%
      BM_UFlat/12                           201229    196447  610.5MB/s  txt2                  +2.4%
      BM_UFlat/13                           609547    596761  685.3MB/s  txt3                  +2.1%
      BM_UFlat/14                           824362    804821  573.8MB/s  txt4                  +2.4%
      BM_UFlat/15                           371095    374899  1.3GB/s  bin                     -1.0%
      BM_UFlat/16                              267       267  717.8MB/s  bin_200               +0.0%
      BM_UFlat/17                            44623     43828  835.9MB/s  sum                   +1.8%
      BM_UFlat/18                             5077      4815  841.0MB/s  man                   +5.4%
      BM_UFlat/19                            74964     73210  1.5GB/s  pb                      +2.4%
      BM_UFlat/20                           237987    236745  746.0MB/s  gaviota               +0.5%
      Sum of all benchmarks                4794092   4697659                                   +2.1%
      
      
      Istanbul:
      
      Benchmark                          Base (ns)  New (ns)                                Improvement
      -------------------------------------------------------------------------------------------------
      BM_UFlat/0                             98614     96376  1020.4MB/s  html                 +2.3%
      BM_UFlat/1                            963740    953241  707.2MB/s  urls                  +1.1%
      BM_UFlat/2                             25042     24769  4.8GB/s  jpg                     +1.1%
      BM_UFlat/3                               180       180  1065.6MB/s  jpg_200              +0.0%
      BM_UFlat/4                             45942     45403  1.9GB/s  pdf                     +1.2%
      BM_UFlat/5                            400135    390226  1008.2MB/s  html4                +2.5%
      BM_UFlat/6                             37768     37392  631.9MB/s  cp                    +1.0%
      BM_UFlat/7                             18585     18200  588.2MB/s  c                     +2.1%
      BM_UFlat/8                              5751      5690  627.7MB/s  lsp                   +1.1%
      BM_UFlat/9                           1543154   1542209  641.4MB/s  xls                   +0.1%
      BM_UFlat/10                              381       388  494.6MB/s  xls_200               -1.8%
      BM_UFlat/11                           339715    331973  440.1MB/s  txt1                  +2.3%
      BM_UFlat/12                           294807    289418  415.4MB/s  txt2                  +1.9%
      BM_UFlat/13                           906160    884094  463.3MB/s  txt3                  +2.5%
      BM_UFlat/14                          1224221   1198435  386.1MB/s  txt4                  +2.2%
      BM_UFlat/15                           516277    502923  979.5MB/s  bin                   +2.7%
      BM_UFlat/16                              405       402  477.2MB/s  bin_200               +0.7%
      BM_UFlat/17                            61640     60621  605.6MB/s  sum                   +1.7%
      BM_UFlat/18                             7326      7383  549.5MB/s  man                   -0.8%
      BM_UFlat/19                            94720     92653  1.2GB/s  pb                      +2.2%
      BM_UFlat/20                           360435    346687  510.6MB/s  gaviota               +4.0%
      Sum of all benchmarks                6944998   6828663                                   +1.7%
      
      
      git-svn-id: https://snappy.googlecode.com/svn/trunk@77 03e5f5b5-db94-4691-08a0-1a8bf15f6143
      062bf544
  26. Jun 13, 2013
  27. Jun 12, 2013
  28. Apr 09, 2013
  29. Feb 05, 2013
  30. Jan 18, 2013
    • snappy.mirrorbot@gmail.com's avatar
      Increase the Zippy block size from 32 kB to 64 kB, winning ~3% density · 27a0cc39
      snappy.mirrorbot@gmail.com authored
      while being effectively performance neutral.
      
      The longer story about density is that we win 3-6% density on the benchmarks 
      where this has any effect at all; many of the benchmarks (cp, c, lsp, man)
      are smaller than 32 kB and thus will have no effect. Binary data also seems
      to win little or nothing; of course, the already-compressed data wins nothing.
      The protobuf benchmark wins as much as ~18% depending on architecture,
      but I wouldn't be too sure that this is representative of protobuf data in
      general.
      
      As of performance, we lose a tiny amount since we get more tags (e.g., a long
      literal might be broken up into literal-copy-literal), but we win it back with
      less clearing of the hash table, and more opportunities to skip incompressible
      data (e.g. in the jpg benchmark). Decompression seems to get ever so slightly
      slower, again due to more tags. The total net change is about as close to zero
      as we can get, so the end effect seems to be simply more density and no
      real performance change.
      
      The comment about not changing kBlockSize, scary as it is, is not really
      relevant, since we're never going to have a block-level decompressor without
      explicitly marked blocks. Replace it with something more appropriate.
      
      This affects the framing format, but it's okay to change it since it basically
      has no users yet.
      
      
      Density (note that cp, c, lsp and man are all smaller than 32 kB):
      
         Benchmark         Description   Base (%)  New (%)  Improvement
         --------------------------------------------------------------
         ZFlat/0           html            22.57    22.31     +5.6%
         ZFlat/1           urls            50.89    47.77     +6.5%
         ZFlat/2           jpg             99.88    99.87     +0.0%
         ZFlat/3           pdf             82.13    82.07     +0.1%
         ZFlat/4           html4           23.55    22.51     +4.6%
         ZFlat/5           cp              48.12    48.12     +0.0%
         ZFlat/6           c               42.40    42.40     +0.0%
         ZFlat/7           lsp             48.37    48.37     +0.0%
         ZFlat/8           xls             41.34    41.23     +0.3%
         ZFlat/9           txt1            59.81    57.87     +3.4%
         ZFlat/10          txt2            64.07    61.93     +3.5%
         ZFlat/11          txt3            57.11    54.92     +4.0%
         ZFlat/12          txt4            68.35    66.22     +3.2%
         ZFlat/13          bin             18.21    18.11     +0.6%
         ZFlat/14          sum             51.88    48.96     +6.0%
         ZFlat/15          man             59.36    59.36     +0.0%
         ZFlat/16          pb              23.15    19.64    +17.9%
         ZFlat/17          gaviota         38.27    37.72     +1.5%
         Geometric mean                    45.51    44.15     +3.1%
      
      
      Microbenchmarks (64-bit, opt):
      
      Westmere 2.8 GHz:
      
         Benchmark                          Base (ns)  New (ns)                                Improvement
         -------------------------------------------------------------------------------------------------
         BM_UFlat/0                             75342     75027  1.3GB/s  html                    +0.4%
         BM_UFlat/1                            723767    744269  899.6MB/s  urls                  -2.8%
         BM_UFlat/2                             10072     10072  11.7GB/s  jpg                    +0.0%
         BM_UFlat/3                             30747     30388  2.9GB/s  pdf                     +1.2%
         BM_UFlat/4                            307353    306063  1.2GB/s  html4                   +0.4%
         BM_UFlat/5                             28593     28743  816.3MB/s  cp                    -0.5%
         BM_UFlat/6                             12958     12998  818.1MB/s  c                     -0.3%
         BM_UFlat/7                              3700      3792  935.8MB/s  lsp                   -2.4%
         BM_UFlat/8                            999685    999905  982.1MB/s  xls                   -0.0%
         BM_UFlat/9                            232954    230079  630.4MB/s  txt1                  +1.2%
         BM_UFlat/10                           200785    201468  592.6MB/s  txt2                  -0.3%
         BM_UFlat/11                           617267    610968  666.1MB/s  txt3                  +1.0%
         BM_UFlat/12                           821595    822475  558.7MB/s  txt4                  -0.1%
         BM_UFlat/13                           377097    377632  1.3GB/s  bin                     -0.1%
         BM_UFlat/14                            45476     45260  805.8MB/s  sum                   +0.5%
         BM_UFlat/15                             4985      5003  805.7MB/s  man                   -0.4%
         BM_UFlat/16                            80813     77494  1.4GB/s  pb                      +4.3%
         BM_UFlat/17                           251792    241553  727.7MB/s  gaviota               +4.2%
         BM_UValidate/0                         40343     40354  2.4GB/s  html                    -0.0%
         BM_UValidate/1                        426890    451574  1.4GB/s  urls                    -5.5%
         BM_UValidate/2                           187       179  661.9GB/s  jpg                   +4.5%
         BM_UValidate/3                         13783     13827  6.4GB/s  pdf                     -0.3%
         BM_UValidate/4                        162393    163335  2.3GB/s  html4                   -0.6%
         BM_UDataBuffer/0                       93756     93302  1046.7MB/s  html                 +0.5%
         BM_UDataBuffer/1                      886714    916292  730.7MB/s  urls                  -3.2%
         BM_UDataBuffer/2                       15861     16401  7.2GB/s  jpg                     -3.3%
         BM_UDataBuffer/3                       38934     39224  2.2GB/s  pdf                     -0.7%
         BM_UDataBuffer/4                      381008    379428  1029.5MB/s  html4                +0.4%
         BM_UCord/0                             92528     91098  1072.0MB/s  html                 +1.6%
         BM_UCord/1                            858421    885287  756.3MB/s  urls                  -3.0%
         BM_UCord/2                             13140     13464  8.8GB/s  jpg                     -2.4%
         BM_UCord/3                             39012     37773  2.3GB/s  pdf                     +3.3%
         BM_UCord/4                            376869    371267  1052.1MB/s  html4                +1.5%
         BM_UCordString/0                       75810     75303  1.3GB/s  html                    +0.7%
         BM_UCordString/1                      735290    753841  888.2MB/s  urls                  -2.5%
         BM_UCordString/2                       11945     13113  9.0GB/s  jpg                     -8.9%
         BM_UCordString/3                       33901     32562  2.7GB/s  pdf                     +4.1%
         BM_UCordString/4                      310985    309390  1.2GB/s  html4                   +0.5%
         BM_UCordValidate/0                     40952     40450  2.4GB/s  html                    +1.2%
         BM_UCordValidate/1                    433842    456531  1.4GB/s  urls                    -5.0%
         BM_UCordValidate/2                      1179      1173  100.8GB/s  jpg                   +0.5%
         BM_UCordValidate/3                     14481     14392  6.1GB/s  pdf                     +0.6%
         BM_UCordValidate/4                    164364    164151  2.3GB/s  html4                   +0.1%
         BM_ZFlat/0                            160610    156601  623.6MB/s  html (22.31 %)        +2.6%
         BM_ZFlat/1                           1995238   1993582  335.9MB/s  urls (47.77 %)        +0.1%
         BM_ZFlat/2                             30133     24983  4.7GB/s  jpg (99.87 %)          +20.6%
         BM_ZFlat/3                             74453     73128  1.2GB/s  pdf (82.07 %)           +1.8%
         BM_ZFlat/4                            647674    633729  616.4MB/s  html4 (22.51 %)       +2.2%
         BM_ZFlat/5                             76259     76090  308.4MB/s  cp (48.12 %)          +0.2%
         BM_ZFlat/6                             31106     31084  342.1MB/s  c (42.40 %)           +0.1%
         BM_ZFlat/7                             10507     10443  339.8MB/s  lsp (48.37 %)         +0.6%
         BM_ZFlat/8                           1811047   1793325  547.6MB/s  xls (41.23 %)         +1.0%
         BM_ZFlat/9                            597903    581793  249.3MB/s  txt1 (57.87 %)        +2.8%
         BM_ZFlat/10                           525320    514522  232.0MB/s  txt2 (61.93 %)        +2.1%
         BM_ZFlat/11                          1596591   1551636  262.3MB/s  txt3 (54.92 %)        +2.9%
         BM_ZFlat/12                          2134523   2094033  219.5MB/s  txt4 (66.22 %)        +1.9%
         BM_ZFlat/13                           593024    587869  832.6MB/s  bin (18.11 %)         +0.9%
         BM_ZFlat/14                           114746    110666  329.5MB/s  sum (48.96 %)         +3.7%
         BM_ZFlat/15                            14376     14485  278.3MB/s  man (59.36 %)         -0.8%
         BM_ZFlat/16                           167908    150070  753.6MB/s  pb (19.64 %)         +11.9%
         BM_ZFlat/17                           460228    442253  397.5MB/s  gaviota (37.72 %)     +4.1%
         BM_ZCord/0                            164896    160241  609.4MB/s  html                  +2.9%
         BM_ZCord/1                           2070239   2043492  327.7MB/s  urls                  +1.3%
         BM_ZCord/2                             54402     47002  2.5GB/s  jpg                    +15.7%
         BM_ZCord/3                             85871     83832  1073.1MB/s  pdf                  +2.4%
         BM_ZCord/4                            664078    648825  602.0MB/s  html4                 +2.4%
         BM_ZDataBuffer/0                      174874    172549  566.0MB/s  html                  +1.3%
         BM_ZDataBuffer/1                     2134410   2139173  313.0MB/s  urls                  -0.2%
         BM_ZDataBuffer/2                       71911     69551  1.7GB/s  jpg                     +3.4%
         BM_ZDataBuffer/3                       98236     99727  902.1MB/s  pdf                   -1.5%
         BM_ZDataBuffer/4                      710776    699104  558.8MB/s  html4                 +1.7%
         Sum of all benchmarks               27358908  27200688                                   +0.6%
      
      
      Sandy Bridge 2.6 GHz:
      
         Benchmark                          Base (ns)  New (ns)                                Improvement
         -------------------------------------------------------------------------------------------------
         BM_UFlat/0                             49356     49018  1.9GB/s  html                    +0.7%
         BM_UFlat/1                            516764    531955  1.2GB/s  urls                    -2.9%
         BM_UFlat/2                              6982      7304  16.2GB/s  jpg                    -4.4%
         BM_UFlat/3                             15285     15598  5.6GB/s  pdf                     -2.0%
         BM_UFlat/4                            206557    206669  1.8GB/s  html4                   -0.1%
         BM_UFlat/5                             13681     13567  1.7GB/s  cp                      +0.8%
         BM_UFlat/6                              6571      6592  1.6GB/s  c                       -0.3%
         BM_UFlat/7                              2008      1994  1.7GB/s  lsp                     +0.7%
         BM_UFlat/8                            775700    773286  1.2GB/s  xls                     +0.3%
         BM_UFlat/9                            165578    164480  881.8MB/s  txt1                  +0.7%
         BM_UFlat/10                           143707    144139  828.2MB/s  txt2                  -0.3%
         BM_UFlat/11                           443026    436281  932.8MB/s  txt3                  +1.5%
         BM_UFlat/12                           603129    595856  771.2MB/s  txt4                  +1.2%
         BM_UFlat/13                           271682    270450  1.8GB/s  bin                     +0.5%
         BM_UFlat/14                            26200     25666  1.4GB/s  sum                     +2.1%
         BM_UFlat/15                             2620      2608  1.5GB/s  man                     +0.5%
         BM_UFlat/16                            48908     47756  2.3GB/s  pb                      +2.4%
         BM_UFlat/17                           174638    170346  1031.9MB/s  gaviota              +2.5%
         BM_UValidate/0                         31922     31898  3.0GB/s  html                    +0.1%
         BM_UValidate/1                        341265    363554  1.8GB/s  urls                    -6.1%
         BM_UValidate/2                           160       151  782.8GB/s  jpg                   +6.0%
         BM_UValidate/3                         10402     10380  8.5GB/s  pdf                     +0.2%
         BM_UValidate/4                        129490    130587  2.9GB/s  html4                   -0.8%
         BM_UDataBuffer/0                       59383     58736  1.6GB/s  html                    +1.1%
         BM_UDataBuffer/1                      619222    637786  1049.8MB/s  urls                 -2.9%
         BM_UDataBuffer/2                       10775     11941  9.9GB/s  jpg                     -9.8%
         BM_UDataBuffer/3                       18002     17930  4.9GB/s  pdf                     +0.4%
         BM_UDataBuffer/4                      259182    259306  1.5GB/s  html4                   -0.0%
         BM_UCord/0                             59379     57814  1.6GB/s  html                    +2.7%
         BM_UCord/1                            598456    615162  1088.4MB/s  urls                 -2.7%
         BM_UCord/2                              8519      8628  13.7GB/s  jpg                    -1.3%
         BM_UCord/3                             18123     17537  5.0GB/s  pdf                     +3.3%
         BM_UCord/4                            252375    252331  1.5GB/s  html4                   +0.0%
         BM_UCordString/0                       49494     49790  1.9GB/s  html                    -0.6%
         BM_UCordString/1                      524659    541803  1.2GB/s  urls                    -3.2%
         BM_UCordString/2                        8206      8354  14.2GB/s  jpg                    -1.8%
         BM_UCordString/3                       17235     16537  5.3GB/s  pdf                     +4.2%
         BM_UCordString/4                      210188    211072  1.8GB/s  html4                   -0.4%
         BM_UCordValidate/0                     31956     31587  3.0GB/s  html                    +1.2%
         BM_UCordValidate/1                    340828    362141  1.8GB/s  urls                    -5.9%
         BM_UCordValidate/2                       783       744  158.9GB/s  jpg                   +5.2%
         BM_UCordValidate/3                     10543     10462  8.4GB/s  pdf                     +0.8%
         BM_UCordValidate/4                    130150    129789  2.9GB/s  html4                   +0.3%
         BM_ZFlat/0                            113873    111200  878.2MB/s  html (22.31 %)        +2.4%
         BM_ZFlat/1                           1473023   1489858  449.4MB/s  urls (47.77 %)        -1.1%
         BM_ZFlat/2                             23569     19486  6.1GB/s  jpg (99.87 %)          +21.0%
         BM_ZFlat/3                             49178     48046  1.8GB/s  pdf (82.07 %)           +2.4%
         BM_ZFlat/4                            475063    469394  832.2MB/s  html4 (22.51 %)       +1.2%
         BM_ZFlat/5                             46910     46816  501.2MB/s  cp (48.12 %)          +0.2%
         BM_ZFlat/6                             16883     16916  628.6MB/s  c (42.40 %)           -0.2%
         BM_ZFlat/7                              5381      5447  651.5MB/s  lsp (48.37 %)         -1.2%
         BM_ZFlat/8                           1466870   1473861  666.3MB/s  xls (41.23 %)         -0.5%
         BM_ZFlat/9                            468006    464101  312.5MB/s  txt1 (57.87 %)        +0.8%
         BM_ZFlat/10                           408157    408957  291.9MB/s  txt2 (61.93 %)        -0.2%
         BM_ZFlat/11                          1253348   1232910  330.1MB/s  txt3 (54.92 %)        +1.7%
         BM_ZFlat/12                          1702373   1702977  269.8MB/s  txt4 (66.22 %)        -0.0%
         BM_ZFlat/13                           439792    438557  1116.0MB/s  bin (18.11 %)        +0.3%
         BM_ZFlat/14                            80766     78851  462.5MB/s  sum (48.96 %)         +2.4%
         BM_ZFlat/15                             7420      7542  534.5MB/s  man (59.36 %)         -1.6%
         BM_ZFlat/16                           112043    100126  1.1GB/s  pb (19.64 %)           +11.9%
         BM_ZFlat/17                           368877    357703  491.4MB/s  gaviota (37.72 %)     +3.1%
         BM_ZCord/0                            116402    113564  859.9MB/s  html                  +2.5%
         BM_ZCord/1                           1507156   1519911  440.5MB/s  urls                  -0.8%
         BM_ZCord/2                             39860     33686  3.5GB/s  jpg                    +18.3%
         BM_ZCord/3                             56211     54694  1.6GB/s  pdf                     +2.8%
         BM_ZCord/4                            485594    479212  815.1MB/s  html4                 +1.3%
         BM_ZDataBuffer/0                      123185    121572  803.3MB/s  html                  +1.3%
         BM_ZDataBuffer/1                     1569111   1589380  421.3MB/s  urls                  -1.3%
         BM_ZDataBuffer/2                       53143     49556  2.4GB/s  jpg                     +7.2%
         BM_ZDataBuffer/3                       65725     66826  1.3GB/s  pdf                     -1.6%
         BM_ZDataBuffer/4                      517871    514750  758.9MB/s  html4                 +0.6%
         Sum of all benchmarks               20258879  20315484                                   -0.3%
      
      
      AMD Instanbul 2.4 GHz:
      
         Benchmark                          Base (ns)  New (ns)                                Improvement
         -------------------------------------------------------------------------------------------------
         BM_UFlat/0                             97120     96585  1011.1MB/s  html                 +0.6%
         BM_UFlat/1                            917473    948016  706.3MB/s  urls                  -3.2%
         BM_UFlat/2                             21496     23938  4.9GB/s  jpg                    -10.2%
         BM_UFlat/3                             44751     45639  1.9GB/s  pdf                     -1.9%
         BM_UFlat/4                            391950    391413  998.0MB/s  html4                 +0.1%
         BM_UFlat/5                             37366     37201  630.7MB/s  cp                    +0.4%
         BM_UFlat/6                             18350     18318  580.5MB/s  c                     +0.2%
         BM_UFlat/7                              5672      5661  626.9MB/s  lsp                   +0.2%
         BM_UFlat/8                           1533390   1529441  642.1MB/s  xls                   +0.3%
         BM_UFlat/9                            335477    336553  431.0MB/s  txt1                  -0.3%
         BM_UFlat/10                           285140    292080  408.7MB/s  txt2                  -2.4%
         BM_UFlat/11                           888507    894758  454.9MB/s  txt3                  -0.7%
         BM_UFlat/12                          1187643   1210928  379.5MB/s  txt4                  -1.9%
         BM_UFlat/13                           493717    507447  964.5MB/s  bin                   -2.7%
         BM_UFlat/14                            61740     60870  599.1MB/s  sum                   +1.4%
         BM_UFlat/15                             7211      7187  560.9MB/s  man                   +0.3%
         BM_UFlat/16                            97435     93100  1.2GB/s  pb                      +4.7%
         BM_UFlat/17                           362662    356395  493.2MB/s  gaviota               +1.8%
         BM_UValidate/0                         47475     47118  2.0GB/s  html                    +0.8%
         BM_UValidate/1                        501304    529741  1.2GB/s  urls                    -5.4%
         BM_UValidate/2                           276       243  486.2GB/s  jpg                  +13.6%
         BM_UValidate/3                         16361     16261  5.4GB/s  pdf                     +0.6%
         BM_UValidate/4                        190741    190353  2.0GB/s  html4                   +0.2%
         BM_UDataBuffer/0                      111080    109771  889.6MB/s  html                  +1.2%
         BM_UDataBuffer/1                     1051035   1085999  616.5MB/s  urls                  -3.2%
         BM_UDataBuffer/2                       25801     25463  4.6GB/s  jpg                     +1.3%
         BM_UDataBuffer/3                       50493     49946  1.8GB/s  pdf                     +1.1%
         BM_UDataBuffer/4                      447258    444138  879.5MB/s  html4                 +0.7%
         BM_UCord/0                            109350    107909  905.0MB/s  html                  +1.3%
         BM_UCord/1                           1023396   1054964  634.7MB/s  urls                  -3.0%
         BM_UCord/2                             25292     24371  4.9GB/s  jpg                     +3.8%
         BM_UCord/3                             48955     49736  1.8GB/s  pdf                     -1.6%
         BM_UCord/4                            440452    437331  893.2MB/s  html4                 +0.7%
         BM_UCordString/0                       98511     98031  996.2MB/s  html                  +0.5%
         BM_UCordString/1                      933230    963495  694.9MB/s  urls                  -3.1%
         BM_UCordString/2                       23311     24076  4.9GB/s  jpg                     -3.2%
         BM_UCordString/3                       45568     46196  1.9GB/s  pdf                     -1.4%
         BM_UCordString/4                      397791    396934  984.1MB/s  html4                 +0.2%
         BM_UCordValidate/0                     47537     46921  2.0GB/s  html                    +1.3%
         BM_UCordValidate/1                    505071    532716  1.2GB/s  urls                    -5.2%
         BM_UCordValidate/2                      1663      1621  72.9GB/s  jpg                    +2.6%
         BM_UCordValidate/3                     16890     16926  5.2GB/s  pdf                     -0.2%
         BM_UCordValidate/4                    192365    191984  2.0GB/s  html4                   +0.2%
         BM_ZFlat/0                            184708    179103  545.3MB/s  html (22.31 %)        +3.1%
         BM_ZFlat/1                           2293864   2302950  290.7MB/s  urls (47.77 %)        -0.4%
         BM_ZFlat/2                             52852     47618  2.5GB/s  jpg (99.87 %)          +11.0%
         BM_ZFlat/3                            100766     96179  935.3MB/s  pdf (82.07 %)         +4.8%
         BM_ZFlat/4                            741220    727977  536.6MB/s  html4 (22.51 %)       +1.8%
         BM_ZFlat/5                             85402     85418  274.7MB/s  cp (48.12 %)          -0.0%
         BM_ZFlat/6                             36558     36494  291.4MB/s  c (42.40 %)           +0.2%
         BM_ZFlat/7                             12706     12507  283.7MB/s  lsp (48.37 %)         +1.6%
         BM_ZFlat/8                           2336823   2335688  420.5MB/s  xls (41.23 %)         +0.0%
         BM_ZFlat/9                            701804    681153  212.9MB/s  txt1 (57.87 %)        +3.0%
         BM_ZFlat/10                           606700    597194  199.9MB/s  txt2 (61.93 %)        +1.6%
         BM_ZFlat/11                          1852283   1803238  225.7MB/s  txt3 (54.92 %)        +2.7%
         BM_ZFlat/12                          2475527   2443354  188.1MB/s  txt4 (66.22 %)        +1.3%
         BM_ZFlat/13                           694497    696654  702.6MB/s  bin (18.11 %)         -0.3%
         BM_ZFlat/14                           136929    129855  280.8MB/s  sum (48.96 %)         +5.4%
         BM_ZFlat/15                            17172     17124  235.4MB/s  man (59.36 %)         +0.3%
         BM_ZFlat/16                           190364    171763  658.4MB/s  pb (19.64 %)         +10.8%
         BM_ZFlat/17                           567285    555190  316.6MB/s  gaviota (37.72 %)     +2.2%
         BM_ZCord/0                            193490    187031  522.1MB/s  html                  +3.5%
         BM_ZCord/1                           2427537   2415315  277.2MB/s  urls                  +0.5%
         BM_ZCord/2                             85378     81412  1.5GB/s  jpg                     +4.9%
         BM_ZCord/3                            121898    119419  753.3MB/s  pdf                   +2.1%
         BM_ZCord/4                            779564    762961  512.0MB/s  html4                 +2.2%
         BM_ZDataBuffer/0                      213820    207272  471.1MB/s  html                  +3.2%
         BM_ZDataBuffer/1                     2589010   2586495  258.9MB/s  urls                  +0.1%
         BM_ZDataBuffer/2                      121871    118885  1018.4MB/s  jpg                  +2.5%
         BM_ZDataBuffer/3                      145382    145986  616.2MB/s  pdf                   -0.4%
         BM_ZDataBuffer/4                      868117    852754  458.1MB/s  html4                 +1.8%
         Sum of all benchmarks               33771833  33744763                                   +0.1%
      
      
      git-svn-id: https://snappy.googlecode.com/svn/trunk@71 03e5f5b5-db94-4691-08a0-1a8bf15f6143
      27a0cc39
  31. Jan 06, 2013
Loading