Re: COMPRESSION

Chris Marrin ([email protected])
Wed, 31 May 1995 18:39:37 -0700


> I have just done this by hand on a couple of small VRML files
> but I have found that ASCII->tokenized->GZIP is roughly 1/4 the
> size of ASCII->GZIP... This seems to make sense to me because
> the tokenized file has the same basic patterns as the full VRML
> document, just the patterns are XX rather than SomeCoolNode {

I think the key word here is "small". I've experimented a bit and have
found that small files compress much differently than large files. I
conjecture that this is because there is some overhead in storing a table
with the original strings (e.g. - "Separator {") mapped to their
compressed counterparts. A table of 2 character values is much shorter
than a table of 10 character values. This is significant for small test
files, much less so for real world files.

-- 
chris marrin      Silicon      http://www.sgi.com/Products/WebFORCE/WebSpace
(415) 390-5367    Graphics  ," http://reality.sgi.com/employees/cmarrin/
[email protected]   Inc.    b`    ,                             ,,.
                        mP     b"                            , 1$'
        ,.`           ,b`    ,`                              :$$' 
     ,|`             mP    ,`                                             ,mm
   ,b"              b"   ,`                ,mm      m$$    ,m          ,,`P$$
  m$`             ,b`  .` ,mm          ,.`'|$P   ,|"1$`  ,b$P       ,,`   :$1
 b$`             ,$: :,`` |$$       ,:`    $$` ,|` ,$$,,`"$$      .`      :$|
b$|            _m$`,:`    :$1    ,:`      ,$Pm|`    `    :$$,..;"'        |$:
P$b,      _;b$$b$1"       |$$ ,,``       ,$$"             ``'             $$
 ```"```'"    `"`         `""`           ""`                             ,P`
"As a general rule, don't solve puzzles that open portals to Hell."-...-'
		   - excerpt from "A Horror Movie Character's Survival Guide"