ThesaHelp: references t-z
Topic: data compression algorithms
Topic: text compression
Topic: compressed data
Topic: searching compressed data
| |
Reference
Witten, I.H., Moffat, A., Bell, T.C.,
Managing gigabytes: compressing and indexing documents and images, New York, Van Nostrand Reinhold, 1994.
Google
Other Reference
http://www.kbs.citri.edu.au/mg
ftp://munari.oz.au/pub/mg
Quotations
64 ;;Quote: comparison of compression techniques with the Calgary corpus: arithmetic coders best, then gzip's variation of LZ77
| 64+;;Quote: for large document collections, better compression with Huffman encoded words than with gzip; simple synchronization, about half as fast, stores lexicon in memory
| 138 ;;Quote: use a skipped index for fast queries in a compressed database
| 367 ;;Quote: public domain code for compressing and indexing large document collections; entire retrieval system is 40% of the original
|
Related Topics
ThesaHelp: references t-z (309 items)
Topic: data compression algorithms (53 items)
Topic: text compression (16 items)
Topic: compressed data (16 items)
Topic: searching compressed data (9 items)
|