Map
Index
Random
Help
th

Quote: compress lexicon entry into 3 bytes; 8 characters on average, shared prefix, compressed suffix, encoded count, predicted entry size

topics > all references > references t-z > QuoteRef: wittIH_1991 , p. 270



Topic:
text compression

Quotation Skeleton

The average word length in an English lexicon … [short words occur more often] … Since words are stored in alphabetical order … only the length of [the shared] prefix need be … Coupled with simple compression of suffixes leads to … [this reduces] the space required for a count to … be predicted from the word's occurrence count … This leads to … just over 3 bytes per lexicon entry …   Google-1   Google-2

Copyright clearance needed for quotation.


Related Topics up

Topic: text compression (16 items)

Copyright © 2002-2008 by C. Bradford Barber. All rights reserved.
Thesa is a trademark of C. Bradford Barber.