Map
Index
Random
Help
th

Quote: don't use a stop list; compress common words by predicting the inter-word gap; 100 words are 76% of references and 44% of compressed size

topics > all references > references t-z > QuoteRef: wittIH_1991 , p. 268



Topic:
text compression
Topic:
full-text indexing

Quotation Skeleton

If the concordance is properly compressed it is … The 100 most common words account for 76% … of omitting words on a stop list is … A simple theoretical model can be used to … [in a concordance], and this distribution can be used to encode the sequence …   Google-1   Google-2

Copyright clearance needed for quotation.


Related Topics up

Topic: text compression (16 items)
Topic: full-text indexing (35 items)

Copyright © 2002-2008 by C. Bradford Barber. All rights reserved.
Thesa is a trademark of C. Bradford Barber.