The genomic [test] data is a collection of … … It is parsed into shorter strings by extracting … where a longer inexact match may be found … [ref] … The genomic data was even more uniform [than the music data] [Nevill-Manning and Witten, IEEE Data Compression Conf, 1999], with even the rarest n-grams occurring hundreds of times, … [p. 206]
Google-1
Google-2
Copyright clearance needed for quotation.