Brain Dump

Compression Model

Tags
text-processing

A representation model/dictionary of some textual data that can be used to improve storage.

Most compression algorithms need a model before they can begin encoding. Decoding is impossible without the relevant model.

Models can be [see page 72, built] in various ways depending on the algorithm.

We can classify a model based on how it uses the surrounding [see page 70, context]:

Word based models consist of words ([A-z0-9] strings) and non-words (space/punctuation).