Table of Contents NAME mg_text_estimate - Estimate the size of the compressed text based on...
Table of Contents
NAME
mg_text_estimate - Estimate the size of the compressed text based on the text statistics and a compression dictionary.
SYNOPSIS
mg_text_estimate [ -h ] [ -H | -B | -D | -Y | -M ] stats-dict compression-dict
DESCRIPTION
This program estimates the size of the compressed text that would be generated if text with certain statistics were compressed with a particular dictionary. This program has no real use in the mg(1) system, except for experimenting with the mg_compression_dict(1) program.
OPTIONS
Options may appear in any order.
- -h
- This displays a usage line on stderr.
- -H
- This specifies that novel words will be coded character by character using Huffman codes.
- -B
- This specifies that the size of an auxiliary dictionary will be estimated. Each novel word found will be placed at the end of the auxiliary dictionary. Novel words will be coded in the compressed text using binary codes. The binary code represents their occurrence position in the auxiliary dictionary.
- -D
- This specifies that the size of an auxiliary dictionary will be estimated. Each novel word found will be placed at the end of the auxiliary dictionary. Novel words will be coded in the compressed text using delta codes. The delta code represents their occurrence position in the auxiliary dictionary.
- -Y
- This specifies that the size of an auxiliary dictionary will be estimated. Each novel word found will be placed at the end of the auxiliary dictionary. Novel words will be coded in the compressed text using a combination of gamma and binary codes. The code represents their occurrence position in the auxiliary dictionary. This generally produces better compression than -B or -D.
- -M
- This specifies that the size of an auxiliary dictionary will be estimated. Each novel word found will be placed at the end of the auxiliary dictionary. Novel words will be coded in the compressed text using a combination of gamma and binary codes. The code represents their occurrence position in the auxiliary dictionary. This method is adaptive within documents, and generally produces better compression than -B, -D or -Y.
FILES
- *.text.stats
- Statistics about the text.
- *.text.dict
- Compressed compression dictionary.
SEE ALSO
mg(1),
mg_compression_dict(1),
mg_fast_comp_dict(1),
mg_get(1),
mg_invf_dict(1),
mg_invf_dump(1),
mg_invf_rebuild(1),
mg_passes(1),
mg_perf_hash_build(1),
mg_weights_build(1),
mgbilevel(1),
mgbuild(1),
mgdictlist(1),
mgfelics(1),
mgquery(1),
mgstat(1),
mgtic(1),
mgticbuild(1),
mgticdump(1),
mgticprune(1),
mgticstat(1).
Table of Contents