Universal lossless compression via multilevel pattern matching

J. Kieffer; E. Yang; G. Nelson; P. Cosman

DOI:10.1109/18.850665
Corpus ID: 8191526

Universal lossless compression via multilevel pattern matching

@article{Kieffer2000UniversalLC,
  title={Universal lossless compression via multilevel pattern matching},
  author={John C. Kieffer and En-hui Yang and G. Nelson and Pamela C. Cosman},
  journal={IEEE Trans. Inf. Theory},
  year={2000},
  volume={46},
  pages={1227-1245},
  url={https://api.semanticscholar.org/CorpusID:8191526}
}

J. KiefferE. Yang P. Cosman
Published in IEEE Transactions on… 1 July 2000
Computer Science

A O(1/log n) maximal redundancy/sample upper bound is established for the multilevel pattern matching code with respect to any class of finite state sources of uniformly bounded complexity in processing a finite-alphabet data string of length n.

View via Publisher

escholarship.org

110 Citations

Highly Influential Citations

Background Citations

Methods Citations

Topics

Finite State Sources Multilevel Pattern Matching Matching Patterns Linear Complexity Parallel Substitutions Input Data String

Context-dependent multilevel pattern matching for lossless image compression

Yunwei JiaE. Yang

Computer Science

IEEE Trans. Inf. Theory

2003

It is shown that among all images of n pixels, the context-dependent 2D MPM code has an O(1/logn) worst case redundancy against any finite-template-based arithmetic code satisfying a mild condition; this redundancy is better than that of the 2DMPM code without context models.

Structured grammar-based codes for universal lossless data compression

J. KiefferE. Yang

Computer Science

Commun. Inf. Syst.

2002

A coding theorem is proved which shows that a structured grammar-based code has maximal redundancy/sample O(1=logn) provided that a weak regular structure condition is satisfied.

Universal Lossless Data Compression Via Binary Decision Diagrams

J. KiefferP. FlajoletE. Yang

Computer Science

ArXiv

2011

A lossless data compression algorithm in which a binary string of length a power of two is compressed via compression of the ROBDD associated to it as described above, showing that the maximal pointwise redundancy/sample with respect to any s-state binary information source has the upper bound.

[PDF]

Universal lossless data compression with side information by using a conditional MPM grammar transform

E. YangA. KaltchenkoJ. Kieffer

Computer Science

2000 IEEE International Symposium on Information…

2000

A universal lossless data compression algorithm with side information called the CMPM algorithm, which has linear time and storage complexity and asymptotically achieves the conditional entropy rate of any stationary, ergodic source pair.

Grammar-based codes: A new class of universal lossless source codes

J. KiefferE. Yang

Computer Science

IEEE Trans. Inf. Theory

2000

It is shown that, subject to some mild restrictions, a grammar-based code is a universal code with respect to the family of finite-state information sources over the finite alphabet.

Efficient universal lossless data compression algorithms based on a greedy sequential grammar transform .2. With context models

E. YangDake He

Computer Science

IEEE Trans. Inf. Theory

2003

It is proved that for some nonstationary sources, the proposed context-dependent algorithms can achieve better expected redundancies than any existing CFG-based codes, including the Lempel-Ziv (1978) algorithm, the multilevel pattern matching algorithm, and the context-free algorithms in Part I of this series of papers.

Efficient Variable-to-Fixe d Length Coding Algorithms for Text Compression

S. Yoshida

Computer Science

2014

This thesis focuses on lossless compression for text data, that is, text compression, and Variable-to-Fixed-length coding, a coding scheme that segments an input text into a consecutive sequence of substrings and then assigns a fixed length codeword to each substring.

Effective Variable-Length-to-Fixed-Length Coding via a Re-Pair Algorithm

S. YoshidaTakuya Kida

Computer Science

2013 Data Compression Conference

2013

This study proposes a new VF coding method that applies a fixed-length code to the set of rules extracted by the Re-Pair algorithm, a simple off-line grammar-based compression method that has good compression-ratio performance with moderate compression speed.

Lossless Data Compression Via Guided Approximate Bisections

E. YangJohn C. Kieer

Computer Science

2000

It is shown that the modi ed bisection method yields maximal redundancy/sample O(1= log n) for n data samples, regardless of the manner in which the approximate bisections are guided.

A grammar-based compression using a variation of Chomsky normal form of context free grammar

M. Arimura

Computer Science

2016 International Symposium on Information…

2016

The proposed method can improve the compression performance of these algorithms by the unified procedure and has an advantage that, transformation from a given sequence to the grammar is quite simple, by using the three-step algorithm through semi-CNF.

Grammar-based codes: A new class of universal lossless source codes

J. KiefferE. Yang

Computer Science

IEEE Trans. Inf. Theory

2000

It is shown that, subject to some mild restrictions, a grammar-based code is a universal code with respect to the family of finite-state information sources over the finite alphabet.

Redundancy of the Lempel-Ziv incremental parsing rule

S. Savari

Computer Science

IEEE Trans. Inf. Theory

1997

It is demonstrated that for unifilar or Markov sources, the redundancy of encoding the first n letters of the source output with the Lempel-Ziv incremental parsing rule, the Welch modification, or a new variant is O((ln n)/sup -1/), and the exact form of convergence is upper-bound.

Compression of individual sequences via variable-rate coding

J. ZivA. Lempel

Computer Science

IEEE Trans. Inf. Theory

1978

The proposed concept of compressibility is shown to play a role analogous to that of entropy in classical information theory where one deals with probabilistic ensembles of sequences rather than with individual sequences.

Universal codeword sets and representations of the integers

P. Elias

Mathematics

IEEE Trans. Inf. Theory

1975

An application is the construction of a uniformly universal sequence of codes for countable memoryless sources, in which the n th code has a ratio of average codeword length to source rate bounded by a function of n for all sources with positive rate.

1,300

Redundancy of MPM data compression system

J. KiefferE. Yang

Computer Science

Proceedings. 1998 IEEE International Symposium on…

1998

A finite-state information source is losslessly encoded via the multilevel pattern matching data compression system and gives a pointwise redundancy bound better than the bound established by Plotnik et al. (1978 version) for the Lempel-Ziv algorithm.

Progressive lossless image coding via self-referential partitions

J. KiefferT. ParkY. XuS. Yakowitz

Computer Science

Proceedings 1998 International Conference on…

1998

The progressive image coder is fast and has a worst-case redundancy performance better than the best currently known worst- case redundancy upper bound for the 2-D Lempel-Ziv algorithm.

Arithmetic coding revisited

Alistair MoffatRadford M. NealI. Witten

Computer Science, Mathematics

ACM Trans. Inf. Syst.

1998

A new implementation of arithmetic coding is described that incorporates several improvements over a widely used earlier version by Witten, Neal, and Cleary, which has become a de facto standard and a modular structure that separates the coding, modeling, and probability estimation components of a compression system is described.

On the average redundancy rate of the Lempel-Ziv code

G. LouchardW. Szpankowski

Mathematics, Computer Science

IEEE Trans. Inf. Theory

1997

It is proved that for a memoryless source the average redundancy rate attains asymptotically Er/sub n/=(A+/spl delta/(n))/log n+ O(log log n/log/sup 2/ n), where A is an explicitly given constant that depends on source characteristics, and /spl delta/(x) is a fluctuating function with a small amplitude.

Upper Bounds On The Probability Of Sequences Emitted By Finite-state Sources And On The Redundancy Of The Lempel-Ziv Algorithm

E. PlotnikM. WeinbergerJ. Ziv

Computer Science, Mathematics

Proceedings. 1991 IEEE International Symposium on…

1991

An upper bound on the probability of a sequence drawn from a finite-state source is derived. The bound is given in terms of the number of phrases obtained by parsing the sequence according to the…

Elements of Information Theory

T. CoverJoy A. Thomas

Mathematics

1991

The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.

Universal lossless compression via multilevel pattern matching

Topics

110 Citations

Context-dependent multilevel pattern matching for lossless image compression

Structured grammar-based codes for universal lossless data compression

Universal Lossless Data Compression Via Binary Decision Diagrams

Universal lossless data compression with side information by using a conditional MPM grammar transform

Grammar-based codes: A new class of universal lossless source codes

Efficient universal lossless data compression algorithms based on a greedy sequential grammar transform .2. With context models

Efficient Variable-to-Fixe d Length Coding Algorithms for Text Compression

Effective Variable-Length-to-Fixed-Length Coding via a Re-Pair Algorithm

Lossless Data Compression Via Guided Approximate Bisections

A grammar-based compression using a variation of Chomsky normal form of context free grammar

36 References

Grammar-based codes: A new class of universal lossless source codes

Redundancy of the Lempel-Ziv incremental parsing rule

Compression of individual sequences via variable-rate coding

Universal codeword sets and representations of the integers

Redundancy of MPM data compression system

Progressive lossless image coding via self-referential partitions

Arithmetic coding revisited

On the average redundancy rate of the Lempel-Ziv code

Upper Bounds On The Probability Of Sequences Emitted By Finite-state Sources And On The Redundancy Of The Lempel-Ziv Algorithm

Elements of Information Theory

Related Papers