fbpx
Wikipedia

Entropy coding

In information theory, an entropy coding (or entropy encoding) is any lossless data compression method that attempts to approach the lower bound declared by Shannon's source coding theorem, which states that any lossless data compression method must have expected code length greater or equal to the entropy of the source.[1]

More precisely, the source coding theorem states that for any source distribution, the expected code length satisfies , where is the number of symbols in a code word, is the coding function, is the number of symbols used to make output codes and is the probability of the source symbol. An entropy coding attempts to approach this lower bound.

Two of the most common entropy coding techniques are Huffman coding and arithmetic coding.[2] If the approximate entropy characteristics of a data stream are known in advance (especially for signal compression), a simpler static code may be useful. These static codes include universal codes (such as Elias gamma coding or Fibonacci coding) and Golomb codes (such as unary coding or Rice coding).

Since 2014, data compressors have started using the asymmetric numeral systems family of entropy coding techniques, which allows combination of the compression ratio of arithmetic coding with a processing cost similar to Huffman coding.

Entropy as a measure of similarity

Besides using entropy coding as a way to compress digital data, an entropy encoder can also be used to measure the amount of similarity between streams of data and already existing classes of data. This is done by generating an entropy coder/compressor for each class of data; unknown data is then classified by feeding the uncompressed data to each compressor and seeing which compressor yields the highest compression. The coder with the best compression is probably the coder trained on the data that was most similar to the unknown data.

See also

References

  1. ^ Duda, Jarek; Tahboub, Khalid; Gadgil, Neeraj J.; Delp, Edward J. (May 2015). "The use of asymmetric numeral systems as an accurate replacement for Huffman coding". 2015 Picture Coding Symposium (PCS): 65–69. doi:10.1109/PCS.2015.7170048. ISBN 978-1-4799-7783-3. S2CID 20260346.
  2. ^ Huffman, David (1952). "A Method for the Construction of Minimum-Redundancy Codes". Proceedings of the IRE. Institute of Electrical and Electronics Engineers (IEEE). 40 (9): 1098–1101. doi:10.1109/jrproc.1952.273898. ISSN 0096-8390.

External links

entropy, coding, this, article, includes, list, general, references, lacks, sufficient, corresponding, inline, citations, please, help, improve, this, article, introducing, more, precise, citations, december, 2013, learn, when, remove, this, template, message,. This article includes a list of general references but it lacks sufficient corresponding inline citations Please help to improve this article by introducing more precise citations December 2013 Learn how and when to remove this template message In information theory an entropy coding or entropy encoding is any lossless data compression method that attempts to approach the lower bound declared by Shannon s source coding theorem which states that any lossless data compression method must have expected code length greater or equal to the entropy of the source 1 More precisely the source coding theorem states that for any source distribution the expected code length satisfies E x P ℓ d x E x P log b P x displaystyle operatorname E x sim P ell d x geq operatorname E x sim P log b P x where ℓ displaystyle ell is the number of symbols in a code word d displaystyle d is the coding function b displaystyle b is the number of symbols used to make output codes and P displaystyle P is the probability of the source symbol An entropy coding attempts to approach this lower bound Two of the most common entropy coding techniques are Huffman coding and arithmetic coding 2 If the approximate entropy characteristics of a data stream are known in advance especially for signal compression a simpler static code may be useful These static codes include universal codes such as Elias gamma coding or Fibonacci coding and Golomb codes such as unary coding or Rice coding Since 2014 data compressors have started using the asymmetric numeral systems family of entropy coding techniques which allows combination of the compression ratio of arithmetic coding with a processing cost similar to Huffman coding Contents 1 Entropy as a measure of similarity 2 See also 3 References 4 External linksEntropy as a measure of similarity EditBesides using entropy coding as a way to compress digital data an entropy encoder can also be used to measure the amount of similarity between streams of data and already existing classes of data This is done by generating an entropy coder compressor for each class of data unknown data is then classified by feeding the uncompressed data to each compressor and seeing which compressor yields the highest compression The coder with the best compression is probably the coder trained on the data that was most similar to the unknown data See also EditArithmetic coding Asymmetric numeral systems ANS Context adaptive binary arithmetic coding CABAC Huffman coding Range codingReferences Edit Duda Jarek Tahboub Khalid Gadgil Neeraj J Delp Edward J May 2015 The use of asymmetric numeral systems as an accurate replacement for Huffman coding 2015 Picture Coding Symposium PCS 65 69 doi 10 1109 PCS 2015 7170048 ISBN 978 1 4799 7783 3 S2CID 20260346 Huffman David 1952 A Method for the Construction of Minimum Redundancy Codes Proceedings of the IRE Institute of Electrical and Electronics Engineers IEEE 40 9 1098 1101 doi 10 1109 jrproc 1952 273898 ISSN 0096 8390 External links EditInformation Theory Inference and Learning Algorithms by David MacKay 2003 gives an introduction to Shannon theory and data compression including the Huffman coding and arithmetic coding Source Coding by T Wiegand and H Schwarz 2011 Retrieved from https en wikipedia org w index php title Entropy coding amp oldid 1132723947, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.