fbpx
Wikipedia

Audio bit depth

In digital audio using pulse-code modulation (PCM), bit depth is the number of bits of information in each sample, and it directly corresponds to the resolution of each sample. Examples of bit depth include Compact Disc Digital Audio, which uses 16 bits per sample, and DVD-Audio and Blu-ray Disc, which can support up to 24 bits per sample.

An analog signal (in red) encoded to 4-bit PCM digital samples (in blue); the bit depth is four, so each sample's amplitude is one of 16 possible values.

In basic implementations, variations in bit depth primarily affect the noise level from quantization error—thus the signal-to-noise ratio (SNR) and dynamic range. However, techniques such as dithering, noise shaping, and oversampling can mitigate these effects without changing the bit depth. Bit depth also affects bit rate and file size.

Bit depth is useful for describing PCM digital signals. Non-PCM formats, such as those using lossy compression, do not have associated bit depths.[a]

Binary representation edit

A PCM signal is a sequence of digital audio samples containing the data providing the necessary information to reconstruct the original analog signal. Each sample represents the amplitude of the signal at a specific point in time, and the samples are uniformly spaced in time. The amplitude is the only information explicitly stored in the sample, and it is typically stored as either an integer or a floating-point number, encoded as a binary number with a fixed number of digits – the sample's bit depth, also referred to as word length or word size.

The resolution indicates the number of discrete values that can be represented over the range of analog values. The resolution of binary integers increases exponentially as the word length increases: adding one bit doubles the resolution, adding two quadruples it, and so on. The number of possible values that an integer bit depth can represent can be calculated by using 2n, where n is the bit depth.[1] Thus, a 16-bit system has a resolution of 65,536 (216) possible values.

Integer PCM audio data is typically stored as signed numbers in two's complement format.[2]

Today, most audio file formats and digital audio workstations (DAWs) support PCM formats with samples represented by floating-point numbers.[3][4][5][6] Both the WAV file format and the AIFF file format support floating-point representations.[7][8] Unlike integers, whose bit pattern is a single series of bits, a floating-point number is instead composed of separate fields whose mathematical relation forms a number. The most common standard is IEEE 754, which is composed of three fields: a sign bit representing whether the number is positive or negative, a mantissa, and an exponent determining a power-of-two factor to scale the mantissa. The mantissa is expressed as a binary fraction in IEEE base-two floating-point formats.[9]

Quantization edit

The bit depth limits the signal-to-noise ratio (SNR) of the reconstructed signal to a maximum level determined by quantization error. The bit depth has no impact on the frequency response, which is constrained by the sample rate.

Quantization error introduced during analog-to-digital conversion (ADC) can be modeled as quantization noise. It is a rounding error between the analog input voltage to the ADC and the output digitized value. The noise is nonlinear and signal-dependent.

 
An 8-bit binary number (149 in decimal), with the LSB highlighted

In an ideal ADC, where the quantization error is uniformly distributed between   least significant bit (LSB) and where the signal has a uniform distribution covering all quantization levels, the signal-to-quantization-noise ratio (SQNR) can be calculated from

 

where b is the number of quantization bits, and the result is measured in decibels (dB).[10][11]

Therefore, 16-bit digital audio found on CDs has a theoretical maximum SNR of 98 dB, and professional 24-bit digital audio tops out as 146 dB. As of 2011, digital audio converter technology is limited to an SNR of about 123 dB[12][13][14] (effectively 21 bits) because of real-world limitations in integrated circuit design.[b] Still, this approximately matches the performance of the human auditory system.[17][18] Multiple converters can be used to cover different ranges of the same signal, being combined to record a wider dynamic range in the long-term, while still being limited by the single converter's dynamic range in the short term, which is called dynamic range extension.[19][20]

Signal-to-noise ratio and resolution of bit depths (unweighted)
# bits SNR
(audio)
SNR
(video)
Minimum dB step
difference
(quantization
rounding error)
No. of possible values
(per sample)
Range (per sample)
for signed representation
4 25.84 dB 34.31 dB 1.723 dB 16 −8 to +7
8 49.93 dB 58.92 dB 0.1958 dB 256 −128 to +127
11 67.99 dB 77.01 dB 0.03321 dB 2,048 −1,024 to +1,023
12 74.01 dB 83.04 dB 0.01807 dB 4,096 −2,048 to +2,047
16 98.09 dB 107.12 dB 0.001497 dB 65,536 −32,768 to +32,767
18 110.13 dB 0.0004201 dB 262,144 −131,072 to +131,071
20 122.17 dB 0.0001165 dB 1,048,576 −524,288 to +524,287
24 146.26 dB 0.000008717 dB 16,777,216 −8,388,608 to +8,388,607
32 194.42 dB 4.52669593×10−8 dB 4,294,967,296 −2,147,483,648 to
+2,147,483,647
48 290.75 dB 1.03295047×10−12 dB 281,474,976,710,656 −140,737,488,355,328 to
+140,737,488,355,327
64 387.08 dB 2.09836113×10−17 dB 18,446,744,073,709,551,616 −9,223,372,036,854,775,808 to
+9,223,372,036,854,775,807

Floating point edit

The resolution of floating-point samples is less straightforward than integer samples because floating-point values are not evenly spaced. In floating-point representation, the space between any two adjacent values is in proportion to the value.

The trade-off between floating-point and integer formats is that the space between large floating-point values is greater than the space between large integer values of the same bit depth. Rounding a large floating-point number results in a greater error than rounding a small floating-point number whereas rounding an integer number will always result in the same level of error. In other words, integers have a round-off that is uniform, always rounding the LSB to 0 or 1, and the floating-point format has uniform SNR, the quantization noise level is always of a certain proportion to the signal level.[21] A floating-point noise floor rises as the signal rises and falls as the signal falls, resulting in audible variance if the bit depth is low enough.[22]

Audio processing edit

Most processing operations on digital audio involve the re-quantization of samples and thus introduce additional rounding errors analogous to the original quantization error introduced during analog-to-digital conversion. To prevent rounding errors larger than the implicit error during ADC, calculations during processing must be performed at higher precisions than the input samples.[23]

Digital signal processing (DSP) operations can be performed in either fixed-point or floating-point precision. In either case, the precision of each operation is determined by the precision of the hardware operations used to perform each step of the processing and not the resolution of the input data. For example, on x86 processors, floating-point operations are performed with single or double precision, and fixed-point operations at 16-, 32- or 64-bit resolution. Consequently, all processing performed on Intel-based hardware will be performed with these constraints regardless of the source format.[c]

Fixed-point digital signal processors often supports specific word lengths to support specific signal resolutions. For example, the Motorola 56000 DSP chip uses 24-bit multipliers and 56-bit accumulators to perform multiply-accumulate operations on two 24-bit samples without overflow or truncation.[24] On devices that do not support large accumulators, fixed-point results may be truncated, reducing precision. Errors compound through multiple stages of DSP at a rate that depends on the operations being performed. For uncorrelated processing steps on audio data without a DC offset, errors are assumed to be random with zero means. Under this assumption, the standard deviation of the distribution represents the error signal, and quantization error scales with the square root of the number of operations.[25] High levels of precision are necessary for algorithms that involve repeated processing, such as convolution.[23] High levels of precision are also necessary in recursive algorithms, such as infinite impulse response (IIR) filters.[26] In the particular case of IIR filters, rounding error can degrade frequency response and cause instability.[23]

Dither edit

 
Headroom and noise floor at audio process stages for the purpose of comparison with dither level

The noise introduced by quantization error, including rounding errors and loss of precision introduced during audio processing, can be mitigated by adding a small amount of random noise, called dither, to the signal before quantizing. Dithering eliminates non-linear quantization error behavior, giving very low distortion, but at the expense of a slightly raised noise floor. Recommended dither for 16-bit digital audio measured using ITU-R 468 noise weighting is about 66 dB below alignment level, or 84 dB below digital full scale, which is comparable to the microphone and room noise level, and hence of little consequence in 16-bit audio.

24-bit and 32-bit audio does not require dithering, as the noise level of the digital converter is always louder than the required level of any dither that might be applied. 24-bit audio could theoretically encode 144 dB of dynamic range, and 32-bit audio can achieve 192 dB, but this is almost impossible to achieve in the real world, as even the best sensors and microphones rarely exceed 130 dB.[27]

Dither can also be used to increase the effective dynamic range. The perceived dynamic range of 16-bit audio can be 120 dB or more with noise-shaped dither, taking advantage of the frequency response of the human ear.[28][29]

Dynamic range and headroom edit

Dynamic range is the difference between the largest and smallest signal a system can record or reproduce. Without dither, the dynamic range correlates to the quantization noise floor. For example, 16-bit integer resolution allows for a dynamic range of about 96 dB. With the proper application of dither, digital systems can reproduce signals with levels lower than their resolution would normally allow, extending the effective dynamic range beyond the limit imposed by the resolution.[30] The use of techniques such as oversampling and noise shaping can further extend the dynamic range of sampled audio by moving quantization error out of the frequency band of interest.

If the signal's maximum level is lower than that allowed by the bit depth, the recording has headroom. Using higher bit depths during studio recording can make headroom available while maintaining the same dynamic range. This reduces the risk of clipping without increasing quantization errors at low volumes.

Oversampling edit

Oversampling is an alternative method to increase the dynamic range of PCM audio without changing the number of bits per sample.[31] In oversampling, audio samples are acquired at a multiple of the desired sample rate. Because quantization error is assumed to be uniformly distributed with frequency, much of the quantization error is shifted to ultrasonic frequencies and can be removed by the digital-to-analog converter during playback.

For an increase equivalent to n additional bits of resolution, a signal must be oversampled by

 

For example, a 14-bit ADC can produce 16-bit 48 kHz audio if operated at 16× oversampling, or 768 kHz. Oversampled PCM, therefore, exchanges fewer bits per sample for more samples to obtain the same resolution.

Dynamic range can also be enhanced with oversampling at signal reconstruction, absent oversampling at the source. Consider 16× oversampling at reconstruction. Each sample at reconstruction would be unique in that for each of the original sample points sixteen are inserted, all having been calculated by a digital reconstruction filter. The mechanism of increased effective bit depth is as previously discussed, that is, quantization noise power has not been reduced, but the noise spectrum has been spread over 16× the audio bandwidth.

Historical note—The compact disc standard was developed by a collaboration between Sony and Philips. The first Sony consumer unit featured a 16-bit DAC; the first Philips units had dual 14-bit DACs. This confused the marketplace and even in professional circles, because 14-bit PCM allows for 84 dB SNR, 12 dB less than 16-bit PCM. Philips had implemented 4× oversampling with first order noise shaping which theoretically realized the full 96 dB dynamic range of the CD format.[32] In practice the Philips CD100 was rated at 90 dB SNR in the audio band of 20 Hz–20 kHz, the same as Sony's CDP-101.[33][34]

Noise shaping edit

Oversampling a signal results in equal quantization noise per unit of bandwidth at all frequencies and a dynamic range that improves with only the square root of the oversampling ratio. Noise shaping is a technique that adds additional noise at higher frequencies which cancels out some error at lower frequencies, resulting in a larger increase in dynamic range when oversampling. For nth-order noise shaping, the dynamic range of an oversampled signal is improved by an additional 6n dB relative to oversampling without noise shaping.[35] For example, for a 20 kHz analog audio sampled at 4× oversampling with second-order noise shaping, the dynamic range is increased by 30 dB. Therefore, a 16-bit signal sampled at 176 kHz would have a bit depth equal to a 21-bit signal sampled at 44.1 kHz without noise shaping.

Noise shaping is commonly implemented with delta-sigma modulation. Using delta-sigma modulation, Direct Stream Digital achieves a theoretical 120 dB SNR at audio frequencies using 1-bit audio with 64× oversampling.

Applications edit

Bit depth is a fundamental property of digital audio implementations. Depending on application requirements and equipment capabilities, different bit depths are used for different applications.

Example applications and supported audio bit depth
Application Description Audio format(s)
CD-DA (Red Book)[36] Digital media 16-bit LPCM
DVD-Audio[37] Digital media 16-, 20- and 24-bit LPCM[A]
Super Audio CD[38] Digital media 1-bit Direct Stream Digital (PDM)
Blu-ray Disc audio[39] Digital media 16-, 20- and 24-bit LPCM and others[B]
DV audio[40] Digital media 12- and 16-bit uncompressed PCM
ITU-T Recommendation G.711[41] Compression standard for telephony 8-bit PCM with companding[C]
NICAM-1, NICAM-2 and NICAM-3[42] Compression standards for broadcasting 10-, 11- and 10-bit PCM respectively, with companding[D]
Ardour DAW by Paul Davis and the Ardour Community 32-bit floating point[43]
Pro Tools 11 DAW by Avid Technology 16- and 24-bit or 32-bit floating point sessions and 64-bit floating point mixing[44]
Logic Pro X DAW by Apple Inc. 16- and 24-bit projects and 32-bit or 64-bit floating point mixing[45]
Cubase DAW by Steinberg Allows audio processing precision to 32-bit float or 64-bit float [46]
Ableton Live[6] DAW by Ableton 32-bit floating point bit depth and 64-bit summing
Reason 7 DAW by Propellerhead Software 16-, 20- and 24-bit I/O, 32-bit floating point arithmetic and 64-bit summing[47]
Reaper 5 DAW by Cockos Inc. 8-bit PCM, 16-bit PCM, 24-bit PCM, 32-bit PCM, 32-bit FP, 64-bit FP, 4-bit IMA ADPCM & 2-bit cADPCM rendering;

8-bit int, 16-bit int, 24-bit int, 32-bit int, 32-bit float, and 64-bit float mixing

GarageBand '11 (version 6) DAW by Apple Inc. 16-bit default with 24-bit real instrument recording[48]
Audacity Open source audio editor 16- and 24-bit LPCM and 32-bit floating point[49]
FL Studio DAW by Image-Line 16- and 24-bit int and 32-bit floating point (controlled by OS)[50]
  1. ^ DVD-Audio also supports optional Meridian Lossless Packing, a lossless compression scheme.
  2. ^ Blu-ray supports a variety of non-LPCM formats but all conform to some combination of 16, 20, or 24 bits per sample.
  3. ^ ITU-T specifies the A-law and μ-law companding algorithms, compressing down from 13 and 14 bits respectively.
  4. ^ NICAM systems 1, 2 and 3 compress down from 13, 14 and 14 bits respectively.

Bit rate and file size edit

Bit depth affects bit rate and file size. Bits are the basic unit of data used in computing and digital communications. Bit rate refers to the amount of data, specifically bits, transmitted or received per second. In MP3 and other lossy compressed audio formats, bit rate describes the amount of information used to encode an audio signal. It is usually measured in kb/s.[51]

See also edit

Notes edit

  1. ^ For example, in MP3, quantization is performed on the frequency-domain representation of the signal, not on the time-domain samples relevant to bit depth.
  2. ^ While 32-bit converters exist, they are purely for marketing purposes and provide no practical benefit over 24-bit converters; the extra bits are either zero or encode only noise.[15][16]
  3. ^ Intel and AMD x86 hardware can handle higher precision than 64 bits, or even arbitrarily large floating-point numbers or integers, but processing takes a lot longer than the native types.

References edit

  1. ^ Thompson, Dan (2005). Understanding Audio. Berklee Press. ISBN 978-0-634-00959-4.
  2. ^ Smith, Julius (2007). "Pulse Code Modulation (PCM)". Mathematics of the Discrete Fourier Transform (DFT) with Audio Applications, Second Edition, online book. Retrieved 22 October 2012.
  3. ^ Campbell, Robert (2013). Pro Tools 10 Advanced Music Production Techniques, pg. 247. Cengage Learning. ISBN 978-1133728016. Retrieved 12 August 2013.
  4. ^ Wherry, Mark (March 2012). "Avid Pro Tools 10". Sound On Sound. Retrieved 10 August 2013.
  5. ^ Price, Simon (October 2005). "Reason Mixing Masterclass". Sound On Sound. Retrieved 10 August 2013.
  6. ^ a b "Ableton Reference Manual Version 10, 32. Audio Fact Sheet". Ableton. 2019. Retrieved 3 September 2019.
  7. ^ Kabal, Peter (3 January 2011). "Audio File Format Specifications, WAVE Specifications". McGill University. Retrieved 10 August 2013.
  8. ^ Kabal, Peter (3 January 2011). "Audio File Format Specifications, AIFF / AIFF-C Specifications". McGill University. Retrieved 10 August 2013.
  9. ^ Smith, Steven (1997–98). "The Scientist and Engineer's Guide to Digital Signal Processing, Chapter 4 – DSP Software / Floating Point (Real Numbers)". www.dspguide.com. Retrieved 10 August 2013.
  10. ^ See Signal-to-noise ratio § Fixed point.
  11. ^ Kester, Walt (2007). "Taking the Mystery out of the Infamous Formula, "SNR = 6.02N + 1.76dB," and Why You Should Care" (PDF). Analog Devices. Retrieved 26 July 2011.
  12. ^ Nwavguy (6 September 2011). "NwAvGuy: Noise & Dynamic Range". NwAvGuy. Retrieved 2 December 2016. 24-bit DACs often only manage approximately 16-bit performance and the very best reach 21-bit (ENOB) performance
  13. ^ "PCM4222". Retrieved 21 April 2011. Dynamic Range (−60 dB input, A-weighted): 124 dB typical Dynamic Range (−60 dB input, 20 kHz bandwidth): 122 dB typical
  14. ^ "WM8741 : High-Performance Stereo DAC". Cirrus Logic. Retrieved 2 December 2016. 128dB SNR ('A'-weighted mono @ 48 kHz) 123 dB SNR (non-weighted stereo @ 48 kHz)
  15. ^ "The great audio myth: why you don't need that 32-bit DAC". Android Authority. 19 January 2016. Retrieved 2 December 2016. So your 32-bit DAC is only ever going to be able to output at most 21 bits of useful data, and the other bits will be masked by circuit noise.
  16. ^ "32-bit capable DACs". hydrogenaud.io. Retrieved 2 December 2016. all the '32-bit capable' DAC chips existent today have an actual resolution less than 24 bit.
  17. ^ D. R. Campbell. (PDF). Archived from the original (PDF) on 21 August 2011. Retrieved 21 April 2011. The dynamic range of human hearing is [approximately] 120 dB
  18. ^ "Sensitivity of Human Ear". from the original on 4 June 2011. Retrieved 21 April 2011. The practical dynamic range could be said to be from the threshold of hearing to the threshold of pain [130 dB]
  19. ^ US6317065B1, "Multiple A to D converters for enhanced dynamic range", issued 1999-07-01 
  20. ^ Christodoulou, Lakis; Lane, John; Kasparis, Takis (1 March 2010). "Dynamic range extension using multiple A/D converters". 2010 4th International Symposium on Communications, Control and Signal Processing (ISCCSP). pp. 1–4. doi:10.1109/ISCCSP.2010.5463427. ISBN 978-1-4244-6285-8. S2CID 16501096.
  21. ^ Smith, Steven (1997–1998). "The Scientist and Engineer's Guide to Digital Signal Processing, Chapter 28 – Digital Signal Processors / Fixed versus Floating Point". www.dspguide.com. Retrieved 10 August 2013.
  22. ^ Moorer, James (September 1999). (PDF). www.jamminpower.com. Archived from the original (PDF) on 14 February 2019. Retrieved 12 August 2013.
  23. ^ a b c Tomarakos, John. "Relationship of Data Word Size to Dynamic Range and Signal Quality in Digital Audio Processing Applications". www.analog.com. Analog Devices. Retrieved 16 August 2013.
  24. ^ "DSP56001A" (PDF). Freescale. Retrieved 15 August 2013.
  25. ^ Smith, Steven (1997–1998). "The Scientist and Engineer's Guide to Digital Signal Processing, Chapter 4 – DSP Software / Number Precision". Retrieved 19 August 2013.
  26. ^ Carletta, Joan (2003). "Determining appropriate precisions for signals in fixed-point IIR filters". Proceedings of the 40th annual Design Automation Conference. pp. 656–661. CiteSeerX 10.1.1.92.1266. doi:10.1145/775832.775998. ISBN 1581136889. S2CID 15615715.
  27. ^ Choosing a high-performance audio ADC, 14 September 2011, retrieved 7 May 2019
  28. ^ Montgomery, Chris (25 March 2012). . xiph.org. Archived from the original on 7 July 2013. Retrieved 26 May 2013. With use of shaped dither, which moves quantization noise energy into frequencies where it's harder to hear, the effective dynamic range of 16-bit audio reaches 120dB in practice, more than fifteen times deeper than the 96dB claim. 120dB is greater than the difference between a mosquito somewhere in the same room and a jackhammer a foot away.... or the difference between a deserted 'soundproof' room and a sound loud enough to cause hearing damage in seconds. 16 bits is enough to store all we can hear and will be enough forever.
  29. ^ Stuart, J. Robert (1997). (PDF). Meridian Audio Ltd. Archived from the original (PDF) on 7 April 2016. Retrieved 25 February 2016. One of the great discoveries in PCM was that by adding a small random noise (that we call dither) the truncation effect can disappear. Even more important was the realization that there is a right sort of random noise to add and that when the right dither is used, the resolution of the digital system becomes infinite.
  30. ^ (PDF). e2v Semiconductors. 2007. Archived from the original (PDF) on 4 October 2011. Retrieved 26 July 2011.
  31. ^ Kester, Walt. (PDF). Analog Devices. Archived from the original (PDF) on 19 May 2012. Retrieved 19 August 2013.
  32. ^ "The history of the CD". philips.com. Retrieved 7 October 2020.
  33. ^ "Philips CD100". hifiengine.
  34. ^ "Sony CDP-101". hifiengine.
  35. ^ "B.1 First and Second-Order Noise Shaping Loops". Retrieved 19 August 2013.
  36. ^ "Sweetwater Knowledge Base, Masterlink: What is a "Red Book" CD?". www.sweetwater.com. Sweetwater. 27 April 2007. Retrieved 25 August 2013.
  37. ^ (PDF). Sonic Solutions. Archived from the original (PDF) on 4 March 2012. Retrieved 25 August 2013.
  38. ^ Shapiro, L. (2 July 2001). "Surround Sound, Page 10". ExtremeTech. Retrieved 26 August 2013.
  39. ^ (PDF). Blu-ray Disc Association. April 2010. Archived from the original (PDF) on 8 July 2011. Retrieved 25 August 2013.
  40. ^ Puhovski, Nenad (April 2000). . www.stanford.edu. Archived from the original on 27 October 2004. Retrieved 26 August 2013.
  41. ^ "G.711 : Pulse code modulation (PCM) of voice frequencies" (PDF). International Telecommunication Union. Retrieved 25 August 2013.
  42. ^ (PDF). BBC Research Department. August 1978. Archived from the original (PDF) on 8 November 2012. Retrieved 26 August 2013.
  43. ^ . Ardour Community. 2014. Archived from the original on 8 April 2014. Retrieved 8 April 2014.
  44. ^ "Pro Tools Documentation, Pro Tools Reference Guide" (ZIP/PDF). Avid. 2013. Retrieved 26 August 2013.
  45. ^ "Logic Pro X: User Guide" (PDF). Apple. January 2010. Retrieved 26 August 2013.[permanent dead link]
  46. ^ "Cubase Pro 10.5 Manual" (PDF). Steinberg. 2020. Retrieved 2 September 2020.
  47. ^ (PDF). Propellerhead Software. 2013. Archived from the original (PDF) on 24 May 2013. Retrieved 26 August 2013.
  48. ^ "GarageBand '11: Set the audio resolution". Apple. 13 March 2012. Retrieved 26 August 2013.
  49. ^ "Audacity: Features". wiki.audacityteam.com. Audacity development team. Retrieved 13 September 2014.
  50. ^ "Audio Settings". www.image-line.com. Retrieved 12 February 2019.
  51. ^ "Sample Rate, Bit-Depth & Bitrate | Exclusivemusicplus". Exclusivemusicplus. 26 October 2018. Retrieved 30 November 2018.
  • Ken C. Pohlmann (15 February 2000). Principles of Digital Audio (4th ed.). McGraw-Hill Professional. ISBN 978-0-07-134819-5.

audio, depth, other, uses, music, chiptune, digital, audio, using, pulse, code, modulation, depth, number, bits, information, each, sample, directly, corresponds, resolution, each, sample, examples, depth, include, compact, disc, digital, audio, which, uses, b. For other uses of 8 bit music see chiptune In digital audio using pulse code modulation PCM bit depth is the number of bits of information in each sample and it directly corresponds to the resolution of each sample Examples of bit depth include Compact Disc Digital Audio which uses 16 bits per sample and DVD Audio and Blu ray Disc which can support up to 24 bits per sample An analog signal in red encoded to 4 bit PCM digital samples in blue the bit depth is four so each sample s amplitude is one of 16 possible values In basic implementations variations in bit depth primarily affect the noise level from quantization error thus the signal to noise ratio SNR and dynamic range However techniques such as dithering noise shaping and oversampling can mitigate these effects without changing the bit depth Bit depth also affects bit rate and file size Bit depth is useful for describing PCM digital signals Non PCM formats such as those using lossy compression do not have associated bit depths a Contents 1 Binary representation 2 Quantization 3 Floating point 4 Audio processing 5 Dither 6 Dynamic range and headroom 6 1 Oversampling 6 2 Noise shaping 7 Applications 8 Bit rate and file size 9 See also 10 Notes 11 ReferencesBinary representation editA PCM signal is a sequence of digital audio samples containing the data providing the necessary information to reconstruct the original analog signal Each sample represents the amplitude of the signal at a specific point in time and the samples are uniformly spaced in time The amplitude is the only information explicitly stored in the sample and it is typically stored as either an integer or a floating point number encoded as a binary number with a fixed number of digits the sample s bit depth also referred to as word length or word size The resolution indicates the number of discrete values that can be represented over the range of analog values The resolution of binary integers increases exponentially as the word length increases adding one bit doubles the resolution adding two quadruples it and so on The number of possible values that an integer bit depth can represent can be calculated by using 2n where n is the bit depth 1 Thus a 16 bit system has a resolution of 65 536 216 possible values Integer PCM audio data is typically stored as signed numbers in two s complement format 2 Today most audio file formats and digital audio workstations DAWs support PCM formats with samples represented by floating point numbers 3 4 5 6 Both the WAV file format and the AIFF file format support floating point representations 7 8 Unlike integers whose bit pattern is a single series of bits a floating point number is instead composed of separate fields whose mathematical relation forms a number The most common standard is IEEE 754 which is composed of three fields a sign bit representing whether the number is positive or negative a mantissa and an exponent determining a power of two factor to scale the mantissa The mantissa is expressed as a binary fraction in IEEE base two floating point formats 9 Quantization editThe bit depth limits the signal to noise ratio SNR of the reconstructed signal to a maximum level determined by quantization error The bit depth has no impact on the frequency response which is constrained by the sample rate Quantization error introduced during analog to digital conversion ADC can be modeled as quantization noise It is a rounding error between the analog input voltage to the ADC and the output digitized value The noise is nonlinear and signal dependent nbsp An 8 bit binary number 149 in decimal with the LSB highlighted In an ideal ADC where the quantization error is uniformly distributed between 1 2 displaystyle scriptstyle pm frac 1 2 nbsp least significant bit LSB and where the signal has a uniform distribution covering all quantization levels the signal to quantization noise ratio SQNR can be calculated from SQNR 20 log 10 1 5 2 b 1 76 6 02 b dB displaystyle text SQNR 20 log 10 sqrt 1 5 cdot 2 b approx 1 76 6 02 b text dB nbsp where b is the number of quantization bits and the result is measured in decibels dB 10 11 Therefore 16 bit digital audio found on CDs has a theoretical maximum SNR of 98 dB and professional 24 bit digital audio tops out as 146 dB As of 2011 update digital audio converter technology is limited to an SNR of about 123 dB 12 13 14 effectively 21 bits because of real world limitations in integrated circuit design b Still this approximately matches the performance of the human auditory system 17 18 Multiple converters can be used to cover different ranges of the same signal being combined to record a wider dynamic range in the long term while still being limited by the single converter s dynamic range in the short term which is called dynamic range extension 19 20 Signal to noise ratio and resolution of bit depths unweighted bits SNR audio SNR video Minimum dB step difference quantization rounding error No of possible values per sample Range per sample for signed representation 4 25 84 dB 34 31 dB 1 723 dB 16 8 to 7 8 49 93 dB 58 92 dB 0 1958 dB 256 128 to 127 11 67 99 dB 77 01 dB 0 03321 dB 2 048 1 024 to 1 023 12 74 01 dB 83 04 dB 0 01807 dB 4 096 2 048 to 2 047 16 98 09 dB 107 12 dB 0 001497 dB 65 536 32 768 to 32 767 18 110 13 dB 0 0004201 dB 262 144 131 072 to 131 071 20 122 17 dB 0 0001165 dB 1 048 576 524 288 to 524 287 24 146 26 dB 0 000008717 dB 16 777 216 8 388 608 to 8 388 607 32 194 42 dB 4 52669593 10 8 dB 4 294 967 296 2 147 483 648 to 2 147 483 647 48 290 75 dB 1 03295047 10 12 dB 281 474 976 710 656 140 737 488 355 328 to 140 737 488 355 327 64 387 08 dB 2 09836113 10 17 dB 18 446 744 073 709 551 616 9 223 372 036 854 775 808 to 9 223 372 036 854 775 807Floating point editThe resolution of floating point samples is less straightforward than integer samples because floating point values are not evenly spaced In floating point representation the space between any two adjacent values is in proportion to the value The trade off between floating point and integer formats is that the space between large floating point values is greater than the space between large integer values of the same bit depth Rounding a large floating point number results in a greater error than rounding a small floating point number whereas rounding an integer number will always result in the same level of error In other words integers have a round off that is uniform always rounding the LSB to 0 or 1 and the floating point format has uniform SNR the quantization noise level is always of a certain proportion to the signal level 21 A floating point noise floor rises as the signal rises and falls as the signal falls resulting in audible variance if the bit depth is low enough 22 Audio processing editMost processing operations on digital audio involve the re quantization of samples and thus introduce additional rounding errors analogous to the original quantization error introduced during analog to digital conversion To prevent rounding errors larger than the implicit error during ADC calculations during processing must be performed at higher precisions than the input samples 23 Digital signal processing DSP operations can be performed in either fixed point or floating point precision In either case the precision of each operation is determined by the precision of the hardware operations used to perform each step of the processing and not the resolution of the input data For example on x86 processors floating point operations are performed with single or double precision and fixed point operations at 16 32 or 64 bit resolution Consequently all processing performed on Intel based hardware will be performed with these constraints regardless of the source format c Fixed point digital signal processors often supports specific word lengths to support specific signal resolutions For example the Motorola 56000 DSP chip uses 24 bit multipliers and 56 bit accumulators to perform multiply accumulate operations on two 24 bit samples without overflow or truncation 24 On devices that do not support large accumulators fixed point results may be truncated reducing precision Errors compound through multiple stages of DSP at a rate that depends on the operations being performed For uncorrelated processing steps on audio data without a DC offset errors are assumed to be random with zero means Under this assumption the standard deviation of the distribution represents the error signal and quantization error scales with the square root of the number of operations 25 High levels of precision are necessary for algorithms that involve repeated processing such as convolution 23 High levels of precision are also necessary in recursive algorithms such as infinite impulse response IIR filters 26 In the particular case of IIR filters rounding error can degrade frequency response and cause instability 23 Dither edit nbsp Headroom and noise floor at audio process stages for the purpose of comparison with dither level The noise introduced by quantization error including rounding errors and loss of precision introduced during audio processing can be mitigated by adding a small amount of random noise called dither to the signal before quantizing Dithering eliminates non linear quantization error behavior giving very low distortion but at the expense of a slightly raised noise floor Recommended dither for 16 bit digital audio measured using ITU R 468 noise weighting is about 66 dB below alignment level or 84 dB below digital full scale which is comparable to the microphone and room noise level and hence of little consequence in 16 bit audio 24 bit and 32 bit audio does not require dithering as the noise level of the digital converter is always louder than the required level of any dither that might be applied 24 bit audio could theoretically encode 144 dB of dynamic range and 32 bit audio can achieve 192 dB but this is almost impossible to achieve in the real world as even the best sensors and microphones rarely exceed 130 dB 27 Dither can also be used to increase the effective dynamic range The perceived dynamic range of 16 bit audio can be 120 dB or more with noise shaped dither taking advantage of the frequency response of the human ear 28 29 Dynamic range and headroom editDynamic range is the difference between the largest and smallest signal a system can record or reproduce Without dither the dynamic range correlates to the quantization noise floor For example 16 bit integer resolution allows for a dynamic range of about 96 dB With the proper application of dither digital systems can reproduce signals with levels lower than their resolution would normally allow extending the effective dynamic range beyond the limit imposed by the resolution 30 The use of techniques such as oversampling and noise shaping can further extend the dynamic range of sampled audio by moving quantization error out of the frequency band of interest If the signal s maximum level is lower than that allowed by the bit depth the recording has headroom Using higher bit depths during studio recording can make headroom available while maintaining the same dynamic range This reduces the risk of clipping without increasing quantization errors at low volumes Oversampling edit Main article Oversampling Oversampling is an alternative method to increase the dynamic range of PCM audio without changing the number of bits per sample 31 In oversampling audio samples are acquired at a multiple of the desired sample rate Because quantization error is assumed to be uniformly distributed with frequency much of the quantization error is shifted to ultrasonic frequencies and can be removed by the digital to analog converter during playback For an increase equivalent to n additional bits of resolution a signal must be oversampled by n u m b e r o f s a m p l e s 2 n 2 2 2 n displaystyle mathrm number of samples 2 n 2 2 2n nbsp For example a 14 bit ADC can produce 16 bit 48 kHz audio if operated at 16 oversampling or 768 kHz Oversampled PCM therefore exchanges fewer bits per sample for more samples to obtain the same resolution Dynamic range can also be enhanced with oversampling at signal reconstruction absent oversampling at the source Consider 16 oversampling at reconstruction Each sample at reconstruction would be unique in that for each of the original sample points sixteen are inserted all having been calculated by a digital reconstruction filter The mechanism of increased effective bit depth is as previously discussed that is quantization noise power has not been reduced but the noise spectrum has been spread over 16 the audio bandwidth Historical note The compact disc standard was developed by a collaboration between Sony and Philips The first Sony consumer unit featured a 16 bit DAC the first Philips units had dual 14 bit DACs This confused the marketplace and even in professional circles because 14 bit PCM allows for 84 dB SNR 12 dB less than 16 bit PCM Philips had implemented 4 oversampling with first order noise shaping which theoretically realized the full 96 dB dynamic range of the CD format 32 In practice the Philips CD100 was rated at 90 dB SNR in the audio band of 20 Hz 20 kHz the same as Sony s CDP 101 33 34 Noise shaping edit Main article Noise shaping Oversampling a signal results in equal quantization noise per unit of bandwidth at all frequencies and a dynamic range that improves with only the square root of the oversampling ratio Noise shaping is a technique that adds additional noise at higher frequencies which cancels out some error at lower frequencies resulting in a larger increase in dynamic range when oversampling For nth order noise shaping the dynamic range of an oversampled signal is improved by an additional 6n dB relative to oversampling without noise shaping 35 For example for a 20 kHz analog audio sampled at 4 oversampling with second order noise shaping the dynamic range is increased by 30 dB Therefore a 16 bit signal sampled at 176 kHz would have a bit depth equal to a 21 bit signal sampled at 44 1 kHz without noise shaping Noise shaping is commonly implemented with delta sigma modulation Using delta sigma modulation Direct Stream Digital achieves a theoretical 120 dB SNR at audio frequencies using 1 bit audio with 64 oversampling Applications editBit depth is a fundamental property of digital audio implementations Depending on application requirements and equipment capabilities different bit depths are used for different applications Example applications and supported audio bit depth Application Description Audio format s CD DA Red Book 36 Digital media 16 bit LPCM DVD Audio 37 Digital media 16 20 and 24 bit LPCM A Super Audio CD 38 Digital media 1 bit Direct Stream Digital PDM Blu ray Disc audio 39 Digital media 16 20 and 24 bit LPCM and others B DV audio 40 Digital media 12 and 16 bit uncompressed PCM ITU T Recommendation G 711 41 Compression standard for telephony 8 bit PCM with companding C NICAM 1 NICAM 2 and NICAM 3 42 Compression standards for broadcasting 10 11 and 10 bit PCM respectively with companding D Ardour DAW by Paul Davis and the Ardour Community 32 bit floating point 43 Pro Tools 11 DAW by Avid Technology 16 and 24 bit or 32 bit floating point sessions and 64 bit floating point mixing 44 Logic Pro X DAW by Apple Inc 16 and 24 bit projects and 32 bit or 64 bit floating point mixing 45 Cubase DAW by Steinberg Allows audio processing precision to 32 bit float or 64 bit float 46 Ableton Live 6 DAW by Ableton 32 bit floating point bit depth and 64 bit summing Reason 7 DAW by Propellerhead Software 16 20 and 24 bit I O 32 bit floating point arithmetic and 64 bit summing 47 Reaper 5 DAW by Cockos Inc 8 bit PCM 16 bit PCM 24 bit PCM 32 bit PCM 32 bit FP 64 bit FP 4 bit IMA ADPCM amp 2 bit cADPCM rendering 8 bit int 16 bit int 24 bit int 32 bit int 32 bit float and 64 bit float mixing GarageBand 11 version 6 DAW by Apple Inc 16 bit default with 24 bit real instrument recording 48 Audacity Open source audio editor 16 and 24 bit LPCM and 32 bit floating point 49 FL Studio DAW by Image Line 16 and 24 bit int and 32 bit floating point controlled by OS 50 DVD Audio also supports optional Meridian Lossless Packing a lossless compression scheme Blu ray supports a variety of non LPCM formats but all conform to some combination of 16 20 or 24 bits per sample ITU T specifies the A law and m law companding algorithms compressing down from 13 and 14 bits respectively NICAM systems 1 2 and 3 compress down from 13 14 and 14 bits respectively Bit rate and file size editBit depth affects bit rate and file size Bits are the basic unit of data used in computing and digital communications Bit rate refers to the amount of data specifically bits transmitted or received per second In MP3 and other lossy compressed audio formats bit rate describes the amount of information used to encode an audio signal It is usually measured in kb s 51 See also editAudio system measurements Color depth the corresponding concept for digital images Effective number of bitsNotes edit For example in MP3 quantization is performed on the frequency domain representation of the signal not on the time domain samples relevant to bit depth While 32 bit converters exist they are purely for marketing purposes and provide no practical benefit over 24 bit converters the extra bits are either zero or encode only noise 15 16 Intel and AMD x86 hardware can handle higher precision than 64 bits or even arbitrarily large floating point numbers or integers but processing takes a lot longer than the native types References edit Thompson Dan 2005 Understanding Audio Berklee Press ISBN 978 0 634 00959 4 Smith Julius 2007 Pulse Code Modulation PCM Mathematics of the Discrete Fourier Transform DFT with Audio Applications Second Edition online book Retrieved 22 October 2012 Campbell Robert 2013 Pro Tools 10 Advanced Music Production Techniques pg 247 Cengage Learning ISBN 978 1133728016 Retrieved 12 August 2013 Wherry Mark March 2012 Avid Pro Tools 10 Sound On Sound Retrieved 10 August 2013 Price Simon October 2005 Reason Mixing Masterclass Sound On Sound Retrieved 10 August 2013 a b Ableton Reference Manual Version 10 32 Audio Fact Sheet Ableton 2019 Retrieved 3 September 2019 Kabal Peter 3 January 2011 Audio File Format Specifications WAVE Specifications McGill University Retrieved 10 August 2013 Kabal Peter 3 January 2011 Audio File Format Specifications AIFF AIFF C Specifications McGill University Retrieved 10 August 2013 Smith Steven 1997 98 The Scientist and Engineer s Guide to Digital Signal Processing Chapter 4 DSP Software Floating Point Real Numbers www dspguide com Retrieved 10 August 2013 See Signal to noise ratio Fixed point Kester Walt 2007 Taking the Mystery out of the Infamous Formula SNR 6 02N 1 76dB and Why You Should Care PDF Analog Devices Retrieved 26 July 2011 Nwavguy 6 September 2011 NwAvGuy Noise amp Dynamic Range NwAvGuy Retrieved 2 December 2016 24 bit DACs often only manage approximately 16 bit performance and the very best reach 21 bit ENOB performance PCM4222 Retrieved 21 April 2011 Dynamic Range 60 dB input A weighted 124 dB typical Dynamic Range 60 dB input 20 kHz bandwidth 122 dB typical WM8741 High Performance Stereo DAC Cirrus Logic Retrieved 2 December 2016 128dB SNR A weighted mono 48 kHz 123 dB SNR non weighted stereo 48 kHz The great audio myth why you don t need that 32 bit DAC Android Authority 19 January 2016 Retrieved 2 December 2016 So your 32 bit DAC is only ever going to be able to output at most 21 bits of useful data and the other bits will be masked by circuit noise 32 bit capable DACs hydrogenaud io Retrieved 2 December 2016 all the 32 bit capable DAC chips existent today have an actual resolution less than 24 bit D R Campbell Aspects of Human Hearing PDF Archived from the original PDF on 21 August 2011 Retrieved 21 April 2011 The dynamic range of human hearing is approximately 120 dB Sensitivity of Human Ear Archived from the original on 4 June 2011 Retrieved 21 April 2011 The practical dynamic range could be said to be from the threshold of hearing to the threshold of pain 130 dB US6317065B1 Multiple A to D converters for enhanced dynamic range issued 1999 07 01 Christodoulou Lakis Lane John Kasparis Takis 1 March 2010 Dynamic range extension using multiple A D converters 2010 4th International Symposium on Communications Control and Signal Processing ISCCSP pp 1 4 doi 10 1109 ISCCSP 2010 5463427 ISBN 978 1 4244 6285 8 S2CID 16501096 Smith Steven 1997 1998 The Scientist and Engineer s Guide to Digital Signal Processing Chapter 28 Digital Signal Processors Fixed versus Floating Point www dspguide com Retrieved 10 August 2013 Moorer James September 1999 48 Bit Integer Processing Beats 32 Bit Floating Point for Professional Audio Applications PDF www jamminpower com Archived from the original PDF on 14 February 2019 Retrieved 12 August 2013 a b c Tomarakos John Relationship of Data Word Size to Dynamic Range and Signal Quality in Digital Audio Processing Applications www analog com Analog Devices Retrieved 16 August 2013 DSP56001A PDF Freescale Retrieved 15 August 2013 Smith Steven 1997 1998 The Scientist and Engineer s Guide to Digital Signal Processing Chapter 4 DSP Software Number Precision Retrieved 19 August 2013 Carletta Joan 2003 Determining appropriate precisions for signals in fixed point IIR filters Proceedings of the 40th annual Design Automation Conference pp 656 661 CiteSeerX 10 1 1 92 1266 doi 10 1145 775832 775998 ISBN 1581136889 S2CID 15615715 Choosing a high performance audio ADC 14 September 2011 retrieved 7 May 2019 Montgomery Chris 25 March 2012 24 192 Music Downloads and why they make no sense xiph org Archived from the original on 7 July 2013 Retrieved 26 May 2013 With use of shaped dither which moves quantization noise energy into frequencies where it s harder to hear the effective dynamic range of 16 bit audio reaches 120dB in practice more than fifteen times deeper than the 96dB claim 120dB is greater than the difference between a mosquito somewhere in the same room and a jackhammer a foot away or the difference between a deserted soundproof room and a sound loud enough to cause hearing damage in seconds 16 bits is enough to store all we can hear and will be enough forever Stuart J Robert 1997 Coding High Quality Digital Audio PDF Meridian Audio Ltd Archived from the original PDF on 7 April 2016 Retrieved 25 February 2016 One of the great discoveries in PCM was that by adding a small random noise that we call dither the truncation effect can disappear Even more important was the realization that there is a right sort of random noise to add and that when the right dither is used the resolution of the digital system becomes infinite Dithering in Analog to Digital Conversion PDF e2v Semiconductors 2007 Archived from the original PDF on 4 October 2011 Retrieved 26 July 2011 Kester Walt Oversampling Interpolating DACs PDF Analog Devices Archived from the original PDF on 19 May 2012 Retrieved 19 August 2013 The history of the CD philips com Retrieved 7 October 2020 Philips CD100 hifiengine Sony CDP 101 hifiengine B 1 First and Second Order Noise Shaping Loops Retrieved 19 August 2013 Sweetwater Knowledge Base Masterlink What is a Red Book CD www sweetwater com Sweetwater 27 April 2007 Retrieved 25 August 2013 Understanding DVD Audio PDF Sonic Solutions Archived from the original PDF on 4 March 2012 Retrieved 25 August 2013 Shapiro L 2 July 2001 Surround Sound Page 10 ExtremeTech Retrieved 26 August 2013 White paper Blu ray Disc Format 2 B Audio Visual Application Format Specifications for BD ROM Version 2 4 PDF Blu ray Disc Association April 2010 Archived from the original PDF on 8 July 2011 Retrieved 25 August 2013 Puhovski Nenad April 2000 DV A SUCCESS STORY www stanford edu Archived from the original on 27 October 2004 Retrieved 26 August 2013 G 711 Pulse code modulation PCM of voice frequencies PDF International Telecommunication Union Retrieved 25 August 2013 DIGITAL SOUND SIGNALS tests to compare the performance of five companding systems for high quality sound signals PDF BBC Research Department August 1978 Archived from the original PDF on 8 November 2012 Retrieved 26 August 2013 Ardour Key Features Ardour Community 2014 Archived from the original on 8 April 2014 Retrieved 8 April 2014 Pro Tools Documentation Pro Tools Reference Guide ZIP PDF Avid 2013 Retrieved 26 August 2013 Logic Pro X User Guide PDF Apple January 2010 Retrieved 26 August 2013 permanent dead link Cubase Pro 10 5 Manual PDF Steinberg 2020 Retrieved 2 September 2020 Reason 7 Operation Manual PDF Propellerhead Software 2013 Archived from the original PDF on 24 May 2013 Retrieved 26 August 2013 GarageBand 11 Set the audio resolution Apple 13 March 2012 Retrieved 26 August 2013 Audacity Features wiki audacityteam com Audacity development team Retrieved 13 September 2014 Audio Settings www image line com Retrieved 12 February 2019 Sample Rate Bit Depth amp Bitrate Exclusivemusicplus Exclusivemusicplus 26 October 2018 Retrieved 30 November 2018 Ken C Pohlmann 15 February 2000 Principles of Digital Audio 4th ed McGraw Hill Professional ISBN 978 0 07 134819 5 Retrieved from https en wikipedia org w index php title Audio bit depth amp oldid 1211991120 Quantization, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.