fbpx
Wikipedia

Perceptual Evaluation of Speech Quality

Perceptual Evaluation of Speech Quality (PESQ) is a family of standards comprising a test methodology for automated assessment of the speech quality as experienced by a user of a telephony system. It was standardized as Recommendation ITU-T P.862[1] in 2001. PESQ is used for objective voice quality testing by phone manufacturers, network equipment vendors and telecom operators. Its usage requires a license. The first edition of PESQ's successor POLQA (Recommendation ITU-T P.863[2]) entered into force in 2011.

Measurement scope edit

PESQ was developed to model subjective tests commonly used in telecommunications (e.g., Recommendation ITU-T P.800) to assess the voice quality perceived by human beings. Consequently, it employs true voice samples as test signals. In order to characterize the listening quality as perceived by users, it is of paramount importance to load modern telecom equipment with speech-like signals. Many systems are optimized for speech and would respond in an unpredictable way to non-speech signals (e.g., tones, noise). Guidelines for proper applications of voice test samples are defined in the PESQ application guide contained in Recommendation ITU-T P.862.3.[3]

Genealogy of related standards edit

ITU-T's family of full reference objective voice quality measurements started in 1997 with Recommendation ITU-T P.861 (PSQM), which was superseded by ITU-T P.862 (PESQ)[1] in 2001. P.862 was later complemented with Recommendations ITU-T P.862.1[4] (mapping of PESQ scores to a MOS scale), ITU-T P.862.2[5] (wideband measurements) and ITU-T P.862.3[3] (application guide). The first edition of ITU-T P.863 (POLQA)[2] entered into force in 2011. An Application guide for Recommendation ITU-T P.863 was approved in 2019 and published as ITU-T P.863.1.[6]

In addition to the above listed full reference methods, the list of ITU-T's objective voice quality measurement standards also includes ITU-T P.563[7] (no-reference algorithm).

Testing typology edit

Depending on the information that is made available to an algorithm, voice-quality test algorithms can be divided into two main categories:

  • A "full reference" (FR) algorithm has access to and makes use of the original reference signal for a comparison (i.e., a difference analysis). It can compare each sample of the reference signal (talker side) to each corresponding sample of the degraded signal (listener side). FR measurements deliver the highest accuracy and repeatability but can only be applied for dedicated tests in live networks (e.g., drive test tools for mobile network benchmarks).
  • A "no reference" (NR) algorithm only uses the degraded signal for the quality estimation and has no information of the original reference signal. NR algorithms (e.g., Recommendation ITU-T P.563[7]) are low-accuracy estimates only, as the originating voice characteristics (e.g., male or female talker, background noise, non-voice) of the source reference is completely unknown. A common variant of NR algorithms does not even analyze the decoded audio signal, but works on an analysis of the digital bit stream on an IP packet level. The measurement is consequently limited to a transport-stream analysis.

PESQ is a full-reference algorithm and analyzes the speech signal sample-by-sample after a temporal alignment of corresponding excerpts of reference and test signal. PESQ can be applied to provide an end-to-end (E2E) quality assessment for a network, or characterize individual network components.

PESQ results principally model mean opinion scores (MOS) that cover a scale from 1 (bad) to 5 (excellent). A mapping function to MOS-LQO is outlined in Recommendation ITU-T P.862.1.[4]

See also edit

References edit

  1. ^ a b "P.862 : Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs". www.itu.int. Retrieved 2021-04-20.
  2. ^ a b "P.863 : Perceptual objective listening quality prediction". www.itu.int. Retrieved 2021-04-11.
  3. ^ a b "P.862.3 : Application guide for objective quality measurement based on Recommendations P.862, P.862.1 and P.862.2". www.itu.int. Retrieved 2021-04-20.
  4. ^ a b "P.862.1 : Mapping function for transforming P.862 raw result scores to MOS-LQO". www.itu.int. Retrieved 2021-04-11.
  5. ^ "P.862.2 : Wideband extension to Recommendation P.862 for the assessment of wideband telephone networks and speech codecs". www.itu.int. Retrieved 2021-04-11.
  6. ^ "P.863.1 : Application guide for Recommendation ITU-T P.863". www.itu.int. Retrieved 2021-04-11.
  7. ^ a b "P.563 : Single-ended method for objective speech quality assessment in narrow-band telephony applications". www.itu.int. Retrieved 2021-04-11.
  • Rix, Antony W.; Hollier, Michael P.; Hekstra, Andries P.; Beerends, John G. (2002-10-15). "Perceptual Evaluation of Speech Quality (PESQ) The New ITU Standard for End-to-End Speech Quality Assessment Part I--Time-Delay Compensation". Journal of the Audio Engineering Society. 50 (10): 755–764.
  • Beerends, John G.; Hekstra, Andries P.; Rix, Antony W.; Hollier, Michael P. (2002-10-15). "Perceptual Evaluation of Speech Quality (PESQ) The New ITU Standard for End-to-End Speech Quality Assessment Part II: Psychoacoustic Model". Journal of the Audio Engineering Society. 50 (10): 765–778.

External links edit

  • Application Note 1GA49: Psychoacoustic Audio Quality Measurements Using R&S UPV Audio Analyzer
  • Application Note 1MA119: PESQ Measurement for GSM with R&SCMUgo
  • Application Note 1MA136: PESQ Measurement for CDMA2000 with R&SCMUgo
  • Application Note 1MA137: PESQ Measurement for WCDMA with R&SCMUgo
  • Application Note 1MA149: VoIP Measurements for WiMAX

perceptual, evaluation, speech, quality, pesq, family, standards, comprising, test, methodology, automated, assessment, speech, quality, experienced, user, telephony, system, standardized, recommendation, 2001, pesq, used, objective, voice, quality, testing, p. Perceptual Evaluation of Speech Quality PESQ is a family of standards comprising a test methodology for automated assessment of the speech quality as experienced by a user of a telephony system It was standardized as Recommendation ITU T P 862 1 in 2001 PESQ is used for objective voice quality testing by phone manufacturers network equipment vendors and telecom operators Its usage requires a license The first edition of PESQ s successor POLQA Recommendation ITU T P 863 2 entered into force in 2011 Contents 1 Measurement scope 2 Genealogy of related standards 3 Testing typology 4 See also 5 References 6 External linksMeasurement scope editPESQ was developed to model subjective tests commonly used in telecommunications e g Recommendation ITU T P 800 to assess the voice quality perceived by human beings Consequently it employs true voice samples as test signals In order to characterize the listening quality as perceived by users it is of paramount importance to load modern telecom equipment with speech like signals Many systems are optimized for speech and would respond in an unpredictable way to non speech signals e g tones noise Guidelines for proper applications of voice test samples are defined in the PESQ application guide contained in Recommendation ITU T P 862 3 3 Genealogy of related standards editITU T s family of full reference objective voice quality measurements started in 1997 with Recommendation ITU T P 861 PSQM which was superseded by ITU T P 862 PESQ 1 in 2001 P 862 was later complemented with Recommendations ITU T P 862 1 4 mapping of PESQ scores to a MOS scale ITU T P 862 2 5 wideband measurements and ITU T P 862 3 3 application guide The first edition of ITU T P 863 POLQA 2 entered into force in 2011 An Application guide for Recommendation ITU T P 863 was approved in 2019 and published as ITU T P 863 1 6 In addition to the above listed full reference methods the list of ITU T s objective voice quality measurement standards also includes ITU T P 563 7 no reference algorithm Testing typology editDepending on the information that is made available to an algorithm voice quality test algorithms can be divided into two main categories A full reference FR algorithm has access to and makes use of the original reference signal for a comparison i e a difference analysis It can compare each sample of the reference signal talker side to each corresponding sample of the degraded signal listener side FR measurements deliver the highest accuracy and repeatability but can only be applied for dedicated tests in live networks e g drive test tools for mobile network benchmarks A no reference NR algorithm only uses the degraded signal for the quality estimation and has no information of the original reference signal NR algorithms e g Recommendation ITU T P 563 7 are low accuracy estimates only as the originating voice characteristics e g male or female talker background noise non voice of the source reference is completely unknown A common variant of NR algorithms does not even analyze the decoded audio signal but works on an analysis of the digital bit stream on an IP packet level The measurement is consequently limited to a transport stream analysis PESQ is a full reference algorithm and analyzes the speech signal sample by sample after a temporal alignment of corresponding excerpts of reference and test signal PESQ can be applied to provide an end to end E2E quality assessment for a network or characterize individual network components PESQ results principally model mean opinion scores MOS that cover a scale from 1 bad to 5 excellent A mapping function to MOS LQO is outlined in Recommendation ITU T P 862 1 4 See also editPerceptual Objective Listening Quality Analysis POLQA Perceptual Evaluation of Video Quality PEVQ Perceptual Evaluation of Audio Quality PEAQ Hearing Aid Speech Quality Index HASQI References edit a b P 862 Perceptual evaluation of speech quality PESQ An objective method for end to end speech quality assessment of narrow band telephone networks and speech codecs www itu int Retrieved 2021 04 20 a b P 863 Perceptual objective listening quality prediction www itu int Retrieved 2021 04 11 a b P 862 3 Application guide for objective quality measurement based on Recommendations P 862 P 862 1 and P 862 2 www itu int Retrieved 2021 04 20 a b P 862 1 Mapping function for transforming P 862 raw result scores to MOS LQO www itu int Retrieved 2021 04 11 P 862 2 Wideband extension to Recommendation P 862 for the assessment of wideband telephone networks and speech codecs www itu int Retrieved 2021 04 11 P 863 1 Application guide for Recommendation ITU T P 863 www itu int Retrieved 2021 04 11 a b P 563 Single ended method for objective speech quality assessment in narrow band telephony applications www itu int Retrieved 2021 04 11 Rix Antony W Hollier Michael P Hekstra Andries P Beerends John G 2002 10 15 Perceptual Evaluation of Speech Quality PESQ The New ITU Standard for End to End Speech Quality Assessment Part I Time Delay Compensation Journal of the Audio Engineering Society 50 10 755 764 Beerends John G Hekstra Andries P Rix Antony W Hollier Michael P 2002 10 15 Perceptual Evaluation of Speech Quality PESQ The New ITU Standard for End to End Speech Quality Assessment Part II Psychoacoustic Model Journal of the Audio Engineering Society 50 10 765 778 External links editApplication Note 1GA49 Psychoacoustic Audio Quality Measurements Using R amp S UPV Audio Analyzer Application Note 1MA119 PESQ Measurement for GSM with R amp SCMUgo Application Note 1MA136 PESQ Measurement for CDMA2000 with R amp SCMUgo Application Note 1MA137 PESQ Measurement for WCDMA with R amp SCMUgo Application Note 1MA149 VoIP Measurements for WiMAX Retrieved from https en wikipedia org w index php title Perceptual Evaluation of Speech Quality amp oldid 1219560465, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.