Comparative analysis of speech coders

Main Article Content

Ivo R Draganov*
Snejana G Pleshkova

Abstract

In this paper a comparative analysis of some of the most popular speech coders is presented. Qualitatively and quantitatively are tested Linear Prediction Coding in its implementation LPC-10e and with the use of auto-correlation and covariance, companding coding including A-law and µ-Law, ADPCM IMA, G.726 A- and µ-Law, and a fully featured MELP coder. All of them proved their efficiency for the typical applications they have been assign to providing the necessary quality vs. output data rate. The methodology for evaluation along with the codecs’ descriptions are considered useful for new coming specialists in the field of audio compression as one possible starting point for them.

Downloads

Download data is not yet available.

Article Details

Draganov, I. R., & Pleshkova, S. G. (2020). Comparative analysis of speech coders. Trends in Computer Science and Information Technology, 5(1), 001–004. https://doi.org/10.17352/tcsit.000010
Research Articles

Copyright (c) 2020 Draganov IR, et al.

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Licensing and protecting the author rights is the central aim and core of the publishing business. Peertechz dedicates itself in making it easier for people to share and build upon the work of others while maintaining consistency with the rules of copyright. Peertechz licensing terms are formulated to facilitate reuse of the manuscripts published in journals to take maximum advantage of Open Access publication and for the purpose of disseminating knowledge.

We support 'libre' open access, which defines Open Access in true terms as free of charge online access along with usage rights. The usage rights are granted through the use of specific Creative Commons license.

Peertechz accomplice with- [CC BY 4.0]

Explanation

'CC' stands for Creative Commons license. 'BY' symbolizes that users have provided attribution to the creator that the published manuscripts can be used or shared. This license allows for redistribution, commercial and non-commercial, as long as it is passed along unchanged and in whole, with credit to the author.

Please take in notification that Creative Commons user licenses are non-revocable. We recommend authors to check if their funding body requires a specific license.

With this license, the authors are allowed that after publishing with Peertechz, they can share their research by posting a free draft copy of their article to any repository or website.
'CC BY' license observance:

License Name

Permission to read and download

Permission to display in a repository

Permission to translate

Commercial uses of manuscript

CC BY 4.0

Yes

Yes

Yes

Yes

The authors please note that Creative Commons license is focused on making creative works available for discovery and reuse. Creative Commons licenses provide an alternative to standard copyrights, allowing authors to specify ways that their works can be used without having to grant permission for each individual request. Others who want to reserve all of their rights under copyright law should not use CC licenses.

Chu WC (2003) Speech Coding Algorithms: Foundation and Evolution of Standardized Coders, New Jersey, USA: John Wiley & Sons. Link: https://bit.ly/2xtX74M

Makhoul J (1975) Linear prediction: A tutorial review, Proceedings of the IEEE. 63: 561-580. Link: https://bit.ly/3aoZ8Os

G.711 (1988) Pulse code modulation (PCM) of voice frequencies, ITU-T Recommendation (11/1988), Link: https://bit.ly/2JlIALb

Salomon D (2007) Data Compression, The Complete Reference, 4 ed., London, UK, Springer. Link: https://bit.ly/2ya2WF6

Supplee L, Cohn R, ColluraJ, McCree A (1997) MELP: The new federal standard at 2400 bps”, In Proc. of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-97), Münich, Germany 1591-1594. Link: https://bit.ly/2JcOe2r

Benesty J, Makino S, Chen J (2005) Speech Enhancement, Springer. Link: https://bit.ly/2QGBdCn

ITU-T Recommendation P.862 (2001) Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs. Link: https://bit.ly/2vPHXGR