• Mar 31, 2016 News!JACN Vol.3, No.3 has been indexed by EI (inspec)!   [Click]
  • Jun 24, 2016 News!JACN Vol.4, No.2 has been published with online version. 15 papres about advances in computer networks are published in this issue.   [Click]
  • Mar 31, 2016 News!JACN Vol.3, No.2 has been indexed by EI (inspec)!   [Click]
General Information
    • ISSN: 1793-8244
    • Frequency: Quarterly
    • DOI: 10.18178/JACN
    • Editor-in-Chief: Dr. Ka Wai Gary Wong
    • Executive Editor: Ms. Julia S. Ma
    • Abstracting/ Indexing: EI (INSPEC, IET), Engineering & Technology Digital Library, DOAJ, Electronic Journals Library, Ulrich's Periodicals Directory, International Computer Science Digital Library (ICSDL), ProQuest, and Google Scholar.
    • E-mail: jacn@ejournal.net
Dr. Ka Wai Gary Wong
Department of Mathematics and Information Technology The Hong Kong Institute of Education, Hong Kong
It's a honor to serve as the editor-in-chief of JACN. I'll work together with the editors and reviewers to help the journal progress
JACN 2013 Vol.1(3): 183-188 ISSN: 1793-8244
DOI: 10.7763/JACN.2013.V1.37

Real-Time Estimation of Speech Quality through the Internet Using Echo State Networks

Sebastián Basterrech and Gerardo Rubino
Abstract—Audio quality in the Internet can be strongly affected by network conditions. As a consequence, many techniques to evaluate it have been developed. In particular, the ITU-T adopted in 2001 a technique called Perceptual Evaluation of Speech Quality (PESQ) to automatically measuring speech quality. PESQ is a well-known and widely used procedure, providing in general an accurate evaluation of perceptual quality by comparing the original and received voice sequences. One obvious inherent limitation of PESQ is, thus, that it requires the original signal (we say the reference), to make its evaluation. This precludes the use of PESQ for assessing the perceived quality in real-time, as the reference is in general not available. In this paper, we describe a procedure for estimating PESQ output working only with measures taken on the network state and properties of the communication system, without any use of the reference. It is based on the use of statistical learning techniques. Specifically, we rely on recent ideas for learning with specific types of neural networks, known under the name of Echo State Networks (ESNs), a member of the class of Reservoir Computing systems. These tools have been proven to be very efficient and robust in many learning tasks. The experimental results obtained show the good accuracy of the resulting procedure, and its capability to give its estimations of speech quality in a real-time context. This allows putting our measuring modules in future Internet applications or services based on voice transmission, for instance for control purposes

Index Terms—Quality assessment, speech quality, echo state networks, reservoir computing.

S. Basterrech is with the University of Rennes 2, Rennes, France (e-mail: Sebastian.Basterrechtiscordio@etudiant.uhb.fr).
G. Rubino is with the National Institute for Research in Computer Science and Control (INRIA Rennes – Bretagne Atlantique), Rennes, France (e-mail: Gerardo.Rubino@inria.fr).


Cite:Sebastián Basterrech and Gerardo Rubino, "Real-Time Estimation of Speech Quality through the Internet Using Echo State Networks," Journal of Advances in Computer Networks vol. 1, no. 3, pp. 183-188, 2013.

Copyright © 2008-2016. Journal of Advances in Computer Networks.  All rights reserved.
E-mail: jacn@ejournal.net