Simulating Conversations for the Prediction of Speech Quality
Title | Simulating Conversations for the Prediction of Speech Quality PDF eBook |
Author | Thilo Michael |
Publisher | Springer Nature |
Pages | 157 |
Release | 2023-06-30 |
Genre | Technology & Engineering |
ISBN | 3031318447 |
This book discusses the simulation of conversations through a novel approach of predicting speech quality based on the interactions of two simulated interlocutors. The author describes the setup of a simulation environment that is capable of simulating human dialogue on the speech level. The impact of delay and bursty packet loss on VoIP conversations is investigated and modeled for the use in the simulation. Based on parameters extracted from simulated conversations, the author proposes extensions to the E-model, a parametric model standardized by the International Telecommunications Union, in order to predict the quality of the simulated conversations. The author shows that predictions based on the simulated conversations outperform models that rely on the transmission parameters alone.
Assessment and Prediction of Speech Quality in Telecommunications
Title | Assessment and Prediction of Speech Quality in Telecommunications PDF eBook |
Author | Sebastian Möller |
Publisher | Springer Science & Business Media |
Pages | 253 |
Release | 2012-12-06 |
Genre | Science |
ISBN | 1475731175 |
The quality of a telecommunication voice service is largely inftuenced by the quality of the transmission system. Nevertheless, the analysis, synthesis and prediction of quality should take into account its multidimensional aspects. Quality can be regarded as a point where the perceived characteristics and the desired or expected ones meet. A schematic is presented which classifies different entities which contribute to the quality of a service, taking into account conversational, user as weIl as service related contributions. Starting from this concept, perceptively relevant constituents of speech communication quality are identified. The perceptive factors result from ele ments of the transmission configuration. A simulation model is developed and implemented which allows the most relevant parameters of traditional trans mission configurations to be manipulated, in real time and for the conversation situation. Inputs into the simulation are instrumentally measurable quality elements commonly used in transmission planning of telephone networks. A reduced set of these quality elements forms a basis for models which aim at predicting mouth-to-ear quality as it would be perceived by a user of the sys tem. These models are an important tool for the planner of telecommunication networks, as they allow the expected quality to be estimated in advance, even before the network has been set up. Two well-known models (the SUBMOD and the E-model) are analyzed in more detail, with an emphasis on the psy choacoustic and psychophysical backgrounds.
Audiovisual Quality Assessment and Prediction for Videotelephony
Title | Audiovisual Quality Assessment and Prediction for Videotelephony PDF eBook |
Author | Benjamin Belmudez |
Publisher | Springer |
Pages | 196 |
Release | 2014-12-27 |
Genre | Technology & Engineering |
ISBN | 331914166X |
The work presented in this book focuses on modeling audiovisual quality as perceived by the users of IP-based solutions for video communication like videotelephony. It also extends the current framework for the parametric prediction of audiovisual call quality. The book addresses several aspects related to the quality perception of entire video calls, namely, the quality estimation of the single audio and video modalities in an interactive context, the audiovisual quality integration of these modalities and the temporal pooling of short sample-based quality scores to account for the perceptual quality impact of time-varying degradations.
Simulating Conversations for the Prediction of Speech Quality
Title | Simulating Conversations for the Prediction of Speech Quality PDF eBook |
Author | Thilo Michael |
Publisher | |
Pages | 0 |
Release | 2023 |
Genre | |
ISBN | 9783031318450 |
This book discusses the simulation of conversations through a novel approach of predicting speech quality based on the interactions of two simulated interlocutors. The author describes the setup of a simulation environment that is capable of simulating human dialogue on the speech level. The impact of delay and bursty packet loss on VoIP conversations is investigated and modeled for the use in the simulation. Based on parameters extracted from simulated conversations, the author proposes extensions to the E-model, a parametric model standardized by the International Telecommunications Union, in order to predict the quality of the simulated conversations. The author shows that predictions based on the simulated conversations outperform models that rely on the transmission parameters alone. Presents the overview of a technical setup of a simulation able to replicate individual interactions Includes insights into the changes of individual interactions that occur due to delay and packet loss Describes and extends the state-of-the-art in parametric speech quality prediction .
Speech Quality of VoIP
Title | Speech Quality of VoIP PDF eBook |
Author | Alexander Raake |
Publisher | John Wiley & Sons |
Pages | 336 |
Release | 2007-01-11 |
Genre | Technology & Engineering |
ISBN | 0470032995 |
Finally a comprehensive overview of speech quality in VoIP from the user's perspective! Speech Quality of VoIP is an essential guide to assessing the speech quality of VoIP networks, whilst addressing the implications for the design of VoIP networks and systems. This book bridges the gap between the technical network-world and the psychoacoustic world of quality perception. Alexander Raake’s unique perspective combines awareness of the technical characteristics of VoIP networks and original research concerning the perception of speech transmitted across them. Starting from the network designer’s point of view, the different characteristics of the network are addressed, and then linked to features perceived by users. This book provides an overview of the available knowledge on the principal, relevant aspects of speech and speech quality perception, of speech quality assessment, and of transmission properties of telephone and VoIP networks, and of the related perceptual features and resulting speech quality. Discussing new research into the specific time-varying degradations VoIP brings along, but also the considerable potential of quality improvement to be achieved with wideband speech transmission, Alexander Raake demonstrates how network and service characteristics impact on the users perception of quality. Speech Quality of VoIP: Offers an insight into speech quality of VoIP from a user's perspective. Presents an overview of different modelling approaches and a parametric network-planning model for quality prediction in VoIP networks. Draws on innovative new research on the quality degradation characteristic of VoIP. Explains in detail how telephone speech quality can be greatly enhanced with VoIP’s wideband speech transmission capability. Assesses the vast collection of references into the technical and scientific literature related to VoIP quality. Illustrates concepts throughout with mathematical models, algorithms and simulations. Speech Quality of VoIP is the definitive guide for researchers, engineers and network planners working in the field of VoIP, Quality of Service, and speech communication processing in telecommunications. Advanced undergraduate and graduate students on telecommunication and networking courses will also find this text an invaluable resource.
Speech and Computer
Title | Speech and Computer PDF eBook |
Author | Alexey Karpov |
Publisher | Springer Nature |
Pages | 704 |
Release | 2020-10-04 |
Genre | Computers |
ISBN | 3030602761 |
This book constitutes the proceedings of the 22nd International Conference on Speech and Computer, SPECOM 2020, held in St. Petersburg, Russia, in October 2020. The 65 papers presented were carefully reviewed and selected from 160 submissions. The papers present current research in the area of computer speech processing including speech science, speech technology, natural language processing, human-computer interaction, language identification, multimedia processing, human-machine interaction, deep learning for audio processing, computational paralinguistics, affective computing, speech and language resources, speech translation systems, text mining and sentiment analysis, voice assistants, etc. Due to the Corona pandemic SPECOM 2020 was held as a virtual event.
Integral and Diagnostic Intrusive Prediction of Speech Quality
Title | Integral and Diagnostic Intrusive Prediction of Speech Quality PDF eBook |
Author | Nicolas Côté |
Publisher | Springer Science & Business Media |
Pages | 255 |
Release | 2011-05-06 |
Genre | Technology & Engineering |
ISBN | 3642184634 |
This work deals with the instrumental measurement methods for the perceived quality of transmitted speech. These measures simulate the speech perception process employed by human subjects during auditory experiments. The measure standardized by the International Telecommunication Union (ITU), called “Wideband-Perceptual Speech Quality Evaluation (WB-PESQ)”, is not able to quantify all these perceived characteristics on a unidimensional quality scale, the Mean Opinion Score (MOS) scale. Recent experimental studies showed that subjects make use of several perceptual dimensions to judge about the quality of speech signals. In order to represent the signal at a higher stage of perception, a new model, called “Diagnostic Instrumental Assessment of Listening quality (DIAL)”, has been developed. It includes a perceptual and a cognitive model which simulate the whole quality judgment process. Except for strong discontinuities, DIAL predicts very well speech quality of different speech processing and transmission systems, and it outperforms the WB-PESQ.