Table of Contents

Cover image

Title page

Copyright

Contributors

About the Editor

Preface

Chapter 1: Speech Processing in Healthcare: Can We Integrate?

Abstract

Chapter 2: End-to-End Acoustic Modeling Using Convolutional Neural Networks

Abstract

2.1 Introduction

2.2 Related Work

2.3 Various Architecture of ASR

2.4 Convolutional Neural Networks

2.5 CNN-Based End-to-End Approach

2.6 Experiments and Their Results

2.7 Conclusion

Chapter 3: A Real-Time DSP-Based System for Voice Activity Detection and Background Noise Reduction

Abstract

3.1 Introduction

3.2 Microchip dsPIC33 Digital Signal Controller

3.3 High Pass Filter

3.4 Fast Fourier Transform

3.5 Channel Energy Computation

3.6 Channel SNR Computation

3.7 VAD Decision

3.8 VAD Hangover

3.9 Computation of Scaling Factor

3.10 Scaling of Frequency Channels

3.11 Inverse Fourier Transform

3.12 Application Programming Interface

3.13 Resource Requirements

3.14 Microchip PIC Programmer

3.15 Audio Components

3.16 VAD and Background Noise Reduction Techniques

3.17 Results and Discussion

3.18 Conclusion and Discussion

Chapter 4: Disambiguating Conflicting Classification Results in AVSR

Abstract

4.1 Introduction

4.2 Detection of Conflicting Classes

4.3 Complementary Models for Classification

4.4 Proposed Cascade of Classifiers

4.5 Audio-Visual Databases

4.6 Experimental Results

4.7 Conclusions

Chapter 5: A Deep Dive Into Deep Learning Techniques for Solving Spoken Language Identification Problems

Abstract

5.1 Introduction

5.2 Spoken Language Identification

5.3 Cues for Spoken Language Identification

5.4 Stages in Spoken Language Identification

5.5 Deep Learning

5.6 Artificial and Deep Neural Network

5.7 Comparison of Spoken LID System Implementations with Deep Learning Techniques

5.8 Discussion

5.9 Conclusion

Chapter 6: Voice Activity Detection-Based Home Automation System for People With Special Needs

Abstract

6.1 Introduction

6.2 Conceptual Design of the System

6.3 System Implementation

6.4 Significance/Contribution

6.5 Conclusion

Chapter 7: Speech Summarization for Tamil Language

Abstract

7.1 Introduction

7.2 Extractive Summarization

7.3 Abstractive Summarization

7.4 Need for Speech Summarization

7.5 Issues in the Summarization of a Spoken Document

7.6 Tamil Language

7.7 System Design for Summarization of Speech Data in Tamil Language

7.8 Evaluation Metrics

7.9 Speech Corpora for Tamil Language

7.10 Conclusion

Chapter 8: Classifying Recurrent Dynamics on Emotional Speech Signals

Abstract

8.1 Introduction

8.2 Data Collection and Processing

8.3 Research Methodology

8.4 Numerical Experiments and Results

8.5 Conclusion

Chapter 9: Intelligent Speech Processing in the Time-Frequency Domain

Abstract

9.1 Wavelet Packet Decomposition

9.2 Empirical Mode Decomposition

9.3 Variational Mode Decomposition

9.4 Synchrosqueezing Wavelet Transform: EMD Like a Tool

9.5 Applications of the Decomposition Technique

9.6 Conclusion

Chapter 10: A Framework for Artificially Intelligent Customized Voice Response System Design using Speech Synthesis Markup Language

Abstract

10.1 Introduction

10.2 Literature Survey

10.3 AWS IoT

10.4 Amazon Voice Service (AVS)

10.5 AWS Lambda

10.6 Message Queuing Telemetry Transport (MQTT)

10.7 Proposed Architecture

10.8 Conclusion

Index

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.117.196.217