Contents

Author Biographies

Foreword

Preface

1 Introduction

1.1 The human auditory system

1.2 Spatial audio reproduction

1.3 Spatial audio coding

1.4 Book outline

2 Background

2.1 Introduction

2.2 Spatial audio playback systems

2.2.1 Stereo audio loudspeaker playback

2.2.2 Headphone audio playback

2.2.3 Multi-channel audio playback

2.3 Audio coding

2.3.1 Audio signal representation

2.3.2 Lossless audio coding

2.3.3 Perceptual audio coding

2.3.4 Parametric audio coding

2.3.5 Combining perceptual and parametric audio coding

2.4 Matrix surround

2.5 Conclusions

3 Spatial Hearing

3.1 Introduction

3.2 Physiology of the human hearing system

3.3 Spatial hearing basics

3.3.1 Spatial hearing with one sound source

3.3.2 Ear entrance signal properties and lateralization

3.3.3 Sound source localization

3.3.4 Two sound sources: summing localization

3.3.5 Superposition of signals each evoking one auditory object

3.4 Spatial hearing in rooms

3.4.1 Source localization in the presence of reflections: the precedence effect

3.4.2 Spatial impression

3.5 Limitations of the human auditory system

3.5.1 Just-noticeable differences in interaural cues

3.5.2 Spectro-temporal decomposition

3.5.3 Localization accuracy of single sources

3.5.4 Localization accuracy of concurrent sources

3.5.5 Localization accuracy when reflections are present

3.6 Source localization in complex listening situations

3.6.1 Cue selection model

3.6.2 Simulation examples

3.7 Conclusions

4 Spatial Audio Coding

4.1 Introduction

4.2 Related techniques

4.2.1 Pseudostereophonic processes

4.2.2 Intensity stereo coding

4.3 Binaural Cue Coding (BCC)

4.3.1 Time–frequency processing

4.3.2 Down-mixing to one channel

4.3.3 ‘Perceptually relevant differences’ between audio channels

4.3.4 Estimation of spatial cues

4.3.5 Synthesis of spatial cues

4.4 Coding of low-frequency effects (LFE) audio channels

4.5 Subjective performance

4.6 Generalization to spatial audio coding

5 Parametric Stereo

5.1 Introduction

5.1.1 Development and standardization

5.1.2 AacPlus v2

5.2 Interaction between core coder and spatial audio coding

5.3 Relation to BCC

5.4 Parametric stereo encoder

5.4.1 Time/frequency decomposition

5.4.2 Parameter extraction

5.4.3 Down-mix

5.4.4 Parameter quantization and coding

5.5 Parametric stereo decoder

5.5.1 Analysis filterbank

5.5.2 Decorrelation

5.5.3 Matrixing

5.5.4 Interpolation

5.5.5 Synthesis filterbanks

5.5.6 Parametric stereo in enhanced aacPlus

5.6 Conclusions

6 MPEG Surround

6.1 Introduction

6.2 Spatial audio coding

6.2.1 Concept

6.2.2 Elementary building blocks

6.3 MPEG Surround encoder

6.3.1 Structure

6.3.2 Pre- and post-gains

6.3.3 Time–frequency decomposition

6.3.4 Spatial encoder

6.3.5 Parameter quantization and coding

6.3.6 Coding of residual signals

6.4 MPEG Surround decoder

6.4.1 Structure

6.4.2 Spatial decoder

6.4.3 Enhanced matrix mode

6.5 Subjective evaluation

6.5.1 Test 1: operation using spatial parameters

6.5.2 Test 2: operation using enhanced matrix mode

6.6 Conclusions

7 Binaural Cues for a Single Sound Source

7.1 Introduction

7.2 HRTF parameterization

7.2.1 HRTF analysis

7.2.2 HRTF synthesis

7.3 Sound source position dependencies

7.3.1 Experimental procedure

7.3.2 Results and discussion

7.4 HRTF set dependencies

7.4.1 Experimental procedure

7.4.2 Results and discussion

7.5 Single ITD approximation

7.5.1 Procedure

7.5.2 Results and discussion

7.6 Conclusions

8 Binaural Cues for Multiple Sound Sources

8.1 Introduction

8.2 Binaural parameters

8.3 Binaural parameter analysis

8.3.1 Binaural parameters for a single sound source

8.3.2 Binaural parameters for multiple independent sound sources

8.3.3 Binaural parameters for multiple sound sources with varying degrees of mutual correlation

8.4 Binaural parameter synthesis

8.4.1 Mono down-mix

8.4.2 Extension towards stereo down-mixes

8.5 Application to MPEG Surround

8.5.1 Binaural decoding mode

8.5.2 Binaural parameter synthesis

8.5.3 Binaural encoding mode

8.5.4 Evaluation

8.6 Conclusions

9 Audio Coding with Mixing Flexibility at the Decoder Side

9.1 Introduction

9.2 Motivation and details

9.2.1 ICTD, ICLD and ICC of the mixer output

9.3 Side information

9.3.1 Reconstructing the sources

9.4 Using spatial audio decoders as mixers

9.5 Transcoding to MPEG Surround

9.6 Conclusions

10 Multi-loudspeaker Playback of Stereo Signals

10.1 Introduction

10.2 Multi-channel stereo

10.3 Spatial decomposition of stereo signals

10.3.1 Estimating ps,b, Ab and pn,b

10.3.2 Least-squares estimation of sm, n1,m and n2,m

10.3.3 Post-scaling

10.3.4 Numerical examples

10.4 Reproduction using different rendering setups

10.4.1 Multiple loudspeakers in front of the listener

10.4.2 Multiple front loudspeakers plus side loudspeakers

10.4.3 Conventional 5.1 surround loudspeaker setup

10.4.4 Wavefield synthesis playback system

10.4.5 Modifying the decomposed audio signals

10.5 Subjective evaluation

10.5.1 Subjects and playback setup

10.5.2 Stimuli

10.5.3 Test method

10.5.4 Results

10.6 Conclusions

10.7 Acknowledgement

Frequently Used Terms, Abbreviations and Notation

Terms and abbreviations

Notation and variables

Bibliography

Index

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.119.106.135