Details

Speech Coding Algorithms


Speech Coding Algorithms

Foundation and Evolution of Standardized Coders
1. Aufl.

von: Wai C. Chu

184,99 €

Verlag: Wiley
Format: PDF
Veröffentl.: 04.03.2004
ISBN/EAN: 9780471668879
Sprache: englisch
Anzahl Seiten: 592

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

<ul> <li>Speech coding is a highly mature branch of signal processing deployed in products such as cellular phones, communication devices, and more recently, voice over internet protocol</li> <li>This book collects many of the techniques used in speech coding and presents them in an accessible fashion</li> <li>Emphasizes the foundation and evolution of standardized speech coders, covering standards from 1984 to the present</li> <li>The theory behind the applications is thoroughly analyzed and proved</li> </ul>
<p>Preface xiii</p> <p>Acronyms xix</p> <p>Notation xxiii</p> <p><b>1 Introduction 1</b></p> <p>1.1 Overview of Speech Coding 2</p> <p>1.2 Classification of Speech Coders 8</p> <p>1.3 Speech Production and Modeling 11</p> <p>1.4 Some Properties of the Human Auditory System 18</p> <p>1.5 Speech Coding Standards 22</p> <p>1.6 About Algorithms 26</p> <p>1.7 Summary and References 31</p> <p><b>2 Signal Processing Techniques 33</b></p> <p>2.1 Pitch Period Estimation 33</p> <p>2.2 All-Pole and All-Zero Filters 45</p> <p>2.3 Convolution 52</p> <p>2.4 Summary and References 57</p> <p>Exercises 57</p> <p><b>3 Stochastic Processes and Models 61</b></p> <p>3.1 Power Spectral Density 62</p> <p>3.2 Periodogram 67</p> <p>3.3 Autoregressive Model 69</p> <p>3.4 Autocorrelation Estimation 73</p> <p>3.5 Other Signal Models 85</p> <p>3.6 Summary and References 86</p> <p>Exercises 87</p> <p><b>4 Linear Prediction 91</b></p> <p>4.1 The Problem of Linear Prediction 92</p> <p>4.2 Linear Prediction Analysis of Nonstationary Signals 96</p> <p>4.3 Examples of Linear Prediction Analysis of Speech 101</p> <p>4.4 The Levinson–Durbin Algorithm 107</p> <p>4.5 The Leroux–Gueguen Algorithm 114</p> <p>4.6 Long-Term Linear Prediction 120</p> <p>4.7 Synthesis Filters 127</p> <p>4.8 Practical Implementation 131</p> <p>4.9 Moving Average Prediction 137</p> <p>4.10 Summary and References 138</p> <p>Exercises 139</p> <p><b>5 Scalar Quantization 143</b></p> <p>5.1 Introduction 143</p> <p>5.2 Uniform Quantizer 147</p> <p>5.3 Optimal Quantizer 149</p> <p>5.4 Quantizer Design Algorithms 151</p> <p>5.5 Algorithmic Implementation 155</p> <p>5.6 Summary and References 158</p> <p>Exercises 158</p> <p><b>6 Pulse Code Modulation and Its Variants 161</b></p> <p>6.1 Uniform Quantization 161</p> <p>6.2 Nonuniform Quantization 166</p> <p>6.3 Differential Pulse Code Modulation 172</p> <p>6.4 Adaptive Schemes 175</p> <p>6.5 Summary and References 180</p> <p>Exercises 181</p> <p><b>7 Vector Quantization 184</b></p> <p>7.1 Introduction 185</p> <p>7.2 Optimal Quantizer 188</p> <p>7.3 Quantizer Design Algorithms 189</p> <p>7.4 Multistage VQ 194</p> <p>7.5 Predictive VQ 216</p> <p>7.6 Other Structured Schemes 219</p> <p>7.7 Summary and References 221</p> <p>Exercises 222</p> <p><b>8 Scalar Quantization of Linear Prediction Coefficient 227</b></p> <p>8.1 Spectral Distortion 227</p> <p>8.2 Quantization Based on Reflection Coefficient and Log Area Ratio 232</p> <p>8.3 Line Spectral Frequency 239</p> <p>8.4 Quantization Based on Line Spectral Frequency 252</p> <p>8.5 Interpolation of LPC 256</p> <p>8.6 Summary and References 258</p> <p>Exercises 260</p> <p><b>9 Linear Prediction Coding 263</b></p> <p>9.1 Speech Production Model 264</p> <p>9.2 Structure of the Algorithm 268</p> <p>9.3 Voicing Detector 271</p> <p>9.4 The FS1015 LPC Coder 275</p> <p>9.5 Limitations of the LPC Model 277</p> <p>9.6 Summary and References 280</p> <p>Exercises 281</p> <p><b>10 Regular-pulse Excitation Coders 285</b></p> <p>10.1 Multipulse Excitation Model 286</p> <p>10.2 Regular-Pulse-Excited–Long-Term Prediction 289</p> <p>10.3 Summary and References 295</p> <p>Exercises 296</p> <p><b>11 Code-excited Linear Prediction 299</b></p> <p>11.1 The CELP Speech Production Model 300</p> <p>11.2 The Principle of Analysis-by-Synthesis 301</p> <p>11.3 Encoding and Decoding 302</p> <p>11.4 Excitation Codebook Search 308</p> <p>11.5 Postfilter 317</p> <p>11.6 Summary and References 325</p> <p>Exercises 326</p> <p><b>12 The Federal Standard Version of CELP 330</b></p> <p>12.1 Improving the Long-Term Predictor 331</p> <p>12.2 The Concept of the Adaptive Codebook 333</p> <p>12.3 Incorporation of the Adaptive Codebook to the CELP Framework 336</p> <p>12.4 Stochastic Codebook Structure 338</p> <p>12.5 Adaptive Codebook Search 341</p> <p>12.6 Stochastic Codebook Search 344</p> <p>12.7 Encoder and Decoder 346</p> <p>12.8 Summary and References 349</p> <p>Exercises 350</p> <p><b>13 Vector Sum Excited Linear Prediction 353</b></p> <p>13.1 The Core Encoding Structure 354</p> <p>13.2 Search Strategies for Excitation Codebooks 356</p> <p>13.3 Excitation Codebook Searches 357</p> <p>13.4 Gain Related Procedures 362</p> <p>13.5 Encoder and Decoder 366</p> <p>13.6 Summary and References 368</p> <p>Exercises 369</p> <p><b>14 Low-delay CELP 372</b></p> <p>14.1 Strategies to Achieve Low Delay 373</p> <p>14.2 Basic Operational Principles 375</p> <p>14.3 Linear Prediction Analysis 377</p> <p>14.4 Excitation Codebook Search 380</p> <p>14.5 Backward Gain Adaptation 385</p> <p>14.6 Encoder and Decoder 389</p> <p>14.7 Codebook Training 391</p> <p>14.8 Summary and References 393</p> <p>Exercises 394</p> <p><b>15 Vector Quantization of Linear Prediction Coefficient 396</b></p> <p>15.1 Correlation Among the LSFs 396</p> <p>15.2 Split VQ 399</p> <p>15.3 Multistage VQ 403</p> <p>15.4 Predictive VQ 407</p> <p>15.5 Summary and References 418</p> <p>Exercises 419</p> <p><b>16 Algebraic CELP 423</b></p> <p>16.1 Algebraic Codebook Structure 424</p> <p>16.2 Adaptive Codebook 425</p> <p>16.3 Encoding and Decoding 433</p> <p>16.4 Algebraic Codebook Search 437</p> <p>16.5 Gain Quantization Using Conjugate VQ 443</p> <p>16.6 Other ACELP Standards 446</p> <p>16.7 Summary and References 451</p> <p>Exercises 451</p> <p><b>17 Mixed Excitation Linear Prediction 454</b></p> <p>17.1 The MELP Speech Production Model 455</p> <p>17.2 Fourier Magnitudes 456</p> <p>17.3 Shaping Filters 464</p> <p>17.4 Pitch Period and Voicing Strength Estimation 466</p> <p>17.5 Encoder Operations 474</p> <p>17.6 Decoder Operations 477</p> <p>17.7 Summary and References 481</p> <p>Exercises 482</p> <p><b>18 Source-controlled Variable Bit-rate CELP 486</b></p> <p>18.1 Adaptive Rate Decision 487</p> <p>18.2 LP Analysis and LSF-Related Operations 494</p> <p>18.3 Decoding and Encoding 496</p> <p>18.4 Summary and References 498</p> <p>Exercises 499</p> <p><b>19 Speech Quality Assessment 501</b></p> <p>19.1 The Scope of Quality and Measuring Conditions 501</p> <p>19.2 Objective Quality Measurements for Waveform Coders 502</p> <p>19.3 Subjective Quality Measures 504</p> <p>19.4 Improvements on Objective Quality Measures 505</p> <p>Appendix A Minimum-phase Property of the Forward Prediction-error Filter 507</p> <p>Appendix B Some Properties of Line Spectral Frequency 514</p> <p>Appendix C Research Directions in Speech Coding 518</p> <p>Appendix D Linear Combiner for Pattern Classification 522</p> <p>Appendix E CELP: Optimal Long-term Predictor to Minimize the Weighted Difference 531</p> <p>Appendix F Review of Linear Algebra: Orthogonality, Basis, Linear Independence, and the Gram–schmidt Algorithm 537</p> <p>Bibliography 542</p> <p>Index 553</p>
“…well equipped with exercises and with procedures which are helpful in implementing the coders…” (<i>Zentralblatt Math</i>, Vol.1041, No.16, 2004)
WAI C. CHU earned a PhD in Electrical Engineering from the Pennsylvania State University. His industry experience includes positions at Texas Instruments and various startup companies in the multimedia field. Currently at DoCoMo USA Labs (San Jose, California), he is involved with R&D activities in speech/audio coding, digital signal processing, and multimedia applications.
A thought-provoking look at the historical and practical evolution of speech coding <p>Speech coding has evolved into a highly important branch of applied signal processing, resulting in an explosion of products such as cellular phones, answering machines, communication devices, and most recently, voice over Internet protocol. Because of the increasing demand for speech communication, speech coding technology has captured more and more attention from the research, standardization, and business communities–all of whom have been encouraged to pursue and investigate alternative coding methods with the objective of minimizing its deficiencies and limitations.</p> <p>Covering standards from the inception of digital communication, Speech Coding Algorithms: Foundation and Evolution of Standardized Coders compiles many of the complex techniques widely used in speech coding and presents them in a logical and accessible fashion. By understanding the principles, performances, and limitations of the most successful standards developed throughout history, practitioners will be able to gain sound insights of widely adopted methodologies, with the ultimate goal of developing the next generation of improved algorithms.</p> <p>A strong pedagogical format makes this text a valuable resource for professionals and students alike.</p> <ul> <li>Theories behind each application are thoroughly analyzed and proven</li> <li>A clear, simple writing style makes complex material easy to understand</li> <li>Abundant charts and illustrations help to clarify ideas discussed</li> <li>End of Chapter summaries, exercises, and references help to cement understanding and encourage further study</li> </ul> <p>Speech Coding Algorithms: Foundation and Evolution of Standardized Coders is an invaluable textbook for students and researchers eager to enter the speech coding arena, providing a solid theoretical foundation as well as directions for further research. For professionals working in industry who need to access information regarding the many popular standards, the book represents a one-stop reference where the architecture, bit allocation, computational complexity, and operational aspects are thoroughly explained. Among the standards covered are:</p> <ul> <li>FS 1015 LPC</li> <li>TIA IS54 VSELP</li> <li>ITU-T G.728 LD-CELP</li> <li>ITU-T G.729 CS-ACELP</li> <li>ETSI GSM 6.10 RPE-LTP</li> <li>FS 1015 CELP</li> <li>TIA IS96 VBR-CELP</li> <li>FS MELP</li> </ul>

Diese Produkte könnten Sie auch interessieren:

Pulsed Power
Pulsed Power
von: Gennady A. Mesyats
PDF ebook
309,23 €
High-Frequency Oscillator Design for Integrated Transceivers
High-Frequency Oscillator Design for Integrated Transceivers
von: J. van der Tang, Dieter Kasperkovitz, Arthur H.M. van Roermund
PDF ebook
149,79 €
MEMS Vibratory Gyroscopes
MEMS Vibratory Gyroscopes
von: Cenk Acar, Andrei Shkel
PDF ebook
149,79 €