Loading [MathJax]/extensions/MathMenu.js

Robust and efficient quantization of speech LSP parameters using structured vector quantizers | IEEE Conference Publication | IEEE Xplore

Scheduled Maintenance on Monday 1/13/2025

Single article sales and account management will be unavailable from 5:00 AM - 7:00 PM ET (09:00 - 23:00 UTC). We apologize for the inconvenience.

IEEE.org
IEEE Xplore
IEEE SA
IEEE Spectrum
More Sites

- Donate
- Personal Sign In

Access provided by:

MIT Libraries

Access provided by:

MIT Libraries

ADVANCED SEARCH

Conferences >[Proceedings] ICASSP 91: 1991...

Robust and efficient quantization of speech LSP parameters using structured vector quantizers

R. Laroia; N. Phamdo; N. Farvardin

Alerts
Alerts
Manage Content Alerts
Add to Citation Alerts

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Three robust algorithms based on the recently proposed concept of structured vector quantization have been developed for quantization of speech LSP (line spectrum pair) p...Show More

Metadata

Abstract:

Three robust algorithms based on the recently proposed concept of structured vector quantization have been developed for quantization of speech LSP (line spectrum pair) parameters. The first algorithm exploits interframe correlation of the LSP parameters and requires 24 bits per LSP vector to achieve 1 dB average spectral distortion. The second and third algorithms quantize each LSP vector independently and for 1 dB distortion require 29 and 25 bits per vector, respectively. The last two algorithms result in a considerably smaller fraction of frames with distortions greater than 2 dB as compared to other schemes proposed so far.<>

Published in: [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing

Date of Conference: 14-17 April 1991

Date Added to IEEE Xplore: 06 August 2002

Print ISBN:0-7803-0003-3

Print ISSN: 1520-6149

DOI: 10.1109/ICASSP.1991.150421

Conference Location: Toronto, ON, Canada

Citations are not available for this document.

Cites in Papers - |

Cites in Papers - IEEE (73)

Select All

1.

R. Sugiura, Y. Kamamoto, N. Harada, H. Kameoka, T. Moriya, "Direct linear conversion of LSP parameters for perceptual control in speech and audio coding", 2014 22nd European Signal Processing Conference (EUSIPCO), pp.56-60, 2014.

2.

Young-Sun Joo, Chi-Sang Jung, Hong-Goo Kang, "Enhancement of spectral clarity for HMM-based text-to-speech systems", 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.7840-7843, 2013.

3.

Yao Qian, Frank K. Soong, Zhi-Jie Yan, "A Unified Trajectory Tiling Approach to High Quality Speech Rendering", IEEE Transactions on Audio, Speech, and Language Processing, vol.21, no.2, pp.280-290, 2013.

4.

Yao Qian, Ji Xu, Frank K. Soong, "A frame mapping based HMM approach to cross-lingual voice transformation", 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5120-5123, 2011.

5.

L. Anders Ekman, Volodya Grancharov, W. Bastiaan Kleijn, "Double-Ended Quality Assessment System for Super-Wideband Speech", IEEE Transactions on Audio, Speech, and Language Processing, vol.19, no.3, pp.558-569, 2011.

6.

Merouane Bouzid, Salah Eddine Cheraitia, Moussa Hireche, "Switched split vector quantizer applied for encoding the LPC parameters of the 2.4 Kbits/s MELP speech coder", 2010 7th International Multi- Conference on Systems, Signals and Devices, pp.1-5, 2010.

7.

Jianhua Tao, Meng Zhang, Jani Nurminen, Jilei Tian, Xia Wang, "Supervisory Data Alignment for Text-Independent Voice Conversion", IEEE Transactions on Audio, Speech, and Language Processing, vol.18, no.5, pp.932-943, 2010.

8.

Oytun Turk, Marc Schroder, "Evaluation of Expressive Speech Synthesis With Voice Conversion and Copy Resynthesis Techniques", IEEE Transactions on Audio, Speech, and Language Processing, vol.18, no.5, pp.965-973, 2010.

9.

Ming Lei, Zhen-Hua Ling, Li-Rong Dai, "Minimum generation error training with weighted Euclidean distance on LSP for HMM-based speech synthesis", 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4230-4233, 2010.

10.

Meng Zhang, Jiaohua Tao, Jani Nurminen, Jilei Tian, Xia Wang, "Phoneme cluster based state mapping for text-independent voice conversion", 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4281-4284, 2009.

11.

Saikat Chatterjee, T. V. Sreenivas, "Switched Conditional PDF-Based Split VQ Using Gaussian Mixture Model", IEEE Signal Processing Letters, vol.15, pp.91-94, 2008.

12.

Saikat Chatterjee, T.V. Sreenivas, "Gaussian Mixture Model Based Switched Split Vector Quantization of LSF Parameters", 2007 IEEE International Symposium on Signal Processing and Information Technology, pp.1054-1059, 2007.

13.

Ping Chen, Mingbo Zhao, Shaoqing Jia, "The Application of Fuzzy Neural Network in the Iatrical Monitor", Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007), vol.3, pp.98-103, 2007.

14.

Thomas Eriksson, Fredrik Norden, "Memory-Based Vector Quantization of LSF Parameters by a Power Series Approximation", IEEE Transactions on Audio, Speech, and Language Processing, vol.15, no.4, pp.1146-1155, 2007.

15.

Saikat Chatterjee, T.V. Sreenivas, "Sequential Split Vector Quantization of LSF Parameters using Conditional Pdf", 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07, vol.4, pp.IV-1101-IV-1104, 2007.

16.

Saikat Chatterjee, T.V. Sreenivas, "Computationally efficient optimum weighting function for vector quantization of LSF parameters", 2007 9th International Symposium on Signal Processing and Its Applications, pp.1-4, 2007.

17.

V. Grancharov, D.Y. Zhao, J. Lindblom, W.B. Kleijn, "Low-Complexity, Nonintrusive Speech Quality Assessment", IEEE Transactions on Audio, Speech, and Language Processing, vol.14, no.6, pp.1948-1956, 2006.

18.

F. Lahouti, A.R. Fazel, A.H. Safavi-Naeini, A.K. Khandani, "Single and double frame coding of speech LPC parameters using a lattice-based quantization scheme", IEEE Transactions on Audio, Speech, and Language Processing, vol.14, no.5, pp.1624-1632, 2006.

19.

H. Duxans, A. Bonafonte, "Residual Conversion Versus Prediction on Voice Morphing Systems", 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, vol.1, pp.I-I, 2006.

20.

B. Denby, Y. Oussar, G. Dreyfus, M. Stone, "Prospects for a Silent Speech Interface using Ultrasound Imaging", 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, vol.1, pp.I-I, 2006.

21.

T.Z. Shabestary, P. Hedelin, "LSP quantization by a union of locally trained codebooks", IEEE Transactions on Speech and Audio Processing, vol.13, no.5, pp.811-820, 2005.

22.

F. Norden, T. Eriksson, "Time evolution in LPC spectrum coding", IEEE Transactions on Speech and Audio Processing, vol.12, no.3, pp.290-301, 2004.

23.

S. Srinivasan, J. Samuelsson, W.B. Kleijn, "Estimation of short-term predictor parameters for coding and enhancement of noisy speech", 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol.1, pp.I-705, 2004.

24.

F. Lahouti, A.K. Khandani, "Quantization of LSF parameters using a trellis modeling", IEEE Transactions on Speech and Audio Processing, vol.11, no.5, pp.400-412, 2003.

25.

R.C. de Lamare, A. Alcaim, "Effects of adaptive postfilters on the LSF quantisation for low bit rate speech coders in tandem connections", Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings., vol.1, pp.393-396 vol.1, 2003.

26.

Woo-Jin Han, Eun-Kyoung Kim, Yung-Hwan Oh, "Multicodebook split vector quantization of LSF parameters", IEEE Signal Processing Letters, vol.9, no.12, pp.418-421, 2002.

27.

Wesley Pereira, Peter Kabal, "Improved spectral tracking using interpolated linear prediction parameters", 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol.1, pp.I-261-I-264, 2002.

28.

R.C. de Lamare, A. Alcaim, "Analysis of LSF switched-predictive vector quantisers", Proceedings of the Sixth International Symposium on Signal Processing and its Applications (Cat.No.01EX467), vol.2, pp.727-730 vol.2, 2001.

29.

Ho Young Hur, Hyung Soon Kim, "Formant weighted cepstral feature for LSP-based speech recognition", 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), vol.1, pp.141-144 vol.1, 2001.

30.

Hai Le Vu, L. Lois, "Efficient distance measure for quantization of LSF and its Karhunen-Loeve transformed parameters", IEEE Transactions on Speech and Audio Processing, vol.8, no.6, pp.744-746, 2000.

Cites in Papers - Other Publishers (24)

1.

Marcelo S. Alencar, Valdemar C. da Rocha, "Speech Coding", Communication Systems, pp.97, 2022.

CrossRef Google Scholar

2.

Marcelo S. Alencar, Valdemar C. da Rocha, "Speech Coding", Communication Systems, pp.89, 2020.

CrossRef Google Scholar

3.

Weixun GAO, Qiying CAO, Yao QIAN, "Cross-Dialectal Voice Conversion with Neural Networks", IEICE Transactions on Information and Systems, vol.E97.D, no.11, pp.2872, 2014.

CrossRef Google Scholar

4.

Byungsik YOON, Heewan PARK, Sangwon KANG, "A Low Power Bandwidth Extension Technique", IEICE Transactions on Communications, vol.E95-B, no.1, pp.358, 2012.

CrossRef Google Scholar

5.

Fatiha Merazka, "Intraframe quantization of speech line spectrum pairs for code-excited linear prediction based coders in packet networks", Transactions on Emerging Telecommunications Technologies, vol.23, no.8, pp.789, 2012.

CrossRef Google Scholar

6.

Elif Bozkurt, Engin Erzin, Cig?dem Erog?lu Erdem, A. Tanju Erdem, "Formant position based weighted spectral features for emotion recognition", Speech Communication, vol.53, no.9-10, pp.1186, 2011.

CrossRef Google Scholar

7.

Sonia L. Q. DallAgnol, Abraham Alcaim, Jose Roberto B. de Marca, "Performance of LSF vector quantizers for VSELP coders in noisy channels", European Transactions on Telecommunications, vol.5, no.5, pp.553, 2010.

CrossRef Google Scholar

8.

Abdellah KADDAI, Mohammed HALIMI, "Low-Complexity Wideband LSF Quantization Using Algebraic Trellis VQ", IEICE Transactions on Information and Systems, vol.E92-D, no.12, pp.2478, 2009.

CrossRef Google Scholar

9.

Saikat Chatterjee, T.V. Sreenivas, "Reduced complexity two stage vector quantization", Digital Signal Processing, vol.19, no.3, pp.476, 2009.

CrossRef Google Scholar

10.

Oytun Turk, Levent M. Arslan, "Automatic source speaker selection for voice conversion", The Journal of the Acoustical Society of America, vol.125, no.1, pp.480, 2009.

CrossRef Google Scholar

11.

Merouane Bouzid, Amar Djeradi, "Optimisation de la quantification vectorielle codee par treillis: application au codage des parametres LSF", Annales Des Telecommunications, vol.60, no.5-6, pp.744, 2005.

CrossRef Google Scholar

12.

Alexander Petrovsky, Andrzej Sawicki, Alexander Pavlovec, Information Processing and Security Systems, pp.67, 2005.

13.

Merouane Bouzid, Amar Djeradi, Bachir Boudraa, "Optimized trellis coded vector quantization of LSF parameters, application to the 4.8kbps FS1016 speech coder", Signal Processing, vol.85, no.9, pp.1675, 2005.

CrossRef Google Scholar

14.

Juan M. Lopez-Soler, Victoria Sanchez, Angel de la Torre, Antonio J. Rubio-Ayuso, "Linear inter-frame dependencies for very low bit-rate speech coding", Speech Communication, vol.34, no.4, pp.333, 2001.

CrossRef Google Scholar

15.

Mi Suk Lee, Hong Kook Kim, Hwang Soo Lee, "A new distortion measure for spectral quantization based on the LSF intermodel interlacing property", Speech Communication, vol.35, no.3-4, pp.191, 2001.

CrossRef Google Scholar

16.

Seung Ho Choi, Hong Kook Kim, Hwang Soo Lee, "Speech recognition using quantized LSP parameters and their transformations in digital communication", Speech Communication, vol.30, no.4, pp.223, 2000.

CrossRef Google Scholar

17.

Levent M. Arslan, David Talkin, "Codebook based face point trajectory synthesis algorithm using speech input", Speech Communication, vol.27, no.2, pp.81, 1999.

CrossRef Google Scholar

18.

C.Q Chen, S.N Koh, I.Y Soon, "An associatively classified partitioned vector quantizer", Signal Processing, vol.76, no.3, pp.311, 1999.

CrossRef Google Scholar

19.

Levent M. Arslan, "Speaker Transformation Algorithm using Segmental Codebooks (STASC)", Speech Communication, vol.28, no.3, pp.211, 1999.

CrossRef Google Scholar

20.

Balázs Kövesi, Samir Saoudi, Jean Marc Boucher, Gábor Horváth, "Real time vector quantization of LSP parameters", Speech Communication, vol.29, no.1, pp.39, 1999.

CrossRef Google Scholar

21.

Engin Erzin, A. Enis Çetin, Speech Recognition and Coding, pp.431, 1995.

22.

Nam Phamdo, Tai Hong Lee, Nariman Farvardin, Speech Recognition and Coding, pp.493, 1995.

23.

Shihua Wang, Erdal Paksoy, Allen Gersho, Speech and Audio Coding for Wireless and Network Applications, pp.251, 1993.

24.

Nam Phamdo, Nariman Farvardin, Takehiro Moriya, "Combined Source-Channel Coding of LSP Parameters Using Multi-Stage Vector Quantization", Speech and Audio Coding for Wireless and Network Applications, pp.181, 1993.

CrossRef Google Scholar

More Like This

Baseband speech coding at 2400 bps using "Spherical vector quantization"

ICASSP '84. IEEE International Conference on Acoustics, Speech, and Signal Processing

Published: 1984

Effect of White-Noise Correction on Linear Predictive Coding

IEEE Signal Processing Letters

Published: 2007

References

References is not available for this document.

IEEE Personal Account

Change username/password

Purchase Details

Payment Options
View Purchased Documents

Profile Information

Communications Preferences
Profession and Education
Technical interests

Need Help?

US & Canada: +1 800 678 4333
Worldwide: +1 732 981 0060
Contact & Support

Follow

About IEEE Xplore | Contact Us | Help | Accessibility | Terms of Use | Nondiscrimination Policy | IEEE Ethics Reporting | Sitemap | IEEE Privacy Policy

A public charity, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

© Copyright 2025 IEEE - All rights reserved, including rights for text and data mining and training of artificial intelligence and similar technologies.

IEEE Account

Change Username/Password
Update Address

Purchase Details

Payment Options
Order History
View Purchased Documents

Profile Information

Communications Preferences
Profession and Education
Technical Interests

Need Help?

US & Canada: +1 800 678 4333
Worldwide: +1 732 981 0060
Contact & Support

About IEEE Xplore
Contact Us
Help
Accessibility
Terms of Use
Nondiscrimination Policy
Sitemap
Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.
© Copyright 2025 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

Test Whats new message.