Abstract:
Three robust algorithms based on the recently proposed concept of structured vector quantization have been developed for quantization of speech LSP (line spectrum pair) p...Show MoreMetadata
Abstract:
Three robust algorithms based on the recently proposed concept of structured vector quantization have been developed for quantization of speech LSP (line spectrum pair) parameters. The first algorithm exploits interframe correlation of the LSP parameters and requires 24 bits per LSP vector to achieve 1 dB average spectral distortion. The second and third algorithms quantize each LSP vector independently and for 1 dB distortion require 29 and 25 bits per vector, respectively. The last two algorithms result in a considerably smaller fraction of frames with distortions greater than 2 dB as compared to other schemes proposed so far.<>
Published in: [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing
Date of Conference: 14-17 April 1991
Date Added to IEEE Xplore: 06 August 2002
Print ISBN:0-7803-0003-3
Print ISSN: 1520-6149
Citations are not available for this document.
Cites in Papers - |
Cites in Papers - IEEE (73)
Select All
1.
R. Sugiura, Y. Kamamoto, N. Harada, H. Kameoka, T. Moriya, "Direct linear conversion of LSP parameters for perceptual control in speech and audio coding", 2014 22nd European Signal Processing Conference (EUSIPCO), pp.56-60, 2014.
2.
Young-Sun Joo, Chi-Sang Jung, Hong-Goo Kang, "Enhancement of spectral clarity for HMM-based text-to-speech systems", 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.7840-7843, 2013.
3.
Yao Qian, Frank K. Soong, Zhi-Jie Yan, "A Unified Trajectory Tiling Approach to High Quality Speech Rendering", IEEE Transactions on Audio, Speech, and Language Processing, vol.21, no.2, pp.280-290, 2013.
4.
Yao Qian, Ji Xu, Frank K. Soong, "A frame mapping based HMM approach to cross-lingual voice transformation", 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5120-5123, 2011.
5.
L. Anders Ekman, Volodya Grancharov, W. Bastiaan Kleijn, "Double-Ended Quality Assessment System for Super-Wideband Speech", IEEE Transactions on Audio, Speech, and Language Processing, vol.19, no.3, pp.558-569, 2011.
6.
Merouane Bouzid, Salah Eddine Cheraitia, Moussa Hireche, "Switched split vector quantizer applied for encoding the LPC parameters of the 2.4 Kbits/s MELP speech coder", 2010 7th International Multi- Conference on Systems, Signals and Devices, pp.1-5, 2010.
7.
Jianhua Tao, Meng Zhang, Jani Nurminen, Jilei Tian, Xia Wang, "Supervisory Data Alignment for Text-Independent Voice Conversion", IEEE Transactions on Audio, Speech, and Language Processing, vol.18, no.5, pp.932-943, 2010.
8.
Oytun Turk, Marc Schroder, "Evaluation of Expressive Speech Synthesis With Voice Conversion and Copy Resynthesis Techniques", IEEE Transactions on Audio, Speech, and Language Processing, vol.18, no.5, pp.965-973, 2010.
9.
Ming Lei, Zhen-Hua Ling, Li-Rong Dai, "Minimum generation error training with weighted Euclidean distance on LSP for HMM-based speech synthesis", 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4230-4233, 2010.
10.
Meng Zhang, Jiaohua Tao, Jani Nurminen, Jilei Tian, Xia Wang, "Phoneme cluster based state mapping for text-independent voice conversion", 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4281-4284, 2009.
11.
Saikat Chatterjee, T. V. Sreenivas, "Switched Conditional PDF-Based Split VQ Using Gaussian Mixture Model", IEEE Signal Processing Letters, vol.15, pp.91-94, 2008.
12.
Saikat Chatterjee, T.V. Sreenivas, "Gaussian Mixture Model Based Switched Split Vector Quantization of LSF Parameters", 2007 IEEE International Symposium on Signal Processing and Information Technology, pp.1054-1059, 2007.
13.
Ping Chen, Mingbo Zhao, Shaoqing Jia, "The Application of Fuzzy Neural Network in the Iatrical Monitor", Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007), vol.3, pp.98-103, 2007.
14.
Thomas Eriksson, Fredrik Norden, "Memory-Based Vector Quantization of LSF Parameters by a Power Series Approximation", IEEE Transactions on Audio, Speech, and Language Processing, vol.15, no.4, pp.1146-1155, 2007.
15.
Saikat Chatterjee, T.V. Sreenivas, "Sequential Split Vector Quantization of LSF Parameters using Conditional Pdf", 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07, vol.4, pp.IV-1101-IV-1104, 2007.
16.
Saikat Chatterjee, T.V. Sreenivas, "Computationally efficient optimum weighting function for vector quantization of LSF parameters", 2007 9th International Symposium on Signal Processing and Its Applications, pp.1-4, 2007.
17.
V. Grancharov, D.Y. Zhao, J. Lindblom, W.B. Kleijn, "Low-Complexity, Nonintrusive Speech Quality Assessment", IEEE Transactions on Audio, Speech, and Language Processing, vol.14, no.6, pp.1948-1956, 2006.
18.
F. Lahouti, A.R. Fazel, A.H. Safavi-Naeini, A.K. Khandani, "Single and double frame coding of speech LPC parameters using a lattice-based quantization scheme", IEEE Transactions on Audio, Speech, and Language Processing, vol.14, no.5, pp.1624-1632, 2006.
19.
H. Duxans, A. Bonafonte, "Residual Conversion Versus Prediction on Voice Morphing Systems", 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, vol.1, pp.I-I, 2006.
20.
B. Denby, Y. Oussar, G. Dreyfus, M. Stone, "Prospects for a Silent Speech Interface using Ultrasound Imaging", 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, vol.1, pp.I-I, 2006.
21.
T.Z. Shabestary, P. Hedelin, "LSP quantization by a union of locally trained codebooks", IEEE Transactions on Speech and Audio Processing, vol.13, no.5, pp.811-820, 2005.
22.
F. Norden, T. Eriksson, "Time evolution in LPC spectrum coding", IEEE Transactions on Speech and Audio Processing, vol.12, no.3, pp.290-301, 2004.
23.
S. Srinivasan, J. Samuelsson, W.B. Kleijn, "Estimation of short-term predictor parameters for coding and enhancement of noisy speech", 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol.1, pp.I-705, 2004.
24.
F. Lahouti, A.K. Khandani, "Quantization of LSF parameters using a trellis modeling", IEEE Transactions on Speech and Audio Processing, vol.11, no.5, pp.400-412, 2003.
25.
R.C. de Lamare, A. Alcaim, "Effects of adaptive postfilters on the LSF quantisation for low bit rate speech coders in tandem connections", Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings., vol.1, pp.393-396 vol.1, 2003.
26.
Woo-Jin Han, Eun-Kyoung Kim, Yung-Hwan Oh, "Multicodebook split vector quantization of LSF parameters", IEEE Signal Processing Letters, vol.9, no.12, pp.418-421, 2002.
27.
Wesley Pereira, Peter Kabal, "Improved spectral tracking using interpolated linear prediction parameters", 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol.1, pp.I-261-I-264, 2002.
28.
R.C. de Lamare, A. Alcaim, "Analysis of LSF switched-predictive vector quantisers", Proceedings of the Sixth International Symposium on Signal Processing and its Applications (Cat.No.01EX467), vol.2, pp.727-730 vol.2, 2001.
29.
Ho Young Hur, Hyung Soon Kim, "Formant weighted cepstral feature for LSP-based speech recognition", 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), vol.1, pp.141-144 vol.1, 2001.
30.
Hai Le Vu, L. Lois, "Efficient distance measure for quantization of LSF and its Karhunen-Loeve transformed parameters", IEEE Transactions on Speech and Audio Processing, vol.8, no.6, pp.744-746, 2000.
Cites in Papers - Other Publishers (24)
1.
Marcelo S. Alencar, Valdemar C. da Rocha, "Speech Coding", Communication Systems, pp.97, 2022.
2.
Marcelo S. Alencar, Valdemar C. da Rocha, "Speech Coding", Communication Systems, pp.89, 2020.
3.
Weixun GAO, Qiying CAO, Yao QIAN, "Cross-Dialectal Voice Conversion with Neural Networks", IEICE Transactions on Information and Systems, vol.E97.D, no.11, pp.2872, 2014.
4.
Byungsik YOON, Heewan PARK, Sangwon KANG, "A Low Power Bandwidth Extension Technique", IEICE Transactions on Communications, vol.E95-B, no.1, pp.358, 2012.
5.
Fatiha Merazka, "Intraframe quantization of speech line spectrum pairs for code-excited linear prediction based coders in packet networks", Transactions on Emerging Telecommunications Technologies, vol.23, no.8, pp.789, 2012.
6.
Elif Bozkurt, Engin Erzin, Cig?dem Erog?lu Erdem, A. Tanju Erdem, "Formant position based weighted spectral features for emotion recognition", Speech Communication, vol.53, no.9-10, pp.1186, 2011.
7.
Sonia L. Q. DallAgnol, Abraham Alcaim, Jose Roberto B. de Marca, "Performance of LSF vector quantizers for VSELP coders in noisy channels", European Transactions on Telecommunications, vol.5, no.5, pp.553, 2010.
8.
Abdellah KADDAI, Mohammed HALIMI, "Low-Complexity Wideband LSF Quantization Using Algebraic Trellis VQ", IEICE Transactions on Information and Systems, vol.E92-D, no.12, pp.2478, 2009.
9.
Saikat Chatterjee, T.V. Sreenivas, "Reduced complexity two stage vector quantization", Digital Signal Processing, vol.19, no.3, pp.476, 2009.
10.
Oytun Turk, Levent M. Arslan, "Automatic source speaker selection for voice conversion", The Journal of the Acoustical Society of America, vol.125, no.1, pp.480, 2009.
11.
Merouane Bouzid, Amar Djeradi, "Optimisation de la quantification vectorielle codee par treillis: application au codage des parametres LSF", Annales Des Telecommunications, vol.60, no.5-6, pp.744, 2005.
12.
Alexander Petrovsky, Andrzej Sawicki, Alexander Pavlovec, Information Processing and Security Systems, pp.67, 2005.
13.
Merouane Bouzid, Amar Djeradi, Bachir Boudraa, "Optimized trellis coded vector quantization of LSF parameters, application to the 4.8kbps FS1016 speech coder", Signal Processing, vol.85, no.9, pp.1675, 2005.
14.
Juan M. Lopez-Soler, Victoria Sanchez, Angel de la Torre, Antonio J. Rubio-Ayuso, "Linear inter-frame dependencies for very low bit-rate speech coding", Speech Communication, vol.34, no.4, pp.333, 2001.
15.
Mi Suk Lee, Hong Kook Kim, Hwang Soo Lee, "A new distortion measure for spectral quantization based on the LSF intermodel interlacing property", Speech Communication, vol.35, no.3-4, pp.191, 2001.
16.
Seung Ho Choi, Hong Kook Kim, Hwang Soo Lee, "Speech recognition using quantized LSP parameters and their transformations in digital communication", Speech Communication, vol.30, no.4, pp.223, 2000.
17.
Levent M. Arslan, David Talkin, "Codebook based face point trajectory synthesis algorithm using speech input", Speech Communication, vol.27, no.2, pp.81, 1999.
18.
C.Q Chen, S.N Koh, I.Y Soon, "An associatively classified partitioned vector quantizer", Signal Processing, vol.76, no.3, pp.311, 1999.
19.
Levent M. Arslan, "Speaker Transformation Algorithm using Segmental Codebooks (STASC)", Speech Communication, vol.28, no.3, pp.211, 1999.
20.
Balázs Kövesi, Samir Saoudi, Jean Marc Boucher, Gábor Horváth, "Real time vector quantization of LSP parameters", Speech Communication, vol.29, no.1, pp.39, 1999.
21.
Engin Erzin, A. Enis Çetin, Speech Recognition and Coding, pp.431, 1995.
22.
Nam Phamdo, Tai Hong Lee, Nariman Farvardin, Speech Recognition and Coding, pp.493, 1995.
23.
Shihua Wang, Erdal Paksoy, Allen Gersho, Speech and Audio Coding for Wireless and Network Applications, pp.251, 1993.
24.
Nam Phamdo, Nariman Farvardin, Takehiro Moriya, "Combined Source-Channel Coding of LSP Parameters Using Multi-Stage Vector Quantization", Speech and Audio Coding for Wireless and Network Applications, pp.181, 1993.