Loading [MathJax]/extensions/MathZoom.js
Huisheng Chi - IEEE Xplore Author Profile

Showing 1-17 of 17 results

Filter Results

Show

Results

In this paper, we presented an asynchronous multiple stream based Chinese tonal acoustic modeling framework. In this framework, toneless phonetic units and tones are modeled separately with different acoustic features. During the training and decoding process, a set of models are coupled together with a product hidden Markov models (PHMM) to represent whole tonal phonetic units. Through this, a co...Show More
Outlier problem is one of the typical problems in an incomplete data based machine learning system [1][2][3]. An outlier is a pattern that was either mislabeled in the training data, or inherently ambiguous and hard to recognize, therefore, it usually brings extra trouble for a learning task, either in debasing the performance or leading the learning process to be more complicated. In order to tac...Show More
Text categorization is one of the typical machine learning tasks that suffer from an incomplete training data problem. A main reason is the existence of outliers in training data, such as non-sense documents, documents mislabeled or lying on the border between different categories, and documents that are out of the defined categories, etc. Therefore, in a text categorization task, outlier learning...Show More
Text categorization task always suffers from a high dimension problem, which leads the learning system to be in a status of either lower efficiency or lower performance. A number of feature selection methods have therefore been adopted or proposed for its dimensional reduction, such as DF, IG, Chi Square and so on. Unlike those traditional feature selection methods, in this paper, a feature select...Show More
The perception mechanisms of the human auditory periphery and cochlear nucleus were simulated and the potential application to the voice password gatekeeper was discussed. A biomimetics speaker identification system was implemented based on the auditory processing. Obvious improvement in the robustness was shown under a noisy environment.Show More
Model-based approach is one of methods widely used for speaker identification, where a statistical model is used to characterize a specific speaker's voice but no interspeaker information is involved in its parameter estimation. It is observed that interspeaker information is very helpful in discriminating between different speakers. In this paper, we propose a novel method for the use of interspe...Show More
Many speaker identification systems are created by model-based approaches, where a statistical model is used to characterize a speaker's voice and no inter-speaker information is used in parameter estimation. It is well known that inter-speaker information is very helpful in discrimination of different speakers. We propose a method for the use of inter-speaker information to improve performance of...Show More
We propose an alternative method for the use of different feature sets in pattern classification. Unlike traditional methods, e.g. combination of multiple classifiers and use of a composite feature set, our method copes with the problem based on an idea of soft competition on different feature sets, a modular neural network architecture is proposed to implement the idea accordingly. The proposed a...Show More
According to the characteristics of the auditory periphery and cochlear nucleus, as well as attempting to simulate the mechanism of auditory system as a whole, two kinds of novel speech feature are presented in this paper, and a framework of neural network has been adopted. The two features considered are: the weighted average localized synchronized rate cepstrum, and the weighted firing rate ceps...Show More
A hybrid architecture based upon hidden Markov models (HMMs) and multilayer feedforward neural network (MFNN) is presented for speaker identification. Unlike most of the previous combing methods, the proposed architecture uses HMMs to model individual speaker and uses MFNN to deal with the inter-speaker information for improving performance. Learning in the proposed architecture consists of two ph...Show More
A modular neural architecture, MME, is considered here as an alternative to the standard mixtures of experts architecture for classification with diverse features. Unlike the standard mixtures of experts architecture, a gate-bank consisting of multiple gating networks is introduced to the proposed architecture, and those gating networks in the gate-bank receive different input vectors while expert...Show More
Current speech or speaker recognition systems rely largely on voiced parts of utterance, though a great amount of information for speech perception is contained in the nonstationary consonants and transition. How to model and characterize the dynamic spectral features describing the transition still remains a question. This paper investigates the modeling and detection of the spectral transition b...Show More
In this paper, we present a scheme of combining multiple time-delay nets with the structure of the hierarchical mixture of experts (HME) in order to complete a multi-scale analysis for the temporal data. This method extends the technique of combining multiple classifiers based upon the same static input to sequence processing. We have already applied the proposed method to a real-world problem, ca...Show More
A modified hierarchical mixtures of experts (HME) architecture is presented for text-dependent speaker identification. A new gating network is introduced to the original HME architecture for the use of instantaneous and transitional spectral information in text-dependent speaker identification. The statistical model underlying the proposed architecture is presented and learning is treated as a max...Show More
In this paper, we explore the hierarchical mixture of experts (HME) architecture for a substantial problem, that of text-dependent speaker identification. For a specific multiway classification, we propose a generalized Bernolli density instead of the multinomial logit density. Time-delay technique is also introduced to HME for spatio-temporal processing. Using the proposed density and the time-de...Show More
A TDM/CDMA VSAT network is described in which the data and real-time vocoder service can be simultaneously provided. The network architecture, link design, switching equipment, communication protocol and network management etc. are given.<>Show More