Method of extracting features in a voice recognition system

Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S206000

Reexamination Certificate

active

06182036

ABSTRACT:

FIELD OF THE INVENTION
The present invention pertains to voice recognition, and more particularly to feature extraction in a voice recognition system.
BACKGROUND OF THE INVENTION
In a speaker dependent speech recognition system, users must enroll the vocabulary words they wish to have available when using the system. A vocabulary “word” can be a single spoken word or a short phrase, and the vocabulary words chosen depend on the particular application. For example, a speech recognition implementation for a portable radiotelephone might require the user to provide the names and locations of frequently called people (e.g., “Fred's office”), or commands for frequently used features usually available in a user interface (e.g., “battery meter”, “messages”, or “phone lock”).
During an enrollment procedure, a speech recognition system is responsive to the user's input to derive a representative template for each vocabulary word. In some systems, this template can be represented by a hidden Markov model (HMM) which consists of a series of states. Each state represents a finite section of a speech utterance: utterance as used herein referring to a “vocabulary word” which may comprise one or more words. A statistical representation of each state of an HMM is calculated using one or more enrollment speech samples of a particular vocabulary word uttered by the user. This is accomplished through frame-to-state assignments.
Such state assignments are used both for training and voice recognition modes of operation. In particular, the assigned states are used to create models in a training mode which are used as a comparison reference during speech recognition mode. The assignments for input utterances in a voice recognition mode of operation are used to compare the input utterances to stored reference models during the voice recognition mode.
An alignment algorithm, such as a Viterbi algorithm is used for frame-to-state alignment of an utterance. This alignment algorithm, which provides the best match of the speech utterance onto the model, is used to assign each frame of the vocabulary word utterance to individual states of the model. Using this assignment, the statistical representations for each state can be refined.
Because of the amount of information, most speech recognition systems require large amounts of both volatile memory, such as random access memory (RAM), and non-volatile memory (NVM), such as flash ROM or electronically erasable read only memory (EEPROM). These memory requirements can be prohibitively expensive for cost-sensitive applications such as portable wireless communication devices. Additionally, speech recognition systems require significant computational requirements measured in millions of instructions per second (MIPS). The large number of MIPS are required for training and voice recognition. This large MIPS requirement can negatively impact the performance of the device in which voice recognition is employed by using valuable resources and slowing down operating speeds.
In order to implement a speaker dependent training and recognition algorithm on a portable device, such as wireless communication device where very little random access memory (RAM) is available, there is a need for a method that supports a smaller memory and uses fewer MIPS without significantly negatively impacting on recognition in all environments.


REFERENCES:
patent: 5097509 (1992-03-01), Lennig
patent: 6029124 (2000-02-01), Gillick et al.
patent: 0192898 A1 (1986-03-01), None
Hunt, M. J., and Lefebvre, C., “A Comparison of Several Acoustic Representations for Speech recognition with Degraded and Undegraded Speech,” 1989 Int. Conf. Acoust. Speech Sig. Proc. ICASSP-89, vol. 1, pp. 262-265, 23-26 May 1989.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method of extracting features in a voice recognition system does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method of extracting features in a voice recognition system, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method of extracting features in a voice recognition system will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2504032

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.