Rapid tree-based method for vector quantization

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

395 209, G10L 302

Patent

active

057347913

ABSTRACT:
The branching decision for each node in a vector quantization (VQ) binary tree is made by a simple comparison of a pre-selected element of the candidate vector with a stored threshold resulting in a binary decision for reaching the next lower level. Each node has a preassigned element and threshold value. Conventional centroid distance training techniques (such as LBG and k-means) are used to establish code-book indices corresponding to a set of VQ centroids. The set of training vectors are used a second time to select a vector element and threshold value at each node that approximately splits the data evenly. After processing the training vectors through the binary tree using threshold decisions, a histogram is generated for each code-book index that represents the number of times a training vector belonging to a given index set appeared at each index. The final quantization is accomplished by processing and then selecting the nearest centroid belonging to that histogram. Accuracy comparable to that achieved by conventional binary tree VQ is realized but with almost a full magnitude increase in processing speed.

REFERENCES:
patent: Re34562 (1994-03-01), Murakami et al.
patent: 4348553 (1982-09-01), Baker et al.
patent: 4727354 (1988-02-01), Lindsay
patent: 4878230 (1989-10-01), Murakami et al.
patent: 4903305 (1990-02-01), Gillick et al.
patent: 5021971 (1991-06-01), Lindsay
patent: 5027406 (1991-06-01), Roberts et al.
patent: 5194950 (1993-03-01), Murakami et al.
patent: 5291286 (1994-03-01), Murakami et al.
patent: 5297170 (1994-03-01), Eyuboglu et al.
George M. White, "Speech Recognition, Neural Nets, and Brains", Jan. 1992, pp. 1-48.
Kai-Fu Lee, "Large-Vocabulary Speaker-Independent Continuous Speech Recogniton: The Sphinx System" Carnegie Mellon University, Pittsburgh, Pennsylvania, Apr. 1988, pp. 1-184.
Ronald W. Schafer and Lawrence R. Rabiner, "Digital Representations of Speech Signals" The Institute of Electrical and Electronics Engineers, Inc., 1975, pp. 49-63.
D. Raj Reddy, "Speech Recognition by Machine: A Review"IEEE Proceedings 64(4):502-531, Apr. 1976, pp. 8-35.
Robert M. Gray, "Vector Quantization" IEEE,1984, pp. 75-100.
Rabiner, L., Sondhi, M. and Levison, S., "Note on the Properties of a Vector Quantizer for LPC Coefficients," vol. 62, No. 8, Oct. 1983, pp. 2603-2615, Bell System Technical Journal.
Linde, Y., Buzo, A., and Gray, R.M., "An Algorithm for Vector Quantization," IEEE Trans. Commun., COM-28, No. 1 (Jan. 1980) pp. 84-95.
Bahl, I.R., et al., "Large Vocabulary National Language Continuous Speech Recognition," Proceeding of the IEEE ICASSP 1989, Glasgow, pp. 465-467.
Gray, R.M., "Vector Quantization", IEEE ASSP Magazine, Apr. 1984, vol. 1, No. 2, pp. 4-29.
Bahl, L.R., Baker, J.L., Cohen, P.S., Jelineck, F., Lewis, B.L., Mercer, R.L., "Recognition of a Continuously Read Natural Corpus" IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Apr. 1978, pp. 422-424.
Schwartz, R., Chow, Y., Kimball, Ol, Roucos, S., Krasner, M., Makhoul, J., "Context-Dependent Modeling for Acoustic-Phonetic Recognition of Continuous Speech," IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Apr. 1985, pp. 1205-1208.
Schwartz, R.M., Chow, S.L., Roucos, S., Krauser, M., Makhoul, J., "Improved Hidden Markov Modeling of Phonemes for Continuous Speech Recognition," IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Apr. 1984, pp. 35.6.1-35.6.4.
Alleva, F. Hon, H., Huang, X., Hwang, M., Rosenfeld, R., Weide, R., "Applying Sphinx II to DARPA Wall Street Journal CSR Task", Proc. of the DARPA Speech and NL Workshop, Feb. 1992, Morgan Kaufman Pub., San Mateo, CA, pp. 393-398.
Kai-Fu Lee, "Automatic Speech Recognition," Kluwer Academic Publishers, Boston/Dordrecht/London, 1989, pp. 1-203.
Tenenbaum et al., Data Structures Using Pascal, 1981, Prentice-Hall, Inc., pp. 252-283.
Buzo et al., "Speech Coding Based Upon Vector Quantization," IEEE Trans on ASSP, vol. ASSP-28, No. 5, Oct. 1980, pp. 562-574.
Parsons, "Voice and Speech Processing," 1987 by McGraw-Hill, Inc., pp. 203-213.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Rapid tree-based method for vector quantization does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Rapid tree-based method for vector quantization, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Rapid tree-based method for vector quantization will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-58767

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.