Data processing: speech signal processing – linguistics – language – Speech signal processing – Application
Patent
1995-11-07
1999-08-03
Hudspeth, David R.
Data processing: speech signal processing, linguistics, language
Speech signal processing
Application
704207, 704218, 704241, G10L 504, G10L 908
Patent
active
059338083
ABSTRACT:
A system that synchronously segments a speech waveform using pitch period and a center of the pitch waveform. The pitch waveform center is determined by finding a local minimum of a centroid histogram waveform of the low-pass filtered speech waveform for one pitch period. The speech waveform can then be represented by one or more of such pitch waveforms or segments during speech compression, reconstruction or synthesis. The pitch waveform can be modified by frequency enhancement/filtering, waveform stretching/shrinking in speech synthesis or speech disguise. The utterance rate can also be controlled to speed up or slow down the speech.
REFERENCES:
patent: 3535454 (1970-10-01), Miller
patent: 3649765 (1972-03-01), Rabiner et al.
patent: 3928722 (1975-12-01), Nakata et al.
patent: 4246617 (1981-01-01), Portnoff
patent: 4435832 (1984-03-01), Asada et al.
patent: 4520502 (1985-05-01), Fujita
patent: 4561337 (1985-12-01), Wachi
patent: 4672667 (1987-06-01), Scott et al.
patent: 4852169 (1989-07-01), Veeneman et al.
patent: 5003604 (1991-03-01), Ozaki et al.
patent: 5054085 (1991-10-01), Meisel et al.
patent: 5113449 (1992-05-01), Blanton et al.
patent: 5127053 (1992-06-01), Koch
patent: 5422977 (1995-06-01), Patterson et al.
patent: 5479564 (1995-12-01), Vogten et al.
Carl W. Helstrom, Statistical Theory Of Signal Detection, second edition, rgamon, p. 19, 1968.
L.R. Rabiner and R.W. Schafer, "Digital Processing of Speech Signals", Prentice-Hall Inc., Englewood Cliffs, NJ, 1978, Chapter 4.
G.S. Kang, L.J. Fransen and E.L. Kline, "Multirate Processor (MRP) for Digital Voice Communications", Naval Research Laboratory, Washington, D.C., Mar. 21, 1979, p. 60.
G.S. Kang and L.J. Fransen, "Second Report of the Multirate Processor (MRP) for Digital Voice Communications", Naval Research Laboratory, Washington, D.C., Sep. 30, 1982.
G.S. Kang and L.J. Fransen, "Low-Bit Rate Speech Encoders Based on Line-Spectrum Frequencies (LSFs)", Naval Research Laboratory, Washington, D.C., Jan. 24, 1985.
G.S. Kang and L.J. Fransen, "High-Quality 800-b/s Voice Processing Algorithm", Naval Research Laboratory, Washington, D.C., Feb. 25, 1991.
Colin J. Powell, "C41 for the Warrior", Jun. 12, 1992.
"Digital Voice Processor Consortium Report on Performance of the LPC-10e Voice Processor".
Proceedings ICASSP 85, IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, "Automatic Speaker Recognition Using Vocoded Speech", Stephanie S. Everett, Naval Research Laboratory, Washington, D.C., pp. 383-386.
Alan V. Oppenheim and Ronald W. Schafer, "Discrete-Time Signal Processing", Prentice-Hall, Englewood Cliffs, NJ, Chapter 10 -Discrete Hilber Transforms, pp. 674-675.
G.S. Kang, T.M. Moran and D.A. Heide, Voice Message Systems for Tactical Applications (Canned Speech Approach), Naval Research Laboratory, Washington, D.C., Sep. 3, 1993.
Ralph K. Potter, George A. Kopp and Harriet Green Kopp, "Visible Speech", Dover Publications, Inc., New York, pp. 1-3 and 4.
Athanasios Papoulis, "Signal Analysis", McGraw-Hill Book Company, p. 66.
Thomas E. Tremain, "The Government Standard Linear Predictive Coding Algorithm: LPC-10", Speech Technology -Man/Machine Voice Communications, vol. 1, No. 2, Apr. 1982, pp. 40-43.
Homer Dudley, "The Carrier Nature of Speech", Speech Synthesis, Benchmark Papers in Acoustics, 1940, pp. 22-43.
FF9, Identifying familiar talkers over a 2.4 kbpa LPC voice system, Astrid Schmidt-Nielsen (Code 7526, Naval Research Laboratory, Washington, D.C. 20375).
George S. Kang and Lawerence J. Fransen, "Speech Analysis and Synthesis Based on Pitch-Synchronous Segmentation of the Speech Waveform", Naval Research Laboratory, Nov. 9, 1994.
DARPA TIMIT Acoustic Phoenetic Continuous Speech Database, Training Set: 420 Talkers, 4200 Sentences, Prototype, Dec. 1988.
G.S. Kang and Stephanie S. Everett, "Improvement of the Excitation Source in the Narrow-Band Linear Prediction Vocoder", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-33, No. 2, Apr. 1985, pp. 377-386.
Fransen Lawrence J.
Kang George S.
Hudspeth David R.
Jameson George
McDonnell Thomas E.
Smits Talivaldis Ivars
The United States of America as represented by the Secretary of
LandOfFree
Method and apparatus for generating modified speech from pitch-s does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for generating modified speech from pitch-s, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for generating modified speech from pitch-s will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-860293