Data processing: speech signal processing – linguistics – language – Audio signal bandwidth compression or expansion
Reexamination Certificate
2000-03-03
2003-05-20
To, Doris H. (Department: 2655)
Data processing: speech signal processing, linguistics, language
Audio signal bandwidth compression or expansion
C704S501000
Reexamination Certificate
active
06567781
ABSTRACT:
FIELD OF INVENTION
The present invention generally relates to the field of audio compression, and more particularly to a method and apparatus for audio compression which operates on dynamical systems, such as cellular automata (CA).
BACKGROUND OF THE INVENTION
The need frequently arises to transmit digital audio data across communications networks (e.g., the Internet; the Plain Old Telephone System, POTS; Local Area Networks, LAN; Wide Area Networks, WAN; Satellite Communications Systems). Many applications also require digital audio data to be stored on electronic devices such as magnetic media, optical disks and flash memories. The volume of data required to encode raw audio data is large. Consider a stereo audio data sampled at 44100 samples per second and with a maximum of 16 bits used to encode each sample per channel. A one-hour recording of a raw digital stereo music with that fidelity will occupy about 606 Megabytes of storage space. To transmit such an audio file over a 56 kilobits per second communications channel (e.g., the rate supported by most POTS through modems), will take over 24.6 hours.
The best approach for dealing with the bandwidth limitation and also reduce huge storage requirement is to compress the audio data. The most popular technique for compressing audio data combines transform approaches (e.g. the Discrete Cosine Transform, DCT) with psycho-acoustic techniques. The current industry standard is the so-called MP3 format (or MPEG audio developed by the International Standards Organization International Electrochemical Committee, ISO/IEC) which uses the aforementioned approach. Various enhancements to the standard have been proposed. For example, Bolton and Fiocca, in U.S. Pat. No. 5,761,636, taught a method for improving the audio compression system by a bit allocation scheme that favors certain frequency subband. Davis, in U.S. Pat. No. 5,699,484, taught a split-band perceptual coding system that makes use of predictive coding in frequency bands.
Other audio compression inventions that are based on variations of the traditional DCT transform and/or some bit allocation schemes (utilizing perceptual models) include those taught by Mitsuno et al. (U.S. Pat. No. 5,590,108), Shimoyoshi et al (U.S. Pat. No. 5,548,574), Johnston (U.S. Pat. No. 5,481,614), Fielder and Davidson (U.S. Pat. No. 5,109,417), Dobson et al. (U.S. Pat. No. 5,819,215), Davidson et al. (U.S. Pat. No. 5,632,003), Anderson et al. (U.S. Pat. No. 5,388,181), Sudharsanan et al. (U.S. Pat. No. 5,764,698) and Herre (U.S. Pat. No. 5,781,888).
Some recent inventions (e.g., Dobson et al. in U.S. Pat. No. 5,819,215) teach the use of the wavelet transform as the tool for audio compression. The bit allocation schemes on the wavelet-based compression methods are generally based on the so-called embedded zero-tree concept taught by Shapiro (U.S. Pat. Nos. 5,321,776 and 5,412,741). Other audio compression schemes that utilize wavelets as basis functions are described in the paper by Painter & Spanias (1999) and they include the work by Tewik et al (1993a,b,c); Black & Zeytinoglu (1995); Kudumakis and Sandler (1995a,b); and Boland & Deriche (1995,1996).
In order to achieve a better compression of digital audio data, the present. invention makes use of a transform method that uses dynamical systems. In accordance with a preferred embodiment, the evolving fields of cellular automata are used to generate building blocks for audio data. The rules governing the evolution of the dynamical system can be adjusted to produce building blocks that satisfy the requirements of low-bit rate audio compression process.
The concept of cellular automata transform (CAT) is taught in U.S. Pat. No. 5,677,956 by Lafe, as an apparatus for encrypting and decrypting data. The present invention teaches the use of more complex dynamical systems that produce efficient building blocks for encoding audio data. The present invention also teaches a psycho-acoustic method developed specially for the sub-band encoding process arising from the cellular automata transform. A special bit allocation scheme that also facilitates audio streaming is taught as an efficient means for encoding the quantized transform coefficients obtained after the cellular automata transform process.
SUMMARY OF THE INVENTION
According to the present invention there is provided a method of compressing audio data comprising: determining a multi-state dynamical rule set and an associated transform basis function, receiving input audio data, and performing a forward transform using the transform basis function to obtain transform coefficients suitable for reconstructing the input audio data.
An advantage of the present invention is the provision of a method and apparatus for audio compression which provides improvements in the efficiency of digital media storage.
Another advantage of the present invention is the provision of a method and apparatus for audio compression which provides faster data transmission through communication channels.
Still another advantage of the present invention is the provision of a method and apparatus for audio compression which utilizes psycho-acoustics.
Yet another advantage of the present invention is the provision of a method and apparatus for audio compression which facilitates audio streaming.
Still other advantages of the invention will become apparent to those skilled in the art upon a reading and understanding of the following detailed description, accompanying drawings and appended claims.
REFERENCES:
patent: 4769644 (1988-09-01), Frazier
patent: 4866636 (1989-09-01), Fukami et al.
patent: 5109417 (1992-04-01), Fielder et al.
patent: 5321776 (1994-06-01), Shapiro
patent: 5388181 (1995-02-01), Anderson et al.
patent: 5412741 (1995-05-01), Shapiro
patent: 5479562 (1995-12-01), Fielder et al.
patent: 5481614 (1996-01-01), Johnston
patent: 5511146 (1996-04-01), Simar, Jr.
patent: 5548574 (1996-08-01), Shimoyoshi et al.
patent: 5570305 (1996-10-01), Fattouche et al.
patent: 5590108 (1996-12-01), Mitsuno et al.
patent: 5611038 (1997-03-01), Shaw et al.
patent: 5632003 (1997-05-01), Davidson et al.
patent: 5677956 (1997-10-01), Lafe
patent: 5680462 (1997-10-01), Miller et al.
patent: 5699484 (1997-12-01), Davis
patent: 5761636 (1998-06-01), Bolton et al.
patent: 5764698 (1998-06-01), Sudharsanan et al.
patent: 5781888 (1998-07-01), Herre
patent: 5819215 (1998-10-01), Dobson et al.
patent: 6006179 (1999-12-01), Wu et al.
patent: 6363350 (2002-03-01), Lafe
patent: 6393154 (2002-05-01), Lafe
patent: 6400766 (2002-06-01), Lafe
patent: 6456744 (2002-09-01), Lafe
patent: 97/12330 (1997-04-01), None
Y. Mahieux, et al.; “Transform Coding of Audio Signals at 64 Kbit/s”; 1990 IEEE; pp. 518-522.
M. Wada, et al.; “Possibility of Digital Data Description by Means of Rule Dynamics in Cellular Automata”; 1999 IEEE; pp. 278-283.
M. Goodwin, et al.; “Automic Decompositions of Audio Signals”; XP-002161889.
P. J. Hahn, et al.; “Perceptually Lossless Image Compression”; XP-002163214.
A. Aggarwal, et al.; “Perceptual Zerotrees for Scalable Wavelet Coding of Wideband Audio”; XP002163214.
EP International Search Report mailed Mar. 3, 2001.
Jaffe Michael A.
Kusner Mark
Opsasnick Michael N.
QuikCAT.com, Inc.
To Doris H.
LandOfFree
Method and apparatus for compressing audio data using a... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for compressing audio data using a..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for compressing audio data using a... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3071215