Data processing: speech signal processing – linguistics – language – Audio signal time compression or expansion – With content reduction encoding
Reexamination Certificate
2007-03-20
2011-12-27
Sked, Matthew (Department: 2626)
Data processing: speech signal processing, linguistics, language
Audio signal time compression or expansion
With content reduction encoding
C704S201000, C704S203000, C704S204000, C704S503000
Reexamination Certificate
active
08086465
ABSTRACT:
A “STAC Codec” provides audio transcoding and decoding by processing an encoded audio signal using a backward-adaptive run-length Golomb-Rice (RLGR) decoder to recover transform coefficients of the encoded audio signal. The transform coefficients are then either transcoded in the transform domain to lossy or other formats, or decoded to the time domain by applying an inverse integer-reversible modulated lapped transform (MLT) to the recovered transform coefficients to recover an uncompressed time domain representation compressed audio signal. In additional embodiments, an inter-block spectral estimation and inverse data sorting strategy is used in recovering the transform coefficients from the encoded audio signal. In other embodiments, conversion from lossless encoding to near-lossless encoding is achieved by right-shifting recovered transform coefficients by some number of bits such that quantization errors are not perceived as distortion in the decoded audio signal, then re-encoding the right shifted transform coefficients.
REFERENCES:
patent: 5839100 (1998-11-01), Wegener
patent: 6240380 (2001-05-01), Malvar
patent: 6567562 (2003-05-01), Nakayama et al.
patent: 6778965 (2004-08-01), Bruekers et al.
patent: 7126506 (2006-10-01), Malvar
patent: 7133832 (2006-11-01), Heo
patent: 7283967 (2007-10-01), Nishio et al.
patent: 7318027 (2008-01-01), Lennon et al.
patent: 7333929 (2008-02-01), Chmounk
patent: 7340391 (2008-03-01), Herre
patent: 7395210 (2008-07-01), Li
patent: 7483581 (2009-01-01), Raveendran et al.
patent: 7630563 (2009-12-01), Irvine et al.
patent: 2004/0044534 (2004-03-01), Chen
patent: 2005/0083216 (2005-04-01), Li
patent: 2005/0131660 (2005-06-01), Yadegar
patent: 2005/0180586 (2005-08-01), Kim et al.
patent: 2005/0192799 (2005-09-01), Kim et al.
patent: 2005/0203731 (2005-09-01), Oh
patent: 2005/0231396 (2005-10-01), Dunn
patent: 2006/0103556 (2006-05-01), Malvar
patent: 2006/0257036 (2006-11-01), Hou
Malvar, “Lossless and near-lossless audio compression using integer-reversible modulated lapped transforms”, in: Proceedings of the IEEE Data Compression Conference (DCC'2007), Snowbird, UT, Mar. 2007, pp. 1-10.
Garcia, J. “Backward Linear Prediction for Lossless Coding of Stereo Audio,” AES 116th Convention, Berlin, Germany May 8-11, 2004.
Malvar, H.S. “Adaptive run-length/Golomb-Rice encoding of quantized generalized Gaussian sources with unknown statistics,” Data Compression Conference, 2006. DCC 2006. Proceedings, Issue Date: Mar. 28-30, 2006.
Ashland, M. T., Monkey's audio: a fast and powerful lossless audio compressor, available at http://www.monkeysaudio.com.
Brandenburg, K., and T. Sporer, NMR and Masking Flag: Evaluation of quality using perceptual criteria, Proc. 11th Int. AES Conf., May 1992, pp. 169-179, Portland, OR.
Burges, C. J. C., D. Plastina, J. Platt, E. Renshaw, and H. S. Malvar, Using audio fingerprinting for duplicate detection and thumbnail generation, Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. III, Mar. 2005, pp. 9-12, Philadelphia, PA.
Coalson, J., FLAC—Free Lossless Audio Codec, available at http://flac.sourceforge.net.
Ghido, F., Ghido's data compression page, available at http://www.losslessaudio.org.
Giurcaneanu, C., I. Tabus, and J. Astola, Integer wavelet transform based lossless audio compression, Proc. of the IEEE-EURASIP Workshop on Nonlinear Signal and Image Processing (NSIP'99), Jun. 20-23, 1999, pp. 378-382, Antalya, Turkey.
Huang, H., S. Rahardja, Integer MDCT with enhanced approximation of the DCT-IV, IEEE Trans. on Signal Processing, Mar. 2006, pp. 1156-1159, vol. 54.
Hydrogen Audio: Lossless comparison, available at http://wiki.hydrogenaudio.org/index.php?title=Lossless—comparison.
Hydrogen Audio: “Monkey's Audio,” available at http://wiki.hydrogenaudio.org/index.php?title=Monkey's—Audio.
Kim, J., Lossless wideband audio compression: Prediction and transform, Communication Engineering, Technical University Berlin, Germany, 2003.
Krishnan, T., and S. Oraintara, Fast and lossless implementation of the forward and inverse MDCT computation in MPEG audio coding, Proc. Int. Symp. Circuits and Systems, May 2002, pp. 181-184, vol. 2, Scottsdale, AZ.
Li, J., A progressive to lossless embedded audio coder (PLEAC) with reversible modulated lapped transform, Proc. Int. Conf. Acoustics, Speech, Signal Processing, Apr. 2003, pp. 221-224, Hong Kong, vol. III.
Li, J., Low noise reversible MDCT (RMDCT) and its application in progressive-to-lossless embedded audio coding, IEEE Trans. on Signal Processing, May 2005, pp. 1870-1880, vol. 53.
Liebchen, T., and Y. Reznik, MPEG-4 ALS: an emerging standard for lossless audio coding, Proc. Data Compression Conf., Mar. 2006, pp. 439-448, Snowbird, UT.
Malvar, H. S., Adaptive run-length/Golomb-Rice encoding of quantized generalized Gaussian sources with unknown statistics, Proc. Data Compression Conf., Mar. 2006, pp. 23-32, Snowbird, UT.
Robinson, A. J., Shorten: Simple lossless and near-lossless waveform compression, Tech. Rep. CUED/F-INFENG/TR.156, Cambridge University Eng. Dept., Dec. 1994.
Wikipedia: Audio data compression, available at: http://en.wikipedia.org/wiki/Audio—data—compression.
Yokotani, Y., R. Geiger, G.D.T. Schuller, S. Oraintara, K. R. Rao, Lossless audio coding using the IntMDCT and rounding error shaping, IEEE Transactions on Audio, Speech, and Language Processing, Nov. 2006, pp. 2201-2211, vol. 14, No. 6.
Yu, R., X. Lin, S. Rahardja, and C. C. Ko, A statistics study of the MDCT coefficient distribution for audio, Proc. IEEE Int. Conf. on Multimedia and Expo, Jun. 2004, pp. 1483-1486, vol. 2, Taipei, Taiwan.
Yu, R., S. Rahardja, L. Xiao, and C. C. Ko, A fine granular scalable to lossless audio coder, IEEE Trans. on Audio, Speech, and Language Processing, Jul. 2006, pp. 1352-1363, vol. 14.
Ritz, C. H., J. Parsons, Lossless wideband speech coding, Proceedings of the 10th Australian Int'l Conf. on Speech Science & Tech., Macquarie University, Sydney, pp. 249-252, Dec. 8-10, 2004.
International Search Report, Application No. PCT/US2008/057657, completed Aug. 22, 2008, mailed Aug. 22, 2008.
Wozniak, James S., U.S. Appl. No. 11/688,851, U.S. Office Action, Jun. 21, 2010.
Sked, Matthew J., USPTO Office Action dated Nov. 16, 2010 for U.S. Appl. No. 11/688,851.
Lyon & Harr LLP
Microsoft Corporation
Sked Matthew
Watson Mark A.
LandOfFree
Transform domain transcoding and decoding of audio data... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Transform domain transcoding and decoding of audio data..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Transform domain transcoding and decoding of audio data... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4313821