Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2004-06-29
2008-12-02
Han, Qi (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S500000, C704S501000, C704S200100
Reexamination Certificate
active
07460990
ABSTRACT:
Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.
REFERENCES:
patent: 5438643 (1995-08-01), Akagiri et al.
patent: 5539829 (1996-07-01), Lokhoff et al.
patent: 5581653 (1996-12-01), Todd
patent: 5819214 (1998-10-01), Suzuki et al.
patent: 5845243 (1998-12-01), Smart et al.
patent: 6029126 (2000-02-01), Malvar
patent: 6115688 (2000-09-01), Brandenburg et al.
patent: 6341165 (2002-01-01), Gbur et al.
patent: 6393392 (2002-05-01), Minde
patent: 6680972 (2004-01-01), Liljeryd et al.
patent: 6760698 (2004-07-01), Gao
patent: 6766293 (2004-07-01), Herre et al.
patent: 6771777 (2004-08-01), Gbur et al.
patent: 2003/0093271 (2003-05-01), Tsushima et al.
patent: 2003/0115041 (2003-06-01), Chen et al.
patent: 2003/0115042 (2003-06-01), Chen et al.
patent: 2003/0115050 (2003-06-01), Chen et al.
patent: 2003/0115051 (2003-06-01), Chen et al.
patent: 2003/0115052 (2003-06-01), Chen et al.
patent: 2003/0233234 (2003-12-01), Truman et al.
patent: 2003/0236580 (2003-12-01), Wilson et al.
patent: 2004/0243397 (2004-12-01), Averty et al.
patent: 2005/0065780 (2005-03-01), Wiser et al.
patent: 2005/0108007 (2005-05-01), Bessette et al.
patent: 2005/0149322 (2005-07-01), Bruhn et al.
patent: 2005/0165611 (2005-07-01), Mehrotra et al.
patent: 199529 (1995-07-01), None
patent: 0910927 (1999-04-01), None
patent: 0931386 (1999-07-01), None
patent: 1396841 (2004-03-01), None
patent: WO 02/43054 (2002-05-01), None
Mark Hasegawa-Johnson and Abeer Alwan, “Speech coding: fundamentals and applications,”Handbook of Telecommunications, John Wiley and Sons, Inc., pp. 1-33 (2003). [available at http://citeseer.ist.psu.edu/617093.html].
Najafzadeh-Azghandi, Hossein and Kabal, Peter, “Perceptual coding of narrowband audio signals at 8 Kbit/s” (1997), available at http://citeseer.ist.psu.edu
ajafzadeh-azghandi97perceptual.html.
Painter, T. and Spanias, A., “Perceptual Coding Of Digital Audio,”Proceedings Of The IEEE, vol. 88, Issue 4, pp. 451-515, Apr. 2000, available at http://www.eas.asu.edu/˜spanias/papers/paper-audio-tedspanias-00.pdf.
M. Schroeder, B. Atal, “Code-excited linear prediction (CELP): High-quality speech at very low bit rates,”Proc. IEEE Int. Conf ASSP, pp. 937-940, 1985.
Schulz, D., “Improving audio codecs by noise substitution,”Journal Of The AES, vol. 44, No. 7/8, pp. 593-598, Jul./Aug. 1996.
Th. Sporer, Kh. Brandenburg, B. Elder, “The Use of Multirate Filter Banks for Coding of High Quality Digital Audio,”6thEuropean Signal Processing Conference(EUSIPCO), Amsterdam, vol. 1, pp. 211-214, Jun. 1992.
Advanced Television Systems Committee,ATSC Standard: Digital Audio Compression(AC-3),Revision A, 140 pp. (1995).
Brandenburg, “ASPEC Coding”,AES 10thInternational Conference, pp. 81-90 (1991).
“ISO/IEC 11172-3, Information Technology—Coding of Moving Pictures and Associated Audio for Digital Storage Media at Up to About 1.5 Mbit/s—Part 3: Audio,” 154 pp. (1993).
“ISO/IEC 13818-7, Information Technology—Generic Coding of Moving Pictures and Associated Audio Information—Part 7: Advanced Audio Coding (AAC),” 174 pp. (1997).
“ISO/IEC 13818-7, Information Technology—Generic Coding of Moving Pictures and Associated Audio Information—Part 7: Advanced Audio Coding (AAC), Technical Corrigendum 1” 22 pp. (1998).
ITU, Recommendation ITU-R BS 1115, Low Bit-Rate Audio Coding, 9 pp. (1994).
Search Report from PCT/US04/24935, dated Feb. 24, 2005.
Search Report from PCT/US06/27238, dated Aug. 15, 2007.
Chen Wei-Ge
Mehrotra Sanjeev
Han Qi
Klarquist & Sparkman, LLP
Microsoft Corporation
LandOfFree
Efficient coding of digital media spectral data using... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Efficient coding of digital media spectral data using..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Efficient coding of digital media spectral data using... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4027084