Data processing: speech signal processing – linguistics – language – Audio signal bandwidth compression or expansion
Reexamination Certificate
2007-11-20
2007-11-20
Hudspeth, David (Department: 2626)
Data processing: speech signal processing, linguistics, language
Audio signal bandwidth compression or expansion
C704S200100
Reexamination Certificate
active
10642551
ABSTRACT:
An audio encoder and decoder use architectures and techniques that improve the efficiency of quantization (e.g., weighting) and inverse quantization (e.g., inverse weighting) in audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder quantizes audio data in multiple channels, applying multiple channel-specific quantizer step modifiers, which give the encoder more control over balancing reconstruction quality between channels. The encoder also applies multiple quantization matrices and varies the resolution of the quantization matrices, which allows the encoder to use more resolution if overall quality is good and use less resolution if overall quality is poor. Finally, the encoder compresses one or more quantization matrices using temporal prediction to reduce the bitrate associated with the quantization matrices. An audio decoder performs corresponding inverse processing and decoding.
REFERENCES:
patent: 5079547 (1992-01-01), Fuchigama et al.
patent: 5260980 (1993-11-01), Akagiri et al.
patent: 5388181 (1995-02-01), Anderson et al.
patent: 5524054 (1996-06-01), Spille
patent: 5627938 (1997-05-01), Johnston
patent: 5629780 (1997-05-01), Watson
patent: 5661755 (1997-08-01), Van De Kerkhof et al.
patent: 5661823 (1997-08-01), Yamauchi et al.
patent: 5682152 (1997-10-01), Wang et al.
patent: 5684920 (1997-11-01), Iwakami et al.
patent: 5686964 (1997-11-01), Tabatabai et al.
patent: 5701346 (1997-12-01), Herre et al.
patent: 5812971 (1998-09-01), Herre
patent: 5835030 (1998-11-01), Tsutsui et al.
patent: 5845243 (1998-12-01), Smart et al.
patent: 5956674 (1999-09-01), Smyth et al.
patent: 5974380 (1999-10-01), Smyth et al.
patent: 5995151 (1999-11-01), Naveen et al.
patent: 6029126 (2000-02-01), Malvar
patent: 6041295 (2000-03-01), Hinderks
patent: 6058362 (2000-05-01), Malvar
patent: 6064954 (2000-05-01), Cohen et al.
patent: 6115688 (2000-09-01), Brandenburg et al.
patent: 6115689 (2000-09-01), Malvar
patent: 6182034 (2001-01-01), Malvar
patent: 6240380 (2001-05-01), Malvar
patent: 6249614 (2001-06-01), Kolesnik et al.
patent: 6370502 (2002-04-01), Wu et al.
patent: 6418405 (2002-07-01), Satyamurti et al.
patent: 6445739 (2002-09-01), Shen et al.
patent: 6658162 (2003-12-01), Zeng et al.
patent: 6738074 (2004-05-01), Rao et al.
patent: 6766293 (2004-07-01), Herre et al.
patent: 6771777 (2004-08-01), Gbur et al.
patent: 6934677 (2005-08-01), Chen et al.
patent: 7062445 (2006-06-01), Kadatch
patent: 2002/0143556 (2002-10-01), Kadatch
patent: 2004/0044527 (2004-03-01), Thumpudi et al.
patent: 0597649 (1994-05-01), None
patent: 0669724 (1995-08-01), None
patent: 0910927 (1999-04-01), None
patent: 0931386 (1999-07-01), None
patent: WO 99/43110 (1999-08-01), None
Advanced Television Systems Committee, ATSC Standard: Digital Audio Compression (AC-3), Revision A, 140 pp. (1995).
Beerends, “Audio Quality Determination Based on Perceptual Measurement Techniques,” Applications of Digital Signal Processing to Audio and Acoustics, Chapter 1, Ed. Mark Kahrs, Karlheinz Brandenburg, Kluwer Acad. Publ., pp. 1-38 (1998).
Bosi et al., “ISO/IEC MPEG-2 Advanced Audio Coding,” Journal of the Audio Engineering Society, Audio Engineering Society, vol. 45, No. 10, pp. 789-812 (1997).
Caetano et al., “Rate Control Strategy for Embedded Wavelet Video Coders,” Electronics Letters, pp. 1815-1817 (Oct. 14, 1999).
De Luca, “AN1090 Application Note: STA013 MPEG 2.5 Layer III Source Decoder,” STMicroelectronics, 17 pp. (1999).
de Queiroz et al., “Time-Varying Lapped Transforms and Wavelet Packets,” IEEE Transactions on Signal Processing, vol. 41, pp. 3293-3305 (1993).
Dolby Laboratories, “AAC Technology,” 4 pp. [Downloaded from the web site aac-audio.com on World Wide Web on Nov. 21, 2001.].
Fraunhofer-Gesellschaft, “MPEG Audio Layer-3,” 4 pp. [Downloaded from the World Wide Web on Oct. 24, 2001.].
Fraunhofer-Gesellschaft, “MPEG-2 AAC,” 3 pp. [Downloaded from the World Wide Web on Oct. 24, 2001.].
ISO/IEC 13818-7, Information technology—Generic coding of moving pictures and associated audio information—Part 7: Advanced Audio Coding (AAC), 150 pp. (1997).
ITU, Recommendation ITU-R BS 1387, Method for Objective Measurements of Perceived Audio Quality, 89 pp. (1998).
Kondoz, Digital Speech: Coding for Low Bit Rate Communications Systems, “Chapter 3.3: Linear Predictive Modeling of Speech Signals” and “Chapter 4: LPC Parameter Quantisation Using LSFs,” John Wiley & Sons, pp. 42-53 and 79-97 (1994).
Malvar, “Biorthogonal and Nonuniform Lapped Transforms for Transform Coding with Reduced Blocking and Ringing Artifacts,” appeared in IEEE Transactions on Signal Processing, Special Issue on Multirate Systems, Filter Banks, Wavelets, and Applications, vol. 46, 29 pp. (1998).
Malvar, “Lapped Transforms for Efficient Transform/Subband Coding,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 38, No. 6, pp. 969-978 (1990).
Malvar, “Signal Processing with Lapped Transforms,” Artech House, Norwood, MA, pp. iv, vii-xi, 175-218, and 353-57 (1992).
OPTICOM GmbH, “Objective Perceptual Measurement,” 14 pp. [Downloaded from the World Wide Web on Oct. 24, 2001.].
Phamdo, “Speech Compression,” 13 pp. [Downloaded from the World Wide Web on Nov. 25, 2001.].
Ribas Corbera et al., “Rate Control in DCT Video Coding for Low-Delay Communications,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, No. 1, pp. 172-185 (Feb. 1999).
Search Report for European Patent Application No. 03 020 110.7.
Search Report for European Patent Application No. 03 020 111.5.
Shlien, “The Modulated Lapped Transform, Its Time-Varying Forms, and Its Application to Audio Coding Standards,” IEEE Transactions on Speech and Audio Processing, vol. 5, No. 4, pp. 359-366 (Jul. 1997).
Srinivasan et al., “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,” IEEE Transactions on Signal Processing, vol. 46, No. 4, pp. 1085-1093 (Apr. 1998).
Terhardt, “Calculating Virtual Pitch,” Hearing Research, 1:155-182 (1979).
Wragg et al., “An Optimised Software Solution for an ARM PoweredTM MP3 Decoder,” 9 pp. [Downloaded from the World Wide Web on Oct. 27, 2001.].
Zwicker, Psychoakustik, Title Page, Table of Contents, “Teil I: Einfuhrung,” Index, Springer-Verlag, Berlin Heidelberg, New York, pp. II, IX-XI, 1-30, and 157-162 (1982).
Zwicker et al., Das Ohr als Nachrichtenempfä{umlaut over ( )}nger, Title page, Table of Contents, “I: Schallschwingungen,” Index, Hirzel-Verlag, Stuttgart, pp. III, IX-XI, 1-26, and 231-32 (1967).
Brandenburg, “ASPEC Coding”,AES 10thInternational Conference, pp. 81-90 (1991).
“ISO/IEC 13818-7, Information Technology—Generic Coding of Moving Pictures and Associated Audio Information—Part 7: Advanced Audio Coding (AAC), Technical Corrigendum 1,” 22 pp. (1998).
Jesteadt et al., “Forward Masking as a Function of Frequency, Masker Level, and Signal Delay,”Journal of Acoustical Society of America, 71:950-962 (1982).
Lutfi, “Additivity of Simultaneous Masking,”Journal of Acoustic Soceity of America, 73:262-267 (1983).
Yang et al., “An Inter-Channel Redundancy Removal Approach for High-Quality Multichannel Audio Compression,” inAES 109thConvention, Los Angeles, California, 8 pp. (Sep. 2000).
Wang et al., “A Multichannel Audio Coding Algorithm for Inter-Channel Redundancy Removal,” inAES 100thConvention, Amsterdam, the Netherlands, 6pp. (May 2001).
Yang et al., “Adaptive Karhunen-Loeve Transform for Enhanced Multichannel Audio Coding,” Proc. SPIE vol. 4475, 13 pp., Mathematics of Data/Image Coding, Compression, and Encryptio
Chen Wei-Ge
Thumpudi Naveen
Hudspeth David
Microsoft Corporation
Rider Justin W.
LandOfFree
Quantization and inverse quantization for audio does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Quantization and inverse quantization for audio, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Quantization and inverse quantization for audio will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3844793