Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2006-06-26
2009-06-16
Armstrong, Angela A (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S200100
Reexamination Certificate
active
07548855
ABSTRACT:
An audio processing tool measures the quality of reconstructed audio data. For example, an audio encoder measures the quality of a block of reconstructed frequency coefficient data in a quantization loop. The invention includes several techniques and tools, which can be used in combination or separately. First, before measuring quality, the tool normalizes the block to account for variation in block sizes. Second, for the quality measurement, the tool processes the reconstructed data by critical bands, which can differ from the quantization bands used to compress the data. Third, the tool accounts for the masking effect of the reconstructed data, not just the masking effect of the original data. Fourth, the tool band weights the quality measurement, which can be used to account for noise substitution or band truncation. Finally, the tool changes quality measurement techniques depending on the channel coding mode.
REFERENCES:
patent: 5414796 (1995-05-01), Jacobs et al.
patent: 5686964 (1997-11-01), Tabatabai et al.
patent: 5845243 (1998-12-01), Smart et al.
patent: 5995151 (1999-11-01), Naveen et al.
patent: 6029126 (2000-02-01), Malvar
patent: 6058362 (2000-05-01), Malvar
patent: 6064954 (2000-05-01), Cohen et al.
patent: 6115689 (2000-09-01), Malvar
patent: 6182034 (2001-01-01), Malvar
patent: 6240380 (2001-05-01), Malvar
patent: 6810083 (2004-10-01), Chen et al.
patent: 6934677 (2005-08-01), Chen et al.
patent: 7027982 (2006-04-01), Chen et al.
patent: 7062445 (2006-06-01), Kadatch
patent: 7143030 (2006-11-01), Chen et al.
patent: 7146313 (2006-12-01), Chen et al.
patent: 7155383 (2006-12-01), Chen et al.
patent: 7240001 (2007-07-01), Chen et al.
patent: 7249016 (2007-07-01), Chen et al.
patent: 7260525 (2007-08-01), Chen et al.
patent: 7263482 (2007-08-01), Chen et al.
patent: 7277848 (2007-10-01), Chen et al.
patent: 7283952 (2007-10-01), Chen et al.
patent: 7295971 (2007-11-01), Chen et al.
patent: 7295973 (2007-11-01), Chen et al.
patent: 7299175 (2007-11-01), Chen et al.
patent: 7340394 (2008-03-01), Chen et al.
patent: 2003/0115052 (2003-06-01), Chen et al.
patent: 2007/0185706 (2007-08-01), Chen et al.
patent: 2008/0015850 (2008-01-01), Chen et al.
Beerends, “Audio Quality Determination Based on Perceptual Measurement Techniques,”Applications of Digital Signal Processing to Audio and Acoustics, Chapter 1, Ed. Mark Kahrs, Karlheinz Brandenburg, Kluwer Acad. Publ., pp. 1-38 (1998).
Caetano et al., “Rate Control Strategy for Embedded Wavelet Video Coders,”Electronics Letters, pp. 1815-1817 (Oct. 14, 1999).
De Luca, “AN1090 Application Note: STA013 MPEG 2.5 Layer III Source Decoder,” STMicroelectronics, 17 pp. (1999).
de Queiroz et al., “Time-Varying Lapped Transforms and Wavelet Packets,”IEEE Transactions on Signal Processing, vol. 41, pp. 3293-3305 (1993).
Dolby Laboratories, “AAC Technology,” 4 pp. [Downloaded from the web site aac-audio.com on World Wide Web on Nov. 21, 2001.].
Fraunhofer-Gesellschaft, “MPEG Audio Layer-3,” 4 pp. [Downloaded from the World Wide Web on Oct. 24, 2001.].
Fraunhofer-Gesellschaft, “MPEG-2 AAC,” 3 pp. [Downloaded from the World Wide Web on Oct. 24, 2001:].
Gibson et al.,Digital Compression for Multimedia, Title Page, Contents, “Chapter 7: Frequency Domain Coding,” Morgan Kaufman Publishers, Inc., pp. iii, v-xi, and 227-262 (1998).
Herley et al., “Tilings of the Time-Frequency Plane: Construction of Arbitrary Orthogonal Bases and Fast Tiling Algorithms,”IEEE Transactions on Signal Processing, vol. 41, No. 12, pp. 3341-3359 (1993).
ISO/IEC 11172-3, Information Technology—Coding of Moving Pictures and Associated Audio for Digital Storage Media at Up to About 1.5 Mbit/s—Part 3: Audio, 154 pp. (1993).
ITU, Recommendation ITU-R BS 1115, Low Bit-Rate Audio Coding, 9 pp. (1994).
ITU, Recommendation ITU-R BS 1387, Method for Objective Measurements of Perceived Audio Quality, 89 pp. (1998).
Jesteadt et al., “Forward Maskings as a Function of Frequency, Masker Level, and Signal Delay,”Journal of Acoustical Society of America, 71:950-962 (1982).
Kondoz,Digital Speech: Coding for Low Bit Rate Communications Systems, “Chapter 3.3: Linear Predictive Modeling of Speech Signals” and “Chapter 4: LPC Parameter Quantisation Using LSFs,” John Wiley & Sons, pp. 42-53 and 79-97 (1994).
Lutfi, “Additivity of Simultaneous Masking,”Journal of Acoustic Society of America, 73:262-267 (1983).
Malvar, “Biorthogonal and Nonuniform Lapped Transforms for Transform Coding with Reduced Blocking and Ringing Artifacts,” appeared inIEEE Transactions on Signal Processing, Special Issue on Multirate Systems, Filter Banks, Wavelets, and Applications, vol. 46, 29 pp. (1998).
Malvar, “Lapped Transforms for Efficient Transform/Subband Coding,”IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 38, No. 6, pp. 969-978 (1990).
Malvar,Signal Processing with Lapped Transforms, Artech House, Norwood, MA, pp. iv, vii-xi, 175-218, and 353-357 (1992).
OPTICOM GmbH, “Objective Perceptual Measurement,” 14 pp. [Downloaded from the World Wide Web on Oct. 24, 2001.].
Phamdo, “Speech Compression,” 13 pp. [Downloaded from the World Wide Web on Nov. 25, 2001.].
Ribas Corbera et al., “Rate Control in DCT Video Coding for Low-Delay Communications,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, No. 1, pp. 172-185 (Feb. 1999).
Schlien, “The Modulated Lapped Transform, Its Time-Varying Forms, and Its Application to Audio Coding Standards,”IEEE Transactions on Speech and Audio Processing, vol. 5, No. 4, pp. 359-366 (Jul. 1997).
Solari,Digital Video and Audio Compression, Title Page, Contents, “Chapter 8: Sound and Audio,” McGraw-Hill, Inc., pp. iii, v-vi, and 187-211 (1997).
Srinivasan et al., “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,”IEEE Transactions on Signal Processing, vol. 46, No. 4, pp. 1085-1093 (Apr. 1998).
Terhardt, “Calculating Virtual Pitch,”Hearing Research, 1:155-182 (1979).
Wragg et al., “An Optimised Software Solution for an ARM Powered™ MP3 Decoder,” 9 pp. [Downloaded from the World Wide Web on Oct. 27, 2001.].
Zwicker et al.,Das Ohr als Nachrichtenempfänger, Title Page, Table of Contents, “I: Schallschwingungen,” Index, Hirzel-Verlag, Stuttgart, pp. III, IX-XI, 1-26, and 231-232 (1967).
Zwicker,Psychoakustik, Title Page, Table of Contents, “Teil I: Einfuhrung,” Index, Springer-Verlag, Berlin Heidelberg, New York, pp. II, IX-XI, 1-30, and 157-162 (1982).
Chen Wei-Ge
Lee Ming-Chieh
Thumpudi Naveen
Armstrong Angela A
Klarquist & Sparkman, LLP
Microsoft Corporation
LandOfFree
Techniques for measurement of perceptual audio quality does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Techniques for measurement of perceptual audio quality, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Techniques for measurement of perceptual audio quality will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4060584