Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2001-12-14
2008-12-02
Armstrong, Angela A (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S200100
Reexamination Certificate
active
07460993
ABSTRACT:
A transform coder adaptively configures window sizes for transform coding in a two-pass process to maximize coding efficiency, while achieving necessary time resolution to avoid pre-echo. In a first pass, the coder places small size windows over detected transient regions of an input signal in an open-loop window configuration process. In a second pass, the coder adjusts the window size configuration according to measurements of the achieved quality in a closed-loop window configuration process. Where quality measurement shows unacceptable quantization noise, the coder increases window size. Where pre-echo is detected, the coder reduces window size within coding bit rate constraints.
REFERENCES:
patent: 5325215 (1994-06-01), Shibata et al.
patent: 5357594 (1994-10-01), Fielder
patent: 5379351 (1995-01-01), Fandrianto et al.
patent: 5394473 (1995-02-01), Davidson
patent: 5590066 (1996-12-01), Ohki
patent: 5686964 (1997-11-01), Tabatabai et al.
patent: 5845243 (1998-12-01), Smart et al.
patent: 5848391 (1998-12-01), Bosi et al.
patent: 5970173 (1999-10-01), Lee et al.
patent: 5995151 (1999-11-01), Naveen et al.
patent: 5995539 (1999-11-01), Miller
patent: 6029126 (2000-02-01), Malvar
patent: 6073153 (2000-06-01), Malvar
patent: 6115689 (2000-09-01), Malvar
patent: 6154762 (2000-11-01), Malvar
patent: 6167093 (2000-12-01), Tsutsui et al.
patent: 6301304 (2001-10-01), Jing et al.
patent: 6311154 (2001-10-01), Gersho et al.
patent: 6324560 (2001-11-01), Malvar
patent: 6363117 (2002-03-01), Kok
patent: 6370502 (2002-04-01), Wu et al.
patent: 6473534 (2002-10-01), Merhav et al.
patent: 6487574 (2002-11-01), Malvar
patent: 6496795 (2002-12-01), Malvar
patent: 6507614 (2003-01-01), Li
patent: 6687726 (2004-02-01), Schneider
patent: 6694342 (2004-02-01), Mou
patent: 6701019 (2004-03-01), Wu et al.
patent: 6728317 (2004-04-01), Demos
patent: 6882685 (2005-04-01), Malvar
patent: 2003/0115052 (2003-06-01), Chen et al.
patent: 2005/0165611 (2005-07-01), Mehrotra et al.
patent: 2452343 (2003-01-01), None
patent: 854653 (1998-07-01), None
patent: 2003-348598 (2003-12-01), None
Gibson et al.,Digital Compression for Multimedia, Title Page, Contents, “Chapter 7: Frequency Domain Coding,” Morgan Kaufman Publishers, Inc., pp. iii, v-xi, and 227-262 (1998).
H.S. Malvar,Signal Processing with Lapped Transforms, Artech House, Norwood, MA, pp. iv, vii-xi, 175-218, and 353-357 (1992).
H.S. Malvar, “Lapped Transforms for Efficient Transform/Subband Coding,”IEEE Transactions on Acoustics on Acoustics, Speech and Signal Processing, vol. 38, No. 6, pp. 969-978 (1990).
Seymour Schlien, “The Modulated Lapped Transform, Its Time-Varying Forms, and Its Application to Audio Coding Standards,”IEEE Transactions on Speech and Audio Processing, vol. 5, No. 4, pp. 359-366 (Jul. 1997).
de Queiroz et al., “Time-Varying Lapped Transforms and Wavelet Packets,”IEEE Transactions on Signal Processing, vol. 41, pp. 3293-3305 (1993).
Herley et al., “Tilings of the Time-Frequency Plane: Construction of Arbitrary Orthogonal Bases and Fast Tiling Algorithms,”IEEE Transactions on Signal Processing, vol. 41, No. 12, pp. 3341-3359 (1993).
ISO/IEC 11172-3, Information Technology—Coding of Moving Pictures and Associated Audio for Digital Storage Media at Up to About 1.5 Mbit/s—Part 3: Audio, 154 pp. (1993).
Dolby Laboratories, “AAC Technology,” 4 pp. [Downloaded from the web site aac-audio.com on World Wide Web on Nov. 21, 2001.].
Srinivasan et al., “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,”IEEE Transactions on Signal Processing, vol. 46, No. 4, pp. 1085-1093 (Apr. 1998).
Caetano et al., “Rate Control Strategy for Embedded Wavelet Video Coders,”Electronics Letters, pp. 1815-1817 (Oct. 14, 1999).
Ribas Corbera et al., “Rate Control in DCT Video Coding for Low-Delay Communications,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, No. 1, pp. 172-185 (Feb. 1999).
Zwicker et al.,Das Ohr als Nachrichtenempfänger, Title Page, Table of Contents, “I: Schallschwingungen,” Index, Hirzel-Verlag, Stuttgart, pp. III, IX-XI, 1-26, and 231-232 (1967).
Terhardt, “Calculating Virtual Pitch,”Hearing Research, 1:155-182 (1979).
Lufti, “Additivity of Simultaneous Masking,”Journal of Acoustic Society of America, 73:262-267 (1983).
Jesteadt et al., “Forward Masking as a Function of Frequency, Masker Level, and Signal Delay,”Journal of Acoustical Society of America, 71:950-962 (1982).
ITU, Recommendation ITU-R BS 1387, Method for Objective Measurements of Perceived Audio Quality, 89 pp. (1998).
ITU, Recommendation ITU-R BS 1115, Low Bit-Rate Audio Coding, 9 pp. (1994).
Beerends, “Audio Quality Determination Based on Perceptual Measurement Techniques,”Applications of Digital Signal Processing to Audio and Acoustics, Chapter 1, Ed. Mark Kahrs, Karlheinz Brandenburg, Kluwer Acad. Publ., pp. 1-38 (1998).
Zwicker,Psychoakustik, Title Page, Table of Contents, “Teil I: Einfuhrung,” Index, Springer-Verlag, Berlin Heidelberg, New York, pp. II, IX-XI, 1-30, and 157-162 (1982).
Solari,Digital Video and Audio Compression, Title Page, Contents, “Chapter 8: Sound and Audio,” McGraw-Hill, Inc., pp. iii, v-vi, and 187-211 (1997).
A.M. Kondoz,Digital Speech: Coding for Low Bit Rate Communications Systems, “Chapter 3.3: Linear Predictive Modeling of Speech Signals” and “Chapter 4: LPC Parameter Quantisation Using LSFs,” John Wiley & Sons, pp. 42-53 and 79-97 (1994).
Kadatch, U.S. Appl. No. 09/771,371, entitled, “Quantization Loop with Heuristic Approach,” filed Jan. 26, 2001.
Chen et al., U.S. Appl. No. 10/017,694, entitled, “Quality and Rate Control Strategy for Digital Audio,” filed Dec. 14, 2001.
Chen et al., U.S. Appl. No. 10/017,702, entitled, “Quantization Matrices for Digital Audio,” filed Dec. 14, 2001.
Chen et al., U.S. Appl. No. 10/017,861, entitled, “Techniques for Measurement of Perceptual Audio Quality,” filed Dec. 14, 2001.
Chen et al., U.S. Appl. No. 10/016,918, entitled, “Quality Improvement Techniques in an Audio Encoder,” filed Dec. 14, 2001.
Wragg et al., “An Optimised Software Solution for an ARM Powered™ MP3 Decoder,” 9 pp. [Downloaded from the World Wide Web on Oct. 27, 2001.].
Fraunhofer-Gesellschaft, “MPEG Audio Layer-3,” 4 pp. [Downloaded from the World Wide Web on Oct. 24, 2001.].
Fraunhofer-Gesellschaft, “MPEG-2 AAC,” 3 pp. [Downloaded from the World Wide Web on Oct. 24, 2001.].
OPTICOM GmbH, “Objective Perceptual Measurement,” 14 pp. [Downloaded from the World Wide Web on Oct. 24, 2001.].
De Luca, “AN1090 Application Note: STA013 MPEG 2.5 Layer III Source Decoder,” STMicroelectronics, 17 pp. (1999).
Pharndo, “Speech Compression,” 13 pp. [Downloaded from the World Wide Web on Nov. 25, 2001.].
Malvar, “Biorthogonal and Nonuniform Lapped Transforms for Transform Coding with Reduced Blocking and Ringing Artifacts,” appeared inIEEE Transactions on Signal Processing, Special Issue on Multirate Systems, Fiter Banks, Wavelets, and Applications, vol. 46, 29 pp. (1998).
Advanced Television Systems Committee, “ATSC Standard: Digital Audio Compression (AC-3), Revision A,” pp. 1-140 (Aug. 2001).
Arai, et al., “A Fast DCT-SQ Scheme for Images,” The Transactions of the IEICE, vol. E 71, No. 11, Nov. 1988, pp. 1095-1097.
Bjontegaard, “H.26L Test Model Long Term No. 8 (TML-8) Draft 0,”Video Coding Experts Group(VCEG), pp. 1-46.
Brandenburg, “ASPEC Coding”,AES 10thInternational Conference, pp. 81-90 (1991).
Calderbank et al., “Wavelet Transforms that Map Integers to Intergers,” pp. 1-39 (Aug. 1996).
Cham, “Development of Integer Cosine Transforms by the Prin
Chen Wei-Ge
Lee Ming-Chieh
Thumpudi Naveen
Armstrong Angela A
Klarquist & Sparkman, LLP
Microsoft Corporation
LandOfFree
Adaptive window-size selection in transform coding does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Adaptive window-size selection in transform coding, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Adaptive window-size selection in transform coding will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4051898