Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2000-05-26
2001-10-30
Korzuch, William (Department: 2741)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
C704S270000, C704S200000, C381S027000, C381S010000
Reexamination Certificate
active
06311155
ABSTRACT:
FIELD OF THE INVENTION
Embodiments of the present invention relate generally to a method and apparatus for processing audio signals, and more particularly, to a method and apparatus for processing audio signals for use in consumer applications.
BACKGROUND OF THE INVENTION
End users with “high-end” or expensive equipment including multi-channel amplifiers and multi-speaker systems, currently have a limited capability to adjust the volume on the center channel signal of a multi-channel audio system independently of the audio signals on the other remaining channels. Since many movies have mostly dialog on the center channel and other sound effects located on other channels, this limited adjustment capability allows the end user to raise the amplitude of the mostly dialog channel so that it is more intelligible during sections with loud sound effects. Currently, this limited adjustment has important shortcomings. First, it is an adjustment capability that is only available to the end users that have an expensive digital versatile disk (DVD) player and a multi-channel speaker system such as a six-speaker home theater system that permits volume level adjustment of all speakers independently. Thus, user that cannot afford such a system cannot enjoy listening to a recording or broadcast program that allows the end user to raise the amplitude of the mostly dialog channel so that it is more intelligible.
Second, it is an adjustment that can only services one customer at a time. For example, if a user selects a level of mostly dialog to background that improves his listening intelligibility, this level however, may not be satisfactory to other individual in the room. Thus, there is no way to simultaneously provide this adjustment feature to multiple listeners having different hearing preferences.
Also, it is an adjustment that needs to be continuously modified during transients in a preferred audio signal or voice dialog (center channel) and remaining audio signal (all other channels). The final shortcoming is that a voice to remaining audio (VRA) adjustments that were acceptable during one audio segment of the movie program may not be good for another audio segment if the remaining audio level increases too much or the dialog level reduces too much.
It is a fact that a large majority of end users do not and will not have a home theater that permits this adjustment capability, i.e., Dolby Digital decoder, six-channel variable gain amplifier and multi-speaker system for many years. In addition, the end uses does not have the ability to ensure that the VRA ratio selected at the beginning of the program will stay the same for an entire program.
FIG. 3
illustrates the intended spatial positioning setup of a common home theater system. Although there are no written rules for audio production in 5.1 spatial channels, there are industry standards. As used herein, the term “spatial channels refers to the physical location of an output device (e.g., speakers) and how the sound from the output device is delivered to the end user. One of these standards is to locate the majority of dialog on the center channel
526
. Likewise other sound effects that require spatial positioning will be placed on any of the other four speakers labeled L
521
, R
522
, Ls
523
, and Rs
524
for left, right, left surround and right surround. In addition, to avoid damage to midrange speakers, low frequency effects (LFE) are placed on the 0.1 channel directed toward a subwoofer speaker
525
. Digital audio compression allows the producer to provide the user with a greater dynamic range for the audio that was not possible through analog transmission. This greater dynamic range causes most dialog to sound too low in the presence of some very loud sound effects. The following example provides an explanation. Suppose an analog transmission (or recording) has the capability to transmit dynamic range amplitudes up to 95 dB and dialog is typically recorded at 80 dB. Loud segments of remaining audio may obscure the dialog when that remaining audio reaches the upper limit while someone is speaking. However, this situation is exacerbated when digital audio compression allows a dynamic range up to 105 dB. Clearly, the dialog will remain at the same level (80 dB) with respect to other sounds, only now the loud remaining audio can be more realistically reproduced in terms of its amplitude. User complaints that dialog levels have been recorded too low on DVD's are very common. In fact, the dialog IS at the proper level and is more appropriate and realistic than what exists for analog recordings with limited dynamic range.
Even for consumers who currently have properly calibrated home theater systems, dialog is frequently masked by the loud remaining audio sections in many DVD movies produced today. A small group of consumers are able to find some improvement in intelligibility by increasing the volume of the center channel and/or decreasing the volume of all of the other channels. However, this fixed adjustment is only acceptable for certain audio passages and it disrupts the levels from the proper calibration. The speaker levels are typically calibrated to produce certain sound pressure level (SPL)s in the viewing location. This proper calibration ensures that the viewing is as realistic as possible. Unfortunately this means that loud sounds are reproduced very loud. During late night viewing, this may not be desirable. However, any adjustment of the speaker levels will disrupt the calibration.
SUMMARY OF THE INVENTION
A method for providing multiple users with voice-to-remaining audio (VRA) adjustment capability includes receiving at a first decoder a voice signal and a remaining audio signal and simultaneouly receiving at a second decoder, the voice signal and the remaining audio signal, wherein the voice signal and the remaining audio signal are received separately; and separately adjusting by each of the decoders, the separately received voice and remaining audio signals.
REFERENCES:
patent: 2783677 (1957-03-01), Becker
patent: 3046337 (1962-07-01), Hornyak
patent: 3110769 (1963-11-01), Bertram
patent: 4024344 (1977-05-01), Dolby et al.
patent: 4051331 (1977-09-01), Strong et al.
patent: 4052559 (1977-10-01), Paul et al.
patent: 4074084 (1978-02-01), van den Berg
patent: 4150253 (1979-04-01), Knoppel
patent: 4405831 (1983-09-01), Michelson
patent: 4406001 (1983-09-01), Klasco et al.
patent: 4454609 (1984-06-01), Kates
patent: 4484345 (1984-11-01), Stearns
patent: 4516257 (1985-05-01), Torick
patent: 4622440 (1986-11-01), Slavin
patent: 4631525 (1986-12-01), Serravalle, Jr.
patent: 4647980 (1987-03-01), Steventon et al.
patent: 4776016 (1988-10-01), Hansen
patent: 4809337 (1989-02-01), Scholz et al.
patent: 4816905 (1989-03-01), Tweedy et al.
patent: 4866515 (1989-09-01), Tagawa et al.
patent: 4868881 (1989-09-01), Zwicker et al.
patent: 4890170 (1989-12-01), Inohana et al.
patent: 4941179 (1990-07-01), Bergenstoff et al.
patent: 5003605 (1991-03-01), Phillipps et al.
patent: 5033036 (1991-07-01), Ohmori et al.
patent: 5113447 (1992-05-01), Hately et al.
patent: 5131311 (1992-07-01), Murakami et al.
patent: 5138498 (1992-08-01), Takigami
patent: 5144454 (1992-09-01), Cury
patent: 5146504 (1992-09-01), Pinckley
patent: 5155510 (1992-10-01), Beard
patent: 5155770 (1992-10-01), Maejima
patent: 5197100 (1993-03-01), Shiraki
patent: 5210366 (1993-05-01), Sykes, Jr.
patent: 5212764 (1993-05-01), Ariyoshi
patent: 5216718 (1993-06-01), Fukuda
patent: 5228098 (1993-07-01), Kane et al.
patent: 5285503 (1994-02-01), Satoh et al.
patent: 5289272 (1994-02-01), Rabowsky et al.
patent: 5294746 (1994-03-01), Tsumura et al.
patent: 5297209 (1994-03-01), Kowaki
patent: 5319713 (1994-06-01), Waller, Jr. et al.
patent: 5323467 (1994-06-01), Hermes
patent: 5341253 (1994-08-01), Liao et al.
patent: 5384599 (1995-01-01), Casavant et al.
patent: 5395123 (1995-03-01), Kondo
patent: 5396560 (1995-03-01), Arcos et al.
patent: 5400409 (1995-03-01), Linhard
patent: 5408686 (1995-04-01), Mankovitz
patent: 5434922 (1995-07-01), Miller et al.
patent:
Saunders William R.
Vaudrey Michael A.
Abebe Daniel
Hearing Enhancement Company LLC
Kenyon & Kenyon
Korzuch William
LandOfFree
Use of voice-to-remaining audio (VRA) in consumer applications does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Use of voice-to-remaining audio (VRA) in consumer applications, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Use of voice-to-remaining audio (VRA) in consumer applications will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2573269