Data processing: speech signal processing – linguistics – language – Speech signal processing – For storage or transmission
Reexamination Certificate
2002-08-30
2008-11-18
Opsasnick, Michael N. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
For storage or transmission
Reexamination Certificate
active
07454331
ABSTRACT:
Mechanisms are known that allow receivers to control loudness of speech in broadcast signals but these mechanisms require an estimate of speech loudness be inserted into the signal. Disclosed techniques provide improved estimates of loudness. According to one implementation, an indication of the loudness of an audio signal containing speech and other types of audio material is obtained by classifying segments of audio information as either speech or non-speech. The loudness of the speech segments is estimated and this estimate is used to derive the indication of loudness. The indication of loudness maybe used to control audio signal levels so that variations in loudness of speech between different programs is reduced. A preferred method for classifying speech segments is described.
REFERENCES:
patent: 4281218 (1981-07-01), Chuang et al.
patent: 4543537 (1985-09-01), Kuhn et al.
patent: 5097510 (1992-03-01), Graupe
patent: 5457769 (1995-10-01), Valley
patent: 5548638 (1996-08-01), Yamaguchi et al.
patent: 5649060 (1997-07-01), Ellozy et al.
patent: 5712954 (1998-01-01), Dezonno
patent: 5819247 (1998-10-01), Freund et al.
patent: 5878391 (1999-03-01), Aarts
patent: 6061647 (2000-05-01), Barrett
patent: 6094489 (2000-07-01), Ishige et al.
patent: 6125343 (2000-09-01), Schuster
patent: 6182033 (2001-01-01), Accardi et al.
patent: 6233554 (2001-05-01), Heimbigner et al.
patent: 6272360 (2001-08-01), Yamaguchi et al.
patent: 6275795 (2001-08-01), Tzirkel-Hancock
patent: 6298139 (2001-10-01), Poulsen et al.
patent: 6311155 (2001-10-01), Vaudrey et al.
patent: 6314396 (2001-11-01), Monkowski
patent: 6351731 (2002-02-01), Anderson et al.
patent: 6353671 (2002-03-01), Kandel et al.
patent: 6370255 (2002-04-01), Schaub et al.
patent: 6411927 (2002-06-01), Morin et al.
patent: 6625433 (2003-09-01), Poirier et al.
patent: 6772127 (2004-08-01), Saunders et al.
patent: 6807525 (2004-10-01), Li et al.
patent: 6823303 (2004-11-01), Su et al.
patent: 6889186 (2005-05-01), Michaelis
patent: 6985594 (2006-01-01), Vaudrey et al.
patent: 7065498 (2006-06-01), Thomas et al.
patent: 7068723 (2006-06-01), Foote et al.
patent: 7155385 (2006-12-01), Berestesky et al.
patent: 19509149 (1996-09-01), None
patent: 19848491 (2000-04-01), None
patent: 0517233 (1992-12-01), None
patent: 0746116 (1996-12-01), None
patent: 0637011 (1998-10-01), None
patent: 9827543 (1998-06-01), None
patent: WO 0045379 (2000-08-01), None
patent: WO 0078093 (2000-12-01), None
WO 00/78093 Vaudrey et al, “Voice to Remaining Audio (VRA) Interactive Hearing Aid and Auxiliary Equipment”, Dec. 21, 2000.
Juang et al, “Technical Advances in Digital Audio Radio Broadcasting”, Proceedings of the IEEE, vol. 90, Issue 8, Aug. 2002, pp. 1303-1333.
Atkinson, I. A.; et al., “Time Envelope LP Vocoder: A New Coding Technique at Very Low Bit Rates,” 4th E 1995, ISSN 1018-4074, pp. 241-244.
Belger, “The Loudness Balance of Audio Broadcast Programs,” J. Audio Eng. Soc., vol. 17, No. 3, Jun. 1969, pp. 282-285.
Saunders, “Real-Time Discrimination of Broadcast Speech/Music,” Proc. of Int. Conf. on Acoust. Speech and Sig. Proc., 1996, pp. 993-996.
Moore, Glasberg and Baer, “A Model for the Prediction of Thresholds, Loudness and Partial Loudness,” J. Audio Eng. Soc., vol. 45, No. 4, Apr. 1997, pp. 224-240.
Bosi, et al., “ISO/IEC MPEG-2 Advanced Audio Coding,” J. Audio Eng. Soc., vol. 45, No. 10, Oct. 1997, pp. 789-814.
Scheirer and Slaney, “Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator,” Proc. of Int. Conf. on Acoust. Speech and Sig. Proc., 1997, pp. 1331-1334.
Schapire, “A Brief Introduction to Boosting,” Proc. of the 16th Int. Joint Conf. on Artifical Intelligence, 1999.
Glasberg and Moore, “A Model of Loudness Applicable to Time-Varying Sounds”, J. Audio Eng. Soc., vol. 50, No. 5, May 2002, pp. 331-342.
Guide to the Use of the ATSC Digital Television Standard, Oct. 1995, Sections 1-4 and 6.0-6.6.
ATSC Standard: Digital Audio Compression (AC-3), Revision A, Aug. 2001, Sections 1-4, 6, 7.3, 7.6, 7.7 and 8.
ATSC Standard: Digital Television Standard, Revision B, Aug. 2001, Sections 1-5 and Annex B.
ISO Standard 532:1975 published 1975.
CEI/IEC Standard 60804 published Oct. 2000.
Gundry Kenneth James
Riedmiller Jeffrey Charles
Robinson Charles Quito
Venezia Steven Joseph
Vinton Mark Stuart
Dolby Laboratories Licensing Corporation
Gallagher & Lathrop
Lathrop David N.
Opsasnick Michael N.
LandOfFree
Controlling loudness of speech in signals that contain... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Controlling loudness of speech in signals that contain..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Controlling loudness of speech in signals that contain... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4047030