Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
2007-11-13
2007-11-13
Edouard, Patrick N. (Department: 2626)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S232000, C084S616000
Reexamination Certificate
active
09939954
ABSTRACT:
The method of the present invention utilizes machine-learning techniques, particularly Support Vector Machines in combination with a neural network, to process a unique machine-learning enabled representation of the audio bitstream. Using this method, a classifying machine is able to autonomously detect characteristics of a piece of music, such as the artist or genre, and classify it accordingly. The method includes transforming digital time-domain representation of music into a frequency-domain representation, then dividing that frequency data into time slices, and compressing it into frequency bands to form multiple learning representations of each song. The learning representations that result are processed by a group of Support Vector Machines, then by a neural network, both previously trained to distinguish among a given set of characteristics, to determine the classification.
REFERENCES:
patent: 4843562 (1989-06-01), Kenyon et al.
patent: 5335291 (1994-08-01), Kramer et al.
patent: 5479575 (1995-12-01), Yoda
patent: 5546477 (1996-08-01), Knowles et al.
patent: 5619616 (1997-04-01), Brady et al.
patent: 5763800 (1998-06-01), Rossum et al.
patent: 5918223 (1999-06-01), Blum et al.
patent: 5969654 (1999-10-01), Goldin
patent: 6192360 (2001-02-01), Dumais et al.
patent: 6201176 (2001-03-01), Yourlo
patent: 6657117 (2003-12-01), Weare et al.
patent: 6697564 (2004-02-01), Toklu et al.
patent: 6931451 (2005-08-01), Logan et al.
Matityaho et al; “Neural Network Based Model for Classification of Music Type;” Electrical and Electronics Engineers in Israel, 1995., Eighteenth Convention of , Mar. 7-8, 1995, pp. 4.3.4/1-4.3.4/5.
H. Soltau, T. Schultz, Martin Westphal, and Alex Waibel. “Recognition of Music Types”. ICASSP 1998, vol. II, pp. 1137-1140, 1998.
Herrera, Perfecto; Amatriain, Xavier; Batlle, Eloi and Serra, Xavier. “Towards instrument segmentation for music content description: a critical review of instrument classification techniques.” Proceedings of ISMIR 2000, Plymouth MA, 2000.
Pye, “Content-Based Methods for the Management of Digital Music,” in Proc. Int. Conf Acoustics, Speech, Signal Processing (ICASSP), 2000.
Scott, “Music Classification Using Neural Networks,” Tech. Rep., Stanford University, 2001.
“An Overview of Audio Information Retrieval,” J. Foote, ACM-Springer Multimedia Systems, 1998. “Towards Instrument Segmentation for Music Content Description: a Critical Review of Instrument Classification Techniques,” Herrera, P. et al., Proceedings of ISMIR 2000, Plymouth, MA.
“Audio Information Retrieval (AIR) Tools,” Tzanetakis, G. et al., Proceedings of ISMIR 2000, Plymouth, MA.” “Arthur: Retrieving Orchestral Music by Long-Term Structure,” J. Foote, Proceedings of ISMIR 2000, Plymouth, MA” “Mel Frequency Cepstral Coefficients for Music Modeling,” B. Logan, Proceedings of ISMIR 2000.
Moodlogic, www.Moodlogic.com, http://www.moodlogic.com/p117.html. Relatable, www.Relatable.com, http://www.relatable.com/tech/engine.html.
“A Comparison of Language modeling and Probabilistic Text Information Retrieval Approaches to Monophonic Music Retrieval,” J. Pickens, Proceedings of ISMIR 2000, Plymouth, MA. “NODElib Documentation,” G.W. Flake,http://www.neci.nj.nec.com/homepages/flake
odelib/html /.
“Efficient SVM Regression Training with SMO,” Machine Learning, to appear 2001, G.W. Flake, et al. “Musical Instrument Identification: A Pattern-Recognition Approach,” K.D. Martin, et al., Presented at the 136th Meeting of the Acoustical Society of America, 1998.
“A Tutorial on Support Vector Machines for Pattern Recognition,” C.J.C. Burges, Kluwer Academic Publishers, Boston, Manufactured in The Netherlands, 1998; “Neural Networks for Real World Tasks Limitations and Solutions,” S.R. Lawrence, p. 39 of thesis dated Aug. 12, 1996.
The Nature of Statistical Learning Theory, V.N. Vapnik, pp. 133-141, 1995. Bitbop, www.Bitbop.com, http://www.bitbop.com/press-bp20010508.html.
Flake Gary W.
Lawrence Stephen R.
Whitman Brian
Edouard Patrick N.
NEC Laboratories America, Inc.
Scully , Scott, Murphy & Presser, P.C.
Wozniak James S.
LandOfFree
Extracting classifying data in music from an audio bitstream does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Extracting classifying data in music from an audio bitstream, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Extracting classifying data in music from an audio bitstream will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3854194