Content-driven speech- or audio-browser

Data processing: speech signal processing – linguistics – language – Speech signal processing – Application

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

Reexamination Certificate

active

06772124

ABSTRACT:

FIELD OF THE INVENTION
The invention relates in particular to a method for categorizing web sites that provide audio broadcasts over the Internet.
BACKGROUND ART
Currently, nearly 10,000 radio stations broadcast over the Internet. The stations stream their audio content. A streamed file is a file that can be started for playing out before the download is completed. With a proper network connection, decoding and playback software on one's PC or set top box the audio can be captured. Audio output hardware, e.g., analog sound cards, USB speakers, and streaming media tools, such as RealPlayer from RealNetworks, Inc., have become widely available and enable to add the radio functionality to one's PC.
SUMMARY OF THE INVENTION
The user has to check many listings of large numbers of web sites that provide these broadcasts over the Internet. Accordingly, there is a need for helping the user to select from among the huge number of stations available. To this end, the invention provides a method for categorizing web sites or resources on the Internet that provide audio (e.g., speech and music) streaming based on their typical content. A web resource that provides audio streaming is identified by its resource type. The resource type is determined by way of the type extension in its URL that indicates the file format, e.g., “.ram”, “.tsp” or “.swa”. This extension enables, for example, to automatically open the proper software applications (or “plug-ins”) in the user's browser when the hyperlink is clicked. Accordingly, the relevant resources on the Internet can be identified based on their URL. If the file extension is not available through the URL, the resource type is determined by the MIME type or content-type information provided in the HTTP header of the resource. Taking into consideration the resource's country domain extension, e.g., “.nl” for the Netherlands or “.ru” for Russia, further optimizes the analysis of the URL, for example if one is interested in audio content in a specific natural language. Upon finding a relevant resource, i.e., one that provides streaming of audio, the resource's file is retrieved from the relevant server and analyzed based on its audio content. In a preferred embodiment, the inventor proposes to use speech recognition or music (tune/rhythm) recognition software to search through and categorize these stations by, e.g., language, style of music, absence of commercials. Speech recognition software is capable of determining the signature of various kinds of music, thus allowing categorization of music with just this kind of software. For example, classical music has typically a different speech recognition signature than rock music. A server can be dedicated to categorize stations or channels in a data base, similar as to what PlanetSearch or Altavista does for text documents. One or more web crawlers can be used in parallel to automatically fetch web sites that supply audio so as to identify them for a search engine. Additionally, the resource's server can be evaluated by the crawler for the quality of the connection, e.g., connection speed, reliability, etc. For example, the categorizing server may recommend to a user, who has broadband network access (e.g., ISDB, cable, T1), higher connection speed sources. An audio browser is provided, analogous to PlanetSearch's or Alta Vista's for text, to provide a searchable collection of Internet audio web sites based from which specific pages are returned to the user based on certain audio search criteria. Alternatively, the catalog approach (Yahoo experts hand-pick and assign sites to categories) can be taken to categorize the stations at the server and make them accessible through a search engine. Once the sites are categorized, a user provides a query input to the server and receives a list of URLs representative of the channels that match the query input (e.g., give me a French language station that plays music like this). As an alternative or supporting this, the server provides a customized electronic program guide to the user based on a profile of the user stored on the server, e.g., using the SmartConnect infrastructure of Philips Electronics.
The invention is of commercial interest in particular to, e.g., cable providers and network owners, and service providers in order to serve as an incentive for subscribers.
As to music recognition, see, for example U.S. Ser. No. 08/840,356 filed Apr. 28, 1997 or Mark Hoffberg for BIBLIOGRAPHIC MUSIC DATA BASE WITH NORMALIZED MUSICAL THEMES, now allowed and herein incorporated by reference. This patent document discusses, among other things, how rhythm information or tonal information of a musical theme can be used to identify the theme. The rhythm information comprises the time signature (meter) and the accentuations of the theme. The time signature determines the number of beats to the measure. The accentuation determines which beat gets an accent and which one does not. For example, the sign
6
8
in a musical score is the time signature indicating that the meter is 6 beats to the measure and that an eighth note gets one beat. Flamenco music has a variety of different styles, each determined by its own compàs (rhythmic accentuation pattern). Typical examples of flamenco music are Alegrias, Bulerìas, Siguiriyas and Soleares that all have 12 beats to the measure. In the Alegrias, Bulerìas and Soleares, the third, sixth, eighth, tenth and twelfth beats are accentuated. The first, third, fifth, eighth and eleventh beats are emphasized in the Siguiriyas style. In this system rhythmic accentuation patterns are used as input data in order to retrieve bibliographic information associated with the theme that is represented by the rhythm. For example, the rhythmic accentuation pattern is entered into the system as a substantially monotonic sequence of accentuated and unaccentuated sounds. The input data then is represented by, e.g., a sequence of beats or peaks of varying height in the time domain. The relative distances between successive peaks represent the temporal aspects of the pattern and the relative heights represent the accentuations in the pattern. The sequence of beats and rests in between is represented by a digital word. The words can be stored lexicographically to enable a fast and orderly retrieval. If tonal information and/or rhythm information can be used to identify individual musical themes, they can also be used to identify with more or less accuracy a certain style of music.
As to SmartConnect, see, for example U.S. Ser. No. 09/160,490 filed Sep. 25, 1998 for Adrian Turner et al., for CUSTOMIZED UPGRADING OF INTERNET-ENABLED DEVICES BASED ON USER-PROFILE, herein incorporated by reference. This document relates to a server system that maintains a user profile of a particular end-user of consumer electronics network-enabled equipment and a data base of new technical features for this type of equipment, e.g., a home network. If there is a match between the user-profile and a new technical feature, and the user has indicates to prefer receiving information about updates or sales offers, the user gets notified via the network of the option to obtain the feature.
As to SmartConnect, also see U.S. Ser. No. 09/189,535 filed Nov. 10, 1998 for Yevgeniy Shteyn for UPGRADING OF SYNERGETIC ASPECTS OF HOME NETWORKS, herein incorporated by reference. This document relates to a system with a server that has access to an inventory of devices and capabilities on a user's home network. The inventory is, for example, a look-up service as provided by HAVi, JINI and Home API architectures. The server has also access to a data base with information of features for a network. The server determines if the synergy of the apparatus present on the user's network can be enhanced based on the listing of the inventory and on the user's profile. If there are features that are relevant to the synergy, based on these criteria, the user gets notified.


REFERENCES:
patent: 6185527 (2001-02-01), Petkovic et al.
patent: 6243676 (20

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Content-driven speech- or audio-browser does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Content-driven speech- or audio-browser, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Content-driven speech- or audio-browser will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3313190

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.