Extracting information from formatted sources

Data processing: database and file management or data structures – Database design – Data structure types

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C707S793000, C707S793000, C715S200000

Reexamination Certificate

active

07630968

ABSTRACT:
An extraction manager extracts information from formatted input. The input is annotated with presentation information, and parsed into a set of elements comprising a canonical representation thereof. An information analyzer analyzes the elements in order to glean additional information. An entity extractor determines entities to extract from the input. The entity extractor analyzes elements according to specific entities to be extracted, and creates entity specific observations for analyzed elements. These observations comprise possible values for the relevant entities. A heuristics processor maintains a collection of entity specific heuristics, each comprising a test to help determine the suitability of data as a value for the corresponding entity. The heuristics processor selects heuristics for the entities to be extracted, and tests observations for these entities against the selected heuristics. Responsive to this testing, ordered possible values for entities to extract are determined.

REFERENCES:
patent: 6606659 (2003-08-01), Hegli et al.
patent: 6947985 (2005-09-01), Hegli et al.
patent: 2002/0091671 (2002-07-01), Prokoph
patent: 2004/0158799 (2004-08-01), Breuel
patent: 2005/0131896 (2005-06-01), Cao et al.
patent: 2005/0289452 (2005-12-01), Kashi et al.
patent: 2006/0224950 (2006-10-01), Takaai et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Extracting information from formatted sources does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Extracting information from formatted sources, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Extracting information from formatted sources will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4146248

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.