Method and apparatus for identifying words described in a portab

Data processing: database and file management or data structures – Database design – Data structure types

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

G06F 1500

Patent

active

058325301

ABSTRACT:
A method and apparatus for identifying words stored in a portable electronic document. A digital computation apparatus stores a page of a document including characters in text segments that have not been identified as words. A word identifying mechanism analyzes the text segments of the page and stores the text segments as text objects in a linked list. The word identifying mechanism identifies words from the text objects in the linked list by analyzing the text objects for word breaks and by analyzing gaps between text objects using position data associated with the text segments. The identified words are stored in a word list and are sorted if necessary. A method of the present invention receives a text segment from a page of a document having multiple text segments and associated position data, including x and y coordinates for each text segment. A text object is created for each text segment, and the text objects are entered into a linked list. Words are then identified from the linked list by analyzing the text objects for word breaks and by analyzing gaps between text objects using the associated position data. Words that are identified in the text objects are added to a word list. The above steps are repeated until the end of the page is reached. The method and apparatus can be used for searching for words in a portable electronic document.

REFERENCES:
patent: 4741045 (1988-04-01), Denning
patent: 5003614 (1991-03-01), Tanaka et al.
patent: 5161245 (1992-11-01), Fenwick
patent: 5167016 (1992-11-01), Bagley et al.
patent: 5224040 (1993-06-01), Tou
patent: 5265171 (1993-11-01), Sangu
patent: 5278918 (1994-01-01), Bernzott et al.
patent: 5321770 (1994-06-01), Huttenlocher et al.
patent: 5325444 (1994-06-01), Cass et al.
patent: 5359673 (1994-10-01), De La Beaujardiere
patent: 5369714 (1994-11-01), Withgott et al.
patent: 5384864 (1995-01-01), Spitz
patent: 5390259 (1995-02-01), Withgott et al.
patent: 5410611 (1995-04-01), Huttenlocher et al.
patent: 5438630 (1995-08-01), Chen et al.
patent: 5455871 (1995-10-01), Bloomberg et al.
patent: 5465309 (1995-11-01), Motoyama et al.
patent: 5483629 (1996-01-01), Johnson
patent: 5483653 (1996-01-01), Furman
patent: 5488719 (1996-01-01), Kaplan et al.
patent: 5491760 (1996-02-01), Withgott et al.
patent: 5493634 (1996-02-01), Bonk et al.
patent: 5504843 (1996-04-01), Catapano et al.
patent: 5504891 (1996-04-01), Motoyama et al.
patent: 5506985 (1996-04-01), Motoyama et al.
patent: 5513311 (1996-04-01), McKiel Jr.
patent: 5539841 (1996-07-01), Huttenlocher et al.
Lang, "About GSview", GSview.exe help file, Jan. 1997.
Birrell et al., "The ps to text program", http://www.research.digital.com/SRC/virtualpaper/pstotext.html, Oct. 29, 1996.
Birrell et al., "ps to text", man page documentation, http://www.research.digital.com/SRC/virtualpaper/manpages/pstotext.1.html, Oct. 29, 1996.
R. Skinner, "Cross-Platform Formatting Programs," Library Software Review, Summer, 1994, vol. 13, n. 2, pp. 152-156.
IBM Technical Disclosure Bulletin, vol. 37, No. 5, May 1, 1994, pp. 163-166, "Generating Words from Characters using an Adoptive `Learning` Algorithm".
Lau, "Building a Hypermedia Information System on the Internet", IPCC '94--Scaling New Heights in Technical Commnication, Sep. 28, 1994, pp. 192-197.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for identifying words described in a portab does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for identifying words described in a portab, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for identifying words described in a portab will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-704517

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.