Data processing: database and file management or data structures – Database design – Data structure types
Patent
1997-06-27
1998-11-03
Burwell, Joseph R.
Data processing: database and file management or data structures
Database design
Data structure types
G06F 1500
Patent
active
058325301
ABSTRACT:
A method and apparatus for identifying words stored in a portable electronic document. A digital computation apparatus stores a page of a document including characters in text segments that have not been identified as words. A word identifying mechanism analyzes the text segments of the page and stores the text segments as text objects in a linked list. The word identifying mechanism identifies words from the text objects in the linked list by analyzing the text objects for word breaks and by analyzing gaps between text objects using position data associated with the text segments. The identified words are stored in a word list and are sorted if necessary. A method of the present invention receives a text segment from a page of a document having multiple text segments and associated position data, including x and y coordinates for each text segment. A text object is created for each text segment, and the text objects are entered into a linked list. Words are then identified from the linked list by analyzing the text objects for word breaks and by analyzing gaps between text objects using the associated position data. Words that are identified in the text objects are added to a word list. The above steps are repeated until the end of the page is reached. The method and apparatus can be used for searching for words in a portable electronic document.
REFERENCES:
patent: 4741045 (1988-04-01), Denning
patent: 5003614 (1991-03-01), Tanaka et al.
patent: 5161245 (1992-11-01), Fenwick
patent: 5167016 (1992-11-01), Bagley et al.
patent: 5224040 (1993-06-01), Tou
patent: 5265171 (1993-11-01), Sangu
patent: 5278918 (1994-01-01), Bernzott et al.
patent: 5321770 (1994-06-01), Huttenlocher et al.
patent: 5325444 (1994-06-01), Cass et al.
patent: 5359673 (1994-10-01), De La Beaujardiere
patent: 5369714 (1994-11-01), Withgott et al.
patent: 5384864 (1995-01-01), Spitz
patent: 5390259 (1995-02-01), Withgott et al.
patent: 5410611 (1995-04-01), Huttenlocher et al.
patent: 5438630 (1995-08-01), Chen et al.
patent: 5455871 (1995-10-01), Bloomberg et al.
patent: 5465309 (1995-11-01), Motoyama et al.
patent: 5483629 (1996-01-01), Johnson
patent: 5483653 (1996-01-01), Furman
patent: 5488719 (1996-01-01), Kaplan et al.
patent: 5491760 (1996-02-01), Withgott et al.
patent: 5493634 (1996-02-01), Bonk et al.
patent: 5504843 (1996-04-01), Catapano et al.
patent: 5504891 (1996-04-01), Motoyama et al.
patent: 5506985 (1996-04-01), Motoyama et al.
patent: 5513311 (1996-04-01), McKiel Jr.
patent: 5539841 (1996-07-01), Huttenlocher et al.
Lang, "About GSview", GSview.exe help file, Jan. 1997.
Birrell et al., "The ps to text program", http://www.research.digital.com/SRC/virtualpaper/pstotext.html, Oct. 29, 1996.
Birrell et al., "ps to text", man page documentation, http://www.research.digital.com/SRC/virtualpaper/manpages/pstotext.1.html, Oct. 29, 1996.
R. Skinner, "Cross-Platform Formatting Programs," Library Software Review, Summer, 1994, vol. 13, n. 2, pp. 152-156.
IBM Technical Disclosure Bulletin, vol. 37, No. 5, May 1, 1994, pp. 163-166, "Generating Words from Characters using an Adoptive `Learning` Algorithm".
Lau, "Building a Hypermedia Information System on the Internet", IPCC '94--Scaling New Heights in Technical Commnication, Sep. 28, 1994, pp. 192-197.
Ayers Robert M.
Paknad Mohammad Daryoush
Adobe Systems Incorporated
Burwell Joseph R.
LandOfFree
Method and apparatus for identifying words described in a portab does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method and apparatus for identifying words described in a portab, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for identifying words described in a portab will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-704517