Data processing: speech signal processing – linguistics – language – Linguistics – Natural language
Patent
1994-09-28
1998-08-25
Weinhardt, Robert A.
Data processing: speech signal processing, linguistics, language
Linguistics
Natural language
704 10, G06F 1720, G06F 1721, G06F 1728, G06F 1760
Patent
active
057992681
ABSTRACT:
A method involving computer-mediated linguistic analysis of online technical documentation to extract and catalog from the documentation knowledge essential to, for example, creating a online help database useful in providing online assistance to users in performing a task. The method comprises stripping markup tags from the documentation, linguistically analyzing and annotating the text, including the steps of morphologically and lexically analyzing the text, disambiguating between possible parts-of-speech for each word, and syntactically analyzing and labeling each word. The method further comprises the steps of combining the linguistically analyzed, annotated, and labeled text and previously stripped markup information into a merged file, mining the merged file for domain knowledge, including the steps of identifying and creating a list of technical terminology, mining the merged file for manifestations of domain primitives and maintaining a list of manifestations of such domain primitives in an observations file, analyzing the discourse context of each sentence or phrase in the merged file, analyzing the frequency of manifestations of domain primitives in the observations file to determine those that are important, expanding the list of key terms by searching for terms sanctioned by a domain primitive deemed important in the previous step, and searching the merged file for larger relations by searching for particular lexico-syntactic patterns involving key terms and manifestations of domain primitives previously identified. The method further comprises the steps of structuring the knowledge thus mined and building a domain catalog.
REFERENCES:
patent: 4965763 (1990-10-01), Zamora
patent: 4992972 (1991-02-01), Brooks et al.
patent: 5424947 (1995-06-01), Nagao et al.
patent: 5475587 (1995-12-01), Anick et al.
L.L. Briner, "Identifying Keywords in Text Data Processing", Directions and Challenges, Fifteenth Annual Technical Symposium, Jun. 17, 1996, pp. 85-90.
Alan F. Smeaton, "Natural Language Processing and Information Retrieval", Information Processing and Management, vol. 26, No. 1, pp. 73-92, 1990.
Apple Computer Inc.
Hughet William N.
Weinhardt Robert A.
LandOfFree
Method for extracting knowledge from online documentation and cr does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method for extracting knowledge from online documentation and cr, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for extracting knowledge from online documentation and cr will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-46528