Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2007-09-04
2007-09-04
Wassum, Luke (Department: 2167)
Data processing: database and file management or data structures
Database design
Data structure types
C707S793000, C707S793000
Reexamination Certificate
active
10187859
ABSTRACT:
A full text indexing system is provided for processing content associated with data applications such as encyclopedia and dictionary applications. A build process collects data from various sources, processes the data into constituent parts, including alternative word sets, and stores the constituent parts in structured database tables. A run-time process is used to query the database tables and the results in order to provide effective matches in an efficient manner. Run-time processing is optimized by preprocessing all steps that are query-independent during the build process. A double word table representing all possible word pair combinations for each index entry and an alternative word table are used to further optimize run-time processing.
REFERENCES:
patent: 5265065 (1993-11-01), Turtle
patent: 5321833 (1994-06-01), Chang et al.
patent: 5369577 (1994-11-01), Kadashevich et al.
patent: 5374928 (1994-12-01), Moore et al.
patent: 5469354 (1995-11-01), Hatakeyama et al.
patent: 5701469 (1997-12-01), Brandli et al.
patent: 5809502 (1998-09-01), Burrows
patent: 5835905 (1998-11-01), Pirolli et al.
patent: 5864863 (1999-01-01), Burrows
patent: 5963965 (1999-10-01), Vogel
patent: 6067552 (2000-05-01), Yu
patent: 6112202 (2000-08-01), Kleinberg
patent: 6175830 (2001-01-01), Maynard
patent: 6202064 (2001-03-01), Julliard
patent: 6336112 (2002-01-01), Chakrabarti et al.
patent: 6484166 (2002-11-01), Maynard
patent: 6493692 (2002-12-01), Kobayashi et al.
patent: 6493705 (2002-12-01), Kobayashi et al.
patent: 6502091 (2002-12-01), Chundi et al.
patent: 6542889 (2003-04-01), Aggarwal et al.
patent: 6574622 (2003-06-01), Miyauchi et al.
patent: 6665837 (2003-12-01), Dean et al.
patent: 6678694 (2004-01-01), Zimmermann et al.
patent: 6775666 (2004-08-01), Stumpf et al.
patent: 2004/0225497 (2004-11-01), Callahan
Fagan, J.L. “The Effectiveness of a Nonsyntactic Approach to Automatic Phrase Indexing for Document Retrieval”, Journal of the American Society for Information Science, vol. 40, No. 2, 1989, pp. 115-132.
Brill, E. and M. Marcus “Tagging an Unfamiliar Text with Minimal Human Supervision”, Proceedings of the Fall Symposium on Probabilistic Approaches to Natural Language, AAAI Technical Report, 1992.
Salton, G. and J. Allan “Selective Text Utilization and Text Traversal”, Proceedings of the Fifth ACM Conference on Hypertext and Hypermedia, Nov. 14-18, 1993, pp. 131-144.
Salton, G., J. Allan and A. Singhal “Automatic Text Decomposition and Structuring”, Information Processing & Management, vol. 32, No. 2, Mar. 1996, pp. 127-138.
Beeferman, D. “Lexical Discovery with an Enriched Semantic Network”, Proceedings of the ACL/COLING Workshop on Applications of WordNet in Natural Language Processing Systems, 1998, pp. 135-141.
Jacqeumin, C. “Syntagmatic and Paradigmatic Representations of Term Variation”, Proceedings of the 37thAnnual Meeting of the Association for Computational Linguistics and Conference on Computational Linguistics, 1999, pp. 341-348.
Salton, G. “Automatic Text Indexing Using Complex Identifiers”, Proceedings of the ACM Conference on Document Processing Systems, 2000, pp. 135-144.
Dillon, M. and A.S. Gray “FASIT: A Fully Automatic Syntactically Based Indexing System”, Journal of the American Society for Information Science, vol. 34, No. 2, 1983, pp. 99-108.
Croft, W.B., H.R. Tuttle and D.D. Lewis “The Use of Phrases and Structured Queries in Information Retrieval”, Proceedings of the 14thAnnual ACM SIGIR Conference on Research and Development in Information Retrieval, 1991, pp. 32-45.
Mauldin, M.L. “Retrieval Performance in FERRET: A Conceptual Information Retrieval System”, Proceedings of the 14th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, 1991, pp. 347-355.
Riloff, E. and W. Lenhert “Information Extraction as a Basis for High-Precision Text Classification”, ACM Transactions on Information Systems, vol. 12, No. 3, 1994, pp. 296-333.
Kochtanek, T.R. “Document Clustering, Using Macro Retrieval”, Journal of the American Society for Information Science, vol. 34, No. 5, pp. 356-359, Sep. 1983.
Anderson Christopher Walter
Jayanti Harish
Microsoft Corporation
Wassum Luke
Workman Nydegger
LandOfFree
Content data indexing does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Content data indexing, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Content data indexing will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3756355