Data processing: database and file management or data structures – Database design – Data structure types
Reexamination Certificate
2005-12-02
2009-08-11
Mizrahi, Diane (Department: 2617)
Data processing: database and file management or data structures
Database design
Data structure types
Reexamination Certificate
active
07574449
ABSTRACT:
Various technologies and techniques are disclosed that improve the identification of related content. An article for which to identify matching content is received or selected. The raw text of the article is analyzed to reduce the raw text to a core set of words, and the results are stored in a document feature vector array. The formatted text of the article is analyzed and vector array scores are updated based on the formatting. Anchor text words for documents that link to the article are added to the vector array. Articles linking to and from the particular article are identified and added to the vector array as appropriate. Transformations are performed, such as to adjust the vector scores based on how common or generic the words are. Vector arrays are created for other potentially related documents. The vectors are compared to determine how related they are to each other.
REFERENCES:
patent: 2002/0103809 (2002-08-01), Starzl et al.
patent: 2002/0165873 (2002-11-01), Kwok et al.
patent: 2003/0086515 (2003-05-01), Trans et al.
patent: 2003/0126139 (2003-07-01), Lee et al.
patent: 2005/0071300 (2005-03-01), Bartlett et al.
patent: 2007/0022072 (2007-01-01), Kao et al.
patent: WO01/95155 (2001-12-01), None
patent: WO03/001413 (2003-01-01), None
patent: WO2004/001979 (2003-12-01), None
International Search Report dated Feb. 26, 2007 for Application No. PCT/US2006/041058, 11 pages.
Microsoft Corporation
Mizrahi Diane
LandOfFree
Content matching does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Content matching, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Content matching will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-4133311