Data processing: speech signal processing – linguistics – language – Linguistics – Dictionary building – modification – or prioritization
Reexamination Certificate
2011-03-29
2011-03-29
Sked, Matthew J (Department: 2626)
Data processing: speech signal processing, linguistics, language
Linguistics
Dictionary building, modification, or prioritization
C704S001000, C704S009000
Reexamination Certificate
active
07917355
ABSTRACT:
Methods, systems, and apparatus, including computer program products, in which data from web documents are partitioned into a training corpus and a development corpus are provided. First word probabilities for words are determined for the training corpus, and second word probabilities for the words are determined for the development corpus. Uncertainty values based on the word probabilities for the training corpus and the development corpus are compared, and new words are identified based on the comparison.
REFERENCES:
patent: 6052657 (2000-04-01), Yamron et al.
patent: 6128613 (2000-10-01), Wong et al.
patent: 6167368 (2000-12-01), Wacholder
patent: 6651058 (2003-11-01), Sundaresan et al.
patent: 6711577 (2004-03-01), Wong et al.
patent: 7024624 (2006-04-01), Hintz
patent: 7478033 (2009-01-01), Wu et al.
patent: 7680649 (2010-03-01), Park
patent: 2004/0225667 (2004-11-01), Hu et al.
patent: 2005/0021324 (2005-01-01), Brants et al.
patent: 2005/0278613 (2005-12-01), Morinaga et al.
patent: 2007/0143101 (2007-06-01), Goutte
patent: 2009/0055381 (2009-02-01), Wu et al.
Ren, He. “A chinese word extraction algorithm based on information entropy,” Journal of Chinese Information Processing, May 2006.
Sui, Z. et al. “Automatic recognition of Chinese scientific and technological terms using integrated linguistic knowledge,” Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on, Oct. 26-29, 2003, pp. 444-451.
He, S. et al. “Bootstrap method for Chinese new words extraction,” Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on, pp. 581-584 vol. 1.
Jiang, W. et al. “An Improved Unknown Word Recognition Model based on Multi-Knowledge Source Method,” Intelligent Systems Design and Applications, 2006. ISDA '06. Sixth International Conference on, Oct. 16-18, 2006, pp. 825-832.
He et al., “An Approach to Automatically Constructing Domain Ontology”, In: PACLIC 2006, Wuhan, China, Nov. 1-3, 2006.
Hitamitsu et al., “Topic Word Selection Based on Combinatorial Probability”, NLPRS-2001, pp. 289-296, 8 pages.
Lavrenko et al., “Relevance Models for Topic Detection and Tracking”, In Proceeding of HLT-2002, 12 pages.
Notification of Transmittal of the International Search Report and the Written Opinion of the International Searching Authority, or the Declaration, PCT/CN2008/072128, Dec. 4, 2008, 16 pages, 7 pages.
Notification Concerning Transmittal of International Preliminary Report on Patentability and the Written Opinion of the International Searching Authority, PCT/CN2008/072128, Mar. 4, 2010, 10 pages.
Ryu et al., “Determining the Specificity of Terms based on Information Theoretic Measures”, CompuTerm 2004 Poster Session—3rdInternational Workshop on Computational Terminology, pp. 87-90, 4 pages.
USPTO Non-Final Office Action in U.S. Appl. No. 11/844,067, mailed Aug. 5, 2010, 19 pages.
Fish & Richardson P.C., Amendment in Reply to Action dated Aug. 5, 2010 in U.S. Appl. No. 11/844,067, filed Nov. 5, 2010, 17 pages.
Hong Feng
Liu Tang Xi
Wang Yonggang
Wu Jun
Yang Bo
Fish & Richardson P.C.
Google Inc.
Sked Matthew J
LandOfFree
Word detection does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Word detection, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Word detection will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2634098