Systems and methods for detecting text

Image analysis – Pattern recognition – Classification

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S176000, C707S793000

Reexamination Certificate

active

07570816

ABSTRACT:
The subject invention relates to facilitating text detection. The invention employs a boosted classifier and a transductive classifier to provide accurate and efficient text detection systems and/or methods. The boosted classifier is trained through features generated from a set of training connected components and labels. The boosted classifier utilizes the features to classify the training connected components, wherein inferred labels are conveyed to a transductive classifier, which generates additional properties. The initial set of features and the properties are utilized to train the transductive classifier. Upon training, the system and/or methods can be utilized to detect text in data under text detection, wherein unlabeled data is received, and connected components are extracted therefrom and utilized to generate corresponding feature vectors, which are employed to classify the connected components using the initial boosted classifier. Inferred labels are utilized to generate properties, which are utilized along with the initial feature vectors to classify each connected component using the transductive classifier.

REFERENCES:
patent: 3701095 (1972-10-01), Yamaguchi et al.
patent: 4955066 (1990-09-01), Notenboom
patent: 5109433 (1992-04-01), Notenboom
patent: 5181255 (1993-01-01), Bloomberg
patent: 5237628 (1993-08-01), Levitan
patent: 5297216 (1994-03-01), Sklarew
patent: 5465353 (1995-11-01), Hull et al.
patent: 5499294 (1996-03-01), Friedman
patent: 5526444 (1996-06-01), Kopec et al.
patent: 5542006 (1996-07-01), Shustorovich et al.
patent: 5594809 (1997-01-01), Kopec et al.
patent: 5699244 (1997-12-01), Clark et al.
patent: 5812698 (1998-09-01), Platt et al.
patent: 5832474 (1998-11-01), Lopresti et al.
patent: 5867597 (1999-02-01), Peairs et al.
patent: 5999653 (1999-12-01), Rucklidge et al.
patent: 6137908 (2000-10-01), Rhee
patent: 6233353 (2001-05-01), Danisewicz
patent: 6279014 (2001-08-01), Schilit et al.
patent: 6356922 (2002-03-01), Schilit et al.
patent: 6393395 (2002-05-01), Guha et al.
patent: 6397212 (2002-05-01), Biffar
patent: 6470094 (2002-10-01), Lienhart et al.
patent: 6487301 (2002-11-01), Zhao
patent: 6523134 (2003-02-01), Korenshtein
patent: 6546385 (2003-04-01), Mao et al.
patent: 6580806 (2003-06-01), Sato
patent: 6587217 (2003-07-01), Lahey et al.
patent: 6594393 (2003-07-01), Minka et al.
patent: 6658623 (2003-12-01), Schilit et al.
patent: 6687876 (2004-02-01), Schilit et al.
patent: 6869023 (2005-03-01), Hawes
patent: 6928548 (2005-08-01), Hale et al.
patent: 6938203 (2005-08-01), Dimarco et al.
patent: 7010751 (2006-03-01), Shneiderman
patent: 7024054 (2006-04-01), Cahill et al.
patent: 7062497 (2006-06-01), Hamburg et al.
patent: 7111230 (2006-09-01), Euchner et al.
patent: 7120299 (2006-10-01), Keskar et al.
patent: 7327883 (2008-02-01), Polonowski
patent: 7373291 (2008-05-01), Garst
patent: 2002/0032698 (2002-03-01), Cox
patent: 2002/0078088 (2002-06-01), Kuruoglu et al.
patent: 2002/0116379 (2002-08-01), Lee et al.
patent: 2003/0076537 (2003-04-01), Brown
patent: 2003/0123733 (2003-07-01), Keskar et al.
patent: 2003/0152293 (2003-08-01), Bresler et al.
patent: 2004/0003261 (2004-01-01), Hayashi
patent: 2004/0015697 (2004-01-01), de Queiroz
patent: 2004/0078757 (2004-04-01), Golovchinsky et al.
patent: 2004/0090439 (2004-05-01), Dillner
patent: 2004/0107348 (2004-06-01), Iwamura
patent: 2004/0189667 (2004-09-01), Beda et al.
patent: 2004/0205542 (2004-10-01), Bargeron et al.
patent: 2004/0205545 (2004-10-01), Bargeron et al.
patent: 2004/0252888 (2004-12-01), Bargeron et al.
patent: 2005/0138541 (2005-06-01), Euchner et al.
patent: 2005/0165747 (2005-07-01), Bargeron et al.
patent: 2005/0234955 (2005-10-01), Zeng et al.
patent: 2006/0045337 (2006-03-01), Shilman et al.
patent: 2006/0050969 (2006-03-01), Shilman et al.
R. Lienhart, A. Kuranov, and V. Pisarevsky. Empirical analysis of detection cascades of boosted classifiers for rapid object detection. In DAGM, 2003.
Taira, H. and M. Haruno (2001). Text Categorization Using Transductive Boosting. Proceedings of ECML-01, 12th European Conference on Machine Learning. Freiburg, DE, Springer Verlag, Heidelberg, DE: 454-465.
S. Marinai, et al., “Recognizing Freeform Digital Ink Annotations” Proceedings of the 6th International Workshop on Document Analysis Systems, 2004, vol. 2163, pp. 322-331.
G. Golovchinsky, et al., “Moving Markup: Repositioning Freeform Annotation” UIST 02. Proceedings of the 15th Annual ACM Symposium on user Interface Software and Technology, 2002, vol. conf. 15, pp. 21-29.
European Search Report dated Dec. 29, 2006, mailed for European Patent Application Serial No. 05 108 068.7, 2 Pages.
Murphey, et al. “Neural Learning Using AdaBoost” (2001) IEEE, 6 pages.
U.S. Appl. No. 11/165,070, David Bargeron.
U.S. Appl. No. 11/171,064, David Bargeron.
Vinajak R. Borkar, et al., Automatically extracting structure from free text addresses, 2000, 6 pages, In Bulletin of the IEEE Computer Society Technical committee on Data Engineering. IEEE.
Remco Bouckaert, Low level information extraction: A bayesian network based approach, 2002, 9 pages, In Proceedings of TextML 2002, Sydney, Australia.
Claire Cardie, et al., Proposal for an interactive environment for information extraction, 1998, 12 pages, Technical Report TR98-1702, 2.
Rich Caruana, et al., High precision information extraction, Aug. 2000, 7 pages. In KDD-2000 Workshop on Text Mining.
M. Collins, Discriminative training methods for hidden markov models : Theory and experiments with perception algorithms, Jul. 2002, p. 1-8, In Proceedings of Empirical Methods in Natural Language Processing (EMNLP02).
Corinna Cortes, et al., Support-vector networks. Machine Learning, 1995, 20(3): 273-297.
Y. Freund, et al., Large margin classification using the perceptron algorithm, Machine earning, 37(3):277-296.
Y. Freund, et al., Experiments with a new boosting algorithm, 1996, In International Conference on Machine Learning, pp. 148-156.
T. Kristjansson, et al., Interactive information extraction with constrained conditional random fields, 2004, In Proceedings of the 19th international conference on artificial intelligence, AAAI. pp. 412-418.
John Lafferty, et al., Conditional random fields: Probabilistic models for segmenting and labeling sequence data, 2001, In Proc. 18th International Conf. on Machine Learning, pp. 282-289. Morgan Kaufmann, San Francisco, CA.
M. Marcus, et al., The penn treebank: Annotating predicate argument structure, 1994, pp. 114-119.
Andrew McCallum, Efficiently inducing features of conditional random fields, 2003, 8 pages, In Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI03).
Andrew McCallum, et al., Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons, 2003, 4 pages, In Hearst/Ostendorf, Eds, HLT-NAACL, Ass'n for Computational Linguistics, Edmonton, Alberta, Canada.
Kamal Nigam, et al., Using maximum entropy for text classification, 1999, 7 pages, In Proceedings of the IJCAI'99 Workshop on Information Filtering.
David Pinto, et al., Table extraction using conditional random fields, 2003, 8 pages, In Proceedings of the ACM SIGIR'03, Jul. 28-Aug. 1, 2003, Toronto, Canada.
L.R. Rabiner, A tutorial on hidden markov models and selected applications in speech recognition, 1989, In Proceedings of the IEEE, vol. 77, pp. 257-286.
Fei Sha, et al., Shallow parsing with conditional random fields. In Hearst/Ostendorf, Eds, 2003, HLT-NAACL: Main Proceedings, pp. 213-220, Ass'n for Computational Linguistics, Edmonton, Alberta, Canada.
J. Stylos, et al., Citrine:providing intelligent copy-and-paste, 2005, In Proceedings of ACM Symposium on User Interface Software and Technology (UIST 2004), pp. 185-188.
B. Taskar, et al., Max-margin parsing, 2004, 8 pages, In Empirical Methods in Natural Language Processing (EMNLP04).
S. Mao, et al., Document structure analysis algorithms: A literature survey, Jan. 2003, vol. 5010, pp. 197-207, In Proc. SPIE Electronic Imaging.
M. Krishnamoorthy, et al., Syntactic seg

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Systems and methods for detecting text does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Systems and methods for detecting text, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Systems and methods for detecting text will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-4075678

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.