Determining language for character sequence

Data processing: speech signal processing – linguistics – language – Linguistics – Natural language

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C704S008000

Reexamination Certificate

active

07139697

ABSTRACT:
A method for selecting the language for a character sequence fed into a data processing device, wherein decision trees are trained for different characters on the basis of lexicons of predetermined languages. The decision trees describe language probabilities on the basis of characters in the environments of the characters. The decision trees for at least some of the characters of the character sequence fed into the data processing device are traversed, thus obtaining a probability of at least one language for each character. The language for the character sequence is selected on the basis of the probabilities obtained.

REFERENCES:
patent: 5062143 (1991-10-01), Schmitt
patent: 5548507 (1996-08-01), Martino et al.
patent: 5805832 (1998-09-01), Brown et al.
patent: 6016471 (2000-01-01), Kuhn et al.
patent: 6026410 (2000-02-01), Allen et al.
patent: 6912499 (2005-06-01), Sabourin et al.
patent: 1014276 (2000-06-01), None
patent: 2338369 (1999-12-01), None
“A Tree-Based Statistical Language Model For Natural Language Speech Recognition”, Bahl et al., IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 37, No. 7, 1989.
“Automatic Construction of Decision Trees From Data: A Multi-Disciplinary Survey”, Murthy, Data Mining and Knwoledge Discovery, vol. 4, 1998.
“Text Classification And Segmentation Using Minimum Cross-Entropy”, Teahan, International Conference on Content-Based Multimedia Information Access, 2000.
“Incorporating POS Tagging Into Language Modeling” Heeman et al., Proceeding of the 5thEuropean Conference on Speech Communication and Technology, 1997.
“A Study Of N-Gram And Decision Tree Letter Language Modeling Methods”, Potamianos et al., Speech Communication, vol. 24(3), 1998.
“Two Decades Of Statistical Language Modeling: Where Do We Go From Here?”, Rosenfeld, Proceedings of the IEEE, 2000.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Determining language for character sequence does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Determining language for character sequence, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Determining language for character sequence will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3643771

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.