System and method for evaluating character sets of a message...

Image analysis – Pattern recognition – Context analysis or word recognition

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C382S218000, C704S008000, C707S793000

Reexamination Certificate

active

06539118

ABSTRACT:

FIELD OF THE INVENTION
The invention relates to the field of information processing, and more particularly to the matching of candidate character sets to the intended language of an electronic message containing a plurality of character sets.
BACKGROUND OF THE INVENTION
With the use of the Internet, email and related electronic services, communications software has been increasingly called upon to handle data in a variety of formats. While the barriers to simple communications have been removed from many hardware implementations, the problem of operating system or application software being unable to display text in different languages remains.
For instance, a person browsing the World Wide Web may wish to input a search string in their native language. Some Web pages or search engines will simply accept that string in the form in which it was input, but not process the spelling, syntax or character set in native form. The search engine then performs a search as though the search were in English, usually resulting in no hits. Other Web pages may allow a user to manually specify the desired language for browsing and searching. There is a need for more robust and more highly automated language handling for general searching, messaging and other communications purposes.
SUMMARY OF THE INVENTION
The invention overcoming these and other problems in the art relates to a system and method whereby electronic messages coded in a universal character set such as Unicode or others can be reliably and accurately transmitted using standard conventional encoding methods over the Internet, or other networks. The encoded documents may be in MIME Multipurpose Internet Mail Extensions).
An object of the invention is to provide an automatic and rigorous language evaluation facility by which the content of a message represented in a universal character set is tested against a bank of available language character sets, to determine which if any of those candidate languages can express the message.
Another object of the invention is to provide a system and method for evaluating character sets which identify languages which are capable of expressing the message from the language bank, to present to a user or otherwise.
Another object of the invention is to provide a system and method for evaluating character sets which assign a rating to languages which can express a given message, to determine which of those candidate languages offers the best fit to express the message.
Another object of the invention is to provide a system and method for evaluating a character set which permit searching and reading of text expressions in their native character sets, improving the quality of search results.
The system and method of the invention accomplishing these and other objects employs a character table bank against which the ability of a number of character sets, representing different languages, to encode a given character is tested. When a message of unknown origin is presented to the system, its characters are parsed and tested against the character table bank to separate the character sets (hence languages) to identify which of the pool of character sets can express each character.
A character set which contains a match for every character of the message is likely to be the native language of the original message. Tallies of matches to individual characters across all available character sets in the character table bank can also be made for the message as a whole. The invention has been implemented in and will be described in one regard with respect to the Lotus Notesrm environment, but it will be understood that the invention has universal application and can be used in any system that needs to receive and display information in multiple languages.


REFERENCES:
patent: 4289411 (1981-09-01), Cornelius et al.
patent: 4428694 (1984-01-01), Ragen
patent: 4456969 (1984-06-01), Herzik et al.
patent: 4777617 (1988-10-01), Frisch et al.
patent: 4873634 (1989-10-01), Frisch et al.
patent: 5009276 (1991-04-01), Raikes et al.
patent: 5165014 (1992-11-01), Vassar
patent: 5222200 (1993-06-01), Callister et al.
patent: 5377280 (1994-12-01), Nakayama
patent: 5392419 (1995-02-01), Walton
patent: 5418718 (1995-05-01), Lim et al.
patent: 5438650 (1995-08-01), Motoyama et al.
patent: 5500931 (1996-03-01), Sonnenschein
patent: 5506940 (1996-04-01), Bamford et al.
patent: 5526469 (1996-06-01), Brindle et al.
patent: 5548507 (1996-08-01), Martino et al.
patent: 5659770 (1997-08-01), Yamada
patent: 5706413 (1998-01-01), Takabayashi et al.
patent: 5717840 (1998-02-01), Pardo
patent: 5754748 (1998-05-01), Rivers et al.
patent: 5778213 (1998-07-01), Shakib et al.
patent: 5778361 (1998-07-01), Nanjo et al.
patent: 5778400 (1998-07-01), Tateno
patent: 5793381 (1998-08-01), Edberg et al.
patent: 5802539 (1998-09-01), Daniels et al.
patent: 5812818 (1998-09-01), Adler et al.
patent: 5819303 (1998-10-01), Calhoun
patent: 5828817 (1998-10-01), Landau
patent: 5844991 (1998-12-01), Hochberg et al.
patent: 5873111 (1999-02-01), Edberg
patent: 5946648 (1999-08-01), Halstead, Jr. et al.
patent: 6073147 (2000-06-01), Chan et al.
patent: 6098071 (2000-08-01), Aoyama et al.
patent: 6138086 (2000-10-01), Rose et al.
patent: 6141656 (2000-10-01), Ozbutun et al.
patent: 6157905 (2000-12-01), Powell
patent: 6240186 (2001-05-01), Hyde et al.
patent: 6321192 (2001-11-01), Houchin et al.
patent: 2001/0020243 (2001-09-01), Koppolu et al.
patent: 0 457 705 (1991-11-01), None
patent: 0 886 228 (1998-12-01), None
patent: 1 056 024 (2000-11-01), None
patent: WO 01/20500 (2001-03-01), None
U.S. Patent Application Ser. No. 09/384,088, Brendan P. Murray et al., filed Aug. 27, 1999.
U.S. Patent Application Ser. No. 09/384,089, David D. Taieb, filed Aug. 27, 1999.
U.S. Patent Application Ser. No. 09/384,371, Brendan P. Murray et al., filed Aug. 27, 1999.
U.S. Patent Application Ser. No. 09/384,443, Brendan P. Murray et al., filed Aug. 27, 1999.
U.S. Patent Application Ser. No. 09/384,538, David D. Taieb, filed Aug. 27, 1999.
U.S. Patent Application Ser. No. 09/384,541, David D. Taieb, filed Aug. 27, 1999.
U.S. Patent Application Ser. No. 09/384,542, David D. Taieb, filed Aug. 27, 1999.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

System and method for evaluating character sets of a message... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for evaluating character sets of a message..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for evaluating character sets of a message... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3021743

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.