Data processing: speech signal processing – linguistics – language – Linguistics – Multilingual or national language support
Reexamination Certificate
1998-12-15
2002-05-14
Edouard, Patrick N. (Department: 2747)
Data processing: speech signal processing, linguistics, language
Linguistics
Multilingual or national language support
C707S793000, C707S793000
Reexamination Certificate
active
06389386
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Technical Field
The present invention relates in general to sorting text strings and in particular to sorting text strings in a culturally correct order where the text string language does not provide pronunciation information and/or data processing system character codes are unsorted for the text string language. Still more particularly, the present invention relates to sorting text strings based on either pronunciation or user-defined sort orders artificially promoting selected text strings while displaying pronunciation- or sort order-independent representations of the text strings.
2. Description of the Related Art
Multinational companies often run information system (IS) networks which span multiple locales spread around the globe. To maximize the usefulness of such networks, operations within each country tend to run in the local language of the country. Where possible, names of abstract objects in user applications are in the local language and match the local language organization, city, or human names which the abstract objects represent. In the case of system management software, often abstract objects would represent each of a global enterprise's local offices.
For offices located in Japan, objects would be named in Japanese. However, Japanese is a logosyllabic or ideographic language which does not have an alphabet representing simple sounds, but instead has a very large character set with symbols (“ideographs”) corresponding to concepts and objects rather than simple sounds. For instance, the Joyo Kanji List (Kanji for Daily Use) adopted for the Japanese language in 1981 includes 1945 symbols.
In Japanese, the same ideograph may have multiple meanings and/or pronunciations. For instance, the symbol depicted in
FIG. 4A
may be pronounced either “hayashi” or “rin” (or “lin”); and the characters depicted in
FIG. 4B
may be pronounced “suga no,” “suga ya,” “kan no,” or “kan ya.” This circumstance is based in part on the history of the Japanese language, in which the Kanji characters were adopted from the Chinese language. Thus, for example, the “rin” symbol depicted in
FIG. 4A
is On-Yomi, basically a simulation of the Chinese pronunciation when the character was imported to Japan, while “hayashill” is Kun-Yomi, a Japanese word assigned to the character which has the same meaning.
A culturally correct sort order for names represented by identical ideographs might be based on pronunciation. However, ideographs or data processing system codes (e.g., Unicode codes) representing ideographs generally provide no information as to the correct pronunciation. Sorting a plurality of abstract objects by name, therefore, may result in an incorrect sort order being displayed.
An additional and independent complication arises in sorting abstract objects by name from the fact that the same Kanji character set is largely shared by the Chinese, Japanese and other Asian languages. The UNICODE Han unification has reordered the Kanji characters for Chinese, unsorting them for Japanese, as well as for Vietnamese and Korean.
Yet another independent sorting problem is that frequently a need arises to alter the collation sequence of a list or to create a second-order list. For instance, it may be desirable to sort a list of users alphabetically within each hierarchical level of an enterprise. That is, the user list is to be sorted with the enterprise's principal officer first, followed by other officers sorted alphabetically and then by staff members sorted alphabetically. Currently names are “floated” to the top of a sorted list by inserting an extraneous character with a low sort value at the front of the string (e.g., “_Frank Moss”). However, the extraneous characters will normally be displayed in the sorted list.
It would be desirable, therefore, to provide a mechanism for sorting Japanese text strings in a culturally correct order. It would further be advantageous if the mechanism could be employed to sort Kanji character strings in a culturally correct order despite the sorting order employed by character sets. It would also be advantageous to sort text strings in a user-defined sort order with selected strings artificially promoted without displaying the mechanism by which the selected text strings are promoted.
SUMMARY OF THE INVENTION
It is therefore one object of the present invention to provide an improved method, system and computer program product for sorting text strings.
It is another object of the present invention to provide a method, system and computer program product for sorting text strings in a culturally correct order where the text string language does not provide pronunciation information and/or data processing system character codes are unsorted for the text string language.
It is yet another object of the present invention to provide a method, system and computer program product for sorting text strings based on either pronunciation or user-defined sort orders artificially promoting selected text strings while displaying representations of the text strings which are independent of pronunciation or artificial sort order mechanisms.
The foregoing objects are achieved as is now described. A multi-field text string contains display characters in a first field and sort characters in a second field. For ideographic languages such as Japanese, the display characters may be Kanji symbols for the text string while the sort characters are phonetic syllabary representations of the Kanji symbols. A plurality of such multi-field text strings may then be sorted by the contents of the second field rather than the contents of the first. Despite both the multiple pronunciations or meanings associated with the same Kanji symbols in Japanese and the unsorted ordering of Kanji symbols within the character set for Japanese, a culturally correct sort order is achieved for the multi-field text strings. Additionally, the contents of the second field may be altered to artificially promote a specific item within the sort order, while displaying the sorted text strings utilizing the contents of the first field. The mechanism for promoting particular text strings within the sort order does not interfere with user viewing of the displayed text strings.
The above as well as additional objects, features, and advantages of the present invention will become apparent in the following detailed written description.
REFERENCES:
patent: 4379288 (1983-04-01), Leung et al.
patent: 4384329 (1983-05-01), Rosenbaum et al.
patent: 4544276 (1985-10-01), Horodeck
patent: 4611280 (1986-09-01), Linderman
patent: 4641264 (1987-02-01), Nitta et al.
patent: 4706212 (1987-11-01), Toma
patent: 4730270 (1988-03-01), Okajima et al.
patent: 4737040 (1988-04-01), Moon
patent: 4951202 (1990-08-01), Yan
patent: 4954984 (1990-09-01), Kaijima et al.
patent: 4962452 (1990-10-01), Nogami et al.
patent: 5040218 (1991-08-01), Vitale et al.
patent: 5056021 (1991-10-01), Ausborn
patent: 5091878 (1992-02-01), Nagasawa et al.
patent: 5109352 (1992-04-01), O'Dell
patent: 5136503 (1992-08-01), Takagi et al.
patent: 5164900 (1992-11-01), Bernath
patent: 5175803 (1992-12-01), Yeh
patent: 5214583 (1993-05-01), Miike et al.
patent: 5243519 (1993-09-01), Andrews et al.
patent: 5251130 (1993-10-01), Andrews et al.
patent: 5268990 (1993-12-01), Cohen et al.
patent: 5307267 (1994-04-01), Yang
patent: 5371844 (1994-12-01), Andrew et al.
patent: 5377317 (1994-12-01), Bates et al.
patent: 5384700 (1995-01-01), Lim et al.
patent: 5390295 (1995-02-01), Bates et al.
patent: 5416903 (1995-05-01), Malcolm
patent: 5418718 (1995-05-01), Lim et al.
patent: 5420976 (1995-05-01), Schell et al.
patent: 5426583 (1995-06-01), Uribe-Echebarria Diaz De Mendibil
patent: 5432948 (1995-07-01), Davis et al.
patent: 5434777 (1995-07-01), Luciw
patent: 5440482 (1995-08-01), Davis
patent: 5448474 (1995-09-01), Zamora
patent: 5485373 (1996-01-01), Davis et al.
patent: 5490061 (1996-02-01), Tolin et al.
patent: 5523946 (1996-06-01), Kaplan et al.
patent: 5546575 (1996-08-01), Potter et al.
patent: 5550965 (1996-08-01)
Hetherington David James
Kumhyr David Bruce
Bracewell & Patterson L.L.P.
Edouard Patrick N.
LaBaw Jeffrey S.
LandOfFree
Method, system and computer program product for sorting text... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Method, system and computer program product for sorting text..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method, system and computer program product for sorting text... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2865562