Method and apparatus for classifying text

Boots – shoes – and leggings

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

G06F 1538

Patent

active

051827085

ABSTRACT:
The present invention provides a method and apparatus for classifying text by using two constants determined by analyzing the text. The first constant, G, classifies text in the order of constraint. It is defined by the equation G=log (N/L)/ {log(N)-1}, where N is the number of words and L is the number of different words in the text being classified. The second constant, R, is the correlation coefficient between the word length and the logarithm scaled rank order of word frequency. The values of the two constants can be used to determine how to classify text. In the case of English text, the text may be classified as computer language, text from a technical manual, English text written by foreigners or English text written by native English speakers.

REFERENCES:
patent: 4580241 (1985-04-01), McKucera
patent: 4888730 (1989-12-01), McRae et al.
Booth, A. D.; A Law of Occurrences for Words of Low Frequency Information and Control, 10(4); pp. 386-393 (1967).
Kennedy, J.; Neville, A,; Basic Statistical Method for Engineers and Scientists pp. 407-416, Harper and Row Publishers, Inc. (1986).
Mitzutani, S.; Lecture on Japanese, Asakura, Tokyo, 1983.
Tankard, J.; The Literary Detective Byte, Feb. 1986, pp. 231-238.
Zipf, G. K.; The Psycho-biology of Language The MITPress (1965), Originally Printed by Houghton Mifflin Co. 1935, pp. 20-48.
Nicolis, J.; Dynamics of Hierachical Systems Springer-Verlag, Berlin 1956, pp. 344-359.
Hormann, H.; Psycholinguistics 2nd Ed. Rev., Springer-Verlag, pp. 88-92, 1979.
Mandelbrot, B. B.; Fractal Geometry of Nature, W. H. Freeman and Co., pp. 344-348, New York 1982.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for classifying text does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for classifying text, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for classifying text will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1416642

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.