Data processing: measuring – calibrating – or testing – Measurement system in a specific environment – Biological or biochemical
Reexamination Certificate
1998-10-28
2003-07-15
Borin, Michael (Department: 1631)
Data processing: measuring, calibrating, or testing
Measurement system in a specific environment
Biological or biochemical
C382S133000, C382S134000, C435S235100, C436S800000
Reexamination Certificate
active
06594586
ABSTRACT:
FIELD OF THE INVENTION
The invention relates to techniques of identifying and classifying objects. More specifically, the present system describes using contextual information to improve statistical accuracy.
BACKGROUND
A common assumption made in the field of machine learning is that the examples are drawn independently from some joint input-output distribution.
Classical statistics begins with an assumption of a population distribution. Tiat population distribution describes how some attribute varies over a set of entities. The goal is to obtain information about the population distribution by sampling or by some other real world technique. It is often impractical to evaluate the parameters exactly. Therefore, one attempts, using the techniques of statistics, to obtain as much information as one can based on experimental evidence. Then all of the parameters can be viewed a coordinates of a single vector often called x . The value of x is fixed, but there is uncertainty about its value.
The population often has a distribution. For example, if the population has a Gaussian distribution, the parameters may have a mean value &mgr; and a standard deviation &Sgr;.
The Bayesian approach models the population by considering elements of the population to be randomly selected from a class of populations. Each class of the population is characterized by parameters that are under investigation. Bayesian analysis considers two parameters x and y. Bayesian analysis is largely an effort at determining the probability for the occurrence of the event y=&bgr; conditioned by the fact that x=&agr; has occurred. During N trials the event x=&agr; occurs n
x
(&agr;) times. The joint effect x=&agr; and y=&bgr; occurs n
xy
(&agr;, &bgr;) times.
The usual Bayesian approach assumes the distribution for x by looking at the trials for x. Hence, Bayesian analysis considers the trials in which x=&agr;. Once the random sampling processes are obtained, the notion of the random sampling process is extended by assuming that the class x
i
: 1 is less than or equal to I; is conditionally independent. Each of the x
i
have the same conditional distribution. This is because each of the x
i
are defined through the same random parameter x. While the events x and y are independent, both have the same conditional probabilities. Hence, the conditional probability that y=&bgr; given that x=&agr; is independent of &agr;. This means that
P
x
|
y
⁢
(
α
|
β
)
=
P
x
|
y
⁢
(
β
|
α
)
⁢
P
x
⁢
(
α
)
P
y
⁢
(
β
)
where the left part is the conditional probability that y=&bgr; given that x=&agr;, the right part is the conditional probability that x=&agr; given that y=&bgr;, p
x
(&agr;) is the probability of &agr; and p
x
(&bgr;) is the probability of &bgr;.
Bayesian analysis is often used in estimator design. For instance, the conditional probability of a received message may be known and the conditional probability of the transmitted message is to be obtained. Bayesian analysis requires that the examples are drawn independently from the joint input/output distribution, as described above.
There are cases, however, where this no context assumption is not valid. One application where the independence assumption does not hold is the identification of white blood cell images. Abnormal cells are much more likely to appear in bunches than in isolation. Specifically, in a sample of several hundred cells, it is more likely to find either no abnormal cells or many abnormal cells than it is to find just a few. More generally, certain random processes are subject to “aliasing” whereby they occur in clumps.
SUMMARY OF THE INVENTION
The present specification describes pattern classification in situations where an independence assumption is not necessarily correct. In such cases, the probability of identity of an object may be dependent on the identities of the accompanying objects. Those accompanying objects which provides the contextual information.
The techniques take into consideration the joint distribution of all the classes, and uses the joint distribution to adjust the object-by-object classification.
This method can be used in a number of settings. For example, the classification of white blood cells is improved by the use of contextual information.
REFERENCES:
patent: 4538299 (1985-08-01), DeForest
patent: 5123055 (1992-06-01), Kasdan
patent: 5343538 (1994-08-01), Kasdan
patent: 5436978 (1995-07-01), Kasdan
patent: 5733721 (1998-03-01), Hemstreet, III et al.
Database GenBank accesion No. Q74185, Donnelly et al. May 31, 1995, See sequence alignment.*
Database GenEmbl accesion No. AB008889, Otsuka et al., Feb. 13, 1999. See sequence alignment.
Abu-Mostafa Yaser
Kasdan Harvey
Sill Joseph
Song Xubo
Borin Michael
California Institute of Technology
LandOfFree
Incorporation of contextual information in object... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Incorporation of contextual information in object..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Incorporation of contextual information in object... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3054261