Data processing: speech signal processing – linguistics – language – Speech signal processing – Recognition
Reexamination Certificate
1998-12-11
2001-01-16
Hudspeth, David R. (Department: 2741)
Data processing: speech signal processing, linguistics, language
Speech signal processing
Recognition
C704S202000, C704S259000, C704S211000
Reexamination Certificate
active
06175818
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention relates to signal processing arrangements, and more particularly to such arrangements which are adapted for use with varying, e.g. time varying, band-limited input signals, such as speech.
2. Description of the Related Art
For a number of years the Time Encoding of speech and other time varying band-limited signals has been known, as a means for the economical coding of time varying signals into a plurality of Time Encoded Speech or Signal (TES) descriptors or symbols to afford a TES symbol stream, and for forming such a symbol stream into fixed dimensional, fixed size data matrices, where the dimensionality and size of the matrix is fixed, a priori, by design, irrespective of the duration of the input speech or other event to be recognised. See, for example:
1. U.K. Patent No. 2145864 and corresponding European Patent No. 0141497.
2. Article by J. Holbeche, R. D. Hughes, and R. A. King, “Time Encoded Speech (TES) descriptors as a symbol feature set for voice recognition systems”, published in IEE Int. Conf. Speech Input/Output; Techniques and Applications, pages 310-315, London, March 1986.
3. Article by Martin George “A New Approach to Speaker Verification”, published in “VOICE+”, October 1995, Vol. 2, No. 8.
4. U.K. Patent No. 2268609 and corresponding International Application No. PCT/GB92/00285 (W092/00285).
5. Article by Martin George “Time for TESPAR” published in “CONDITION MONITOR”, September 1995, No. 105.
6. Article by R. A. King “TESPAR/FANN An Effective New Capability for Voice Verification In The Defence Environment” published by the Royal Aeronautical Society, 4 Hamilton Place, London W1V OBQ, “The Role of Intelligent Systems in Defence”, 27-28 March 1995.
7. Article by M. H. George and R. A. King “A Robust Speaker Verification Biometric”. Proceedings IEE 29th Annual 1995 International Carnahan
Conference on Security Technology. Pages 41-46, 18-20 October 1995.
The Time Encoding of speech and other signals described in the above references have, for convenience, been referred to as TESPAR coding, where TESPAR stands for Time Encoded Signal Processing and Recognition.
It should be appreciated that references in this document to Time Encoded Speech, or Time Encoded signals, or TES, are intended to indicate solely, the concepts and processes of Time Encoding, set out in the aforesaid references and not to any other processes.
In U.K. Patent No. 2145864 and in some of the other references already referred to, it is described in detail how a speech waveform, which may typically be an individual word or a group of words, may be coded using time encoded speech (TES) coding, in the form of a stream of TES symbols, and also how the symbol stream may be coded in the form of, for example, an “A” matrix, which is of fixed size regardless of the length of the speech waveform.
As has already been mentioned and as is described in others of the references referred to, it has been appreciated that the principle of TES coding is applicable to any time varying band-limited signal ranging from seismic signals with frequencies and bandwidths of fractions of a Hertz, to radio frequency signals in the gigaHertz region and beyond. One particularly important application is in the evaluation of acoustic and vibrational emissions from rotating machinery.
In the references referred to it has been shown that time varying input signals may be represented in TESPAR matrix form where the matrix may typically be one dimensional or two dimensional. For the purposes of this disclosure two dimensional or “A” matrices will be used but the processes are identical with “N” dimensional matrices where “N” may be any number greater than 1, and typically between 1 and 3. It has also been shown how numbers of “A” matrices purporting to represent a particular word, or person, or condition, may be grouped together simply to form archetypes, that is to say archetype matrices, such that those events which are consistent in the set are enhanced and those which are inconsistent and variable, are reduced in significance. It is then possible to compare an “A” matrix derived from an input signal being investigated with the archetype matrices in order to provide an indication of the identification or verification of the input signal. In this respect see U.K. Patent No. 2268609 (Reference 4) in which the comparison of the input matrix with the archetype matrices is carried out using fast artificial neural networks (FANNS).
It will be appreciated, as is explained in the prior art, for time varying waveforms especially, this process is several orders of magnitude simpler and more effective than similar processes deployed utilising conventional procedures and frequency domain data sets.
It has now been appreciated that the performance of TESPAR and TESPAR/FANN recognition, classification, verification and discrimination systems can, nevertheless, be further significantly improved.
The invention to be disclosed will use as its example architecture TESPAR/FANN data sets and networks, but it will be appreciated by those skilled in the art that the invention may equally be applied to data sets other than TESPAR.
BRIEF SUMMARY OF THE INVENTION
According to the present invention there is provided a signal processing arrangement for a varying band-limited input signal, comprising a plurality N of signal comparators, each signal comparator being adapted to compare said input signal with a plurality of different exemplar signals and for affording an output indicative of which of said exemplar signals corresponds most closely to said input signal, characterised in that each of the exemplar signals of said signal comparators is arbitrarily derived indepently of any expected input signal, and by means for deriving an N-part output signal which is indicative of said input signal, each part of said N-part output signal being derived from the output signal of a respective one of said N signal comparators.
In a preferred arrangement in accordance with the present invention each of said signal comparators is based on TESPAR coding.
In carrying out the invention each of said signal comparators comprises coding means operable on said input signal for affording a time encoded signal symbol stream, means operable on said symbol stream for deriving matrix dependent signals corresponding to a fixed size matrix formable from said symbol stream, and artificial neural network processing means responsive to said matrix dependent signals for affording an output indicative of said input signal.
In an especially preferred arrangement for speech input signals it will be arranged that means is provided for storing said output signal on a magnetic stripe card, a smart card or on a plastic card e.g. using a bar code.
In carrying out the invention it may be arranged that said especially preferred arrangement comprises means for detecting an input speech signal, means for applying said detected input speech signal to said plurality of signal comparators to derive said N-part output signal, card reader means for detecting an output signal stored on said card, and means for comparing the N-part output signal derived from said input speech signal with the output signal derived from said card to afford an output indicative of whether they correspond or not.
Advantageously, it may be arranged that each of said signal comparators is effective for comparing said input signal with eight exemplar signals whereby said output signal comprises N, three bit words.
REFERENCES:
patent: 5528728 (1996-06-01), Matsuura et al.
patent: 5621848 (1997-04-01), Wang
patent: 5687286 (1997-11-01), Bar-Yam
patent: 5689616 (1997-11-01), Li
patent: 5751904 (1998-05-01), Inazumi
patent: 5787393 (1998-07-01), Inazumi
patent: 5787394 (1998-07-01), Bahl et al.
patent: 5794190 (1998-08-01), Linggard et al.
patent: 5842161 (1998-11-01), Cohrs et al.
patent: 5903863 (1999-05-01), Wang
Joubert et al.; Speech Recognition Using TES Symbols; Communications and Signal Processing; pp. 116-119, 1991.
Timms et al.; Speake
Azad Abul K.
Domain Dynamics Limited
Hudspeth David R.
Jacobson Price Holman & Stern PLLC
LandOfFree
Signal verification using signal processing arrangement for... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Signal verification using signal processing arrangement for..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Signal verification using signal processing arrangement for... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2491401