Data processing: speech signal processing – linguistics – language – Audio signal bandwidth compression or expansion
Reexamination Certificate
2000-07-12
2003-05-20
To, Doris H. (Department: 2655)
Data processing: speech signal processing, linguistics, language
Audio signal bandwidth compression or expansion
C704S275000, C704S270000, C704S270100, C704S504000
Reexamination Certificate
active
06567782
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to a portable information terminal for use as a portable telephone set, a PHS (Portable Handy-phone System) terminal, a portable game machine, or a PDA (Personal Digital Assistant), a method of processing audio data for compressing and expanding audio data used in the portable information terminal, a recording medium which stores a program for carrying out at least the method of processing audio data and data to be referred to by the program, and a program for carrying out the method of processing audio data.
2. Description of the Related Art
Generally, an audio coding process removes redundancy from audio data so as to be able to handle the audio data in a reduced amount of information. The audio coding process is an indispensable technology for digital audio data transmission, e.g., audio data transmission via portable telephone and the Internet, and digital audio data storage, e.g., audio data storage in CD-ROMs, DVDs (Digital Video Discs), and MDs (Mini Discs).
The audio coding process utilizes audio data properties and human auditory characteristics in order to code audio data efficiently.
The audio coding process is based on various techniques including nonuniform quantization, predictive coding, differential PCM, subband coding based on human auditory and visual characteristics, and transform coding.
According to the transform coding technique, for example, a time-domain signal is sliced with a window ranging from 5 to 50 ms, converted into a frequency domain by DCT (Discrete Cosine Transform) or DFT (Discrete Fourier Transform), classified into a corresponding group with a critical bandwidth, and coded with a minimum number required of quantization bits in view of masking.
Specifically, audio data is converted into a frequency domain by MDCT (Modified Discrete Cosine Transform), and also subjected to FFT (Fast Fourier Transform). A masking threshold value for each critical band is determined from the FFT signal, and the MDCT audio data is quantized and thereafter subjected to variable-length coding, i.e., entropy coding such as Huffman coding.
The encoded audio data is decoded by variable-length decoding and inverse quantization, and then inverse DCT (IDCT) to restore time-domain data, i.e., audio data.
In order to meet demands for smaller size and weight, efforts have been made to improve portable information terminals such as portable telephone sets, PHS terminals, portable game machines, or PDAs by employing piezoelectric speakers and small memories of reduced storage capacity for operating programs.
A program for achieving the audio coding process (hereinafter referred to as “audio coding program”) is made up of a vast number of steps. If the audio coding program is ported directly to a portable information terminal, then the following programs are liable to arise:
(1) Since it is necessary to provide a storage capacity large enough to store the audio coding program, there is a greater need for a memory for storing the audio coding program.
(2) If no sufficient storage capacity is available, then the audio coding program cannot be made resident. Therefore, each time there is a request for audio data output, the audio coding program must be stored in a working area and activated. However, the process leads to a reduction in the processing speed, and the timing for audio data output is liable to be retarded.
(3) Because the number of steps of the audio coding program is large, the audio coding program is required to be paged frequently. The frequent paging process is liable to increase the processing time and operational errors.
SUMMARY OF THE INVENTION
It is therefore an object of the present invention to provide a portable information terminal having a simplified audio coding program that is capable of quickly processing audio data.
Another object of the present invention is to provide a method of processing audio data which is suitable for the processing of audio data in a portable information terminal.
Still another object of the present invention is to provide a recording medium which stores a program for quickly processing audio data that is suitable for the processing of audio data in a portable information terminal, and data to be referred to by the program.
A portable information terminal according to the present invention has a memory for storing data, expansion processing means stored in the memory, for expanding compressed data which has not been subjected to variable-length coding, into audio data, sound output means for converting the audio data into sound and outputting the sound, and a plurality of data buffers accessible by at least the expansion processing means, for successively selectively reading and writing at least the audio data.
The expansion processing means expands the compressed data to restore the audio data. The compressed data has not been subjected to variable-length coding.
Generally, Huffman coding is used as the variable-length coding (entropy coding). If the variable-length coding employed, the number of steps required to decode the data is increased, resulting in an increase in the size of a program that is used to expand the compressed data.
According to the present invention, no decoding process is required because the expansion processing means expands the compressed data which has not been subjected to variable-length coding, and hence the number of steps carried out by the expansion processing means is greatly reduced. As a result, the compressed data can quickly be restored without having to increase the storage capacity of the memory.
Furthermore, since the plural data buffers are used, no data interference occurs during the processing, and the compressed data can be expanded quickly with high accuracy.
If the number of data based on a processable number that is preset to meet the performance of at least the sound output means is regarded as one sample, then the expansion processing means may comprise means for expanding the compressed data for each sample. Therefore, the compressed data can be expanded depending on the specifications of the portable information terminal, preventing sound from being outputted out of synchronism with displayed images.
The expansion processing means may comprise inverse quantization processing means for inversely quantizing the compressed data into inversely quantized data, and IDCT processing means for effecting an IDCT process on the inversely quantized data to produce the audio data, the data buffers including two inverse quantization buffers accessible for successively selectively reading and writing the inversely quantized data, and two audio data buffers accessible for successively selectively reading and writing the audio data.
The IDCT processing means may comprise means for effecting the IDCT process using a cosine table which has been generated in advance. With the cosine table used, the IDCT process is simplified, and the compressed data can be expanded more quickly.
The IDCT processing means may have low-pass filter processing means for logically effecting a low-pass filter on IDCT-processed data to reduce a high-frequency component contained therein.
The portable information terminal may further comprise communication means for sending data to and receiving data from an external device, the compressed data comprising data stored in the memory via the communication means.
The compressed data may comprise data which is produced when uncompressed data recorded in a recording medium accessible via the external device is compressed by the external device, or may comprise data which is recorded in a recording medium accessible via the external device and compressed when recorded in the recording medium.
The uncompressed data may be compressed by effecting a DCT process on the uncompressed data to produce DCT-processed data and quantizing the DCT-processed data. The DCT-processed data may be quantized by determining maximum values at respective frequencies of the DCT-processed data to generate a quantiz
Guss Paul A.
Nolan Daniel
Sony Computer Entertainment Inc.
To Doris H.
LandOfFree
Portable information terminal, method of processing audio... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Portable information terminal, method of processing audio..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Portable information terminal, method of processing audio... will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3087480