Modular and scalable system for signal and multimedia...

Electrical computers and digital data processing systems: input/ – Intrasystem connection – Bus access regulation

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C710S120000

Reexamination Certificate

active

06275891

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention relates to the field of processing system architectures suitable for signal and multimedia processing. More specifically, this invention relates to system having a shared memory architecture which simultaneously provides high memory access bandwidth to multiple processing devices.
2. Description of the Related Art
Digital multimedia systems require a substantial digital signal processing capability. This requirement is shared by many other digital systems including image rendering systems, artificial vision systems, digital communication systems, and speech recognition systems. The typical architecture for such systems is shown in FIG.
1
.
FIG. 1
shows a microcontroller bus
102
which couples a microcontroller
104
to a microcontroller memory
106
. A digital signal processor (DSP)
108
is similarly coupled to a DSP memory
110
by a DSP bus
112
. The two busses are coupled by a bus bridge
114
.
This architecture is popular since the microcontroller
104
can assume the responsibility for system-level functions (such as controlling a user interface, initiating and terminating operation of various system modules, and coordinating data transfers), and the DSP
108
can assume the responsibility for computationally-intensive tasks (such as various coding and compression algorithms, filtering operations, and data transforms). This division of labor provides eases system design and programming.
However, this architecture is inadequate for future generations of digital multimedia systems. The processing requirements are being increased as designers take advantage of compression algorithms and higher bandwidths to transmit more information. To keep pace, this architecture requires a more powerful DSP.
A more powerful DSP can be created in two ways. The clock speed can be increased, but this requires careful optimization and redesign of the DSP for every incremental improvement in semiconductor processing technology. Alternatively, the DSP can be provided with wider data paths, e.g. an 8-bit DSP could be replaced with a 32-bit DSP. However, the increases in the required area and power consumption are quadratic (i.e. to double the data path width, the area and power requirements increase by approximately a factor of four). This alternative is undesirable since power consumption is a perennial design constraint, particularly in view of the increasing popularity of portable devices.
Furthermore, larger data path widths are likely to be a poor “fit” for the data granularity, leading to inefficient use of the more powerful DSPs. For example, MPEG video compression operates on 8-bit blocks of video data. Even if multiple blocks were retrieved at a time, the DSP could only perform (at most) one 8-bit block operation per clock cycle. The rest of the data path width is unused for these operations.
To address these problems, this architecture may be modified by the addition of a dedicated hardware accelerator that is custom-designed to efficiently and quickly carry out specific algorithms. The hardware accelerator may be coupled to the DSP
108
and the DSP memory
110
via the DSP bus
112
. The DSP
108
then performs the less demanding computationally-intensive tasks of pre-processing and post-processing the data, and allows the hardware accelerator to perform the processing steps that the DSP
108
is too inefficient to perform.
If the hardware accelerator includes its own memory buffer, then direct memory transfers may be used to move the data across bus
112
. This represents undesirable power consumption, but the alternative is to require that the DSP bus
112
provide a high bandwidth connection between the hardware accelerator and DSP memory
110
. This alternative presents a memory contention issue that practically requires the DSP
108
to halt whenever the hardware accelerator is operating.
Various work-arounds may be designed, such as additional memories or additional busses, but these may be expected to result in complex, custom-designed interfaces between the hardware accelerator and the DSP, and many limitations or liabilities of the DSP
108
(such as insufficient address space) may be difficult for the hardware accelerator to overcome.
Accordingly, it is desirable to have an architecture that provides modularity and support for high-bandwidth memory connections for each processor element in the system. Such an architecture would preferably provide a shared memory to minimize power consumption, and yet be scalable so as to support multiple processor elements.
SUMMARY OF THE INVENTION
The problems outlined above are addressed by a modular, scalable system architecture that includes a traffic master for providing high-bandwidth, shared memory connections to two or more processor units. In one embodiment, the system architecture includes an array of memory modules coupled to an array of processor units by a traffic master. Each of the memory modules is connected to the traffic master by a data channel, and each data channel includes an address path and one or more data paths. The data channels all share a common data path bit-width. On the other hand, the processor units are each coupled to the traffic master by a data bus that has address and data path widths which are dictated by the design of the processor unit. Although the address path width of a given processor unit may be unable to span the address space of the shared memory, the processor unit can nonetheless access any memory location through the use of page pointers. Further, although the data path width of a given processor unit may be too large for a single data channel to support, several data channels may be combined to provide the required data path width. The traffic master includes a processor interface port for each processor, and a router. The processor interface ports convert address and write data signals from a data bus into corresponding address and write data signals for one or more data channels, and also convert read data signals from the data channels into corresponding read data signals for the data bus. The router routes the data channel signals between the processor interface ports and the memory modules. When applied to multimedia systems with a microcontroller, one or more digital signal processors, and one or more hardware accelerators, this architecture is expected to provide a substantial increase in processing capabilities which can be maintained under power consumption constraints and which will be enhanced as semiconductor technologies continue to improve.


REFERENCES:
patent: 5249301 (1993-09-01), Keryvel et al.
patent: 5392446 (1995-02-01), Tower et al.
patent: 5408677 (1995-04-01), Nogi
patent: 5535406 (1996-07-01), Kolchinsky
patent: 5535408 (1996-07-01), Hillis
patent: 5701507 (1997-12-01), Bonneau, Jr. et al.
patent: 5842034 (1998-11-01), Bolstad et al.
patent: 6101565 (2000-08-01), Nishtala et al.
patent: 6116768 (2000-09-01), Guttag et al.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Modular and scalable system for signal and multimedia... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Modular and scalable system for signal and multimedia..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Modular and scalable system for signal and multimedia... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2473628

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.