Electrical computers and digital data processing systems: input/ – Intrasystem connection – Bus access regulation
Reexamination Certificate
2000-05-17
2002-01-29
Dharia, Rupal (Department: 2181)
Electrical computers and digital data processing systems: input/
Intrasystem connection
Bus access regulation
C710S120000, C708S209000, C712S002000
Reexamination Certificate
active
06343337
ABSTRACT:
BACKGROUND OF THE INVENTION
1. Technical Field
The present invention relates in general to consolidation of multimedia facilities and in particular to reusing existing circuitry for one multimedia instruction in place of comparable circuitry for other multimedia instructions. Still more particularly, the present invention relates to employing a crossbar within a vector permute unit for wide shifting functions required for other multimedia instructions.
2. Description of the Related Art
Multimedia applications are increasing, leading to an increased demand for multimedia facilities within processors. Processors, such as the PowerPC™ processor available from IBM Corporation of Armonk, N.Y., are increasingly incorporating such multimedia facilities. In the case of the PowerPC™, the multimedia facility is the vector multimedia extensions (VMX) facility.
Several of the instructions implemented by the VMX facility require a multiplexing function for at least one stage. For example, the traditional approach to implementing the vpack instruction, which compresses either 32 bits into 16 bits or 16 bits into 8 bits, would involve a multiplexer. An example is depicted in
FIG. 3. A
vpack instruction is received by decode logic
302
, which generates selects for multiplexer
304
based on whether the operand
306
is being compressed from 16 bits to 8 bits or from 32 bits to 16 bits. Multiplexer
304
selects possible alternatives for the top target byte
308
a
from the bytes of 32 bit operand
306
. Saturation multiplexers
310
a
and
310
b
, under the control of saturation detection logic
312
, select between source bytes from operand
306
or their saturated values
314
a
and
314
b
for target bytes
308
a
and
308
b
. Multiplexer
304
, in particular, requires a significant amount of area within the multimedia facility and may incur undesirable latency in instruction execution.
Other instructions supported by a typical multimedia facility within a processor architecture require other, specialized hardware. Shift operations, for example are traditionally performed utilizing a barrel shifter. However, this approach becomes more expensive as the shifts become wider. The ability to perform wide shifts (64 to 128 bits) is useful in performing multimedia operations, but is very expensive in hardware implementations. The area needed for a barrel shifter performing wide shifts, and the latency incurred by such a shifter, may become unacceptable.
It would be desirable, therefore, to utilize existing hardware within the multimedia facilities of a processor to performing comparable multiplexing and shifting functions for other instructions. It would further be advantageous if the resulting mechanism reduced latencies for the instructions.
SUMMARY OF THE INVENTION
It is therefore one object of the present invention to provide a method and apparatus for consolidation of multimedia facilities.
It is another object of the present invention to provide a method and apparatus for reusing existing circuitry for one multimedia instruction in place of comparable circuitry for other multimedia instructions.
It is yet another object of the present invention to provide a method and apparatus for employing a crossbar within a vector permute unit for wide shifting functions required for other multimedia instructions.
The foregoing objects are achieved as is now described. A crossbar is implemented within multimedia facilities of a processor to perform vector permute operations, in which the bytes of a source operand are reordered in the target output. The crossbar is then reused for other instructions requiring multiplexing or shifting operations, particularly those in which the size of additional multiplexers or the size and delay of a barrel shifter is significant. A wide shift operation, for example, may be performed with one cycle latency by the crossbar and one additional layer of multiplexers or a small barrel shifter. The crossbar facility thus gets reused with improved performance of the instructions now sharing the crossbar and a reduction in the total area required by a multimedia facility within a processor.
The above as well as additional objects, features, and advantages of the present invention will become apparent in the following detailed written description.
REFERENCES:
patent: 4740993 (1988-04-01), Skubnik
patent: 5155698 (1992-10-01), Niimi
patent: 5262971 (1993-11-01), Yamaguchi
patent: 5471628 (1995-11-01), Phillips et al.
patent: 5557734 (1996-09-01), Wilson
patent: 5726926 (1998-03-01), Makino
patent: 5948050 (1999-09-01), Diamondstein et al.
patent: 5996057 (1999-11-01), Scales, III et al.
patent: 6119224 (2000-09-01), Roth
patent: 6178500 (2001-01-01), Roth
Diefendorf Keith Everett
Dubey Pradeep Kumar
Hochsprung Ronald Ray
Olsson Brett
Roth Charles Philip
Bracewell & Patterson L.L.P.
Dharia Rupal
England Anthony V. S.
LandOfFree
Wide shifting in the vector permute unit does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Wide shifting in the vector permute unit, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Wide shifting in the vector permute unit will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-2839113