Arrayed processing element redundancy architecture

Electronic digital logic circuitry – Reliability – Redundant

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C326S009000, C326S038000, C365S200000, C716S030000, C716S030000

Reexamination Certificate

active

06803782

ABSTRACT:

FIELD OF THE INVENTION
The present invention relates to semiconductor memory. In particular, the present invention relates to column redundancy architectures of computational random access memories.
BACKGROUND OF THE INVENTION
Redundancy circuits are essential to boosting production yield of semiconductor devices, especially high density memory devices, since defects are more likely to occur in the high density memory arrays than in the peripheral circuits. Various redundancy schemes have been developed to repair memories having faulty memory cells during testing. Such fault tolerant schemes can involve column redundancy for replacing the column having the defect, with a redundant column of the memory array.
Many redundancy schemes have been proposed in the art for increasing semiconductor yield, which are generally implemented as follows. Once the location of a defective memory cell or cells is identified during testing, the column it is part of is effectively removed from the memory array by ensuring that it can no longer be addressed. A spare column of memory cells physically located elsewhere on the chip is programmed to be accessed by the logical address that would have accessed the defective column. Address programming is typically done through the use of laser-blown fuses and anti-fuses, for example.
Another type of integrated circuit device that requires column redundancy to increase yield are computational random access memories (CRAM). CRAM is a memory device having arrayed parallel processors with hundreds to thousands of processors coupled to a memory array. CRAM is a processor in memory architecture that takes advantage of the large internal memory bandwidth available at the sense amplifiers. By pitch matching each bit serial processing element (PE) to one or more memory columns, CRAM can operate as a massively parallel single instruction stream, multiple data stream (SIMD) computer. CRAM architectures and arrayed parallel processor architectures are well known in the art.
An example of a prior art CRAM is shown in FIG.
1
. The CRAM
20
shown in
FIG. 1
includes two banks
22
and
24
, labeled “Bank
1
” and “Bank
2
” respectively, although a CRAM can contain any number of banks. Bank
22
includes a memory array
26
coupled to peripheral circuits such as row decoders
28
, processing elements (PE's)
30
, and column decoders
32
. Bank
24
is identically configured to bank
22
, and includes a memory array
34
coupled to peripheral circuits such as row decoders
36
, PE's
38
, and column decoders
40
. Memory arrays
26
and
34
can be of any type of memory, such as dynamic random access memory (DRAM) or static random access memory (SRAM), for example, with row decoders
28
and column decoders
32
selecting particular memory cells for read and write operations. Each PE
30
has direct access to a single column of memory for use as its local memory, and is coupled to a common broadcast bus
42
. As shown in
FIG. 1
, PE's
30
and
38
are all coupled to the same broadcast bus
42
, which can further extend to other banks of the chip. The PE's
30
are connected to the common broadcast bus
42
in a wired AND configuration, allowing the common broadcast bus
42
to function as a dynamic zero detect circuit. Furthermore, if at least one PE
30
writes a zero to the common broadcast bus
42
, all other PE's
30
receive the zero value for register write back.
An example of a prior art PE
30
or
38
used in CRAM
20
of
FIG. 1
is shown in
FIG. 2. A
pair of adjacent PE's
30
are shown in
FIG. 2
, illustrating the interconnections between each other and the broadcast bus
42
. The presently shown PE's
30
support bit-serial computation, left-right nearest-neighbor communication, wired-AND broadcast bus evaluation and external databus access. PE
30
includes a single-bit memory register
50
, a single-bit write enable register
52
, a single-bit shift left register
54
and a single-bit shift right register
56
, an arithmetic logic unit (ALU)
58
, and a transceiver
60
. Memory register
50
can include well known bitline sense amplifiers, for example. In addition to providing shifting functionality, registers
54
and
56
can be used as temporary storage locations for the results provided by the ALU
58
. Each of the four single-bit registers is implemented as a six transistor dual rail RAM cell in the present embodiments, but can be implemented with any equivalent register circuit. The registers can include additional gating circuits that receive control signals for controlling the input and output of data, which are not shown in
FIG. 4
to simplify the schematic, but understood to be required for proper operation by those of skill in the art. For example, shift left register
54
can receive a right shift control signal for storing data from the PE to the right. In this particular example, the registers store data as complementary logic levels, and complementary signal lines carry the data between the PE components. It should be obvious to those of skill in the art that single-ended logic levels and signal lines can be used in alternate embodiments.
Memory register
50
stores data received from, or to be written to, a memory cell of its associated memory column, and write enable register
52
includes combinational logic to determine whether a PE
30
should write its result to its local memory via memory register
50
. Shift left register
54
receives result data from the PE
30
to its right, while shift right register
56
receives result data from the PE
30
to its left. ALU
58
receives an 8-bit opcode, a single bit of data from memory register
50
, shift left register
54
and shift right register
56
, and provides a result from its output. ALU
58
consists of a multiplexer that can implement any boolean function of the three single bit inputs it receives. The result output of ALU
58
is provided to each register of PE
30
, and to transceiver
60
for communicating data between the PE
30
and the broadcast bus
42
. The bus transceiver
60
is implemented with static CMOS NOR gates that connect to NMOS pull down transistors. The memory can also be accessed externally through a conventional databus
62
.
Because memory array
26
of
FIG. 1
is no different than the memory array of commodity semiconductor memory devices, a column redundancy scheme to correct memory defects is necessary to maximize manufacturing yield. Unfortunately for CRAM devices, replacing the defective column of memory cells with a spare column of memory cells inherently requires replacement the PE
30
coupled to the defective column. This is significant because the communication lines between adjacent PE's must remain uninterrupted for proper operation. As can be seen in
FIG. 2
, the PE's adjacent to the PE to be removed (disabled) will need to communicate with each other in order to maintain proper operation of the CRAM.
It should be noted that manufacturing defects can occur in the PE
30
itself, eventhough its associated memory column has no defective cells. Correspondingly, a defective PE
30
requires replacement of the PE
30
and its associated memory column.
Several column redundancy schemes that can be applied to CRAM are known in the art. For example, address remapping circuits can be used to preserve the sequential address space of the memory columns, with defective columns being “bypassed” when addressed. These remapping circuits tend to have high latency that negatively impact the performance of the CRAM.
Another column redundancy scheme has nearest-neighbor interconnect that can be utilized for PE fault tolerance, but is not useful for situations in which arbitrary numbers of series-adjacent PE's are defective.
Redundancy can be provided with bypass switches that are used as conductors which are shorted or blown by a laser. Unfortunately, these bypass switches consume large circuit area and cannot be incorporated into the pitch limited PE area.
It is, therefore, desirable to provide a CRA

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Arrayed processing element redundancy architecture does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Arrayed processing element redundancy architecture, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Arrayed processing element redundancy architecture will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3272694

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.