Electrical computers: arithmetic processing and calculating – Electrical digital calculating computer – Particular function performed
Reexamination Certificate
2000-10-13
2004-06-22
Mai, Tan V. (Department: 2124)
Electrical computers: arithmetic processing and calculating
Electrical digital calculating computer
Particular function performed
C326S039000
Reexamination Certificate
active
06754686
ABSTRACT:
BACKGROUND
This invention relates to programmable integrated circuit devices. More specifically, the present invention relates to field programmable gate arrays (FPGAs).
An FPGA is a type of programmable logic device (PLD) that can be configured to perform various logic functions. An FPGA includes an array of configurable logic blocks (CLBs) connectable via programmable interconnect structures. For example, a first FPGA, invented by Freeman, is described in U.S. Pat. No. RE 34,363. CLBs and interconnect structures in FPGAs are shown in U.S. Pat. No. 5,889,411 issued to Chaudhary et al. and pages 4-32 through 4-37 of the Xilinx 1996 Data Book entitled “The Programmable Logic Data Book” available from Xilinx, Inc., 2100 Logic Drive, San Jose, Calif. 95124. The Freeman reference, the Chaudhary reference, and the Data Book are incorporated herein by reference.
In addition to the structures discussed above, FPGAs also include structures for performing special functions. In particular, FPGAs include carry circuits and lines for connecting the carry output of one bit generated in one CLB to the carry input of another CLB, and cascade lines for allowing wide functions to be generated by combining several adjacent CLBs. Carry structures are discussed by Hsieh et al. in U.S. Pat. No. 5,267,187 and by New in U.S. Pat. No. 5,349,250.
Cascade structures are discussed by Goetting et al in U.S. Pat. No. 5,365,125 and by Chiang et al. in U.S. Pat. No. 5,357,153. These patents are also incorporated herein by reference.
As discussed by the above-incorporated references, each CLB may include one or more slices (“slice” or “CLB slice”). Each slice, in turn, includes at least one configurable function generator. The configurable function generator is typically implemented as a four-input lookup table (LUT). The incorporated references also point out that the carry circuits and cascade structures increase the speed at which the FPGA can perform certain functions, such as arithmetic functions.
FIG. 1A
is a simplified block diagram of a conventional CLB
100
. The illustrated CLB
100
includes a first slice
110
and a second slice
120
. First slice
110
includes a first function generator G
112
, a second function generator F
114
, a third function generator
116
, and an output control block
118
. Output control block
118
may include multiplexers, flip-flops, or both. Four independent input terminals are provided to each of the G and F function generators
112
and
114
. A single input terminal C
1
-in is provided to third function generator C
1
116
. Each of function generators
112
and
114
is typically implemented as a four-input LUT, and is capable of implementing any arbitrarily defined Boolean function of the inputs signals. Each of the input terminals may be assigned a number or a letter and referred to as a “literal.” For example, in CLB
100
, function generator
112
receives four input signals, or literals, G
1
, G
2
, G
3
, and G
4
. Function generator
116
, typically implemented as a set of configurable multiplexers, is often used to handle carry bits, but can implement some Boolean functions of its three input signals C
1
-in, G′, and F′. These Boolean functions include bypass, inverter, 2-input AND (product), and 2-input OR (sum). Signals G′, F′, and C
1
-out are multiplexed through output control block
118
. Multiplexer
118
provides output signal lines Y, QY, X, and QX. For this reason, output control block
118
may also be referred to as the “output multiplexer” or “output select multiplexer.” Slice
110
may also provide the carry out signal, C
1
-out. Second slice
120
is similar to first slice
110
. Accordingly, operations of second slice
120
are similar to the operations of first slice
110
.
Operation of CLB
100
is also described by the incorporated references, and, in particular, in chapters seven and eight of the above-incorporated Data Book. For simplicity, CLB
100
of
FIG. 1
is illustrated with two slices; however, the number of slices constituting a CLB is not limited to two.
FIG. 1B
is a simplified block diagram of another conventional CLB
100
a
. CLB
100
a
is similar to CLB
100
of
FIG. 1A
but has an additional LUT
113
. LUT
113
takes outputs of LUT
112
and
114
as well as another input K
1
to slice
110
a
. Thus, LUT
113
allows slice
110
a
to implement any arbitrarily defined Boolean function of nine literals G
1
, G
2
, G
3
, G
4
, F
1
, F
2
, F
3
, F
4
, and K
1
. CLB
110
a
may include additional slices represented by ellipses
120
a.
Technology mapping for LUT-based FPGAs involves decomposition of a circuit into combinational logic having nodes with 4-input (“fan-in”) functions that can be realized in the LUTs of CLB slices. This is because, as shown in slice
110
, the slices commonly include 4-input LUTs as their function generators. By conventionally specifying the functions of function generators F, G, and C
1
, and output control block
118
, slice
110
can be programmed to implement various functions including, without limitation, two independent functions of up to four variables each.
Circuit designs are mapped to FPGAs as combinational logic. The combinational logic may be expressed in Boolean expressions including a number of logic levels and routing between the logic levels. The Boolean expressions include product (logical AND) and sum (logical OR) operations. Two levels of combinational logic may be expressed using sum-of-products (SOP) format. In fact, given a set of inputs and their inverse, any logic equation can be expressed using the SOP format.
In the FPGA art, there is a continuing challenge to increase speed (performance) of FPGA-implemented functions, or circuits. Circuit performance, or speed, is increased when circuit delay is decreased. Circuit delay includes two main components: logic delay and routing delay.
Using logical axioms and Boolean algebraic rules, it is possible to partially collapse a circuit design to reduce the number of logic levels, thus reducing the routing delay. However, this creates wide fan-in nodes. The wide fan-in nodes require use of several levels of LUTs for implementation. This is because, as described above, the LUTs have limited fan-in, for example fan-in of four. Therefore, to implement wide fan-in nodes, multiple levels of CLBs must be used. The requirement to use multiple levels of CLBs increases the logic delay as well as creating other routing delays. These negative effects cancel out the benefits from the routing delay reduction provided by the partial collapse of the circuit design.
Accordingly, there is a need for a method to implement wide fan-in nodes in FPGAs while avoiding the negative effects described above. Additionally, there is a need for CLB and CLB slice designs that allow for fast implementation of wide fan-in SOP functions.
SUMMARY
According to one aspect of the present invention, there is provided a literal-sharing decomposition method for combinational logic circuits expressed as a sum of product terms. A first product term (P
1
) is combined with a second product term (P
2
) resulting in a product chain P
1
+P
2
if P
1
may be implemented in a number of configurable logic block (CLB) slices and the product chain P
1
+P
2
may be implemented on the same number of configurable logic block (CLB) slices. The product chain is then used to configure CLB slices to implement the product terms. Because the product terms are combined, they can be implemented using fewer CLB slices than the number of slices needed for separate implementation. The reduction in the number of slices leads to faster implementation.
A “product chain” is a combination of product terms (“Pterms”) that share one or more literals. A product chain would typically include at least two Pterms; however, a single Pterm may be designated as a product chain to which other Pterms may be combined. A Pterm or a product chain may be implemented on one or more CLB slices. A “slice chain” is one or more slices configured to implement a Pterm or a product ch
Behiel Arthur Joseph
Cartier Lois D.
Mai Tan V.
Xilinx , Inc.
Young Edel M.
LandOfFree
Literal sharing method for fast sum-of-products logic does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Literal sharing method for fast sum-of-products logic, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Literal sharing method for fast sum-of-products logic will most certainly appreciate the feedback.
Profile ID: LFUS-PAI-O-3322311