Computer system implementing flush operation

Electrical computers and digital data processing systems: input/ – Input/output data processing – Input/output command process

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C710S006000, C710S107000, C710S112000, C710S120000, C710S305000, C710S306000, C710S310000, C710S311000, C370S402000

Reexamination Certificate

active

06553430

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention is related to the field of computer systems and, more particularly, to computer systems wherein certain write operations may be considered completed by a source upon transmission (i.e., posted write operations).
2. Description of the Related Art
Generally, personal computers (PCs) and other types of computer systems have been designed around a shared bus system for accessing memory. One or more processors and one or more input/output (I/O) devices are coupled to memory through the shared bus. The I/O devices may be coupled to the shared bus through an I/O bridge which manages the transfer of information between the shared bus and the I/O devices, while processors are typically coupled directly to the shared bus or are coupled through a cache hierarchy to the shared bus.
Unfortunately, shared bus systems suffer from several drawbacks. For example, the multiple devices attached to the shared bus present a relatively large electrical capacitance to devices driving signals on the bus. In addition, the multiple attach points on the shared bus produce signal reflections at high signal frequencies which reduce signal integrity. As a result, signal frequencies on the bus are generally kept relatively low in order to maintain signal integrity at an acceptable level. The low signal frequencies reduce signal bandwidth, limiting the performance of devices attached to the bus.
Lack of scalability to larger numbers of devices is another disadvantage of shared bus systems. As mentioned above, the available bus bandwidth is substantially fixed (and may decrease if adding additional devices causes a reduction in signal frequencies upon the bus). Once the bandwidth requirements of the devices attached to the bus (either directly or indirectly) exceeds the available bandwidth of the bus, devices will frequently be stalled when attempting access to the bus. Overall performance of the computer system including the shared bus will most likely be reduced.
On the other hand, distributed memory systems lack many of the above disadvantages. A computer system with a distributed memory system includes multiple nodes, two or more of which are coupled to different memories. The nodes are coupled to one another using any suitable interconnect. For example, each node may be coupled to each other node using dedicated lines. Alternatively, each node may connect to a fixed number of other nodes, and transactions may be routed from a first node to a second node to which the first node is not directly connected via one or more intermediate nodes. A memory address space of the computer system is assigned across the memories in each node.
In general, a “node” is a device which is capable of participating in transactions upon the interconnect. For example, the interconnect may be packet based, and the node may be configured to receive and transmit packets. Generally speaking, a “packet” is a communication between two nodes: an initiating or “source” node which transmits the packet and a destination or “target” node which receives the packet. When a packet reaches the target node, the target node accepts the information conveyed by the packet and processes the information internally. Alternatively, a node located on a communication path between the source and target nodes may relay the packet from the source node to the target node.
Distributed memory systems present design challenges which differ from the challenges in shared bus systems. For example, shared bus systems regulate the initiation of transactions through bus arbitration. Accordingly, a fair arbitration algorithm allows each bus participant the opportunity to initiate transactions. The order of transactions on the bus may represent the order that transactions are performed (e.g. for coherency purposes). On the other hand, in distributed systems, nodes may initiate transactions concurrently and use the interconnect to transmit the transactions to other nodes. These transactions may have logical conflicts between them (e.g. coherency conflicts for transactions involving the same address) and may experience resource conflicts (e.g. buffer space may not be available in.various nodes) since no central mechanism for regulating the initiation of transactions is provided. Accordingly, it is more difficult to ensure that information continues to propagate among the nodes smoothly and that deadlock situations (in which no transactions are completed due to conflicts between the transactions) are avoided.
For example, certain deadlock conditions may occur in Peripheral Component Interconnect (PCI) I/O systems if “posted” write operations are not allowed to become unordered with respect to other operations. Generally speaking, a posted write operation is considered complete by the source when the write command and corresponding data are transmitted by the source (e.g., by a source interface). A posted write operation is thus in effect completed at the source. As a result, the source may continue with other operations while the packet or packets of the posted write operation travel to the target and the target completes the posted write operation. The source is not directly aware of when the posted write operation is actually completed by the target.
In contrast, a “non-posted” write operation is not considered complete by the source until the target (e.g., a target interface) has completed the non-posted write operation. The target generally transmits an acknowledgement to the source when the non-posted write operation is completed. Such acknowledgements consume interconnect bandwidth and must be received and accounted for by the source. Non-posted write operations may be required when the write operations must be performed in a particular order (i.e., serialized).
When a source must accomplish multiple write operations, and the write operations need not be completed in any particular order, it is generally preferable from a system performance standpoint to accomplish the write operations as posted write operations. Situations may arise, however, where the posted write operations need to be properly ordered within their targets with respect.to other pending operations such that memory coherency is preserved within the computer system before processing operations within the source may be continued.
It would thus be desirable to have a computer system implementing a special operation which provides assurance to the source that all posted write operations previously issued by the source have been properly ordered within their targets with respect to other pending operations. The computer system may have, for example, a distributed memory system, and the special operation may help preserve memory coherency within the computer system.
SUMMARY OF THE INVENTION
A computer system is presented which implements a “flush” operation providing a response to a source which signifies that all posted write operations previously issued by the source have been properly ordered within their targets with respect to other pending operations. The flush operation helps to preserve memory coherency within the computer system.
In one embodiment, the computer system includes a processing subsystem and an input/output (I/O) node. The processing subsystem includes multiple processing nodes interconnected via coherent communication links. Each processing node includes a processor preferably executing software instructions. Each processing node may include, for example, a processor core configured to execute instructions of a predefined instruction set. One of the processing nodes includes a host bridge. The I/O node is coupled to the processing node including the host bridge via a non-coherent communication link. The I/O node may be part of an I/O subsystem including multiple I/O nodes serially interconnected via non-coherent communication links.
The processing subsystem may include, for example, a first processing node, a second processing node, and a memory coupled to the first processing node. Either the first or second pro

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Computer system implementing flush operation does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Computer system implementing flush operation, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Computer system implementing flush operation will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-3019207

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.