Method and apparatus for implementing non-temporal loads

Electrical computers and digital processing systems: memory – Storage accessing and control – Hierarchical memories

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C711S133000

Reexamination Certificate

active

06223258

ABSTRACT:

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates generally to the field of processors, and specifically, to a method and apparatus for implementing non-temporal loads.
2. Background Information
The use of a cache memory with a processor is well known in the computer art. A primary purpose of utilizing cache memory is to bring the data closer to the processor in order for the processor to operate on that data. It is generally understood that memory devices closer to the processor operate faster than memory devices farther away on the data path from the processor. However, there is a cost trade-off in utilizing faster memory devices. The faster the data access, the higher the cost to store a bit of data. Accordingly, a cache memory tends to be much smaller in storage capacity than main memory, but is faster in accessing the data.
A computer system may utilize one or more levels of cache memory. Allocation and de-allocation schemes implemented for the cache for various known computer systems are generally similar in practice. That is, data that is required by the processor is cached in the cache memory (or memories). If a cache miss occurs, then an allocation is made at the entry indexed by the access. The access can be for loading data to the processor or storing data from the processor to memory. The cached information is retained by the cache memory until it is no longer needed, made invalid or replaced by other data, in which instances the cache entry is de-allocated.
Recently, there has been an increase in demand on processors to provide high performance for graphics applications, especially three-dimensional graphics applications. The impetus behind the increase in demand is mainly due to the fact that graphics applications tend to cause the processor to move large amounts of data (e.g., display data) from cache and/or system memory to a display device. This data, for the most part, is used once or at most only a few times (referred to as “non-reusable data”).
For example, assume a cache set with two ways, one with data A and another with data B. Assume further that data A, data B, and data C target the same cache set, and assume also that a program reads and writes data A and data B multiple times. In the middle of the reads and writes of data A and data B, if the program performs an access of non-reusable data C, the cache will have to evict, for example, data A from way one and replace it with data C. If the program then tries to access data A again, a cache “miss” occurs, in which case data A is retrieved from external memory and data B is evicted from way two and replaced with data A. If the program then tries to access data B again, another cache “miss” occurs, in which case data B is retrieved from external memory and data C is evicted from way one and replaced with data B. Since data C is non-reusable by the program, this procedure wastes a considerable amount of clock cycles, decreases efficiency, and pollutes the cache.
Therefore, there is a need in the technology for a method and apparatus to efficiently read non-reusable data from external memory without polluting cache memory.
SUMMARY OF THE INVENTION
The present invention is a processor. The processor includes a decoder to decode instructions and a circuit, in response to a decoded instruction, to detect an incoming load instruction that misses a cache, allocate a buffer to service the incoming load instruction, and issue a bus request to load the data in the buffer without accessing said cache.


REFERENCES:
patent: 5404484 (1995-04-01), Schlansher et al.
patent: 5630075 (1997-05-01), Joshi et al.
patent: 5829026 (1998-10-01), Leung et al.
Visual Instruction Set (VIS) User's Guide, Sun Microsystems, version 1.1, Mar. 1997, pp. 1-127.
AMD-3D Technology manual, /Rev. B, Feb. 1998, pp. 1-58.
The UltraSPARC Processor—Technology White Paper The UltraSPARC Architecture, Sun Microsystems, Jul. 17, 1997, pp. 1-10.
21164 Alpha microprocessor Data Sheet, 1997 Samsung Electronics, pp. 1-77.
TM1000 Preliminary Data Book, (Tri Media), 1997, Philips Electronics.

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for implementing non-temporal loads does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for implementing non-temporal loads, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for implementing non-temporal loads will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2439544

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.