Predictive snooping of cache memory for master-initiated...

Electrical computers and digital processing systems: memory – Storage accessing and control – Hierarchical memories

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Predictive snooping of cache memory for master-initiated... Predictive snooping of cache memory for master-initiated...

: 2000-08-02
: 2002-06-11
: Ellis, Kevin L. (Department: 2185)
: Electrical computers and digital processing systems: memory
: Storage accessing and control
: Hierarchical memories

: Reexamination Certificate
: active
: 06405291
: ABSTRACT:

BACKGROUND
1. Field of the Invention
The invention relates to computer systems in which a host processor and a bus master can access the same address space, and more particularly, to techniques for facilitating burst accesses by such a master.
2. Description of Related Art
In a typical IBM PC/AT-compatible computer system, a host processing unit is coupled to a host bus and most I/O peripheral devices are coupled to a separate I/O bus. The host processing unit typically comprises an Intel i386, i486 or Pentium™ microprocessor, and the I/O bus typically conforms to a standard known as ISA (Industry Standard Architecture). I/O interface circuitry, which usually comprises one or more chips in a “core logic chipset”, provides an interface between the two buses. A typical system also includes a memory subsystem, which usually comprises a large array of DRAM and perhaps a cache memory.
General information on the various forms of IBM PC AT-compatible computers can be found in IBM, “Technical Reference, Personal Computer AT” (1985), in Sanchez, “IBM Microcomputers: A Programmer's Handbook” (McGraw-Hill: 1990), in MicroDesign Resources, “PC Chip Sets” (1992), and in Solari, “AT Bus Design” (San Diego: Annabooks, 1990). See also the various data books and data sheets published by Intel Corporation concerning the structure and use of the 80×86 family of microprocessors, including Intel Corp., “Pentium™ Processor”, Preliminary Data Sheet (1993); Intel Corp., “Pentium™ Processor User's Manual”(1994); “i486 Microprocessor Hardware Reference Manual”, published by Intel Corporation, copyright date 1990, “386 SX Microprocessor”, data sheet, published by Intel Corporation (1990), and “386 DX Microprocessor”, data sheet, published by Intel Corporation (1990). In addition, a typical core logic chipset includes the OPTi 82C802G and either the 82C601 or 82C602, all incorporated herein by reference. The 82C802G is described in OPTi, Inc., “OPTi PC/AT Single Chip 82C802G Data Book”, Version 1.2a (Dec. 1, 1993), and the 82C601 and 82C602 are described in OPTi, Inc., “PC/AT Data Buffer Chips, Preliminary, 82C601/82C602 Data Book”, Version 1.0e (Oct. 13, 1993). All the above references are incorporated herein by reference.
Many IBM PC AT-compatible computers today include one, and usually two, levels of cache memory. A cache memory is a high-speed memory that is positioned between a microprocessor and main memory in a computer system in order to improve system performance. Cache memories (or caches) store copies of portions of main memory data that are actively being used by the central processing unit (CPU) while a program is running. Since the access time of a cache can be faster than that of main memory, the overall access time can be reduced. Descriptions of various uses of and methods of employing caches appear in the following articles: Kaplan, “Cache-based Computer Systems,”
Computer
, 3/73 at 30-36; Rhodes, “Caches Keep Main Memories From Slowing Down Fast CPUs,”
Electronic Design
, Jan. 21, 1982, at 179; Strecker, “Cache Memories for PDP-11 Family Computers,” in Bell, “
Computer Engineering
” (Digital Press), at 263-67, all incorporated herein by reference. See also the description at pp. 6-1 through 6-11 of the “i486 Processor Hardware Reference Manual” incorporated above.
Many microprocessor-based systems implement a “direct mapped” cache memory. In general, a direct mapped cache memory comprises a high speed data Random Access Memory (RAM) and a parallel high speed tag RAM. The RAM address of each line in the data cache is the same as the low-order portion of the main memory line address to which the entry corresponds, the high-order portion of the main memory address being stored in the tag RAM. Thus, if main memory is thought of as 2
m
blocks of 2
n
“lines” of one or more bytes each, the i'th line in the cache data RAM will be a copy of the i'th line of one of the 2
m
blocks in main memory. The identity of the main memory block that the line came from is stored in the i'th location in the tag RAM.
When a CPU requests data from memory, the low-order portion of the line address is supplied as an address to both the cache data and cache tag RAMs. The tag for the selected cache entry is compared with the high-order portion of the CPU's address and, if it matches, then a “cache hit” is indicated and the data from the cache data RAM is enabled onto a data bus of the system. If the tag does not match the high-order portion of the CPU's address, or the tag data is invalid, then a “cache miss” is indicated and the data is fetched from main memory. It is also placed in the cache for potential future use, overwriting the previous entry. Typically, an entire line is read from main memory and placed in the cache on a cache miss, even if only a byte is requested. On a data write from the CPU, either the cache RAM or main memory or both may be updated, it being understood that flags may be necessary to indicate to one that a write has occurred in the other.
Accordingly, in a direct mapped cache, each “line” of secondary memory can be mapped to one and only one line in the cache. In a “fully associative” cache, a particular line of secondary memory may be mapped to any of the lines in the cache; in this case, in a cacheable access, all of the tags must be compared to the address in order to determine whether a cache hit or miss has occurred. “k-way set associative” cache architectures also exist which represent a compromise between direct mapped caches and fully associative caches. In a k-way set associative cache architecture, each line of secondary memory may be mapped to any of k lines in the cache. In this case, k tags must be compared to the address during a cacheable secondary memory access in order to determine whether a cache hit or miss has occurred. Caches may also be “sector buffered” or “sub-block” type caches, in which several cache data lines, each with its own valid bit, correspond to a single cache tag RAM entry.
When the CPU executes instructions that modify the contents of the cache, these modifications must also be made in the main memory or the data in main memory will become “stale.” There are two conventional techniques for keeping the contents of the main memory consistent with that of the cache—(1) the write-through method and (2) the write-back or copy-back method. In the write-through method, on a cache write hit, data is written to the main memory immediately after or while data is written into the cache. This enables the contents of the main memory always to be valid and consistent with that of the cache. In the write-back method, on a cache write hit, the system writes data into the cache and sets a “dirty bit” which indicates that a data word has been written into the cache but not into the main memory. A cache controller checks for a dirty bit before overwriting any line of data in the cache, and if set, writes the line of data out to main memory before loading the cache with new data.
A computer system can have more than one level of cache memory for a given address space. For example, in a two-level cache system, the “level one” (L1) cache is logically adjacent to the host processor. The second level (L2) cache is logically behind the first level cache, and DRAM memory (which in this case can be referred to as tertiary memory) is located logically behind the second level cache. When the host processor performs an access to an address in the memory address space, the first level cache responds if possible. If the first level cache cannot respond (for example, because of an L1 cache miss), then the second level cache responds if possible. If the second level cache also cannot respond, then the access is made to DRAM itself. The host processor does not need to know how many levels of caching are present in the system or indeed that any caching exists at all. Similarly, the first level cache does not need to know whether a second level of caching exists prior to the DRAM. Thus, to the host processing unit, the combination of both c

Affiliated with

Ghosh Subir

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Tung Hsu-Tien

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Ellis Kevin L.

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

Haynes Beffel & Wolfeld LLP

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

OPTi Inc.

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Wolfeld Warren S.

Attorney

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Predictive snooping of cache memory for master-initiated... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Predictive snooping of cache memory for master-initiated..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Predictive snooping of cache memory for master-initiated... will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-2892809

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure