Performance of data stream touch events

Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability

Reexamination Certificate

Rate now

[ 0.00 ] – not rated yet Voters 0 Comments 0

Details Performance of data stream touch events Performance of data stream touch events

: 1999-03-31
: 2002-12-24
: Iqbal, Nadeem (Department: 2184)
: Error detection/correction and fault detection/recovery
: Data processing system error or fault handling
: Reliability and availability

: C711S146000
: Reexamination Certificate
: active
: 06499116
: ABSTRACT:

TECHNICAL FIELD
The present invention relates in general to data processing systems, and in particular, to performance monitoring of events in data processing systems.
BACKGROUND INFORMATION
In typical computer systems utilizing processors, system developers desire optimization of execution software for more effective system design. Usually, studies of a program's access patterns to memory and interaction with a system's memory hierarchy are performed to determine system efficiency. Understanding the memory hierarchy behavior aids in developing algorithms that schedule and/or partition tasks, as well as distribute and structure data for optimizing the system.
Performance monitoring is often used in optimizing the use of software in a system. A performance monitor is generally regarded as a facility incorporated into a processor to monitor selected characteristics to assist in the debugging and analyzing of systems by determining a machine's state at a particular point in time. Often, the performance monitor produces information relating to the utilization of a processor's instruction execution and storage control. For example, the performance monitor can be utilized to provide information regarding the amount of time that has passed between events in a processing system. The information produced usually guides system architects toward ways of enhancing performance of a given system or of developing improvements in the design of a new system.
SUMMARY OF THE INVENTION
The present invention provides a representation of the use of software-directed asynchronous prefetch instructions that occur during execution of a program within a processing system. Ideally, the instructions are used in perfect synchronization with the actual memory fetches that they are trying to speed up. In practical situations, it is difficult to predict ahead of time all side effects of these instructions and memory access latencies/throughput during the execution of any large program. Incorrect usage of such software-directed asynchronous prefetch instructions can cause degraded performance of the system.
Understanding the efficient use of these instructions is not enough in itself to solve all memory access performance problems. It is necessary to identify the most prevalent causes for limitations in the memory subsystem bandwidth. Then, the most appropriate solutions to increase memory bandwidth can be determined.
The present invention concerns the measuring of the effectiveness of such software-directed asynchronous prefetch instructions (“sdapis”). The sdapis are used in a context such as video streaming. Prefetching data in this context is unlike that of prefetching instructions based on an instruction sequence or branch instruction history. It is assumed in the video streaming context that data location is virtually unknowable without software direction. One consequence, then, is that it is a reasonable assumption that virtually every software-directed prefetch results in a cache hit, which would not be a hit in the absence of the software-directed prefetch.
Assume that a program, or a simulation of a program, is running with sdapis (program execution without sdapis is expected to be slower). The number of clock cycles for running the program is counted. In a first aspect, the invention deduces that performance is improved, compared to not running sdapis, according to the reduction in memory access misses, i.e., increase in cache hits, wherein it is assumed that each instance of sdapis causes a cache hit that otherwise would have been a cache miss. In terms of cycles, this is expressed as average cache miss penalties cycles times the number of cache misses avoided (i.e., increase in cache hits). Another aspect, concerns measuring well-timed sdapis and poorly-timed sdapis. The extent of well-timed and poorly-timed sdapis is deduced by counting certain events, as described herein, that concern instances where sdapis result in loading data and the data is not used at all, or not used soon enough to avoid being cast out, and measuring certain time intervals in the case of instances where sdapis result in loading data and the data is used. Another aspect concerns measuring an extent to which sdapis impede certain memory management functions. This extent is deduced by counting certain disclosed events involving tablewalks and translation lookaside buffer castouts. Another aspect concerns measuring an extent to which sdapis are contemplated, but stopped. Events concerning cancellations and suspensions are disclosed. In another aspect, the above measurements are included in numerous streams.
The foregoing has outlined rather broadly the features and technical advantages of the present invention in order that the detailed description of the invention that follows may be better understood. Additional features and advantages of the invention will be described hereinafter which form the subject of the claims of the invention.

REFERENCES:
patent: 5367657 (1994-11-01), Knare et al.
patent: 5594864 (1997-01-01), Trauben
patent: 5689670 (1997-11-01), Luk
patent: 5691920 (1997-11-01), Levine et al.
patent: 5727167 (1998-03-01), Dwyer, III et al.
patent: 5729726 (1998-03-01), Levine et al.
patent: 5737747 (1998-04-01), Vishlitzky et al.
patent: 5751945 (1998-05-01), Levine et al.
patent: 5802273 (1998-09-01), Levine et al.
patent: 5835702 (1998-11-01), Levine et al.
patent: 5881306 (1999-03-01), Levine et al.
patent: 5961654 (1999-10-01), Levine et al.
patent: 5970439 (1999-10-01), Levine et al.
patent: 6085338 (2000-07-01), Levine et al.
patent: 6189072 (2001-02-01), Levine et al.
Tien-Fu Chen, “Reducing memory penalty by a programmable prefetch engine for on-chip caches”,Microprocessors and Microsystems(1997), pp. 1121-130.
“Software Test Coverage Measurement”,IBM Technical Disclosure Bulletin, Vol. 39 No. 08, Aug. 1996, pp. 223-225.

Affiliated with

Roth Charles Philip

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Snyder Michael Dean

Inventor

[ 0.00 ] – not rated yet Voters 0 Comments 0

Also associated with

Carwell Robert M.

Attorney

[ 0.00 ] – not rated yet Voters 0 Comments 0

International Business Machines Corp.

Corporate Assignee

[ 0.00 ] – not rated yet Voters 0 Comments 0

Iqbal Nadeem

Examiner

[ 0.00 ] – not rated yet Voters 0 Comments 0

Kordzik Kelly K.

Attorney

[ 0.00 ] – not rated yet Voters 0 Comments 0

Winstead Sechrest & Minick P.C.

Law Firm

[ 0.00 ] – not rated yet Voters 0 Comments 0

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Performance of data stream touch events does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Performance of data stream touch events, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Performance of data stream touch events will most certainly appreciate the feedback.

Rate now

Comments { 0 }

Profile ID: LFUS-PAI-O-2943660

All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.

Canada

Charities
Companies
MP Candidates
Patents
Employee Salary Disclosure

World

Places of the World
Scientific Papers

United States

Banks
Companies
Counties
Patents
Employee Salary Disclosure