Data processing system with synchronization coprocessor for mult

Boots – shoes – and leggings

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

395375, 364230, 3642443, 364DIG1, G06F 1580

Patent

active

055600298

DESCRIPTION:

BRIEF SUMMARY
BACKGROUND OF THE INVENTION

Many are interested in the goal of general purpose computing that achieves very high speeds by exploiting parallelism in a scalable, cost-effective way. There seems to be widespread consensus that the architecture of such machines will be composed of a number of nodes interconnected with a high speed, regular network, where each node is built with an off-the-shelf microprocessor. Because such machines are built out of commodity parts, and because the topology is scalable, it is felt that such a machine with hundreds or thousands of nodes will be cheaper and faster than classical supercomputers, which are built with exotic technology and are thus very expensive.
To date, the prevailing opinion seems to be that microprocessors have their own evolutionary momentum (from CISC to RISC and, now, to a multiple instruction issue), and that a massively parallel machine will simply track this wave, using whatever microprocessors are currently available. However, a massively parallel machine is in fact a hostile environment for today's micros, arising largely because certain properties of the memory system in a massively parallel machine are fundamentally different from those assumed-during the evolution of these micros. In particular, most micros today assume that all memory is equally distant, and that memory access time can be made effectively small by cacheing. Both these assumptions are questionable in a massively parallel machine.
On the other hand, dataflow processors have been designed from the start by keeping in mind the properties of the memory system in a parallel machine. However, past dataflow processor designs have neglected single-thread performance, and hence must be classified as exotic, not the kind of processor to be found in commodity workstations.
To be cost-effective, the micros used in massively parallel machines should be commodity parts, i.e., they should be the same micros as those used in workstations and personal computers. Market forces are such that a lot more design effort can be expended on a stock microprocessor than on a processor that is sold only in small quantities. In addition, there is a question of software cost. Parallel programs are often evolved from sequential programs, and will continue to use components that were developed for single-thread uniprocessors (such as transcendental function libraries, Unix, etc.). This does not mean that we are restricted to using good, conventional microprocessors in any parallel machine that we build. All it means is that any new processor that we design for multiprocessors must also stand on its own as a cheap and viable uniprocessor.
Parallel programs contain synchronization events. It is well known that processor utilization suffers if it busy-waits; to avoid this, some form of multiplexing amongst threads (tasks or processes) is necessary. This is true even in uniprocessors.
In order to build parallel machines that are scalable both physically and economically, we must face the fact that inter-node latency in the machine will grow with machine size, at least by a factor of log (N), where N is the number of nodes in the machine. Thus, access to a non-local datum in a parallel machine may take tens to hundreds of cycles, or more. If we are to maintain effective utilization of the machine, a processor must perform some other useful work instead of idling during such a remote access. This requires that the processor be multiplexed amongst many threads, and that remote accesses must be performed as split transactions, i.e., a request and its response should be treated as two separate communication events across the machine. If we follow this argument a step further, we see that a communication entering a node will arrive at some relatively unpredictable time, and that we need some means of identifying the thread that is waiting for this communication. This is, in fact, a synchronization event.
Thus, the following picture emerges. In a parallel machine, the way to deal with long inter-node latencies is exactly th

REFERENCES:
patent: 4320455 (1982-03-01), Woods et al.
patent: 4481573 (1984-11-01), Fukunaga et al.
patent: 4615001 (1986-09-01), Hudgins, Jr.
patent: 4819155 (1989-04-01), Wulf et al.
patent: 4858105 (1989-08-01), Kuriyama et al.
patent: 4943908 (1990-07-01), Emma et al.
patent: 5050068 (1991-09-01), Dollas et al.
patent: 5050070 (1991-09-01), Chastain et al.
patent: 5226131 (1993-07-01), Grafe et al.
patent: 5241635 (1993-08-01), Papadopoulos et al.
patent: 5353418 (1994-10-01), Nikhil et al.
Nikhil, Rishiyur S., et al., "Can dataflow subsume von Neumann computing?" In Proceedings of the 16th Annual International Symposium on Computer Architecture, Jerusalem, Israel, May 29-31, 1991, pp. 262-272.
Papadopoulos et al., "Monsoon: an Explicit Token-store Architecture," In Proceedings 17th International Symposium on Computer Architecture, Seattle, Wa, May 1990.
Sakai et al., "An Architecture of a Dataflow Single Chip Processor," In Proc. 16th Annual International Symposium on Computer Archtecture, Jerusalem, Israel, pp. 46-53, May 28-Jun. 1, 1989.
Dennis et al., "An Efficient Pipelined Dataflow Processor Architecture," In Proc. Supercomputing Conference, Orlando, FL, pp.368-373, Nov. 14-18, 1988.
Agarwal et al., "APRIL: A Processor Achitecture for Multiprocessing," In Proc. 17th Annual Intl. on Computer Architecture, Seattle, Washington, U.S.A., pp. 104-114, May 28-31, 1990.
Halstead, Jr. et al., "MASA: A Multithreaded Processor Architecture for Parallel Symbolic Computing," In Proceedings of the IEEE 15th Annual International Symposium on Computer Architecture, Honolulu, Hawaii, Jun. 1988.
Weber et al., "Exploring the Benefits of Multiple Hardware Contexts in a Multiprocessor Architecture: Preliminary Results," In Proceedings of the 16th Annual International Symposium on Computer Architecture, Jerusalem, Israel, pp. 273-280, May 29-31, 1989.
Arvind et al., "Two Fundamental Issues in Multiprocessing," In Proceedings of DFVLR-Conference 1987 on Parallel Processing in Science and Engrg, Bonn-Bad Godesberg, W. Germany, Springer-Verlag LNCS 295, Jun. 25-29, 1987.
Kuehn et al., "The Horizon Supercomputing System: Architecture and Software," In Prof. IEEE Supercomputing Conference, Florida, pp. 28-34, 1988.
Thistle et al, "A Processor Architecture For Horizon," In Prof. IEEE Supercomputing Conference, Florida, pp.35-41, 1988.
Dally et al., "Architecture of a Message-Driven Processor," In Proc. 14th. Annual Intl. Symp. on Computer Architecture, Pittsburgh, PA, pp. 189-196, Jun. 1987.
Arvind et al., "I-Structure: Data Structures for Parallel Computing," ACM Transactions on Proramming Languages and Systems, 11(4): 598-632, (1989).
Richard Buehrer et al., "Incorporating Data Flow Ideas into von Neumann Processors for Parallel Execution," IEEE Transactions on Computer, C-36(12): 1515-1522, (1987).
Musciano, Albert J. et al., "Efficient Dynamic Scheduling of Medium-Grained Tasts for General Purposing Parallel Processing," Proceedings of the 1988 International Conference on Parallel Processing, vol. II, 15 Aug. 1988, London, GB, pp. 166-175.
Hieb, Robert et al., "Continuations and Concurrency," Sigplan Notices, vol. 25, No. 3, 14 Mar. 1990, Seattle, U.S., pp. 128-136.
Holmes, V. P. et al., "A Designer's Perspective of the Hawk Multiprocessor Operating System Kernel," Operating Systems Review (SIGOPS), vol. 23, 3 Jul. 1989, New York, U.S., pp. 158-172.
Hansen, Per Brinch, "Operating System Principles," Prentice-Hall, Inc., Englewood Cliffs, New Jersey, 1973, pp. 145-150.
Roos, Joachim, "A Real-Time Support Processor for Ada Tasking," Computer Architecture News, vol. 17, No. 2, Apr. 1989, New York, U.S., pp. 162-171.
Draves, Richard P. et al., "Using Continuations to Implement Thread Management and Communication in Operating Systems," Operating Systems Review, vol. 25, No. 5, Oct. 13, 1991, pp. 122-136.
Iannucci Rebert A., "Toward a Dataflow/Von Neumann Hybrid Architecture", The Computer Society of the IEEE, The 15th Annual International Symposium on Computer Architecture,

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Data processing system with synchronization coprocessor for mult does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Data processing system with synchronization coprocessor for mult, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Data processing system with synchronization coprocessor for mult will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-1948279

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.