Method and apparatus for testing the responsiveness of a...

Error detection/correction and fault detection/recovery – Data processing system error or fault handling – Reliability and availability

Reexamination Certificate

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

C714S011000, C714S047300

Reexamination Certificate

active

06199172

ABSTRACT:

FIELD OF THE INVENTION
This invention relates to fault management of computer networks and, more particularly, to a method and apparatus wherein a first network device employs a proxy or recruit network device to test the responsiveness of another network device.
BACKGROUND OF THE INVENTION
Networks provide increased computing power, sharing of resources and communications between users. A network may include a number of computer devices within a room, building, or site that are interconnected by a high speed local data link to form a local area network (LAN), such as a token ring network, ethernet network, or the like. LANs in the same or different locations may be interconnected by different media and protocols such as packet switching, microwave links and satellite links to form a wide area network. There may be several hundred or more interconnected devices in a network.
As a network becomes larger and more complex, issues arise as to the amount of traffic on the network, utilization of resources, security and the isolation of network faults. In U.S. Pat. No. 5,436,909, which issued to Roger Dev et al. on Jul. 25, 1995, and which is herein incorporated by reference in its entirety, a system for isolating network faults is disclosed. In the '909 patent, a network management system models network devices and relations between network devices. A contact status of each device is contained in a corresponding model. Each model receives status updates from and/or regularly polls the corresponding network device.
The '909 patent uses a technique known as “status suppression” in order to isolate network faults. When a first network device has lost contact with its corresponding model, the models which correspond to network devices adjacent to the first network device are polled to see if they have also lost contact with their corresponding network devices. If the adjacent models cannot contact their corresponding network devices, then presumably the first network device is not the cause of the fault and a fault status in the first model is suppressed or overridden. If it is determined that all adjacent network devices are not communicating, then the network fault can be more easily determined as something common to all of these devices.
It may be advantageous to focus the failure analysis on the first network device without polling all of the adjacent network devices. In some large networks, such polling could involve hundreds, possibly thousands, of network devices thereby increasing the amount of traffic on the network and degrading network performance. In addition, there may be network devices that, although they have lost contact with the network management system, are still in contact with some other network device.
It is an object of the present invention to provide a method to facilitate fault management in a network which can be used alone or together with other fault management services to deduce the location and/or cause of a network failure.
SUMMARY OF THE INVENTION
The present invention relates to a method and apparatus for determining the responsiveness of a network device through the use of proxy or recruit network devices. More specifically, when a first network device has lost contact with a second network device, a proxy device is recruited to attempt to contact the second network device. Typically, this recruit utilizes a different physical path to the second network device and/or a different communication protocol for contacting the second device. The recruit then reports on whether the contact was successful. If it was successful, then the first network device can infer that the cause of its contact loss may lie with its path to the second network device or with the protocol the first device uses to contact the second device.
In one embodiment, a list of potential recruits is maintained at one or more locations in the network. Then, when a first network device loses contact with a second network device, one or more recruits from the list can be selected to attempt to contact the second network device. Where a plurality of recruits are selected, the recruits may attempt to contact the second device either in series or in parallel. The recruits then report back the results of their attempts, from which a better understanding of the location and/or cause of the network failure may be determined. This method may be used alone or in combination with other fault management services. It may advantageously be used in conjunction with a network management platform, such as the SPECTRUM® management system, available from Cabletron
Systems, Inc., Rochester, N.H., which models the various devices (i.e., physical devices and applications) on the network, and maintains a contact status for each such device.
These and other advantages of the present invention will be understood from the following drawings and detailed description of an exemplary embodiment.


REFERENCES:
patent: 4251858 (1981-02-01), Cambique et al.
patent: 4545011 (1985-10-01), Lyon et al.
patent: 4695946 (1987-09-01), Andreasen et al.
patent: 4701845 (1987-10-01), Andreasen et al.
patent: 4827411 (1989-05-01), Arrowood et al.
patent: 4833592 (1989-05-01), Yamanaka
patent: 4858152 (1989-08-01), Estes
patent: 4868818 (1989-09-01), Madan et al.
patent: 4872165 (1989-10-01), Mori et al.
patent: 5008853 (1991-04-01), Bly et al.
patent: 5038318 (1991-08-01), Roseman
patent: 5049873 (1991-09-01), Robins et al.
patent: 5065399 (1991-11-01), Hasegawa et al.
patent: 5187706 (1993-02-01), Frankel et al.
patent: 5226120 (1993-07-01), Brown et al.
patent: 5235599 (1993-08-01), Nishimura et al.
patent: 5247620 (1993-09-01), Fukuzawa et al.
patent: 5283783 (1994-02-01), Nguyen et al.
patent: 5321813 (1994-06-01), McMillen et al.
patent: 5408618 (1995-04-01), Aho et al.
patent: 5408649 (1995-04-01), Beshears et al.
patent: 5430729 (1995-07-01), Rahnema
patent: 5436909 (1995-07-01), Dey et al.
patent: 5448723 (1995-09-01), Rowett
patent: 5448724 (1995-09-01), Hayashi
patent: 5473599 (1995-12-01), Li et al.
patent: 5513345 (1996-04-01), Sato et al.
patent: 5559955 (1996-09-01), Dev et al.
patent: 5581689 (1996-12-01), Slominski et al.
patent: 5583860 (1996-12-01), Iwakawa et al.
patent: 5590277 (1996-12-01), Fuchs et al.
patent: 5592611 (1997-01-01), Midgely et al.
patent: 5603029 (1997-02-01), Aman et al.
patent: 5630184 (1997-05-01), Roper et al.
patent: 5649108 (1997-07-01), Speigel et al.
patent: WO 93/10495 (1993-05-01), None
Steven L. Fulton et al., “An Introduction To Model-Baed Reasoning,” Al Expert, Jan. 1990, pp. 48-55.
Rodger Knaus, “A Portable Inference Engine,” Al Expert, Jan. 1990, pp. 17-20.
R. S. Gilbert et al., “CNMGRAF—Graphic Presentation Serv. for Network Mgmt.,” Proc. 9th Data Comm. Symp., Sep. 10-13, 1985, pp. 199-206.
D. Bursky, “Simulator Eases Communication Network Design,” Electronic Design, vol. 37, No. 21, Oct. 12, 1989, pp. 97-98, 100.
R. Cantone et al., “Model-Based Probabilistic Reasoning For Electronics Troubleshooting,” Proc. 8th International Joint Conference on Al, Aug. 8-12, 1983, pp. 207-211.
W. Hseush et al., “A Network Architechure For Reliable Distributed Computing,” Proc 1987 Symp. On Simulation of Computer Networks, pp. 11-22.
E. Jones et al., “Monitoring And Analysis Strategies For Digital Networks,” IEEE J. On Selected Areas In Communications, vol. 6, No. 4, May 1988, pp. 715-721.
M. Sutter et al., “Designing Expert Systems For Real-Time Diagnosis Of Self-Correcting Networks,” IEEE Network Magazine, Sep. 1988, pp. 43-51.
L. Feldkhun et al., “Event Management As A Common Functional Area Of Open Systems Management,” Integrated Network Managem ent, I. Meandzija,B. et al. (Eds.) 1989, pp. 365-376.
K. Scott, “Taking Care Of Business With SNMP,” Data Communications, Mar. 21, 1990, pp. 31-41.
R. Presuhn, “Considering CMIP,” Data Communications, Mar. 21, 1990. pp. 55-60
Cabletron Systems, Inc. trade Literature, “SPECTRUM And DLM, Distributed LAN Monitor (DLM),” Oct. 7, 1993.
Sakauchi, Hideki et al., “A Self-Healing Network With An Economical Spare-Channel Assignment”, IEEE Teleco

LandOfFree

Say what you really think

Search LandOfFree.com for the USA inventors and patents. Rate them and share your experience with other people.

Rating

Method and apparatus for testing the responsiveness of a... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method and apparatus for testing the responsiveness of a..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method and apparatus for testing the responsiveness of a... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFUS-PAI-O-2533101

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.