Clustering Cannot Determine If a Shared Disk Is Working Properly (198513)



The information in this article applies to:

  • Microsoft Windows 2000 Advanced Server
  • Microsoft Windows NT Server, Enterprise Edition 4.0

This article was previously published under Q198513

SUMMARY

A drive that is having hardware problems that allow the drive to be seen but no files to be accessed may appear to be online in Cluster Administrator until a program attempts to gain access to the files.

MORE INFORMATION

The cluster service, which implements IsAlive poll operation, does not actually test the volumes on a cluster disk resource for file readability. File access is not checked because no volume on a disk resource is required to have files on it; if a disk has no files, the check for file readability would report that as a failure and try to fix the problem.

Instead, the cluster service checks to determine if the root directory of each volume is accessible. If no problem is found with this check the volume remains online. Therefore, IsAlive may return a SUCCESS value on a bad drive until an actual input/output (I/O) operation does not succeed. If a problem is found with the root directory check, a failure notification is sent to the cluster service, which takes the resource offline and then attempt to bring it online again. The cluster service checks to see if the dirty bit is set and if found, attempts to run Chkdsk.exe with the /f switch on the volume to fix it.

The only exception to this behavior is the volume that contains the quorum log. When the disk resource that contains this volume is brought online the files in the MSCS folder (or the folder specified in the quorum resource properties) are checked.

If Chkdsk.exe cannot be run on the volume, the cluster logging function adds the following entry to the cluster log:
   FixCorruption: CHKDSK returned status of <error code>
				
where an error code of zero indicates success.

Modification Type:MinorLast Reviewed:1/5/2006
Keywords:kbinfo KB198513