Thursday, July 11, 2013

Microsoft Failover Cluster event 1196, standard fixes do not work

Working at a client of mine we ran into an error 1196 Source Microsoft-Windows-Failover-clustering. this error indicates:

Cluster network name resource 'Cluster Name' failed registration of one or more associated DNS name(s) for the following reason:

The handle is invalid.
.
Ensure that the network adapters associated with dependent IP address resources are configured with at least one accessible DNS server.

There are many posts on the internet addressing this problem but none of them worked for us. take a look at them if you want:

  1. http://smtp25.blogspot.nl/2008/10/cluster-network-name-resource-name_23.html
  2. http://blog.subvertallmedia.com/2012/12/06/repairing-a-failover-cluster-in-windows-server-2012-live-migration-fails-dns-cluster-name-errors/
  3. http://haythamalex.wordpress.com/2012/10/02/cluster-name-failed-registration-of-one-or-more-associated-dns-names-for-the-following-reason/
  4. http://jaminquimby.com/joomla253/9-uncategorised/473-cluster-name-resource-failed-registeration-in-dns
We used the suggestions of these post and found out they are not sufficient for us; what is the matter?
All posts point to DNS registration, all fixes do not really touch DNS records.

I our case the DNS record in DNS (the CNO (Cluster Name Object)) A-record turned out to be faulty. when we removed the record we cleaned up our AD-integrated DNS records like this:

  1. removed A record from the DNS server on which the cluster nodes try to register themselves
  2. Update Server Data Files
  3. Clear Cache
  4. Restart DNS service
  5. Do this on all DNS servers in your domain or wait for the DNS replication to distribute these changes
Now we fixed the Cluster with the following recipe:
  1. Temporarily move the CNO account into the Computers container
  2. Log into one of the cluster nodes with a domain account that has the ResetPassword right in the domain
  3. Simulate failures for the cluster Network Name resource until it is in a permanent failed state
  4. Once the resource is in a Failed state, right-click on the resource, choose More Actions and then click Repair
Once this was done the A-record was created on the primary DNS server the cluster nodes point to. (this time with the right object rights)

till next time.

Important information on Error 1196: Microsoft has provided a Hotfix alailable for download here:
http://support.microsoft.com/kb/2838043
 
 

No comments:

Post a Comment