Alert: Root Management Server Unavailable.
Source: SCOM01.DOMAIN.LOCAL
Path: SCOM01.DOMAIN.LOCAL
Last modified by: System
Last modified time: 9/2/2008 1:18:19 AM
Alert description: The root management server (Healthservice) has stopped heartbeating soon after 9/2/2008 1:17:18 AM. This adversely affects all availability calculation for the entire management group.
After the upgrade to SP1, our SCOM server started to give the above alert every minute or so. After much research and a call to Microsoft to confirm. This is in fact a bug, Microsoft’s explaination is as follows “As the RMS gets busy with hundreds of client to monitor, the health service will be slow to response which causes it to skip heartbeats. This in turn trigger the alerts.” This can get really annoying if someone is subscribed to email/IM notification. Here is the work around:
1. Regedt32
2. Locate HKEY_LOCAL_MACHINE -> SOFTWARE -> MICROSOFT -> MICROSOFT OPERATIONS MANANGER -> 3.0 -> SDK SERVICE
3. Right click – New Key
4. Enter “RHS Watcher”
5. Right click “RHS Watcher” -> New -> DWORD
6. Enter “MinutesToWaitBeforeAlerting”
7. Double-click on “MinutesToWaitBeforeAlerting” and enter value of 5
8. Close regdt32 and open Services.msc
9. Restart OpsMgr Config Service, OpsMgr Health Service, and OpsMgr Health Service
This should do the trick, basically what you just did is tell delaying the SDK Service from sending out alerts for 5 minutes, at which time it should have received the next heartbeat.