What's the reconstruction mechanism about hm?

51 views
Skip to first unread message

cg

unread,
Aug 18, 2014, 8:06:20 AM8/18/14
to bosh-...@cloudfoundry.org
I hava deployed a CF by BOSH.When i restart vm about uaa, after a while, the uaa was rebuilt. In fact, no matter what i restart vm, it will be rebuilt.I found the following stacks-trace in /var/vcap/sys/log/health_monitor/health_monitor.log:

I, [2014-08-18T16:45:40.497018 #8607]  INFO : Found deployment `cf'
I, [2014-08-18T16:46:10.168655 #8607]  INFO : Analyzing agents...
I, [2014-08-18T16:46:10.169876 #8607]  INFO : [ALERT] Alert @ 2014-07-24 16:46:10 UTC, severity 2: f4e1f9eb-ca4c-4f8a-b959-70b6a155d0fc has timed out
W, [2014-07-24T16:46:10.170662 #8607]  WARN : (Resurrector) notifying director to recreate unresponsive VM: cf uaa/0

Why hm notify director rebuild uaa. What's the reconstruction mechanism about hm?

Thanks in advance.

James Bayer

unread,
Aug 18, 2014, 11:30:32 AM8/18/14
to bosh-users

i believe this is telling you the bosh health monitor thinks the VM is missing

I, [2014-08-18T16:46:10.169876 #8607]  INFO : [ALERT] Alert @ 2014-07-24 16:46:10 UTC, severity 2: f4e1f9eb-ca4c-4f8a-b959-70b6a155d0fc has timed out

perhaps check your network paths to make sure the BOSH VMs can communicate to the UAA VM?

To unsubscribe from this group and stop receiving emails from it, send an email to bosh-users+...@cloudfoundry.org.

--
Thank you,

James Bayer

Dmitriy Kalinin

unread,
Aug 18, 2014, 12:22:22 PM8/18/14
to bosh-...@cloudfoundry.org
What you've found is BOSH Health Monitor. See http://docs.cloudfoundry.org/bosh/bosh-resurrector.html for more details.

cg

unread,
Aug 18, 2014, 10:12:35 PM8/18/14
to bosh-...@cloudfoundry.org
Thanks jbayer.
 It happen when i restart UAA VM, but vm launch just 30 seconds. I found bosh resurrector in health_monitor.yml:
- name: resurrector
    events
:
     
- alert
    options
:
      director
: *director
      minimum_down_jobs
: 5
      percent_threshold
: 0.2
      time_threshold
: 600

How long the hm thinks the VM is missing or unresponsive? 
在 2014年8月18日星期一UTC+8下午11时30分32秒,jbayer写道:

gaoch...@gmail.com

unread,
Aug 18, 2014, 10:19:54 PM8/18/14
to bosh-...@cloudfoundry.org
Thanks Dmitriy.
 I found bosh resurrector in health_monitor.yml:
- name: resurrector
    events
:
      
- alert
    options
:
      director
: *director
      minimum_down_jobs
: 5
      percent_threshold
: 0.2
      time_threshold
: 600

Any suggest will be very appreciated.

在 2014年8月19日星期二UTC+8上午12时22分22秒,Dmitriy Kalinin写道:
Reply all
Reply to author
Forward
0 new messages