On Tue, Nov 27, 2012 at 3:16 PM, Alan Laird <alan....@gmail.com> wrote:Can you connect to the Razor server API endpoint as specified in the
>
> I'm new to razor and and running into an issue with MK and vmware guests. I
> could really use a suggestion to get past this.
>
> My razor server is a vmware guest and I have two physical servers that
> happily PXE through to the MK and then will install ubuntu based on policy.
> I configured a couple vmware guests and they PXE boot the MK but they do not
> seem to connect to the razor server. Running razor node on the the razor
> server only shows the physical hosts and not the VM hosts.
>
> I changed the MK to use the debug image and was able to login to the node
> and ping the razor server (to validate networking).
>
> Any thoughts?
configuration, from those problem guests?
I've got a really similar issue. Everything is working fine, even through the node checking in the first time, but then it never checks in again, goes inactive and then gets removed after 5 minutes of not checking in. This is at the end of my rz_mk_controller.log:
Wondering if your log file looks similar. BTW, my log file was rz_mk_controller.log, not rz_mk_common.log
--
You received this message because you are subscribed to the Google Groups "puppet-razor" group.
To post to this group, send email to puppet...@googlegroups.com.
To unsubscribe from this group, send email to puppet-razor...@googlegroups.com.
Visit this group at http://groups.google.com/group/puppet-razor?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
--
# cat ./get_razor_collection_counts.rbAs I said, if that's not the issue then I'm fresh out of ideas...the only other thing I can think of doing is to run the debug microkernel (which gathers a lot more data during the initial boot) and to set the 'mk_log_level' parameter in your Razor server configuration file to 'Logger::DEBUG' so that the debug output continues to be collected after the first checkin...
#!/usr/bin/env ruby
require 'rubygems'
require 'mongo'
hostname = '127.0.0.1'
port = 27017
connection = Mongo::Connection.new(hostname, port)
db = connection.db("project_razor")
db.collections.each { |collection|
print "#{collection.name}.count = "
puts collection.count
}
#
What does a 'razor config' show on your Razor server?
Then again, perhaps it's an issue with the new code :( have you tried a slightly older version of the Razor Microkernel (v0.9.1.7, perhaps)? If not, please give that a shot and let me know if that works for you (since it doesn't have the "fix" that adds the other sources for determining the Razor server)...The odd thing is that the change in question (adding in multiple sources for determining the Razor IP address) was added a little over three weeks ago. Did you just update to a more recent version of the Microkernel, perhaps? If that's the case, perhaps we need to revisit the syntax of the changes made by @GregSutcliffe (pull request #13) in order to add this new feature...
So I guess I'm confused here...is it working for some nodes but not for others? Do the "problem nodes" ever check in successfully with the Razor server or do they show this error from the very beginning (with the hostname in the URI set to "false" for all checkin requests)?
It's pretty apparent that the Microkernel is booting properly, so the issue is probably not related to your "next-server" settings in your TFTP server directory. If I understand what you're seeing correctly, then I think the issue is that the configuration that is being returned by Razor (for only some of the nodes? If so, that's odd) or the initial configuration changes that are made by the Microkernel Controller during the boot process (which should effect ALL Microkernel instances, not just a subset of them). Those configuration parameters are saved to the /tmp/mk_conf.yaml file on the Microkernel. Can you verify that you have a /tmp/mk_conf.yaml file that shows "false" for the hostname of the Razor server in that URI on one of those problem nodes???
What is in the /tmp/nextServerIP.addr file on the Microkernel instances
that are failing to register?
Hopefully the contents of those files will give a bit more
information as to what is going on here...if not I'll try to come
up with a custom MK that you can use to gather more diagnostic
output for debugging...
Thanks in advance for your help with this. I don't seem to be
able to recreate the error you are seeing in my own environment
(running either VMware Workstation 8.x or VMware Workstation 9.x
on a 64-bit Linux host and iPXE booting VMs using a Razor instance
running in a separate VM using that same VMware environment)...