|
I have one customer that has reproduced this issue, or pretty close to it, at will and in front of me via Webex.
Details: After buildout of Windows SQL Server VM (and now others non-SQL), the nose fingerprint seems to get lost after the first agent run:
{{C:\Program Files\Puppet Labs\Puppet\bin>puppet agent --fingerprint
C:\Program Files\Puppet Labs\Puppet\bin>}}
Full Description Issue: After provisioning a Windows 2012r2 server, puppet agent gets installed. Custom fact is generated, and `puppet agent –t –waitforcert 120` is issued from the node. Puppet master signs certificated (via vRO WF) and node is classified via the custom fact. Puppet run begins and all configuration is successfully applied. Any subsequent puppet runs result in the following message from the node: ruby 2.1.8p440 (2015-12-16 revision 53160) [x64-mingw32]
C:\Program Files\Puppet Labs\Puppet\bin>puppet agent -t Error: Could not request certificate: The certificate retrieved from the masterdoes not match the agent's private key. Certificate fingerprint: <FINGERPRINT SCRUBBED> To fix this, remove the certificate from both the master and the agent and then start a puppet run, which will automatically regenerate a certficate. On the master: puppet cert clean <HOSTNAME SCRUBBED> On the agent: 1a. On most platforms: find C:/ProgramData/PuppetLabs/puppet/etc/ssl -name <CERTNAME SCRUBBED>.pem -delete 1b. On Windows: del "C:/ProgramData/PuppetLabs/puppet/etc/ssl/<CERTNAME SCRUBBED>.pem" /f 2. puppet agent -t
Exiting; failed to retrieve certificate and waitforcert is disabled
Node is in Puppet console: 
Output from `puppet cert list –all` on master: + "<NODENAME SCRUBBED>" (SHA256) 91:4C:9F:82:D4:57:A1:64:C2:95:D1:9B:A3:C0:07:7F:F5:AA:F4:AA:D5:CA:24:94:BE:6F:B2:12:85:C5:7E:9D
Removing the node from master, deleting certs on the node and re-adding does fix the problem.
This is happening intermittently, but seems to be more consistent with MS SQL servers. These do have significantly longer puppet run times. (oftern greater than 30 minutes). Not sure if that is anything…
Here is the node’s puppet.conf file:
PS C:\ProgramData\PuppetLabs\puppet\etc> cat .\puppet.conf [main] server=<HOSTNAME SCRUBBED> pluginsync=true autoflush=true environment=production runinterval = 6h
Attached is the Application log from the affected node. Let me know what other info you may need to help troubleshoot this issue.
|