Hi,
we are experiencing a weird issue related to the metadata service.
About 1 in 5 instances are not reachable because the instance cannot retrieve the ssh public key from the metadata service. This seems to be different for different zones, us-central1-b gave us 1 in 5 failures, while on europe-west1-b we experience it in about 1 in 40 instances.
While debugging using an image that also includes a hardcoded SSH key, we see the following:
* we can ping the link-local IP where the metadata service is supposed to be running
* port 80 is unreachable, so any call to the metadata service fails unfortunately.
* rebooting doesn't solve the issue
Has anybody experienced any such issue? I feel like I'm at a dead-end.
Cheers,
Rob
==========
Outputs of commands used to determine the above facts:
$ ping 169.254.169.254
PING 169.254.169.254 (169.254.169.254) 56(84) bytes of data.
; (1 server found)
;; global options: +cmd
;; Got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 21168
;; flags: qr rd ra; QUERY: 1, ANSWER: 6, AUTHORITY: 0, ADDITIONAL: 1
;; OPT PSEUDOSECTION:
; EDNS: version: 0, flags:; udp: 512
;; QUESTION SECTION:
;; ANSWER SECTION:
;; Query time: 4 msec
;; SERVER: 169.254.169.254#53(169.254.169.254)
;; WHEN: Mon Jul 28 17:08:22 CEST 2014
;; MSG SIZE rcvd: 139
$ nmap 169.254.169.254
Nmap scan report for metadata.google.internal (169.254.169.254)
Host is up (0.00013s latency).
Not shown: 999 closed ports
PORT STATE SERVICE
53/tcp open domain
Nmap done: 1 IP address (1 host up) scanned in 2.43 seconds
* Hostname was NOT found in DNS cache
* Trying 169.254.169.254...
* connect to 169.254.169.254 port 80 failed: Connection refused
* Failed to connect to metadata port 80: Connection refused
* Closing connection 0
curl: (7) Failed to connect to metadata port 80: Connection refused