I'm having some issues when running Tectonic 1.8 deployed on AWS.
I'm able to get the cluster up and running using terraform however, certain command such as kubectl logs do not work due to a DNS resolution failure.
I've noticed several long threads on GitHub about this, but haven't really found a solution.
For example:
Error from server: Get
https://ip-172-31-xxx-xxx:10250/containerLogs/tectonic-system/tectonic-identity-97d874c57-fxz25/tectonic-identity: dial tcp: lookup ip-172-31-xxx-xxx on
172.31.0.2:53: no such host
Seems that for kubectl logs, the master uses the hostname rather than fqdn for lookups and for some reason go can't resolve it. The hostname is resolvable from the master and worker nodes as well as on my client machine using both host and dig commands. /etc/resolv.conf is configured to use Route53 DNS and includes the search suffix for the <region>.compute.internal domain name as well as the domain name used for the cluster.
In case it matters, I'm using a private hosted zone and only have internal ingress enabled in the Terraform variables file. I had previously followed a similar deployment model on Azure and did not experience this issue there.
Has anyone else run into this before, or know what a (scalable) workaround might be?
Thanks!
Adam