VeeamON 2024 - Use Code "COMMUNITY10" for 10% Off!
It is related to the upgrade to Ubuntu server 22.04. But I’m not sure if it’s OS related, or a problem with the updated kubernetes/microk8s version.After the upgrade the version was 1.23.9. I also tried now on 1.24.3 and 1.25.0.In 1.24.3 I have the same error as in the post above, in 1.25.0 it’s a different error in the kanister job:Snapshotting k10-admin@38d76db0-99c5-43e1-8217-6108ef485b1a.postgres-postgresql.postgres-bp:/kanister-backups ...pg_dumpall: error: could not translate host name "postgres-postgresql.authentication.svc.cluster.local" to address: Try again
i have the same problem
I could finally solve the problem. Don’t know what’s the reason for it, but i had to completely delete and then deploy my workloads again.After that the kanister backups now also work with the current version 4.5.14 agian.
Hi @Satish, after some testing on my test environment I can now tell, that the problem is with version 4.5.13 (it worked until v4.5.12). . Possibly the actual error could be right after that. Can you provide more info about it This is the last log entry, that occurs multiple times in the log process followed to upgrade ?( if helm please provide the command used to upgrade) helm repo updatehelm upgrade k10 kasten/k10 --namespace=kasten-io --version 4.5.13 -f helm-kasten.yamlI also tried it before with the command from the documentation:helm repo update && \ helm get values k10 --output yaml --namespace=kasten-io > k10_val.yaml && \ helm upgrade k10 kasten/k10 --namespace=kasten-io -f k10_val.yamlError yaml of snapshot job:cause: cause: cause: cause: cause: message: context deadline exceeded file: kasten.io/k10/kio/poll/poll.go:95 function: kasten.io/k10/kio/poll.waitWithBackoffWithRetriesHelper linenumber: 95
This shouldn’t take that much time for few MB. You can look at the kanister-svc logs to see if there are any clues. Ok, I found something in that logs that could explain the problem, but I have no clue how to solve that: { "Container": "kanister-sidecar", "File": "pkg/format/format.go", "Function": "github.com/kanisterio/kanister/pkg/format.LogWithCtx", "Line": 61, "Out": "kopia: error: unable to list sources: 401 Unauthorized, try --help", "Pod": "Pod", "cluster_name": "cluster_name", "hostname": "kanister-svc-5fc6fbf99d-9cdwh", "level": "info", "msg": "Pod Update", "time": "2022-04-21T10:58:58.534152852Z", "version": "4.5.13"}
It’s just a few MB that are copied on that snapshot, only took a few seconds before the upgrade.Are there any logs that i can investiagte?
Thanks for the quick reply, this solved the issue with that pod not starting. Now it’s running again!Unfortunatelly it doesn’t solve the problems that i have with volume snapshots since the upgrade:cause: cause: cause: cause: cause: message: context deadline exceeded file: kasten.io/k10/kio/poll/poll.go:95 function: kasten.io/k10/kio/poll.waitWithBackoffWithRetriesHelper linenumber: 95 message: Context done while polling fields: - name: duration value: 44m59.963318481s file: kasten.io/k10/kio/poll/poll.go:65 function: kasten.io/k10/kio/poll.waitWithBackoffWithRetries linenumber: 65 message: Timeout while polling fields: - name: actionSet value: k10-backuptoserver-k10-deployment-generic-volume-2.0.17-piznnbl file: kasten.io/k10/kio/kanister/operation.go:284 function: kasten.io/k10/kio/kanister.(*Operation).waitForActionSetCompletion linenumber: 284 message: Error wait
I did set up my own CA now, which is also capable of creating certificates for the ingress resource (step ca). Now it is working.
Thanks, I was not aware that i’m in the wrong group. Created the topic in the k10 support
Already have an account? Login
Enter your username or e-mail address. We'll send you an e-mail with instructions to reset your password.
Sorry, we're still checking this file's contents to make sure it's safe to download. Please try again in a few minutes.
Sorry, our virus scanner detected that this file isn't safe to download.