Comments (6)
Potentially related / improved / fixed already on main:
It was not cherry-picked back (only for CRS to fix CI related flakes) because we were not able to reproduce it for the other controllers in real.
Would be awesome if this could be checked if this also resolves this issue. That would need more details though how to exactly reproduce it.
from cluster-api.
Question: is the issue for you persistent or resolves itself after ~17 mins?
from cluster-api.
/triage accepted
remote/ClusterCacheTracker: Couldn't find controller pod metadata, the ClusterCacheTracker will always access clusters using the regular apiserver endpoint
Is because we are missing https://github.com/fabriziopandini/cluster-api/blob/0f47a19e038ee6b0d3b1e7675a62cdaf84face8c/controlplane/kubeadm/config/manager/manager.yaml#L28-L40 on the CABPK manifest (probably a leftover of a PR that started using ClusterCacheTracker in CAPBK)
This should be fixed in main and backported as far as possible
NOTE: This issue doesn't cause any issues to users, just degraded performance since we are using an additional client + client cache for self-hosted clusters.
Unable to retrieve Node status" err="failed to create cluster accessor: failed to get lock for cluster: cluster is locked already
Is not an issue
It is a transient condition that happens when two threads/reconcile loops running in parallel are trying to get a client from ClusterCacheTracker, and the client does not exist yet.
This error can appear many times if the underlying connection is not stable, and the reconcile loops/ClusterCacheTracker continuously tries to recreate connections, and each operation takes 10s before timing out (during those 10s other reconcile loops/ClusterCacheTracker gets the lock error).
from cluster-api.
/good-first-issue
from cluster-api.
@fabriziopandini:
This request has been marked as suitable for new contributors.
Guidelines
Please ensure that the issue body includes answers to the following questions:
- Why are we solving this issue?
- To address this issue, are there any code changes? If there are code changes, what needs to be done in the code and what places can the assignee treat as reference points?
- Does this issue have zero to low barrier of entry?
- How can the assignee reach out to you for help?
For more details on the requirements of such an issue, please see here and ensure that they are met.
If this request no longer meets these requirements, the label can be removed
by commenting with the /remove-good-first-issue
command.
In response to this:
/good-first-issue
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.
from cluster-api.
/assign
from cluster-api.
Related Issues (20)
- conditions Utility to let specify the entire condition HOT 21
- Instance in machine pool failed to join cluster withe error bootstrap token not found HOT 2
- Tasks to bump to Kubernetes v1.32 HOT 1
- Consider filtering out finalizer warnings in controller logs HOT 8
- flaking unit test in `TestReconcileMachinePoolMachines` HOT 8
- Bump golangci-lint to v1.60.2 HOT 7
- Allow Clusterctl upgrade plan/apply to use --config to upgrade to specific version HOT 9
- Tasks for v1.9 release cycle HOT 5
- CAPI controllers fail to set ownerReference HOT 3
- [WIP] Tracking issue for Improving status in CAPI resources HOT 2
- Improve "Waiting for Node volumes to be detached" log message HOT 6
- Feature: ClusterResourceSet HOT 3
- Feature: RuntimeSDK
- Feature: KubeadmBootstrapFormatIgnition HOT 1
- Feature: MachineSetPreflightChecks HOT 4
- Improve calculation of NodeDrainTimeout & NodeVolumeDetachTimeout exceeded HOT 7
- flakes in clusterctl upgrade tests HOT 9
- Drain not being performed for KCP machines with K8s v1.31.x HOT 2
- Allow capi-controller-manager running even if Runtime Extensions are down HOT 6
- Timed out after 180.001s. waiting for cluster deletion timed out HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cluster-api.