-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Alerts - OCS Cluster and Cluster Nodes health #1
Comments
@shtripat How do we get these metrics? |
I'm hesitant to get these items from Anthill. It will have its own view of each, but we then get a dependency... If the operator is down or malfunctioning, the alerts are potentially wrong. I would expect many of these to come via data from gluster-prometheus or health checks on labeled pods. The benefit of using g-p is that as long as 1 gd2 pod is ready, the exporter should be available through the gd2 client service. |
It can come from K8s(node exporter). We can add a recording rule and set it under a gluster namespace.
It can come from K8s. but I don't know how useful this will be.
It can be provided by glusterd2 API
It can be provided by glusterd2 api /ping endpoint
It can be provided by v1/cluster/{cluster_id}/status |
Need following status alerts:
Node status (Up/Down)
Container status (Up/Down)
Gluster peer in cluster status (Connected/Disconnected)
Glusterd2 service status (Up/Down)
Cluster status
The text was updated successfully, but these errors were encountered: