Skip to content

Commit 87a2eea

Browse files
committed
add must-gather in README + fix status and acronyms in README
Signed-off-by: oraz <[email protected]>
1 parent dccc134 commit 87a2eea

File tree

1 file changed

+19
-19
lines changed

1 file changed

+19
-19
lines changed

README.md

Lines changed: 19 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
1-
# Node Maintenance Operator
1+
# Node Maintenance Operator (NMO)
22

3-
The node-maintenance-operator is an operator generated from the [operator-sdk](https://github.com/operator-framework/operator-sdk).
4-
The purpose of this operator is to watch for new or deleted custom resources called `NodeMaintenance` which indicate that a node in the cluster should either:
3+
The node-maintenance-operator (NMO) is an operator generated from the [operator-sdk](https://github.com/operator-framework/operator-sdk).
4+
The purpose of this operator is to watch for new or deleted custom resources (CRs) called `NodeMaintenance` which indicate that a node in the cluster should either:
55
- `NodeMaintenance` CR created: move node into maintenance, cordon the node - set it as unschedulable, and evict the pods (which can be evicted) from that node.
66
- `NodeMaintenance` CR deleted: remove pod from maintenance and uncordon the node - set it as schedulable.
77

@@ -19,7 +19,7 @@ There are two ways to run the operator:
1919
After every PR merge to master images were build and pushed to `quay.io`.
2020
For deployment of NMO using these images you need:
2121

22-
- a running OpenShift cluster, or a Kubernetes cluster with OLM installed.
22+
- a running OpenShift cluster, or a Kubernetes cluster with Operator Lifecycle Manager (OLM) installed.
2323
- `operator-sdk` binary installed, see https://sdk.operatorframework.io/docs/installation/.
2424
- a valid `$KUBECONFIG` configured to access your cluster.
2525

@@ -33,7 +33,7 @@ Follow the instructions [here](https://sdk.operatorframework.io/docs/building-op
3333

3434
### Set Maintenance on - Create a NodeMaintenance CR
3535

36-
To set maintenance on a node a `NodeMaintenance` CustomResource should be created.
36+
To set maintenance on a node a `NodeMaintenance` custom resource should be created.
3737
The `NodeMaintenance` CR spec contains:
3838
- nodeName: The name of the node which will be put into maintenance mode.
3939
- reason: The reason why the node will be under maintenance.
@@ -90,38 +90,38 @@ $ kubectl get nm nodemaintenance-sample -o yaml
9090
apiVersion: nodemaintenance.medik8s.io/v1beta1
9191
kind: NodeMaintenance
9292
metadata:
93-
creationTimestamp: "2022-02-24T14:37:25Z"
94-
finalizers:
95-
- foregroundDeleteNodeMaintenance
96-
generation: 1
9793
name: nodemaintenance-sample
98-
resourceVersion: "1267741"
99-
uid: 83cece87-f05c-41e8-bc22-5e6e0114f4b7
10094
spec:
10195
nodeName: node02
10296
reason: Test node maintenance
10397
status:
10498
evictionPods: 5
99+
lastError: 'Last failure message'
105100
pendingPods:
106-
- router-default-7548cf6fb5-6c6ws
107-
- alertmanager-main-1
108-
- prometheus-adapter-7b5bf59787-ccf5w
109-
- prometheus-k8s-1
110-
- thanos-querier-6dffd47d65-h4d5c
101+
- pod-A
102+
- pod-B
103+
- pod-C
104+
- pod-D
105+
- pod-E
111106
phase: Running
112107
totalpods: 19
113108
```
114109
115-
`phase` is the representation of the maintenance progress and can hold a string value of: Running|Succeeded.
116-
The phase is updated for each processing attempt on the CR.
110+
`evictionPods` is the total number of pods up for eviction from the start.
117111

118112
`lastError` represents the latest error if any for the latest reconciliation.
119113

120114
`pendingPods` PendingPods is a list of pending pods for eviction.
121115

116+
`phase` is the representation of the maintenance progress and can hold a string value of: Running|Succeeded.
117+
The phase is updated for each processing attempt on the CR.
118+
122119
`totalPods` is the total number of all pods on the node from the start.
123120

124-
`evictionPods` is the total number of pods up for eviction from the start.
121+
## Debug
122+
### Collecting cluster data with must-gather
123+
124+
Use NMO's must-gather from [here](https://github.com/medik8s/node-maintenance-operator/tree/master/must-gather) to collect related debug data.
125125

126126
## Tests
127127

0 commit comments

Comments
 (0)