Feat: adds a status check for NCs #2716

abel2-code · 2024-04-29T19:01:00Z

Reason for Change:
This is part of my intern project to help improve error messages.

Issue Fixed:
N/A

Requirements:

uses conventional commit messages
includes documentation
adds unit tests
relevant PR labels added

Notes:

nddq · 2024-04-29T21:58:26Z

cns/restserver/restserver.go

@@ -135,6 +137,21 @@ type containerstatus struct {
 VfpUpdateComplete bool // True when VFP programming is completed for the NC
 }

+type multiContainerStatus map[string]containerstatus


why do we have to create a type for this?

+1. This does not need to be a type

This variable is actually not needed as you are simply duplicating 'service.state.ContainerStatus'

I somewhat disagree with the sentiment in this thread. This should be an error implementation if anything, but the problem as it currently stands is that GetUnsuccessfulStatusErrors can return "" making it a poor error (since in its typical usage err != nil, but it is, in fact, nil).

So the idea of having some function determine whether or not there were errors is a good one, but it should actually produce an error or nil--not a string.

It should read something like this:

err := anyFailed(service.state.ContainerStatus) if err != nil { // create the response but use `err.Error()` }

You could have an error type like this to better deal with this:

type MultiContainerError map[string]v1alpha1.NCStatus func (m *MultiContainerError) Error() string { out := bytes.NewBufferString("multiple NCs failed: ") for ncid, status := range m { fmt.Fprintf(out, "%s (%s) ") } return out.String() }

nddq · 2024-04-29T21:59:53Z

cns/restserver/restserver.go

@@ -135,6 +137,21 @@ type containerstatus struct {
 VfpUpdateComplete bool // True when VFP programming is completed for the NC
 }

+type multiContainerStatus map[string]containerstatus
+
+func (mcs *multiContainerStatus) GetUnsuccessfulStatusErrors() string {


instead of making this a struct method, can we just make it a func that accepts `map[string]containerstatus instead?

Suggested change

func (mcs *multiContainerStatus) GetUnsuccessfulStatusErrors() string {

func GetUnsuccessfulStatusErrors(mcs map[string]containerstatus) string {

(1/2) also not a fan of this function returning a string, maybe just return a list of NCs that doesn't have the NCUpdateSuccess status, and then we can do the processing in the caller?

The purpose of the method is the create the CNI Error message so it is ok to return the string here directly.

nddq · 2024-04-29T22:11:22Z

cns/restserver/ipam.go

@@ -116,6 +116,23 @@ func (service *HTTPRestService) RequestIPConfigHandler(w http.ResponseWriter, r
 return
 }

+ // Check the status of the NC
+ ncStatuses := multiContainerStatus(service.state.ContainerStatus)
+ if unsuccessfulStatuses := ncStatuses.GetUnsuccessfulStatusErrors(); unsuccessfulStatuses != "" {


(2/2) and then we can just check for the length of the list returned.

nairashu · 2024-04-29T23:48:23Z

cns/restserver/ipam.go

+ reserveResp := &cns.IPConfigResponse{
+ Response: cns.Response{
+ ReturnCode: types.FailedToAllocateIPConfig,
+ Message: unsuccessfulStatuses,


All you are trying to do is get the NC Status and create an error message from it. So change the name of the method to 'GetNcStatusErrorMessage'

nairashu · 2024-04-29T23:55:00Z

cns/restserver/restserver.go

+func (mcs *multiContainerStatus) GetUnsuccessfulStatusErrors() string {
+ var unsuccessfulStatuses []string
+ for ncID := range *mcs {
+ ncStatus := (*mcs)[ncID].CreateNetworkContainerRequest.NCStatus
+ if ncStatus != cns.NCUpdateSuccess {
+ unsuccessfulStatus := fmt.Sprintf("Expected status for NC %s to be %s but got %s", ncID, string(cns.NCUpdateSuccess), string(ncStatus))
+ unsuccessfulStatuses = append(unsuccessfulStatuses, unsuccessfulStatus)
+ }
+ }
+
+ return strings.Join(unsuccessfulStatuses, "\n")
+}


Suggested change

func (mcs *multiContainerStatus) GetUnsuccessfulStatusErrors() string {

var unsuccessfulStatuses []string

for ncID := range *mcs {

ncStatus := (*mcs)[ncID].CreateNetworkContainerRequest.NCStatus

if ncStatus != cns.NCUpdateSuccess {

unsuccessfulStatus := fmt.Sprintf("Expected status for NC %s to be %s but got %s", ncID, string(cns.NCUpdateSuccess), string(ncStatus))

unsuccessfulStatuses = append(unsuccessfulStatuses, unsuccessfulStatus)

}

}

return strings.Join(unsuccessfulStatuses, "\n")

}

func GetNcStatusErrorMessages(mcs map[string]containerstatus) string {

var unsuccessfulStatuses []string

for ncID, containerStatus := range mcs {

ncStatus := containerStatus.CreateNetworkContainerRequest.NCStatus

if ncStatus != cns.NCUpdateSuccess {

unsuccessfulStatuses = append(unsuccessfulStatuses, fmt.Sprintf("Expected status for NC %s to be %s but got %s. ", ncID, string(cns.NCUpdateSuccess), string(ncStatus)))

}

}

return strings.Join(unsuccessfulStatuses, "\n")

}

nairashu · 2024-04-30T00:07:36Z

cns/restserver/ipam.go

+ // Check the status of the NC
+ ncStatuses := multiContainerStatus(service.state.ContainerStatus)
+ if unsuccessfulStatuses := ncStatuses.GetUnsuccessfulStatusErrors(); unsuccessfulStatuses != "" {
+ // If the status is anything other than success, we send a response back with the actual status and NC ID.
+ reserveResp := &cns.IPConfigResponse{
+ Response: cns.Response{
+ ReturnCode: types.FailedToAllocateIPConfig,
+ Message: unsuccessfulStatuses,
+ },
+ }
+
+ w.Header().Set(cnsReturnCode, reserveResp.Response.ReturnCode.String())
+ err = service.Listener.Encode(w, &reserveResp)
+ logger.ResponseEx(service.Name+operationName, ipconfigRequest, reserveResp, reserveResp.Response.ReturnCode, err)
+ return
+ }


By checking the NCStatus early you are blocking the call to the service. I don't think it is the right behavior here. This will block IP allocations from the IPs that are available on the node itself and are getting reclaimed while the NC status is SubnetFull or something else. We should not be determining the response state from the state of the NC. Basically it should be more of a helper method to add to the response message on line 153.

timraymond · 2024-05-02T14:35:29Z

cns/restserver/restserver.go

+ for ncID := range *mcs {
+ ncStatus := (*mcs)[ncID].CreateNetworkContainerRequest.NCStatus
+ if ncStatus != cns.NCUpdateSuccess {
+ unsuccessfulStatus := fmt.Sprintf("Expected status for NC %s to be %s but got %s", ncID, string(cns.NCUpdateSuccess), string(ncStatus))


This is good output for a test, but not for an error. Generally, it should be concise and lower-case. As in my other comment, I think something like this would be more typical:

multiple NCs failed: uuid1 (status1), uuid2 (status2), uuid3 (status3)

It's implied that the expectation was success, so logging that just consumes log disk space. Though that is perfectly fine (and desirable!) to mention in go test output.

timraymond · 2024-05-02T14:36:25Z

cns/restserver/restserver.go

+ }
+ }
+
+ return strings.Join(unsuccessfulStatuses, "\n")


Also, it's unusual to join by newlines for errors since the log lines themselves are newline separated

github-actions · 2024-05-17T00:01:14Z

This pull request is stale because it has been open for 2 weeks with no activity. Remove stale label or comment or this will be closed in 7 days

github-actions · 2024-05-25T00:01:16Z

Pull request closed due to inactivity.

Feat: adds a status check for NCs

c5c986b

abel2-code requested a review from a team as a code owner April 29, 2024 19:01

abel2-code requested a review from rbtr April 29, 2024 19:01

nddq reviewed Apr 29, 2024

View reviewed changes

nairashu reviewed Apr 29, 2024

View reviewed changes

nairashu reviewed Apr 30, 2024

View reviewed changes

timraymond reviewed May 2, 2024

View reviewed changes

github-actions bot added the stale Stale due to inactivity. label May 17, 2024

github-actions bot closed this May 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: adds a status check for NCs #2716

Feat: adds a status check for NCs #2716

abel2-code commented Apr 29, 2024

nddq Apr 29, 2024

nairashu Apr 29, 2024

nairashu Apr 29, 2024

timraymond May 2, 2024

nddq Apr 29, 2024

nddq Apr 29, 2024

nairashu Apr 29, 2024

nddq Apr 29, 2024

nairashu Apr 29, 2024

nairashu Apr 29, 2024

nairashu Apr 30, 2024

timraymond May 2, 2024

timraymond May 2, 2024

github-actions bot commented May 17, 2024

github-actions bot commented May 25, 2024

	func (mcs *multiContainerStatus) GetUnsuccessfulStatusErrors() string {
	func GetUnsuccessfulStatusErrors(mcs map[string]containerstatus) string {

Feat: adds a status check for NCs #2716

Feat: adds a status check for NCs #2716

Conversation

abel2-code commented Apr 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented May 17, 2024

github-actions bot commented May 25, 2024