-
Notifications
You must be signed in to change notification settings - Fork 123
RSDK-9284 - automate CLI flag parsing #4581
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RSDK-9284 - automate CLI flag parsing #4581
Conversation
camelFormattedName = matchAllCap.ReplaceAllString(camelFormattedName, "${1}-${2}") | ||
camelFormattedName = strings.ToLower(camelFormattedName) | ||
|
||
return ctx.Value(camelFormattedName) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It seems like some guardrails might be nice here, but I'm not sure that we can say at compile time what's safe and what's not. Even if an argument is non-optional, I expect there are cases where a nil value is normal and expected.
cli/app.go
Outdated
type foo struct { | ||
FooFoo string | ||
Bar int | ||
} | ||
|
||
func doFoo(foo foo, ctx *cli.Context) error { | ||
fmt.Printf("FooFoo is %s and Bar is %v.", foo.FooFoo, foo.Bar) | ||
return nil | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This will get deleted in the final project obviously, but I wanted to show an example of what new development would look like. Also, it highlights how we successfully set a field FooFoo
with a flag of foo-foo
fyi @dgottlieb since we discussed this idea a bit during the scope doc process. |
This seems like a good improvement -- I like the flags object (hopefully people will catch name-mismatches manually). However, If you're interested in other options, I've enjoyed this pattern for some of my personal projects. It takes a bit of naming-awareness, but it is fairly simple. This uses func main() {
cmd := &cli.Command{
Name: "sudoku",
Commands: []*cli.Command{
createStatsCli(),
createDataCli(),
createGraphCli(),
createEvaluateCli(),
createExperimentCli(),
lambda.CreateLambdaCli(),
},
}
if err := cmd.Run(context.Background(), os.Args); err != nil {
log.Fatalln("error:", err)
}
} example func createStatsCli() *cli.Command {
graphPath := ""
printUnfinishedSteps := false
action := func(_ context.Context, _ *cli.Command) error {
// SOME VERY SHORT FUNCTION
// (that uses the vars in the parent scope)
graph, err := LoadGraphFromFile(graphPath)
if err != nil {
return err
}
var pStats []ProblemStats
for _, p := range graph.Problems {
pStats = append(pStats, p.GenStats(graph))
}
PrintStatsInfo(pStats, printUnfinishedSteps)
return nil
}
return &cli.Command{
Name: "stats",
Arguments: []cli.Argument{
&cli.StringArg{
Name: "graph",
Destination: &graphPath,
},
},
Flags: []cli.Flag{
&cli.BoolFlag{
Name: "print-unfinished-steps",
Destination: &printUnfinishedSteps,
},
},
Action: action,
// can support nested Commands via
// Commands: []*cli.Command{moreCreateInvocations()}
}
} This has the benefit of not using runtime reflection -- though it does scatter a lot of the cli tree out into different functions instead of one giant object. I like it for personal projects because the No pressure to use this though! Just wanted to throw something out there to consider while you restructure the flag parsing. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for thinking about this + putting a PoC together @stuqdog! I really like how this method does not massively change how we're adding new pieces to the highest level cli.App
struct. I do have three issues with it:
- You are storing values on contexts
a. I can totally get over this one, but it is generally an anti-pattern in Go in the sense that it should be a "last-resort" strategy (this might be a "last-resort" case) - You are using runtime reflection
a. Your reflection code (getValFromContext
) will run for each command at each invocation. Putting reflection in "hot paths" is usually a bad idea because reflection is relatively slow - Your fuzzy searching allows multiple ways of specifying a flag
a. Most CLIs I have interacted with only allow one or two ways of specifying a flag (usually-c
and-count
or something similar) and are pretty stringent about using hyphens between words instead of underscores. I think users will be surprised by the flexibility here especially if it is undocumented
I like the idea of @zaporter-work's solution, but it looks like we are expecting developers to use the Destination
pattern (which is a good one!) in their create*CLI
functions. If they forget to do that (which I think they will, honestly,) then all we've done is, as Zack mentioned, obfuscate the definition of the top-level cli.App
.
I realize I am only criticizing and not offering other solutions 😮💨 ; let me take a stab at something locally and get back to you guys. If we really want to get Viam developers to add ActionFunc
s in a more consistent manner, I think we'll have to design a new API for adding a Command
, and a custom struct that wraps a *cli.App
.
Hey! I think there's some slight misunderstanding specifically around points 1 and 3, messaged you offline to chat about it :) |
Yeah I like @zaporter-work's suggestion a lot but I do worry it expects too much of developers in terms of remembering the right way to do things. @benjirewis and I are speaking this afternoon and will come up with some thoughts on how to proceed from there! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@stuqdog and I spoke offline, and he assuaged all three of my previous concerns. I cannot come up with some clever way to do this without any runtime reflection, but we both agree that the performance hit of using reflection here is fine given the usage of this code in CLI (user is almost always entering one command and waiting for a response.) We did want to create a ticket for a subsequent PR that wraps the cli.App
in a custom ViamCLIApp
struct that introduces a bit more type safety around adding new commands.
I do have a couple nits and one question about your handling of types beyond string
/optional types.
cli/app.go
Outdated
@@ -125,6 +127,11 @@ const ( | |||
cpFlagPreserve = "preserve" | |||
) | |||
|
|||
var ( | |||
matchFirstCap = regexp.MustCompile("(.)([A-Z][a-z]+)") | |||
matchAllCap = regexp.MustCompile("([a-z0-9])([A-Z])") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
[nit] Could you add comments above each of these regexes giving an example of a pattern they would match? I find that to be extremely helpful when reading through code and seeing (well-named but otherwise) random regular expressions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ooh yeah, good call. Will add comments!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Upon looking I have concluded that matchFirstCap
is a bit extraneous. That's what I get for copying someone else's regex here! Removed it, added an explanatory comment for matchAllCap
.
cli/app.go
Outdated
@@ -224,6 +231,49 @@ var dataTagByFilterFlags = append([]cli.Flag{ | |||
}, | |||
commonFilterFlags...) | |||
|
|||
func getValFromContext(name string, ctx *cli.Context) interface{} { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
func getValFromContext(name string, ctx *cli.Context) interface{} { | |
func getValFromContext(name string, ctx *cli.Context) any { |
[nit] Prefer any
to interface{}
post Golang 1.18 (here and elsewhere.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🙏 thanks for the tip, will replace interface{}
with any
!
cli/app.go
Outdated
|
||
type foo struct { | ||
FooFoo string | ||
Bar int |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm surprised you can have int
fields here. I thought the value stored on the cli.Context
would be a string
. How do you not get a runtime error on L260 above when you try to set the Bar int
value to a string
? Similarly, if a user does not specify -bar
flag, would we not try to set the Bar int
value to nil
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good question! So the value stored on the cli.Context
is of any type, we can see this being used in the existing code here e.g., where we're getting a uint out of the context. What this is doing is grabbing out the requested value as an interface
, and then using reflection to determine the expected type and plug it into the struct.
If we don't pass a flag then we get nil, which is parsed as the zero value. So, Bar
becomes 0
and foo
becomes ""
. Which, now that I think about it, is probably not ideal. Probably we should have the struct setup such that optional values are pointers, and we do a if val != nil
check when setting them in createCommandWithT
.
For non-optional types, the existing process of marking a flag as Required
will still be sufficient to enforce its use.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Gotcha! Thanks for the explanation; makes sense.
Probably we should have the struct setup such that optional values are pointers, and we do a if val != nil check when setting them in createCommandWithT.
I wonder if Bar *int
, at least, is an important field to use a pointer with to distinguish between the user passing 0
and passing nothing.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pretty much LGTM! Curious whether you'll make the *Flags
structs have pointer types or just string
, int
, etc.
cli/app.go
Outdated
|
||
type foo struct { | ||
FooFoo string | ||
Bar int |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Gotcha! Thanks for the explanation; makes sense.
Probably we should have the struct setup such that optional values are pointers, and we do a if val != nil check when setting them in createCommandWithT.
I wonder if Bar *int
, at least, is an important field to use a pointer with to distinguish between the user passing 0
and passing nothing.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good! My only question is that I know we're looking at alternative libraries for CLI tooling that offers more features/flexibility, but that is faaaaar outside the scope of this current PR.
Can this pattern be migrated over to the new library should we ever move to that ourselves?
I suspect so, yes! Probably some massaging would be necessary depending on the library but I don't think this pattern is terribly complicated or unique to this library. I do also think that we have ways of achieving what we want within the confines of |
e6a4e94
to
b709165
Compare
&cli.StringSliceFlag{ | ||
Name: dataFlagBboxLabels, | ||
Usage: "bbox labels filter. " + | ||
"accepts string labels corresponding to bounding boxes within images", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(flyby) this flag was missing here but is accepted elsewhere where we want to construct a filter
. Adding it here makes it easier to have shared arg
groups for various Data
methods, avoiding the need for repetitive struct definitions. Spoke with @dmhilly who confirmed this was acceptable to add.
}, | ||
Action: DataAddToDatasetByFilter, | ||
commonFilterFlags...), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(flyby) we were using all the commonFilterFlags
here but rearticulating them with identical descriptions. Removed, moved to appending the commonFilterFlags
.
&cli.StringFlag{ | ||
Name: mlTrainingFlagURL, | ||
Usage: "url of Github repository associated with the training scripts", | ||
Required: false, | ||
}, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(flyby) added this flag, which was being searched for in the withUpload
method but wasn't settable.
@etai-shuchatowitz @tahiyasalam I copied the flag details from other commands, if you have any issues with the shape of the flag here, please let me know!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
great, thank you!
cli/app.go
Outdated
&cli.StringFlag{ | ||
Name: generalFlagOrgID, | ||
Usage: "org ID to train and save ML model in", | ||
Required: true, | ||
}, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(flyby) added this flag, which was being searched for in the withUpload
method but wasn't settable.
@etai-shuchatowitz @tahiyasalam I copied the flag details from other commands, if you have any issues with the shape of the flag here (usage text, etc.), please let me know!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah looking through this now, I am realizing there's actually a bug in our existing withUpload
method.
The trainFlagModelOrgID
here and here should actually in fact be the generalFlagOrgID
, where the generalFlagOrgID
is the "org ID to save the custom training script in". I hate to sneak in too many changes here, so I'm also happy with removing this from this PR & taking on as a follow up for my team or merging in that fix first for you to pull down into your branch. Let me know!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hey, I think that's not a problem to do here! I just made the change, let me know if it still looks off :)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me! :)
415fb64
to
ed6c00c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Didn't review line-by-line, but the general concept seems good to me. Thanks for this big overhaul @stuqdog !
// TODO(RSDK-9447) - We don't support pointers in this. The problem is that when getting a value | ||
// from a context for a supported flag, the context will default to populating with the zero value. | ||
// When getting a value from the context, though, we currently have no way of know if that's going | ||
// to a concrete value, going to a pointer and should be a nil value, or going to a pointer but should | ||
// be a pointer to that default value. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sounds good; I'm a little confused about the explanation here but totally fine revisiting this as part of a separate ticket.
Action: OrganizationDisableBillingServiceAction, | ||
Action: createCommandWithT[organizationDisableBillingServiceArgs](OrganizationDisableBillingServiceAction), | ||
}, | ||
{ | ||
Name: "update", | ||
Usage: "update the billing service update for an organization", | ||
Flags: []cli.Flag{ | ||
&cli.StringFlag{ | ||
Name: generalFlagOrgID, | ||
Required: true, | ||
Usage: "the org to update the billing service for", | ||
}, | ||
&cli.StringFlag{ | ||
Name: organizationBillingAddress, | ||
Required: true, | ||
Usage: "the stringified address that follows the pattern: line1, line2 (optional), city, state, zipcode", | ||
}, | ||
}, | ||
Action: UpdateBillingServiceAction, | ||
Action: createCommandWithT[updateBillingServiceArgs](UpdateBillingServiceAction), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fyi adding flag back as per offline convo @RoxyFarhad
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ML training changes look good to me. I commented on the things that I verified, but let me know if you need me to look at anything else.
&cli.StringFlag{ | ||
Name: mlTrainingFlagPath, | ||
Usage: "path to ML training scripts for upload", | ||
Required: true, | ||
}, | ||
&cli.StringFlag{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Makes sense to me, thank you!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🙏 thanks!
Note: everything below the line was written when this PR was a POC and may be out of date.
Adds typed argument structs to all CLI commands, creates a simple interface for developers to do the same for new commands.
Testing: confirmed locally that all existing flag types (int, uint, string, string slice, bool, duration, path) can be parsed and set correctly. Confirmed values in several commands.
Note to reviewers: this is currently a POC and definitely not ready for prime time. Before I go ahead and make changes to all existing methods, I wanted to get buy-in from folks on this as an approach. Once we have agreement on the shape of this implementation, I'll do the (verbose, but mechanical) work of changing existing methods which should hopefully be trivial to review despite having an estimated large loc diff.
The nice thing about this approach is it provides safe, easy, typeful access to flag data while requiring minimal change in how developers create actions (they have to define a struct with their flag fields and the
Action
field is now populated slightly differently), but there should be an easily replicable pattern in the code base such that this is not harmful.A notable shortcoming of this approach is that the names of the fields in the struct must match (or fuzzy match, taking account for differences in snake/camel/kebab case) the names of the flag. Since a developer is defining the struct, we don't currently have a way to enforce that things are correct. I do think it would be possible using reflection to have some sort of assert that the flags and the fields of the struct fuzzy match, but I fear this will require some decent refactoring and is more of a "programmatic enforcement" issue rather than a "automate flag parsing" issue and so should be done in a future PR.
One other (minor) restriction: the fields on the struct must be public. The reflection library can't access them if they're private, leading to a runtime error.
One other thing to note: I don't think we can get away with not requiring a
CLI.context
in thestruct
ful methods that we're asking users to now define, as certain existing methods (DownloadModuleAction
, e.g.) use fields on thectx
within the method. This means that if a user wants to access flags the old way, we can't stop them. Again, this might be resolvable but probably should go in a separate ticket.