Add `--git-metadata` flag to `buf push` #2953

doriable · 2024-05-07T02:32:00Z

This PR adds the --git-metadata flag to buf push:

this flag is a "meta-flag" that uses Git source control state to help set --label flag(s), --source-control-url, and --create-default-label to the HEAD branch of the default Git remote
this flag is only compatible with checkouts of Git source repositories
this flag does not allow you to set --source-control-url, --create-default-label, --label, --tag, --branch, or --draft labels alongside

This PR also changes the default visibility for --create-visibility to "private". This means that users are no longer required to specify the --create-visibility flag when calling buf push --create -- it will default to creating a private repository if one does not already exist.

Also, this PR includes a changelog entry for all the changes above, as well as the --label, --create-default-label, and --source-control-url flags.

private/buf/cmd/buf/command/push/push.go

pkwarren

Overall looks good. I think we could benefit from some tests on the git logic (we could easily run git init in a temp dir and test various scenarios).

CHANGELOG.md

private/buf/bufcli/flags_args.go

private/buf/cmd/buf/command/push/push.go

private/pkg/git/git.go

private/buf/cmd/buf/command/push/push.go

CHANGELOG.md

…metadata

private/buf/cmd/buf/command/push/push.go

…metadata

bufdev

All of the generic logic needs to be put into the appropriate pkg, bufpkg, or bufpackages, in this case mostlygit`

private/buf/cmd/buf/command/push/push.go

bufdev · 2024-05-13T19:51:40Z

private/buf/cmd/buf/command/push/push.go

+ // with a git checkout.
+ return nil, fmt.Errorf("unable to check input %q, please ensure this is a Git repository checkout: %w", input, err)
+ }
+ if len(uncommittedFiles) > 0 {


Why is this a blocker?

This has always been a part of the product spec -- we do not allow users to push unchecked/uncommitted references. This makes sense, since we are tagging this with Git commit information through the VCS -- if there are uncommitted changes being pushed, that commit information would not make sense.

private/buf/cmd/buf/command/push/push.go

…metadata

doriable · 2024-05-14T19:02:25Z

Resolved old conversations around URL parsing and erroring behaviours (based on today's conversation), refactored git commands for metadata to pkg/git, and added some tests around source control url parsing.

CHANGELOG.md

private/pkg/git/url.go

private/pkg/git/ur_test.go

private/pkg/git/url.go

private/buf/cmd/buf/command/push/push.go

saquibmian

Looks good, just a couple stylistic things.

private/pkg/git/git.go

private/pkg/git/remote.go

Co-authored-by: Saquib Mian <[email protected]>

private/buf/bufcli/flags_args.go

private/buf/cmd/buf/command/push/push.go

pkwarren · 2024-05-15T16:34:53Z

private/buf/cmd/buf/command/push/push.go

+// If the remote hostname contains github (e.g. github.mycompany.com or github.com) or gitlab
+// (e.g. gitlab.mycompany.com or gitlab.com) then it uses the route /commit for the git
+// commit sha.
+func getGitMetadataSourceControlURLUploadOption(


I think it would be good to move the building of source control URLs to the git package in order to keep the logic in here purely around upload. Perhaps a SourceControlURL(commitSHA string) on the git.Remote interface?

Hmm, yeah, I went back and forth a little bit on where specifically this logic should live, since it's kind of "business-y" (rather than "generically git"), but also, the RemoteKind type already breaks that a little bit. So given that, it might make sense to put there.

It is a little strange to have this function take commitSHA, but I think that is reasonable. I'll make it clear that it just accepts the commitSHA as a string but doesn't do any validation against that string.

Actually, in this case, we wouldn't need to expose RemoteKind at all. I think I'm going to unexport the enum and keep it since it is useful for keeping the parsing logic clean (and independent from the initial parsing of the URL).

Co-authored-by: Philip K. Warren <[email protected]>

private/pkg/git/remote.go

bufdev · 2024-05-15T20:27:00Z

private/buf/cmd/buf/command/push/push.go

+ return nil, err
+ }
+ runner := command.NewRunner()
+ uncommittedFiles, err := git.CheckForUncommittedGitChanges(ctx, runner, input)


To what I can see, there is no verification here that input is in fact a directory, however it is expected by git.CheckForUncommitedGitChanges that input is a path to a local directory. This validation needs to be done, and should be obvious in the context of this function.

Added validation here.
It is worth noting that git.CheckForUncommittedGitChanges would return an error that indicates if the provided input is not a valid directory and was captured in the error handling below, but it's safer to more explicit with the validation here.

private/buf/cmd/buf/command/push/push.go

private/pkg/git/remote.go

private/pkg/git/metadata.go

private/buf/cmd/buf/command/push/push.go

bufdev · 2024-05-16T00:33:21Z

private/buf/cmd/buf/command/push/push.go

+ }
+ sourceControlURL := originRemote.SourceControlURL(currentGitCommit)
+ if sourceControlURL == "" {
+ return nil, appcmd.NewInvalidArgumentError("unable to determine source control URL for this repository; only GitHub/GitLab/BitBucket are supported")


Why does it matter? This is a severe limitation, and excludes any github enterprise customer. We should be able to parse a source control url for any http/https/ssh endpoint

There are two potential questions here so will answer both.

Why is it an error if we can't infer a source control url?
If the user provided --git-metadata then they are expecting us to automatically set some values. If we can't set the values they are expecting, we should error. This means during setup, they will catch the problem early, and then later if, due to a code host migration, it breaks, they will know instantly.

Why are we limited to GitHub/Bitbucket/GitLab?
The specification for Remote.SourceControlURL() (which is in the interface documentation in this PR) is designed to work for enterprise customers as long as the name of their code host is in the remote url. If it isn't, we don't know what code host we are dealing with and therefore we don't know how to correctly construct the user facing url because the routes are different depending on the code host.

On GitHub and GitLab, routes look like https://<host><repo-path>/commit/<commit-sha>
and on Bitbucket, routes look like https://<host><repo-path>/commits/<commit-sha>
(note /commit/ vs /commits).

The code quite literally checks for if the hostname includes github, gitlab, bitbucket, and just with a strings.Contains on the hostname. This both requires hostnames to include github, for example (this isn't a hard requirement), and it would also accept things like foogithubbar, which doesn't feel like what this filter wants to do in the first place.

This limitation isn't acceptable for our enterprise customers. Additionally, my impression was that "source control URL" wasn't mean to link to a specific point-in-time on the repository, it was just meant to link to the repository itself.

We can discuss deferring this until after release, but I consider this a bug - limiting acceptable hostnames to strings.Contains on one of github, gitlab, bitbucket isn't acceptable

private/pkg/git/git.go

private/pkg/git/remote.go

nicksnyder · 2024-05-16T01:23:15Z

private/buf/cmd/buf/command/push/push.go

+ }
+ sourceControlURL := originRemote.SourceControlURL(currentGitCommit)
+ if sourceControlURL == "" {
+ return nil, appcmd.NewInvalidArgumentError("unable to determine source control URL for this repository; only GitHub/GitLab/BitBucket are supported")


There are two potential questions here so will answer both.

Why is it an error if we can't infer a source control url?
If the user provided --git-metadata then they are expecting us to automatically set some values. If we can't set the values they are expecting, we should error. This means during setup, they will catch the problem early, and then later if, due to a code host migration, it breaks, they will know instantly.

Why are we limited to GitHub/Bitbucket/GitLab?
The specification for Remote.SourceControlURL() (which is in the interface documentation in this PR) is designed to work for enterprise customers as long as the name of their code host is in the remote url. If it isn't, we don't know what code host we are dealing with and therefore we don't know how to correctly construct the user facing url because the routes are different depending on the code host.

On GitHub and GitLab, routes look like https://<host><repo-path>/commit/<commit-sha>
and on Bitbucket, routes look like https://<host><repo-path>/commits/<commit-sha>
(note /commit/ vs /commits).

doriable added 2 commits May 6, 2024 22:14

Add --git-metadata flag to buf push

78ba531

Add changelog entries

135da98

doriable requested review from saquibmian, bufdev, unmultimedio and pkwarren May 7, 2024 02:32

doriable commented May 7, 2024

View reviewed changes

private/buf/cmd/buf/command/push/push.go Show resolved Hide resolved

pkwarren reviewed May 7, 2024

View reviewed changes

saquibmian reviewed May 7, 2024

View reviewed changes

unmultimedio reviewed May 7, 2024

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

doriable added 7 commits May 7, 2024 16:59

Address some comments

0fa254f

Merge remote-tracking branch 'origin/dev' into BSR-3747-BSR-3751-git-…

e366003

…metadata

Implement check for uncommitted changes

acc958e

Print warning for more than one Git remote

dca5a9f

Handle various Git URLs

928afa2

Check for empty URL

f7a22c9

Merge remote-tracking branch 'origin/dev' into BSR-3747-BSR-3751-git-…

a0ee045

…metadata

saquibmian reviewed May 13, 2024

View reviewed changes

Merge remote-tracking branch 'origin/dev' into BSR-3747-BSR-3751-git-…

d37e336

…metadata

bufdev reviewed May 13, 2024

View reviewed changes

doriable added 6 commits May 13, 2024 16:05

Separate stdout/stderr buffers

7b10d67

Refactor git metadata commands to pkg/git

518f8cb

Merge remote-tracking branch 'origin/dev' into BSR-3747-BSR-3751-git-…

a960cdb

…metadata

Add source control url parsing tests

314f2c2

make generate

8062dc3

Merge remote-tracking branch 'origin/dev' into BSR-3747-BSR-3751-git-…

e5685e2

…metadata

saquibmian reviewed May 14, 2024

View reviewed changes

doriable added 2 commits May 14, 2024 16:30

Address smaller comments

c8ce56d

Refactor the git package and add scp-like url parsing

daeaaca

saquibmian reviewed May 15, 2024

View reviewed changes

private/pkg/git/git.go Outdated Show resolved Hide resolved

private/pkg/git/remote.go Outdated Show resolved Hide resolved

private/pkg/git/remote.go Show resolved Hide resolved

private/pkg/git/remote.go Outdated Show resolved Hide resolved

doriable and others added 2 commits May 15, 2024 12:28

Apply suggestions from code review

cb9e36f

Co-authored-by: Saquib Mian <[email protected]>

Address comments

67d51ac

pkwarren reviewed May 15, 2024

View reviewed changes

doriable and others added 3 commits May 15, 2024 12:51

Apply suggestions from code review

c819dae

Co-authored-by: Philip K. Warren <[email protected]>

Update private/buf/bufcli/flags_args.go

5a43017

Co-authored-by: Philip K. Warren <[email protected]>

Address comments

8cc3b88

pkwarren approved these changes May 15, 2024

View reviewed changes

private/pkg/git/remote.go Outdated Show resolved Hide resolved

Address comments

fa2df32

pkwarren approved these changes May 15, 2024

View reviewed changes

bufdev added 3 commits May 15, 2024 16:19

Merge branch 'dev' into BSR-3747-BSR-3751-git-metadata

2623430

Merge branch 'dev' into BSR-3747-BSR-3751-git-metadata

483d1ea

Update CHANGELOG.md

7e67b26

bufdev reviewed May 15, 2024

View reviewed changes

bufdev and others added 4 commits May 15, 2024 16:39

merge

66b4a1f

Address most comments

92f66f9

Add input validation

38d44d2

Improve input validation

37a1e88

bufdev reviewed May 16, 2024

View reviewed changes

nicksnyder reviewed May 16, 2024

View reviewed changes

doriable added 4 commits May 15, 2024 22:15

Add dirRef validation and git rev-parse check

ca6075d

Address other comments

8316881

Clean-up

9aff322

Clean up docstring

67b924f

bufdev approved these changes May 16, 2024

View reviewed changes

bufdev merged commit 6004e28 into dev May 16, 2024
9 checks passed

bufdev deleted the BSR-3747-BSR-3751-git-metadata branch May 16, 2024 16:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `--git-metadata` flag to `buf push` #2953

Add `--git-metadata` flag to `buf push` #2953

doriable commented May 7, 2024

pkwarren left a comment

bufdev left a comment

bufdev May 13, 2024

doriable May 13, 2024

doriable commented May 14, 2024

saquibmian left a comment

pkwarren May 15, 2024

doriable May 15, 2024

doriable May 15, 2024

bufdev May 15, 2024 •

edited

doriable May 15, 2024

bufdev May 16, 2024

nicksnyder May 16, 2024 •

edited

bufdev May 16, 2024

nicksnyder May 16, 2024 •

edited

Add --git-metadata flag to buf push #2953

Add --git-metadata flag to buf push #2953

Conversation

doriable commented May 7, 2024

pkwarren left a comment

Choose a reason for hiding this comment

bufdev left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

doriable commented May 14, 2024

saquibmian left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bufdev May 15, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nicksnyder May 16, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nicksnyder May 16, 2024 • edited

Choose a reason for hiding this comment

Add `--git-metadata` flag to `buf push` #2953

Add `--git-metadata` flag to `buf push` #2953

bufdev May 15, 2024 •

edited

nicksnyder May 16, 2024 •

edited

nicksnyder May 16, 2024 •

edited