some bug patches + torch no_grad forward pass #157

healeyq3 · 2024-07-25T22:05:29Z

Continuing the work that @PTNobel and I started in the diffcp repository to address cvxpy/cvxpy#2485.

Specifically, used the new diffcp solve_only_batch call chain in the torch CvxpyLayer when gradients are not desired for reverse autodiff. This functionality can be accessed by

Setting all parameter tensors which will be passed into the CvxpyLayer to not require a gradient.
Making the layer(param1, param2, ...) call inside with torch.no_grad()

For examples, see the last two tests in torch/test_cvxpylayer.py.

Additionally, I patched two errors in the test file. Note that the tests

test_basic_gp
test_lml
test_simple_batch_socp
failed prior to the CvxpyLayer additions that I made. The gp failure appears to be due to difference in solutions obtained by cvxpylayers and pure cvxpy. The second two failed tests are due to small(ish) Jacobian mismatches.

The next steps (I think) to complete issue cvxpy/cvxpy#2485 are

Release the new version of diffcp with the solve_only functionality (I used my local copy to make these cvxpylayer changes.)
See if this update provides any meaningful computational enhancements.
Implement this no_grad functionality for JAX and TensorFlow layers.

However, please let me know if there are more suggestions and/or edits I need to make to this PR. Thanks!

PTNobel · 2024-07-27T16:06:01Z

cvxpylayers/torch/cvxpylayer.py

+ As, bs, cs, cone_dicts, **solver_args)
+ )
+ else:
+ xs, _, _ = diffcp.cone_program.solve_only_batch(


Can you open a PR on diffcp to re-export this function from __init__.py so we don't need to access it from the cone_program namespace?

Yup - will do now!

Just opened the PR

Suggested change

xs, _, _ = diffcp.cone_program.solve_only_batch(

xs, _, _ = diffcp.solve_only_batch(

Let's use the soon-to-be-newly exported name

PTNobel · 2024-07-29T00:26:27Z

Alright; once I have time to fix the diffcp release pipeline and we pull that off, I'll merge this and do the CVXPYlayers release.

michaelamir2151 · 2024-07-29T12:57:49Z

Just wanted to say thanks, this PR is very helpful. I am interested in CVXPYLayers for batch computation, but computing the gradients heavily slows things down.

healeyq3 · 2024-08-10T02:38:15Z

examples/torch/learned_state_estimation.ipynb

Only the first half of the notebook (Background) + the Huber reformulation appendix is ready for review.

The implementation half is almost ready: I just need to

Update the code to match the learning section explanation.

Update some notation in the function docstrings.

(Most importantly) Fix the differentiation error for the robust smoother. I fixed it in a JAX implementation by putting the problem in epigraph form, but that doesn’t seem to be working here. I also made those changes in this PyTorch implementation a bit late into the day, so I might’ve made a small typo.

With respect to the write-up, I completely understand if we need to cut a good chunk of it. I wrote a lot of it for my own understanding, and I was also enjoying the process.

In the write-up, the one bit I feel less sure about from a mathematical standpoint is the auto-tuning (learning) problem section. Specifically, the bit about extended-value extensions. I definitely will appreciate a sanity check here.

Thank you for looking through this example! Well, the first half that is. I’m planning on returning to the code this weekend. Targeting total example completion by end of the coming week.

Oh, I also included the html file in this commit just to make it easier to review the write-up. Obviously we’ll discard before merge.

Almost finished with the code fixes/updates for this learned_state_estimation.ipynb example. Aiming to make another push (ideally) by EOD 08/13. However, it's also likely this push will be EOD 08/14.

Sounds great! I've started reading this PR...

PTNobel

Reviewed the code changes. Looks good! Thanks for the cleanups of old code too...

Looking forward to reviewing the notebook soon.

PTNobel · 2024-08-13T04:36:41Z

cvxpylayers/torch/cvxpylayer.py

+ As, bs, cs, cone_dicts, **solver_args)
+ )
+ else:
+ xs, _, _ = diffcp.cone_program.solve_only_batch(


Suggested change

xs, _, _ = diffcp.cone_program.solve_only_batch(

xs, _, _ = diffcp.solve_only_batch(

Let's use the soon-to-be-newly exported name

PTNobel · 2024-08-13T04:37:45Z

cvxpylayers/torch/test_cvxpylayer.py

- A_th.t() @ A_th +
- torch.eye(n).double())[0]
+ b): return torch.linalg.solve(
+ A.t() @ A + torch.eye(n).double(),


Suggested change

A.t() @ A + torch.eye(n).double(),

A.t() @ A + torch.eye(n, dtype=torch.float64),

Thanks, Parth! I'll make these changes locally and push them up with the updated example right now.

healeyq3 · 2024-08-15T03:27:55Z

(Regarding most recent commit, minus the requested changes.)
The new learned_state_estimation example is basically complete.

This new push includes a good number of updates to the code, including some basic tuning instantiation at the bottom of the Implementation section. (which has with it some new graphs!)

Current todos just include finishing up the cross-validation function and related data objects as well as the appendix write-up on creating the selector matrix. I’m hoping that using the cross-validation function with the robust smoother will yield more interesting tuning results than what I’m currently seeing.

I’m definitely open to any suggestions on tweaking the learning function to make the robust smoother tuning more effective.

Finally, as a note, the differentiating through the robust problem error I was experiencing was resolved by switching conda environments. I’ll look into this more before we officially merge this example to cvxpylayers/examples.

I have some obligations on tomorrow (Aug 15) which will prevent me from working on this example, but I'll be back to these final todos on Friday. (And non-example related: I'll also work on creating some bigger problems to test the new no_grad functionality on).

Also let me just say - I'm totally open to any and all comments/criticisms! I really want this example to match the style/quality that is expected of cvxpylayer examples.

PTNobel

All the CVXPYlayers changes look great!

Tests look fine, would be great to test the performance. I'll read the example soon.

healeyq3 · 2024-08-15T20:14:28Z

Tests look fine, would be great to test the performance. I'll read the example soon.

Ah, my bad on the wording of my last comment - that was very sloppy. I should've said "I'll create some bigger problems to test the new no_grad performance on." To do this, unless otherwise directed, I'll write a .py script solving batches of large problems from some different classes (LP, QP, SOCP, GP) and then run that script with the released CVXPYlayers and with my local versions of the to-be-released CVXPYlayers and diffcp.

Thanks for checking those CVXPYlayers changes so quickly!

some bug patches + torch no_grad forward pass

5b64451

PTNobel approved these changes Jul 27, 2024

View reviewed changes

healeyq3 mentioned this pull request Jul 27, 2024

re-export solve_only call chain cvxgrp/diffcp#65

Merged

learned state estimation example (draft)

d348a37

healeyq3 commented Aug 10, 2024

View reviewed changes

PTNobel requested changes Aug 13, 2024

View reviewed changes

requested changes and example update

c91b1fa

PTNobel reviewed Aug 15, 2024

View reviewed changes

PTNobel mentioned this pull request Aug 17, 2024

Adapts Quill's work on batching for easy merging + CI changes #158

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

some bug patches + torch no_grad forward pass #157

some bug patches + torch no_grad forward pass #157

healeyq3 commented Jul 25, 2024

PTNobel Jul 27, 2024

healeyq3 Jul 27, 2024

healeyq3 Jul 27, 2024

PTNobel Aug 13, 2024

PTNobel commented Jul 29, 2024

michaelamir2151 commented Jul 29, 2024

healeyq3 Aug 10, 2024

healeyq3 Aug 13, 2024

PTNobel Aug 13, 2024

PTNobel left a comment

PTNobel Aug 13, 2024

PTNobel Aug 13, 2024

healeyq3 Aug 15, 2024

healeyq3 commented Aug 15, 2024

PTNobel left a comment

healeyq3 commented Aug 15, 2024

	xs, _, _ = diffcp.cone_program.solve_only_batch(
	xs, _, _ = diffcp.solve_only_batch(

	A.t() @ A + torch.eye(n).double(),
	A.t() @ A + torch.eye(n, dtype=torch.float64),

some bug patches + torch no_grad forward pass #157

Are you sure you want to change the base?

some bug patches + torch no_grad forward pass #157

Conversation

healeyq3 commented Jul 25, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PTNobel commented Jul 29, 2024

michaelamir2151 commented Jul 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PTNobel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

healeyq3 commented Aug 15, 2024

PTNobel left a comment

Choose a reason for hiding this comment

healeyq3 commented Aug 15, 2024