Missing operations transpose, swapaxis and basic arithmetic operation #101

agoscinski · 2023-01-20T13:12:54Z

No description provided.

agoscinski · 2023-01-20T14:23:33Z

I think we can generalize all functions that reduce the shape and functions that keep the shape fixed. We already have reduce_over_samples_block, but I think this can be generalized even more.

operation_name: str
parameters : list of strings, allows an application of the operation on a subset of the tensor block (TB) + gradient tensor blocks (GTBs), by default it does the operation on all of them, so it is by default ['values'] + [parameter for paramater, _ in tensor_block.gradients()]
axes: list of str, can contain the strings 'samples', 'components', 'properties' , by default ['samples', 'components', 'properties']
axes_names: dict of list str, for each specified axis in axes, it contains a list of names within the axis over which the reduction should be applied on, by default all names in the axis are used

and operations like add that do not change the shape do not need the last two paramaters

Luthaf · 2023-01-20T16:40:30Z

allows an application of the operation on a subset of the TB + GTBs, by default it does the operation on all of them, so it is by default ['values'] + [parameter for paramater, _ in tensor_block.gradients()]

We should never apply operations only on values or only on gradients, otherwise the values and gradients are no longer in sync (and the "gradients" are not actually gradients of the values). Doing this inside a function is fine, as long as when we give back the data to the user it is fully consistent.

but I think this can be generalized even more.

I don't think it is worth generalizing too much in the user-facing API, this would be very easily confusing. You mention reduce_over_samples_block, but this is an internal function that end users are not supposed to see. I prefer adding functions as we need them, with an API that make sense for this function.

For example, transpose would be fine, and invert samples with properties and components within themself. But swapaxis only makes sense if applied between two components, or between samples and properties.

In general, not all numpy operations have a direct equivalent in equistore, because we are imposing more structure and meaning than a pure numpy array.

agoscinski · 2023-01-23T11:48:25Z

Right, the consistency is important. I think I posted here too much of a brain dump here, sorry for that. I thought about functions that might be used for standardizing features, but not all are needed. I think basic arithmetic would already make this possible with the current reduce function.

For example, transpose would be fine, and invert samples with properties and components within themself

Doesn't it result in the loss of the meaning of the gradient, because the gradient is the derivative of the properties wrt. the parameter?

ceriottm · 2023-01-23T12:11:13Z

Doesn't it result in the loss of the meaning of the gradient, because the gradient is the derivative of the properties wrt. the parameter?

I think that what @Luthaf means is that any operation should be applied so that it leaves properties and gradients in a consistent state. E.g. scalar multiplication should be applied to both values and gradients. Scalar addition of a constant should be applied only to values, etc.

It is not entirely trivial how to formalize this, and it'd be crucial that it's done in a way that is consistent with autograd e.g. in torch.

Luthaf · 2023-01-23T12:14:14Z

Doesn't it result in the loss of the meaning of the gradient, because the gradient is the derivative of the properties wrt. the parameter?

You're right!

This is somehow related to #100: the second argument to dot is "transposed" compared to the first (properties of the first are samples of the second argument), which also explains why having gradient there is harder.

agoscinski · 2023-02-10T13:25:26Z

I split this issue in sub issues

broadcasted TensorBlock x scalar operations returning TensorBlock (add, minus, multiplication, division), discussed in Broadcasted TensorMap/TensorBlock x scalar operations (add, minus, multiplication, division) #118
TensorBlock x TensorBlock operations returning TensorBlock (add, minus, multiplication, division), add function has been added with PR added add function #103
scalar operations on each entry of the TensorBlock, so TensorBlock -> TensorBlock (sqrt, pow) Ops/div min pow #122
Transpose
matrix multiplication with gradients, A: TensorBlock x B: TensorBlock -> TensorBlock (at the moment B must not have gradients)

I think all these cases share the problem, that sometimes we want to want to apply the operation on everything in the TensoBlock equally (I mean here also the gradients), sometimes we want to apply the operation consistently with the gradients, so X -> f(X), and ∇X -> ∇f(X) ∇X. I thought about adding a optional boolean parameter for this gradient_operation to distinguish these two cases

Luthaf · 2023-03-15T11:31:56Z

I'll close this, transpose and dot are tracked by separate issues.

agoscinski added the Python-API Related to the Python bindings to "core" label Jan 20, 2023

agoscinski self-assigned this Jan 20, 2023

agoscinski mentioned this issue Feb 10, 2023

Broadcasted TensorMap/TensorBlock x scalar operations (add, minus, multiplication, division) #118

Closed

DavideTisi mentioned this issue Feb 11, 2023

Ops/div min pow #122

Merged

jwa7 mentioned this issue Mar 13, 2023

Write a transpose operation #191

Open

Luthaf closed this as completed Mar 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing operations transpose, swapaxis and basic arithmetic operation #101

Missing operations transpose, swapaxis and basic arithmetic operation #101

agoscinski commented Jan 20, 2023

agoscinski commented Jan 20, 2023 •

edited

Loading

Luthaf commented Jan 20, 2023

agoscinski commented Jan 23, 2023

ceriottm commented Jan 23, 2023

Luthaf commented Jan 23, 2023

agoscinski commented Feb 10, 2023 •

edited

Loading

Luthaf commented Mar 15, 2023

Missing operations transpose, swapaxis and basic arithmetic operation #101

Missing operations transpose, swapaxis and basic arithmetic operation #101

Comments

agoscinski commented Jan 20, 2023

agoscinski commented Jan 20, 2023 • edited Loading

Luthaf commented Jan 20, 2023

agoscinski commented Jan 23, 2023

ceriottm commented Jan 23, 2023

Luthaf commented Jan 23, 2023

agoscinski commented Feb 10, 2023 • edited Loading

Luthaf commented Mar 15, 2023

agoscinski commented Jan 20, 2023 •

edited

Loading

agoscinski commented Feb 10, 2023 •

edited

Loading