Feature: Add high level OP for MaxSim calculation

### Describe what you are looking for

MaxSim was introduced in the [Colbert paper](https://arxiv.org/abs/2112.01488) and is used by almost all Late Interaction models like Colbert, Colpali etc. 

Unfortunately there is not native or fast implementation (at least I am not aware of one) to compute MaxSim fast over batch of documents. 

I already tried to optimize it a bit. Here my approach with using Numpy:

```python
def max_sim(q,d):
    # flatten then compute; may advantage due to better memory continuous
    K,M,D = d.shape
    Q = q.shape[0]
    scores = d.reshape(-1,D) @ q.T  # (K*M, Q)    
    max_scores = np.max(scores.reshape(K,M,Q), axis=1)  # (K, Q)
    return np.sum(max_scores, axis=0)  # (Q,)
``` 

Shape q: (#query_tokens, dim)
Shape d: (batch_size, #document_tokens, dim).

The benchmark I ran where with the following configs on a Dedicated Hetzner Cloud machine (CCX23):

q: (32, 128)
d: (1000, 666, 128)

Avg. over 10 runs: 86ms

Would love to see if we can do better!

### Can you contribute to the implementation?

- [ ] I can contribute

### Is your feature request specific to a certain interface?

It applies to everything

### Contact Details

_No response_

### Is there an existing issue for this?

- [x] I have searched the existing issues

### Code of Conduct

- [x] I agree to follow this project's Code of Conduct

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature: Add high level OP for MaxSim calculation #272

Describe what you are looking for

Can you contribute to the implementation?

Is your feature request specific to a certain interface?

Contact Details

Is there an existing issue for this?

Code of Conduct

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Feature: Add high level OP for MaxSim calculation #272

Description

Describe what you are looking for

Can you contribute to the implementation?

Is your feature request specific to a certain interface?

Contact Details

Is there an existing issue for this?

Code of Conduct

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions