Preprocessing - What do we need? What do we have?

*Context*: see https://github.com/rust-ml/discussion/issues/1.

This is meant to be a list of functionality we want to implement (a roadmap?) - I have refrained from including more sophisticated methods, limiting to what I believe to be a set of "core" routines we should absolutely offer.
For each piece of functionality I'd like to document what is already available in the Rust ecosystem.

This is meant to be a WIP list, so feel free to chip in @jblondin and edit/add things I might have missed.

- **Dimensionality reduction**:
   - Principal Component Analysis (PCA)
   - Singular Value Decomposition (SVD)
   - Independent component analysis (ICA)
   - Non-negative matrix factorization (NMF)
- **Scaling**: 
   - Standard scaling (zero mean and unit variance)
   - Range scaling (specify min and max)
- **Encoding**:
   - One hot encoding
   - Ordinal encoding
   - Discretization of continuous variables
- **Missing values**:
   - Naive imputation (constant value or using common statistics)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Preprocessing - What do we need? What do we have? #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Preprocessing - What do we need? What do we have? #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions