Skip to content

OpenMined/rds-chat-analysis

Repository files navigation

RDS Chat Analysis

Requirements

Setup

uv venv -p 3.12
uv sync
source .venv/bin/activate

ollama pull gemma3:1b-it-qat

Run against a local syftbox server

Clone syftbox and run:

just run-minio
just run-server -f config/server.dev.yaml

# launch DO and DS
just run-client-simple [email protected]
just run-client-simple [email protected]

export RDS_DO_CONFIG=~/.syftbox/clients/[email protected]/config.json
export RDS_DS_CONFIG=~/.syftbox/clients/[email protected]/config.json

Run against a mock setup

cd notebooks/v2

# Start the RDS server
export RDS_DO_CONFIG=./.rds/wildchat/data_owner_config.json
export RDS_DS_CONFIG=./.rds/wildchat/data_scientist_config.json

# Create a mock syftbox config for the data owner and data scientist
python -m syft_rds.cli init-test-datasite --email [email protected] --data-dir ./.rds/wildchat/ --config-path ${RDS_DO_CONFIG}
python -m syft_rds.cli init-test-datasite --email [email protected] --data-dir ./.rds/wildchat/ --config-path ${RDS_DS_CONFIG}

Running the notebooks

the /notebooks folder have two flows:

  • notebooks/v1 is a rough implementation of embedding, clustering, and visualization of WildChat data.
  • notebooks/v2 Contains the full pipeline to setup and use syft-RDS for WildChat data.

To start the RDS server:

cd notebooks/v2

python -m syft_rds.cli server --syftbox-config ${RDS_DO_CONFIG}

Next, to run the notebooks:

jupyter notebook

Development

pre-commit install

About

Chat log analysis over SyftBox RDS

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Sponsor this project

 

Packages

No packages published