Skip to content

Latest commit

 

History

History
23 lines (16 loc) · 731 Bytes

USAGE.md

File metadata and controls

23 lines (16 loc) · 731 Bytes

Installation

  1. Download the latest release
  2. Untar it tar xzf buzz-X.X.X.tar.gz
  3. Change dir to buzz-X.X.X.tar.gz
  4. Run buzz

Preparing a dataset for Buzz

Datasets should be exported to an Apache Arrow compatible format, such as Feather.

When using Pandas in Python, a DataFrame can be exported to a Feather file by running:

df.to_feather('output.feather', compression='uncompressed')

The dataframe should contain a column named index, which should be a column running from 0 to the length of the dataframe.

Columns can be of two types:

  • feature, the column name should have prefix feat_,
  • or meta data, the column name should have prefix meta_.