Walmart AI Training Data Generator

This project generates training data for an AI model designed to optimize Walmart's operations. It analyzes product trends, inventory management, sales predictions, and customer behavior insights based on provided datasets. After data processing model for fine-tuned using google's AI studio

Features

Generates diverse prompts for various retail scenarios.
Creates data-driven answers based on actual Walmart data.
Performs correlation analysis across different datasets.
Exports training data in both JSON and CSV formats.

Prerequisites

Python 3.x
Pandas library
NumPy library
Google genai library

Installation

Clone the repository: git clone [repository-url]
Install required libraries: pip install -r requirements.txt

Usage

Ensure your Walmart datasets are in the appropriate directory.
Run the main script: python generate_training_data.py
Find the generated training data in the ../data/processed/ directory.

Project Organization

├── LICENSE            <- Open-source license if one is chosen
├── Makefile           <- Makefile with convenience commands like `make data` or `make train`
├── README.md          <- The top-level README for developers using this project.
├── data
│   ├── external -      <- Data from third party sources.
│   ├── raw  or external          <- Contains raw datasets for analysis.
│   │    ├── df_Trends.csv
│   │    ├── df_customerSegmentation.csv
│   │    └── df_WallmartSales.csv
│   ├── processed      <- The final, canonical data sets for modeling.
│        ├── walmart_training_data1.json
│        └── walmart_training_data1.csv
├── models             <- Contains script to test tuned model.
├── notebooks          <- Jupyter notebooks. Naming convention is a number (for ordering),
│                         the creator's initials, and a short `-` delimited description, e.g.
│                         `1.0-jqp-initial-data-exploration`.
├── pyproject.toml     <- Project configuration file with package metadata for 
│                         wallmart and configuration for tools like black.
├── references         <- Data dictionaries, manuals, and all other explanatory materials.
├── requirements.txt   <- The requirements file for reproducing the analysis environment, e.g.
│                         generated with `pip freeze > requirements.txt`
├── setup.cfg          <- Configuration file for flake8.

Output

walmart_training_data1.json: Contains the generated prompts and answers in JSON format.
walmart_training_data1.csv: Contains the generated prompts and answers in CSV format.

Notes

Ensure that your input datasets are up-to-date for accurate insights.
The generated training data can be used to fine-tune language models for retail-specific tasks.

Contributors

[kev0-4]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Walmart AI Training Data Generator

Features

Prerequisites

Installation

Usage

Project Organization

Output

Notes

Contributors

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
models		models
notebooks		notebooks
references		references
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg

kev0-4/Wallmart-operations-optimizer-backend-AI

Folders and files

Latest commit

History

Repository files navigation

Walmart AI Training Data Generator

Features

Prerequisites

Installation

Usage

Project Organization

Output

Notes

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages