multi-armed-bandit

This repository contains an End to End Real time 🕰️ Machine Learning Pipeline to predict star ⭐️ rating of product reviews. This project uses AWS Sagemaker, Kinesis, Lambda, S3, Redshift, Athena, and Step functions. Deployment of multiple models for AB testing and Bandit testing is also included.

nlp docker aws lambda streaming real-time deep-learning athena s3-bucket embeddings kinesis-firehose kinesis-stream ingestion ab-testing redshift data-pipeline multi-armed-bandit sagemaker bert-model

Updated Nov 24, 2023
Jupyter Notebook

alexandrulita91 / multi-armed-bandit

Star

The Multi-armed bandit problem is one of the classical reinforcements learning problems that describe the friction between the agent's exploration and exploitation.

reinforcement-learning thompson-sampling multi-armed-bandit

Updated Sep 22, 2020
Python

erfunmirzaei / Multi-Agent-Bandit

Star

Compared Non-stationary Multi-armed Bandits in Single-Agent to Multi-Agents Scenarios- Distributed Optimization and Learning(DOL) Course Project

multi-armed-bandit non-stationary multi-agent-reinforcement-learning decentralized-online-optimization

Updated Sep 14, 2022
Jupyter Notebook

jakemaz66 / RecoveringSleepingBandit

Star

A Novel Multi-Arm Bandit Optimization Implementation using reinforcement learning in Python for selecting Notifications.

reinforcement-learning optimization duolingo multi-armed-bandit

Updated Apr 26, 2024
Python

alxndrTL / RL-essais-cliniques

Star

reinforcement-learning clinical-trials multi-armed-bandit exploration-exploitation epsilon-greedy-exploration ucb-algorithm essais-cliniques

Updated Sep 6, 2022

TheoKanning / kaggle-2020-multi-armed-bandit

Star

My silver medal winning entry in Kaggle's 2020 Christmas Competition

keras kaggle multi-armed-bandit

Updated Feb 23, 2021
Jupyter Notebook

mriffaud / Introduction-to-Multi-Armed-Bandit

Star

This is a project to build a multi armed bandit from scratch based on the Kaggle Christmas 2020 Competition.

machine-learning kaggle agents multi-armed-bandit fromscratch rewards-probabilities

Updated Aug 7, 2023
Jupyter Notebook

VladMarianCimpeanu / OLA_project

Star

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

reinforcement-learning pricing thompson-sampling multi-armed-bandit montecarlo-simulation mab ucb1 online-learning-applications

Updated Oct 30, 2022
Jupyter Notebook

mweglowski / bandit_problem_simulator

Star

🦾🤖 Visual and interactive simulator of multi-armed bandit problem.

javascript css html machine-learning reinforcement-learning algorithms reactjs multi-armed-bandit tailwindcss

Updated May 7, 2024
JavaScript

amanraj209 / multi-armed-bandit-problem

Star

In probability theory, the multi-armed bandit problem is a problem in which a fixed limited set of resources must be allocated between competing (alternative) choices in a way that maximizes their expected gain, when each choice's properties are only partially known at the time of allocation, and may become better understood as time passes or by…

python reinforcement-learning jupyter-notebook multi-armed-bandit