Skip to content

Commit 7f7b676

Browse files
lyakhovvvllyakho
andauthored
Synchronizing this notebook with published blog (aws#3479)
* Added link to the blog * Improved message and provied link to the blog * Add optinal cells to the notebook from the blog * reformat using black-nb -l 100 SageMaker_Keyspaces_ml_example.ipynb * reformat using black-nb -l 100 SageMaker_Keyspaces_ml_example.ipynb * commit to address reviewer feedback Co-authored-by: Vadim Lyakhovich <[email protected]>
1 parent 18d1526 commit 7f7b676

File tree

2 files changed

+188
-57
lines changed

2 files changed

+188
-57
lines changed

ingest_data/sagemaker-keyspaces/README.md

Lines changed: 15 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1,24 +1,28 @@
1-
# Train Machine Learning Models using Amazon Keyspaces as a Data Source
1+
# Train Machine Learning Models using Amazon Keyspaces as a Data Source
22

3-
In this notebook we will provide step-by-step instruction to use SageMaker to ingest customer data from Amazon Keyspaces and train a clustering model that allowed you to segment customers. You could use this information for targeted marketing, greatly improving your business KPI.
3+
Please read [Train machine learning models using Amazon Keyspaces as a data source](https://aws.amazon.com/blogs/machine-learning/train-machine-learning-models-using-amazon-keyspaces-as-a-data-source/) blog for more detailed instructions to run this notebook.
4+
5+
6+
We provides step-by-step instructions to use SageMaker to ingest customer data from Amazon Keyspaces and train a clustering model that enables you to segment customers. This information can be used for targeted marketing, greatly improving your business KPI.
47

58
1. First, we install Sigv4 driver to connect to Amazon Keyspaces
69

710
> The Amazon Keyspaces SigV4 authentication plugin for Cassandra client drivers enables you to authenticate calls to Amazon Keyspaces ***using IAM access keys instead of user name and password***. To learn more about how the Amazon Keyspaces SigV4 plugin enables [`IAM users, roles, and federated identities`](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles.html) to authenticate in Amazon Keyspaces API requests, see [`AWS Signature Version 4 process (SigV4)`](https://docs.aws.amazon.com/general/latest/gr/signature-version-4.html)
811
9-
2. Next, we establish a connection to Amazon Keyspaces
10-
3. Next, we create new Keyspace ***blog_(yyyymmdd)*** and a new table ***online_retail***
11-
3. Next, we will download retail data about customers.
12-
3. Next, we will ingest retail data about customers into Keyspaces.
13-
3. Next, we use a notebook available within SageMaker Studio to collect data from Keyspaces database, and prepare data for training using KNN Algorithm. Most of our customers use SageMaker Studio for end to end development of ML Use Cases. They could use this notebook as a base and customize it quickly for their use case. Additionally, they will be able to share this with other collaborators without requiring them to install any additional software.
14-
3. Next, we will train the data for clustering.
15-
3. After the training is complete, we can view the mapping between customer and their associated cluster.
16-
3. And finally, Cleanup Step to drop Keyspaces table to avoid future charges.
12+
2. Next, we establish a connection to Amazon Keyspaces
13+
3. Next, we create new Keyspace ***blog_(yyyymmdd)*** and a new table ***online_retail***
14+
3. Next, we download retail data about customers.
15+
3. Next, we ingest retail data about customers into Keyspaces.
16+
3. Next, we use a notebook available within SageMaker Studio to collect data from the Keyspaces database, and prepare data for training using KNN Algorithm. Most of our customers use SageMaker Studio for end-to-end development of ML Use Cases. They use this notebook as a starting point and customize it for their use case. Also, they are able to share this with other collaborators without requiring them to install any additional software.
17+
3. Next, we train the data for clustering.
18+
3. When the training is completed, we can view the mapping between customers and their associated clusters.
19+
3. And finally, we run a Cleanup Step to drop Keyspaces table to avoid future charges.
1720

18-
Contributers
21+
Contributers
1922
- `Vadim Lyakhovich (AWS)`
2023
- `Ram Pathangi (AWS)`
2124
- `Parth Patel (AWS)`
25+
- `Arvind Jain (AWS)`
2226

2327
### Note
2428
The notebook execution role must include permissions to access Amazon Keyspaces and Assume the role.

0 commit comments

Comments
 (0)