Welcome to STM32AI - TAO!
This repository provides a collection of example scripts that can be used to Train, Adapt and Optimize for given use cases and then easily pass the resulting AI models through STEdgeAI Dev Cloud to deploy them on your favourite STM32 boards.
The notebook scripts in classification_tf2 provide a complete life cycle of the model training, optimization and benchmarking using NVIDIA TAO Toolkit and STEdgeAI Developer Cloud.
While, the notebook scripts in classification_pretrained lets you download and use one of the pretrained models from pretrained classification model repository of Nvidia TAO. The models come under creative commons license as mentioned on the explainability link for these models. The notebook in this folder provides a demo example of how a user can download mobilenet_v2 model from the repository, convert it to onnx, and then quantize it, and finally, benchmarking using STEdgeAI Developer Cloud.
NVIDIA Train Adapt Optimize (TAO) Toolkit is a simple and easy-to-use Python based AI toolkit for taking purpose-built AI models and customizing them with users' own data.
STEdgeAI Developer Cloud is a free-of-charge online platform and services allowing the creation, optimization, benchmarking, and generation of AI models for the STM32 microcontrollers. It is based on the STEdgeAI core technology.
Bring Your Own Model (BYOM) is a Python based package that converts any open source ONNX model to TAO compatible model. All you need is to export any model from a deep learning framework of your choice (e.g. PyTorch) to ONNX and run TAO BYOM converter.
With the Jupyter notebooks in classification_tf2, you will learn how to leverage the simplicity and convenience of TAO to take a pretrained model, finetune it on a sample dataset and then:
- Prune the finetuned model,
- Retrain the pruned model to recover lost accuracy,
- Export the pruned model as an onnx model,
- Quantize the model using onnxruntime,
- Run Benchmarking of the quantized onnx model (finetuned, pruned, retrained, and quantized) using STEdgeAI Developer Cloud to know the footprints and embeddability of the models. At the end, you will have generated a trained and optimized classification model which was imported from outside TAO Toolkit, and that may be deployed via STEdgeAI Developer Cloud.
While with the notebook in classification_pretrained you can
- download a pretrained model,
- convert it to onnx and quantize it using qdq quantization with onnxruntime,
- run inference on float and quantized models, and
- Run Benchmarking of the quantized onnx model (finetuned, pruned, retrained, and quantized) using STEdgeAI Developer Cloud to know the footprints and embeddability of the models. At the end, you will have generated a trained and optimized classification model which was imported from outside TAO Toolkit, and that may be deployed via STEdgeAI Developer Cloud.
LICENSE :
This software component is licensed by ST under BSD-3-Clause license, the "License";
You may not use this file except in compliance with the License.
You may obtain a copy of the License at: https://opensource.org/licenses/BSD-3-Clause
Copyright (c) 2023 STMicroelectronics. All rights reserved.
Copyright (c) 2023 NVIDIA. All rights reserved.
This project contains two folders:
- classification_tf2
- byom_converters : contains Jupyter Notebooks to convert the PyTorch models to .tltb files (a proprietary template from NVIDIA for BYOM (bring your own model))
- byom_person : contains Jupyter Notebooks to train the BYOM models generated using byom_converters for person-detection use case as well as the configurations for running these files in folders:
- tao_person : contains a jupyter notebook
stm32ai_tao_efficientnet_b0.ipynb
to fine-tune the pretrainedefficientnet_b0
model obtained from NGC model zoo, for person-detection use case as well as the configurations for running this file in folder specs.
- classification_pretrained
- Notebook to work with the pretrained image classification models from classification model repository of Nvidia TAO, to run the inference on them, quantize the model using onnxruntime, and then benchmark it using the STEdgeAI Developer Cloud.
- LICENSE.md
- SECURITY.md
- CODE_OF_CONDUCT.md
- CONTRIBUTING.md
- Create an account on myST and then sign in to STEdgeAI Developer Cloud to be able access the service.
- If you don't have python already installed, you can download and install it from here, a 3.6 <= Python Version <= 3.8 is required to be able to use NVIDIA TAO and the scripts provided here, we recommend to use Python v3.8.16. (For Windows systems make sure to check the Add python.exe to PATH option during the installation process).
- Install Jupyter server and notebook libraries to run the Jupyter notebooks.
- Clone this repository using the following command:
git clone https://github.com/STMicroelectronics/stm32ai-tao.git
- The delivery contains three types of notebooks
byom_converter_***.ipynb
,stm32ai_tao_***.ipynb
, andtao_image_classification.ipynb
.
- The users need to create seperate Python environments using conda or pyenv to run these notebooks. The Python environments can be created using following commands named as
byom_dev
(for the byom_converter_***.ipynb) andbyom_launcher
(for all the rest):
cd stm32ai-tao
python -m venv <env-name>
- Activate your virtual environment, on Windows run:
<env-name>\Scripts\activate.bat
On Unix or MacOS, run:
source <env-name>/bin/activate
NOTE: The names of the environments are just a suggestion and users can choose whichever name they prefer.
The running of Jupyter notebooks requires:
- activate the
byom_dev
environment for all thebyom_converter***.ipynb
notebooks to convert your onnx models to byom model (.tltb), and - activate the
byom_launcher
environment for thestm32ai_tao_***.ipynb
andtao_image_classification.ipynb
notebooks to adopt, optimize, benchmark and to convert your byom models in optimized c code for STM32 projects.- an internet connection is needed to run some cells of the notebooks
- to download the dataset,
- pretrained mdoels from ngc cloud,
- to get the models from torch.hub, and
- to connect to STEdgeAI developer cloud.
- an internet connection is needed to run some cells of the notebooks