English | 简体中文
A collection of ready-to-run Jupyter notebooks for learning and experimenting with the OpenVINO™ Toolkit. The notebooks provide an introduction to OpenVINO basics and teach developers how to leverage our API for optimized deep learning inference.
Check out the latest notebooks that show how to optimize and deploy popular models on Intel CPU and GPU.
Notebook | Description | Preview | Complementary Materials |
---|---|---|---|
YOLOv8 - Optimization: Object Detection Instance Segmentation Keypoints Detection |
Optimize YOLOv8 using NNCF PTQ API | Blog - How to get YOLOv8 Over 1000 fps with Intel GPUs? | |
SAM - Segment Anything Model |
Prompt based object segmentation mask generation using Segment Anything and OpenVINO™ | Blog - SAM: Segment Anything Model — Versatile by itself and Faster by OpenVINO | |
ControlNet - Stable-Diffusion |
A Text-to-Image Generation with ControlNet Conditioning and OpenVINO™ | Blog - Control your Stable Diffusion Model with ControlNet and OpenVINO | |
Stable Diffusion v2 |
Text-to-Image Generation and Infinite Zoom with Stable Diffusion v2 and OpenVINO™ | Blog - How to run Stable Diffusion on Intel GPUs with OpenVINO | |
Whisper - Subtitles generation |
Generate subtitles for video with OpenAI Whisper and OpenVINO | ||
CLIP - zero-shot-image-classification |
Perform Zero-shot Image Classification with CLIP and OpenVINO | Blog - Generative AI and Explainable AI with OpenVINO | |
BLIP - Visual-language-processing |
Visual Question Answering and Image Captioning using BLIP and OpenVINO™ | Blog - Multimodality with OpenVINO — BLIP | |
Instruct pix2pix - Image-editing |
Image editing with InstructPix2Pix | Blog - Generative AI and Explainable AI with OpenVINO | |
DeepFloyd IF - Text-to-Image generation |
Text-to-Image Generation with DeepFloyd IF and OpenVINO™ | ||
ImageBind |
Binding multimodal data using ImageBind and OpenVINO™ | ||
Dolly v2 |
Instruction following using Databricks Dolly 2.0 and OpenVINO™ | ||
Stable Diffusion XL and Segmind Stable Diffusion 1B (SSD-1B) |
Image generation with Stable Diffusion XL and Segmind Stable Diffusion 1B (SSD-1B) and OpenVINO™ | ||
MusicGen |
Controllable Music Generation with MusicGen and OpenVINO™ | ||
Tiny SD |
Image Generation with Tiny-SD and OpenVINO™ | ||
ZeroScope Text-to-video synthesis |
Text-to-video synthesis with ZeroScope and OpenVINO™ | A panda eating bamboo on a rock | |
LLM chatbot |
Create LLM-powered Chatbot using OpenVINO™ | ||
QA over Document |
Create LLM-powered RAG system using OpenVINO™ and LangChain | ||
Bark Text-to-Speech |
Text-to-Speech generation using Bark and OpenVINO™ | ||
LLaVA Multimodal Chatbot |
Visual-language assistant with LLaVA and OpenVINO™ | ||
BLIP-Diffusion - Subject-Driven Generation |
Subject-driven image generation and editing using BLIP Diffusion and OpenVINO™ | ||
DeciDiffusion |
Image generation with DeciDiffusion and OpenVINO™ | ||
Fast Segment Anything |
Object segmentations with FastSAM and OpenVINO™ | ||
SoftVC VITS Singing Voice Conversion |
SoftVC VITS Singing Voice Conversion and OpenVINO™ | ||
Latent Consistency Models: the next generation of Image Generation models |
Image generation with Latent Consistency Models (LCM) and OpenVINO™ | ||
Speedup ControlNet pipeline with LCM LoRA |
Text-to-Image Generation with LCM LoRA and ControlNet Conditioning | ||
QR Code Monster |
Generate creative QR codes with ControlNet QR Code Monster and OpenVINO™ | ||
Würstchen |
Text-to-image generation with Würstchen and OpenVINO™ | ||
Distil-Whisper |
Automatic speech recognition using Distil-Whisper and OpenVINO™ | ||
FILM |
Frame interpolation with FILM and OpenVINO™ | ||
Audio LDM 2 |
Sound Generation with AudioLDM2 and OpenVINO™ | ||
SDXL-Turbo |
Single-step image generation using SDXL-turbo and OpenVINO | ||
Stable-Zephyr chatbot |
Use Stable-Zephyr as chatbot assistant with OpenVINO | ||
Efficient-SAM |
Object segmentation with EfficientSAM and OpenVINO | ||
LLM Instruction following pipeline |
Usage variety of LLM models for answering questions using OpenVINO | ||
Stable Diffusion with IP-Adapter |
Image conditioning in Stable Diffusion pipeline using IP-Adapter | ||
MobileVLM |
Mobile language assistant with MobileVLM and OpenVINO | ||
DepthAnything |
Monocular Depth estimation with DepthAnything and OpenVINO | ||
Kosmos-2: Grounding Multimodal Large Language Models |
Kosmos-2: Grounding Multimodal Large Language Model and OpenVINO™ | ||
PhotoMaker |
Text-to-image generation using PhotoMaker and OpenVINO | ||
OpenVoice |
OpenVoice a versatile instant voice tone transferring and generating speech in various languages. | ||
InstantID |
InstantID: Zero-shot Identity-Preserving Image Generation using OpenVINO |
- 🚀 AI Trends - Notebooks
- Table of Contents
- 📝 Installation Guide
- 🚀 Getting Started
- ⚙️ System Requirements
- 💻 Run the Notebooks
- 🧹 Cleaning Up
⚠️ Troubleshooting- 🧑💻 Contributors
- ❓ FAQ
OpenVINO Notebooks require Python and Git. To get started, select the guide for your operating system or environment:
Windows | Ubuntu | macOS | Red Hat | CentOS | Azure ML | Docker | Amazon SageMaker |
---|
The Jupyter notebooks are categorized into four classes, select one related to your needs or give them all a try. Good Luck!
NOTE: The main branch of this repository was updated to support the new OpenVINO 2023.3 release. To upgrade to the new release version, please run pip install --upgrade -r requirements.txt
in your openvino_env
virtual environment. If you need to install for the first time, see the Installation Guide section below. If you wish to use the previous release version of OpenVINO, please checkout the 2023.2 branch. If you wish to use the previous Long Term Support (LTS) version of OpenVINO check out the 2022.3 branch.
If you need help, please start a GitHub Discussion.
Brief tutorials that demonstrate how to use OpenVINO's Python API for inference.
001-hello-world |
002-openvino-api |
003-hello-segmentation |
004-hello-detection |
---|---|---|---|
Classify an image with OpenVINO | Learn the OpenVINO Python API | Semantic segmentation with OpenVINO | Text detection with OpenVINO |
Tutorials that explain how to optimize and quantize models with OpenVINO tools.
Notebook | Description |
---|---|
101-tensorflow-classification-to-openvino |
Convert TensorFlow models to OpenVINO IR |
102-pytorch-to-openvino |
Convert PyTorch models to OpenVINO IR |
103-paddle-to-openvino |
Convert PaddlePaddle models to OpenVINO IR |
104-model-tools |
Download, convert and benchmark models from Open Model Zoo |
105-language-quantize-bert |
Optimize and quantize a pre-trained BERT model |
106-auto-device |
Demonstrate how to use AUTO Device |
107-speech-recognition-quantization |
Quantize speech recognition models using NNCF PTQ API |
108-gpu-device | Working with GPUs in OpenVINO™ |
109-performance-tricks | Performance tricks in OpenVINO™ |
110-ct-segmentation-quantize |
Quantize a kidney segmentation model and show live inference |
112-pytorch-post-training-quantization-nncf | Use Neural Network Compression Framework (NNCF) to quantize PyTorch model in post-training mode (without model fine-tuning) |
113-image-classification-quantization |
Quantize Image Classification model |
115-async-api |
Use Asynchronous Execution to Improve Data Pipelining |
116-sparsity-optimization |
Improve performance of sparse Transformer models |
117-model-server | Introduction to model serving with OpenVINO™ Model Server (OVMS) |
118-optimize-preprocessing |
Improve performance of image preprocessing step |
119-tflite-to-openvino |
Convert TensorFlow Lite models to OpenVINO IR |
120-tensorflow-object-detection-to-openvino |
Convert TensorFlow Object Detection models to OpenVINO IR |
121-convert-to-openvino |
Learn OpenVINO model conversion API |
122-quantizing-model-with-accuracy-control | Quantizing with Accuracy Control using NNCF |
123-detectron2-to-openvino |
Convert Detectron2 models to OpenVINO IR |
124-hugging-face-hub |
Load models from Hugging Face Model Hub with OpenVINO™ |
125-torchvision-zoo-to-openvino Classification Semantic Segmentation |
Convert torchvision classification and semantic segmentation models to OpenVINO IR |
126-tensorflow-hub |
Convert TensorFlow Hub models to OpenVINO IR |
127-big-transfer-quantization | BiT Image Classification OpenVINO IR model Quantization with NNCF |
Demos that demonstrate inference on a particular model.
Notebook | Description | Preview |
---|---|---|
201-vision-monodepth |
Monocular depth estimation with images and video | |
202-vision-superresolution-image |
Upscale raw images with a super resolution model | → |
202-vision-superresolution-video |
Turn 360p into 1080p video using a super resolution model | → |
203-meter-reader |
PaddlePaddle pre-trained models to read industrial meter's value | |
204-segmenter-semantic-segmentation |
Semantic Segmentation with OpenVINO™ using Segmenter | |
205-vision-background-removal |
Remove and replace the background in an image using salient object detection | |
206-vision-paddlegan-anime |
Turn an image into anime using a GAN | → |
207-vision-paddlegan-superresolution |
Upscale small images with superresolution using a PaddleGAN model | |
208-optical-character-recognition |
Annotate text on images using text recognition resnet | |
209-handwritten-ocr |
OCR for handwritten simplified Chinese and Japanese | 的人不一了是他有为在责新中任自之我们 |
210-slowfast-video-recognition |
Video Recognition using SlowFast and OpenVINO™ | |
211-speech-to-text |
Run inference on speech-to-text recognition model | |
212-pyannote-speaker-diarization |
Run inference on speaker diarization pipeline | |
213-question-answering |
Answer your questions basing on a context | |
214-grammar-correction | Grammatical Error Correction with OpenVINO | Input text: I'm working in campany for last 2 yeas. Generated text: I'm working in a company for the last 2 years. |
215-image-inpainting |
Fill missing pixels with image in-painting | |
216-attention-center |
The attention center model with OpenVINO™ | |
218-vehicle-detection-and-recognition |
Use pre-trained models to detect and recognize vehicles and their attributes with OpenVINO | |
219-knowledge-graphs-conve |
Optimize the knowledge graph embeddings model (ConvE) with OpenVINO | |
220-books-alignment-labse |
Cross-lingual Books Alignment With Transformers and OpenVINO™ | |
221-machine-translation |
Real-time translation from English to German | |
222-vision-image-colorization |
Use pre-trained models to colorize black & white images using OpenVINO | |
223-text-prediction |
Use pretrained models to perform text prediction on an input sequence | |
224-3D-segmentation-point-clouds |
Process point cloud data and run 3D Part Segmentation with OpenVINO | |
225-stable-diffusion-text-to-image |
Text-to-image generation with Stable Diffusion method | |
226-yolov7-optimization |
Optimize YOLOv7 using NNCF PTQ API | |
227-whisper-subtitles-generation |
Generate subtitles for video with OpenAI Whisper and OpenVINO | |
228-clip-zero-shot-image-classification |
Perform Zero-shot Image Classification with CLIP and OpenVINO | |
229-distilbert-sequence-classification |
Sequence Classification with OpenVINO | |
230-yolov8-object-detection |
Optimize YOLOv8 object detection using NNCF PTQ API | |
230-yolov8-instance-segmentation |
Optimize YOLOv8 instance segmentation using NNCF PTQ API | |
230-yolov8-keypoint-detection |
Optimize YOLOv8 keypoint detection using NNCF PTQ API | |
231-instruct-pix2pix-image-editing |
Image editing with InstructPix2Pix | |
232-clip-language-saliency-map |
Language-Visual Saliency with CLIP and OpenVINO™ | |
233-blip-visual-language-processing |
Visual Question Answering and Image Captioning using BLIP and OpenVINO™ | |
234-encodec-audio-compression |
Audio compression with EnCodec and OpenVINO™ | |
235-controlnet-stable-diffusion |
A Text-to-Image Generation with ControlNet Conditioning and OpenVINO™ | |
236-stable-diffusion-v2 |
Text-to-Image Generation and Infinite Zoom with Stable Diffusion v2 and OpenVINO™ | |
237-segment-anything |
Prompt based segmentation using Segment Anything and OpenVINO™. | |
238-deep-floyd-if |
Text-to-Image Generation with DeepFloyd IF and OpenVINO™ | |
239-image-bind |
Binding multimodal data using ImageBind and OpenVINO™ | |
240-dolly-2-instruction-following |
Instruction following using Databricks Dolly 2.0 and OpenVINO™ | |
241-riffusion-text-to-music |
Text-to-Music generation using Riffusion and OpenVINO™ | |
242-freevc-voice-conversion |
High-Quality Text-Free One-Shot Voice Conversion with FreeVC and OpenVINO™ | |
243-tflite-selfie-segmentation |
Selfie Segmentation using TFLite and OpenVINO™ | |
244-named-entity-recognition |
Named entity recognition with OpenVINO™ | |
245-typo-detector |
English Typo Detection in sentences with OpenVINO™ | |
246-depth-estimation-videpth |
Monocular Visual-Inertial Depth Estimation with OpenVINO™ | |
247-code-language-id |
Identify the programming language used in an arbitrary code snippet | |
248-stable-diffusion-xl |
Image generation with Stable Diffusion XL and OpenVINO™ | |
249-oneformer-segmentation |
Universal segmentation with OneFormer and OpenVINO™ | |
250-music-generation |
Controllable Music Generation with MusicGen and OpenVINO™ | |
251-tiny-sd-image-generation |
Image Generation with Tiny-SD and OpenVINO™ | |
252-fastcomposer-image-generation |
Image generation with FastComposer and OpenVINO™ | |
253-zeroscope-text2video |
Text-to-video synthesis with ZeroScope and OpenVINO™ | A panda eating bamboo on a rock |
254-llm-chatbot |
Create LLM-powered Chatbot using OpenVINO™ | |
255-mms-massively-multilingual-speech |
MMS: Scaling Speech Technology to 1000+ languages with OpenVINO™ | |
256-bark-text-to-audio |
Text-to-Speech generation using Bark and OpenVINO™ | |
257-llava-multimodal-chatbot |
Visual-language assistant with LLaVA and OpenVINO™ | |
258-blip-diffusion-subject-generation |
Subject-driven image generation and editing using BLIP Diffusion and OpenVINO™ | |
259-decidiffusion-image-generation |
Image generation with DeciDiffusion and OpenVINO™ | |
260-pix2struct-docvqa |
Document Visual Question Answering using Pix2Struct and OpenVINO™ | |
261-fast-segment-anything |
Object segmentations with FastSAM and OpenVINO™ | |
262-softvc-voice-conversion |
SoftVC VITS Singing Voice Conversion and OpenVINO™ | |
263-latent-consistency-models-image-generation |
Image generation with Latent Consistency Models (LCM) and OpenVINO™ | |
264-qrcode-monster |
Generate creative QR codes with ControlNet QR Code Monster and OpenVINO™ | |
265-wuerstchen-image-generation |
Text-to-image generation with Würstchen and OpenVINO™ | |
266-speculative-sampling |
Text Generation via Speculative Sampling, KV Caching, and OpenVINO™ | |
267-distil-whisper-asr |
Automatic speech recognition using Distil-Whisper and OpenVINO™ | |
268-table-question-answering |
Table Question Answering using TAPAS and OpenVINO™ | |
269-film-slowmo |
Frame interpolation with FILM and OpenVINO™ | |
270-sound-generation-audioldm2 |
Sound Generation with AudioLDM2 and OpenVINO™ | |
271-sdxl-turbo |
Single-step image generation using SDXL-turbo and OpenVINO | |
272-paint-by-example |
Exemplar based image editing using diffusion models, Paint-by-Example, and OpenVINO™ | |
273-stable-zephyr-3b-chatbot |
Use Stable-Zephyr as chatbot assistant with OpenVINO | |
274-efficient-sam |
Object segmentation with EfficientSAM and OpenVINO™ | |
275-llm-question-answering |
LLM Instruction following pipeline | |
276-stable-diffusion-torchdynamo-backend |
Image generation with Stable Diffusion and OpenVINO™ torch.compile feature |
|
277-amused-lightweight-text-to-image |
Lightweight image generation with aMUSEd and OpenVINO™ | |
278-stable-diffusion-ip-adapter |
Image conditioning in Stable Diffusion pipeline using IP-Adapter | |
279-mobilevlm-language-assistant |
Mobile language assistant with MobileVLM and OpenVINO | |
280-depth-anything |
Monocular Depth Estimation with DepthAnything and OpenVINO | |
281-kosmos2-multimodal-large-language-model |
Kosmos-2: Multimodal Large Language Model and OpenVINO™ | |
282-siglip-zero-shot-image-classification |
Zero-shot Image Classification with SigLIP | |
283-photo-maker |
Text-to-image generation using PhotoMaker and OpenVINO | |
284-openvoice |
OpenVoice a versatile instant voice tone transferring and generating speech in various languages. | |
285-surya-line-level-text-detection |
Line-level text detection with Surya | |
286-instant-id |
InstantID: Zero-shot Identity-Preserving Image Generation using OpenVINO |
Tutorials that include code to train neural networks.
Notebook | Description | Preview |
---|---|---|
301-tensorflow-training-openvino | Train a flower classification model from TensorFlow, then convert to OpenVINO IR | |
301-tensorflow-training-openvino-nncf | Use Neural Network Compression Framework (NNCF) to quantize model from TensorFlow | |
302-pytorch-quantization-aware-training | Use Neural Network Compression Framework (NNCF) to quantize PyTorch model | |
305-tensorflow-quantization-aware-training |
Use Neural Network Compression Framework (NNCF) to quantize TensorFlow model |
Live inference demos that run on a webcam or video files.
Notebook | Description | Preview |
---|---|---|
401-object-detection-webcam |
Object detection with a webcam or video file | |
402-pose-estimation-webcam |
Human pose estimation with a webcam or video file | |
403-action-recognition-webcam |
Human action recognition with a webcam or video file | |
404-style-transfer-webcam |
Style Transfer with a webcam or video file | |
405-paddle-ocr-webcam |
OCR with a webcam or video file | |
406-3D-pose-estimation-webcam |
3D display of human pose estimation with a webcam or video file | |
407-person-tracking-webcam |
Person tracking with a webcam or video file |
If you run into issues, please check the troubleshooting section, FAQs or start a GitHub discussion.
Notebooks with and buttons can be run without installing anything. Binder and Google Colab are free online services with limited resources. For the best performance, please follow the Installation Guide and run the notebooks locally.
The notebooks run almost anywhere — your laptop, a cloud VM, or even a Docker container. The table below lists the supported operating systems and Python versions.
Supported Operating System | Python Version (64-bit) |
---|---|
Ubuntu 20.04 LTS, 64-bit | 3.8 - 3.10 |
Ubuntu 22.04 LTS, 64-bit | 3.8 - 3.10 |
Red Hat Enterprise Linux 8, 64-bit | 3.8 - 3.10 |
CentOS 7, 64-bit | 3.8 - 3.10 |
macOS 10.15.x versions or higher | 3.8 - 3.10 |
Windows 10, 64-bit Pro, Enterprise or Education editions | 3.8 - 3.10 |
Windows Server 2016 or higher | 3.8 - 3.10 |
If you wish to launch only one notebook, like the Monodepth notebook, run the command below.
jupyter 201-vision-monodepth.ipynb
jupyter lab notebooks
In your browser, select a notebook from the file browser in Jupyter Lab using the left sidebar. Each tutorial is located in a subdirectory within the notebooks
directory.
-
Shut Down Jupyter Kernel
To end your Jupyter session, press
Ctrl-c
. This will prompt you toShutdown this Jupyter server (y/[n])?
entery
and hitEnter
.
-
Deactivate Virtual Environment
To deactivate your virtualenv, simply run
deactivate
from the terminal window where you activatedopenvino_env
. This will deactivate your environment.To reactivate your environment, run
source openvino_env/bin/activate
on Linux oropenvino_env\Scripts\activate
on Windows, then typejupyter lab
orjupyter notebook
to launch the notebooks again.
-
Delete Virtual Environment (Optional)
To remove your virtual environment, simply delete the
openvino_env
directory:
-
On Linux and macOS:
rm -rf openvino_env
-
On Windows:
rmdir /s openvino_env
-
Remove
openvino_env
Kernel from Jupyterjupyter kernelspec remove openvino_env
If these tips do not solve your problem, please open a discussion topic or create an issue!
- To check some common installation problems, run
python check_install.py
. This script is located in the openvino_notebooks directory. Please run it after activating theopenvino_env
virtual environment. - If you get an
ImportError
, double-check that you installed the Jupyter kernel. If necessary, choose theopenvino_env
kernel from the Kernel->Change Kernel menu in Jupyter Lab or Jupyter Notebook. - If OpenVINO is installed globally, do not run installation commands in a terminal where
setupvars.bat
orsetupvars.sh
are sourced. - For Windows installation, it is recommended to use Command Prompt (
cmd.exe
), not PowerShell.
Made with contrib.rocks
.
- Which devices does OpenVINO support?
- What is the first CPU generation you support with OpenVINO?
- Are there any success stories about deploying real-world solutions with OpenVINO?
* Other names and brands may be claimed as the property of others.