uvgSaliencyCoding

A framework for saliency-guided High Efficiency Video Coding (HEVC), based on Region of Interest (ROI) detection and tracking.

Features

Real-time performance
Energy-efficient
Accurate ROI tracking capabilities
Optimized bitrates

Installation

Prerequisites

C/C++ development tools (gcc, make)
Autotools (autoconf, automake, libtool)
Python 3.6+
Git
ZeroMQ

apt-get install libzmq3-dev

Kvazaar installation

# Initialize and update the Kvazaar submodule
git submodule init
git submodule update

# Apply the required patch
cd kvazaar
git apply ../patches/kvazaar_framework_7fe86344.patch
cd ..

# Compile and install
cd kvazaar
./autogen.sh
./configure
make -j
sudo make install
sudo ldconfig
cd ..

NOTE: For the most up-to-date installation information, please refer to the Kvazaar repository.

Python Dependencies

pip install -r requirements.txt

Energy measurement

Important: For granting Intel Energy Permission, run:

sudo chmod -R a+r /sys/class/powercap/intel-rapl

Pretrained Models

The pretrained weights are provided in the weights directory of this repository. Download our pretrained weights to quickly get started with ROI detection and tracking:

Model	Type	Trained On	Size
yolov8n-head	Detection	HollywoodHeads	24.5MB
light-tracker-head	Tracking	UVG-ROI dataset	40.8kB
enhanced-tracker-head	Tracking	UVG-ROI dataset	19.4MB

Usage

Training a new tracker model

python train_tracker.py

Training a new detection model

NOTE: For the most up-to-date training instructions and parameters, please refer to the Ultralytics documentation.

python train_yolo.py

Testing the tracker model

python test_tracker.py

ROI-based video encoding with Kvazaar

python kvazaar_roi_encode.py

Standard video encoding with Kvazaar (for comparison)

python kvazaar_traditional_encode.py

Citation

If you use this framework in your research, please cite this paper:

@ARTICLE{10820524,
  author={Partanen, Tero and Hoang, Minh and Mercat, Alexandre and Sainio, Joose and Vanne, Jarno},
  journal={IEEE Journal on Emerging and Selected Topics in Circuits and Systems}, 
  title={Energy-Efficient Saliency-Guided Video Coding Framework for Real-Time Applications}, 
  year={2025},
  volume={15},
  number={1},
  pages={44-57},
  keywords={Encoding;Streaming media;Video coding;Image coding;Saliency detection;Energy efficiency;Energy consumption;Visualization;Object tracking;Computational modeling;Saliency-guided encoding;region-of-interest (ROI);ROI tracking;deep learning (DL);motion vector (MV)},
  doi={10.1109/JETCAS.2024.3525339}}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
evaluation		evaluation
kvazaar @ 999c512		kvazaar @ 999c512
models		models
patches		patches
utils		utils
weights		weights
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
config.py		config.py
gt_generation_yolo.py		gt_generation_yolo.py
hollywood-heads.yaml		hollywood-heads.yaml
kvazaar_roi_encode.py		kvazaar_roi_encode.py
kvazaar_traditional_encode.py		kvazaar_traditional_encode.py
onnx_export.py		onnx_export.py
requirements.txt		requirements.txt
test_tracker.py		test_tracker.py
train_tracker.py		train_tracker.py
train_yolo.py		train_yolo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

uvgSaliencyCoding

Features

Installation

Prerequisites

Kvazaar installation

Python Dependencies

Energy measurement

Pretrained Models

Usage

Training a new tracker model

Training a new detection model

Testing the tracker model

ROI-based video encoding with Kvazaar

Standard video encoding with Kvazaar (for comparison)

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

ultravideo/uvgSaliencyCoding

Folders and files

Latest commit

History

Repository files navigation

uvgSaliencyCoding

Features

Installation

Prerequisites

Kvazaar installation

Python Dependencies

Energy measurement

Pretrained Models

Usage

Training a new tracker model

Training a new detection model

Testing the tracker model

ROI-based video encoding with Kvazaar

Standard video encoding with Kvazaar (for comparison)

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages