Skip to content

voidsy-gmbh/pyThermoNDT

Repository files navigation

PyThermoNDT

PyThermoNDT is a Python package for manipulating thermographic data in Non-Destructive Testing (NDT) applications. It provides various methods to load, transform, visualize, and write thermographic data, making it easier and more efficient to work with thermal imaging in NDT contexts.

Features

  • Multi-source Data Loading: Read thermographic data seamlessly from local files and S3 storage
  • Hierarchical Data Structure: Store and access thermographic data, metadata, and ground truth in a common format
  • Remote Data Caching: Optionally cache data from remote sources for improved performance
  • Composable Transforms: Build custom processing pipelines with reusable transform components
  • PyTorch Integration: Datasets compatible with PyTorch DataLoader for training deep learning models

Quick Example

from torch.utils.data import DataLoader

from pythermondt import LocalReader, S3Reader
from pythermondt import transforms as T
from pythermondt.dataset import ThermoDataset, container_collate

# Load data from different sources
local_reader = LocalReader("./examples/example_data/**/*.hdf5", recursive=True)
s3_reader = S3Reader("ffg-bp", "example2_writing_data", download_files=True)

# Create optimized transform pipeline (deterministic transforms first for better caching)
transform = T.Compose([
    T.ApplyLUT(),                  # Convert raw data to temperatures
    T.RemoveFlash(),               # Remove flash frames
    T.NonUniformSampling(64),      # Resample data to 64 frames
    T.CropFrames(96, 96),          # Center crop the frames to 96x96
    T.MinMaxNormalize()            # Normalize data
])

# 1.) Access individual files using readers
container = local_reader[0]
processed = transform(container)

# 2.) Analyse processed data
processed.show_frame(frame_number=10)
processed.analyse_interactive()

# 3.) Combine sources in a dataset for training workflows
dataset = ThermoDataset([local_reader, s3_reader], transform=transform)

# 4.) Build cache for faster training (splits pipeline at first random transform)
dataset.build_cache("immediate")

# 5.) Use with PyTorch DataLoader for model training to be used in your training loop
collate_fn = container_collate('/Data/Tdata', '/GroundTruth/DefectMask')
dataloader = DataLoader(dataset, batch_size=4, shuffle=True, collate_fn=collate_fn)

for epoch in range(50):
    print(f"Epoch {epoch + 1}")
    for thermal_data, ground_truth in dataloader:
        print(f"Thermal data shape: {thermal_data.shape}")    # [4, 96, 96, 64]
        print(f"Ground truth shape: {ground_truth.shape}")    # [4, 96, 96]

From here?

PyThermoNDT is yours to use! You can start by exploring the examples directory for more detailed usage scenarios. The package is designed to be flexible and extensible, so feel free to modify and adapt it to your specific needs.

Installation

From PyPI (Recommended)

Install the latest stable release from PyPI:

pip install pythermondt

From GitHub

Install the latest development version from the main branch:

pip install git+https://github.com/voidsy-gmbh/pyThermoNDT.git

From Source

Clone the repository and install locally:

git clone https://github.com/voidsy-gmbh/pyThermoNDT.git
cd pyThermoNDT
pip install .

Documentation

For detailed usage examples, check out the Jupyter Notebooks in the examples directory.

Contributing

Contributions are welcome! Please see the Contributing Guidelines for details on setting up a development environment, coding standards, and the pull request process.

Funding

This project was partially funded by

  • the Austrian Research Promotion Agency (FFG) under grant numbers 920062 and 901177 as part of the project 'Thermal tomography'
  • the Austrian Research Promotion Agency (FFG) under grant numbers 921380 as part of the project 'FLARE'

About

No description, website, or topics provided.

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages