Decoding multiple frames without decoder create/destroy #306

slugwarz05 · 2024-03-04T21:08:33Z

Similar to #86 for encoding, it would be great if an enhancement was made to implement a function similar to charls_jpegls_encoder_rewind, except on the decoding side of things.

The ideal state would be if, when decoding multiple same-sized images, no additional new/delete calls are made. This would be achieved in part by not needing to create and destroy a charls_jpegls_decoder object for each image that needs decoding.

Or, if it is possible to calculate memory needs for the encode/decode operation, enabling custom allocators would solve this as well. But that seems like a longer-term enhancement.

The text was updated successfully, but these errors were encountered:

vbaderks · 2024-03-05T19:58:32Z

Hi,

The included test application CharLSTest can be used to run performance tests. With -decodeperformance:10 it is for example possible to decode the same image 10 times.
If I execute this perf test in a profiler, 99% of the CPU is spend in the method decode_lines() or one of its children.

Do you have a specific scenario for which you see a significant performance improvement by re-using the same decoder instance?

Note: internal CharLS will also perform 2 new calls:

1 call to allocate a buffer to store 2 scan lines (size depends on the size of the image that needs to be decoded)
1 call to construct the scan_decoder

slugwarz05 · 2024-03-08T15:35:25Z

Hello,

The specific scenario I have is one where thousands of monochrome, 16 bit-depth, and same-sized images are compressed with JPEG-LS and placed into a binary file. Very similar to the basic operating principle of MJPEG, except other structures besides imagery are also embedded in the file. Due to the high quantity of imagery, it is prudent to minimize superfluous calls to new/delete.

I believe I was able to accomplish this on the encoding side through use of charls_jpegls_encoder_rewind, but would like a way to do this for decoding as well.

vbaderks · 2024-03-12T21:21:51Z

I will see if it is possible to create a proof of concept for charls_jpegls_decoder_rewind.

vbaderks · 2024-03-17T20:59:04Z

I have created in the branch rewind-poc a version that has a method charls_jpegls_decoder_rewind.
If you see see a significant performance improvement, let me know.

Note: deep inside the decoding there are still 2 allocations done. It might be possible to reuse these also.

slugwarz05 · 2024-03-18T22:40:16Z

Much appreciated! I pulled the branch down and found that my rather old C++ compiler (VS2015) does not support CharLS 3.x, with its migration to C++17. I was originally working from the 2.4.2 tag.

I will attempt a manual merge/transcription the commit's substantive changes for throughput testing.

slugwarz05 · 2024-04-02T14:13:12Z

I completed a manual merge/transcription into the 2.4.2 baseline (the primary inconsistency is omission of thresholds/reset_value checks in the new function is_compatible due to where the function lives in the decoder of 2.4.2), and ran a test case where 36,338 frames of 640x512 (16 bit monochrome) resolution were, in sequence:

Encoded, and the same encoded buffer was used to...
Decode by creating/destroying a charls_jpegls_decoder object
Decode by maintaining a charls_jpegls_decoder object and calling charls_jpegls_decoder_rewind after each decode

Memory usage went up by 100kB (but that's the only resolution I had on task manager) for the duration of the test, and the average decode time difference was negligible for my case (~0.4% slower, and interestingly decoding with rewind was slower).

I'll defer to you for the determination on whether these results warrant being part of the next tag.

vbaderks added the enhancement label Mar 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decoding multiple frames without decoder create/destroy #306

Decoding multiple frames without decoder create/destroy #306

slugwarz05 commented Mar 4, 2024

vbaderks commented Mar 5, 2024

slugwarz05 commented Mar 8, 2024

vbaderks commented Mar 12, 2024

vbaderks commented Mar 17, 2024

slugwarz05 commented Mar 18, 2024

slugwarz05 commented Apr 2, 2024

Decoding multiple frames without decoder create/destroy #306

Decoding multiple frames without decoder create/destroy #306

Comments

slugwarz05 commented Mar 4, 2024

vbaderks commented Mar 5, 2024

slugwarz05 commented Mar 8, 2024

vbaderks commented Mar 12, 2024

vbaderks commented Mar 17, 2024

slugwarz05 commented Mar 18, 2024

slugwarz05 commented Apr 2, 2024