Commits an alternative Zeiss CZI Reader #4092

NicoKiaru · 2023-09-08T13:20:30Z

Hello,

This PR is a draft which ultimate goal would be to rewrite the logic of the CZI reader.

The goal is to achieve a compatilibility as good as the original reader while begin more efficient in terms of initial metadata parsing speed as well as being more memory efficient for big files (Tb).

I clearly do not want to get this merged right away - rather, I'd like to know if you could run some tests on your internal set of files and report if some file types are problematic or not.

Related links:

dgault · 2023-09-19T13:04:19Z

@NicoKiaru, I included this PR in the full CI test suite last night, though the initialisation of the tests failed with a number of different exceptions: https://merge-ci.openmicroscopy.org/jenkins/job/BIOFORMATS-test-folder/lastFailedBuild/console

I believe the below are all of the unique exceptions that are see:

NullPointerException: null:

   [testng] java.lang.NullPointerException: null
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer$Corner.fromBlocks(ZeissCZIReader.java:3124)
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.setSpaceAndTimeInformation(ZeissCZIReader.java:3310)
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.initializeMetadata(ZeissCZIReader.java:2127)
   [testng] 	at loci.formats.in.ZeissCZIReader.initFile(ZeissCZIReader.java:1286)
   [testng] 	at loci.formats.FormatReader.setId(FormatReader.java:1496)
   [testng] 	at loci.formats.ImageReader.setId(ImageReader.java:875)

NullPointerException: null:

   [testng] java.lang.NullPointerException: null
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.setSpaceAndTimeInformation(ZeissCZIReader.java:3256)
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.initializeMetadata(ZeissCZIReader.java:2127)
   [testng] 	at loci.formats.in.ZeissCZIReader.initFile(ZeissCZIReader.java:1286)
   [testng] 	at loci.formats.FormatReader.setId(FormatReader.java:1496)
   [testng] 	at loci.formats.ImageReader.setId(ImageReader.java:875)

IndexOutOfBoundsException: Index 0 out of bounds for length 0:

   [testng] java.lang.IndexOutOfBoundsException: Index 0 out of bounds for length 0
   [testng] 	at java.base/jdk.internal.util.Preconditions.outOfBounds(Preconditions.java:64)
   [testng] 	at java.base/jdk.internal.util.Preconditions.outOfBoundsCheckIndex(Preconditions.java:70)
   [testng] 	at java.base/jdk.internal.util.Preconditions.checkIndex(Preconditions.java:248)
   [testng] 	at java.base/java.util.Objects.checkIndex(Objects.java:372)
   [testng] 	at java.base/java.util.ArrayList.get(ArrayList.java:459)
   [testng] 	at ome.xml.model.OME.getImage(OME.java:680)
   [testng] 	at ome.xml.meta.OMEXMLMetadataImpl.setImageROIRef(OMEXMLMetadataImpl.java:7699)
   [testng] 	at ome.xml.meta.FilterMetadata.setImageROIRef(FilterMetadata.java:1216)
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.translateLayers(ZeissCZIReader.java:3827)
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.translateMetadata(ZeissCZIReader.java:2320)
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.readXMLMetadata(ZeissCZIReader.java:2284)
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.initializeMetadata(ZeissCZIReader.java:2116)
   [testng] 	at loci.formats.in.ZeissCZIReader.initFile(ZeissCZIReader.java:1286)
   [testng] 	at loci.formats.FormatReader.setId(FormatReader.java:1496)
   [testng] 	at loci.formats.in.ZeissCZIReader.addSlidePreviewIfExists(ZeissCZIReader.java:1346)
   [testng] 	at loci.formats.in.ZeissCZIReader.initFile(ZeissCZIReader.java:1223)
   [testng] 	at loci.formats.FormatReader.setId(FormatReader.java:1496)
   [testng] 	at loci.formats.ImageReader.setId(ImageReader.java:875)

IOException: ZISRAWDIRECTORY segment expected, found ZISRAWFILE instead:

   [testng] java.io.IOException: ZISRAWDIRECTORY segment expected, found ZISRAWFILE instead.
   [testng] 	at loci.formats.in.libczi.LibCZI.getSubBlockDirectorySegment(LibCZI.java:123)
   [testng] 	at loci.formats.in.ZeissCZIReader$CZISegments.<init>(ZeissCZIReader.java:1994)
   [testng] 	at loci.formats.in.ZeissCZIReader.initFile(ZeissCZIReader.java:971)
   [testng] 	at loci.formats.FormatReader.setId(FormatReader.java:1496)
   [testng] 	at loci.formats.ImageReader.setId(ImageReader.java:875)

ArithmeticException: / by zero:

   [testng] java.lang.ArithmeticException: / by zero
   [testng] 	at loci.formats.in.ZeissCZIReader.setOriginAndSize(ZeissCZIReader.java:1436)
   [testng] 	at loci.formats.in.ZeissCZIReader.initFile(ZeissCZIReader.java:1155)
   [testng] 	at loci.formats.FormatReader.setId(FormatReader.java:1496)
   [testng] 	at loci.formats.ImageReader.setId(ImageReader.java:875)

EOFException: Attempting to read beyond end of file.:

   [testng] java.io.EOFException: Attempting to read beyond end of file.
   [testng] 	at loci.common.NIOFileHandle.readInt(NIOFileHandle.java:415)
   [testng] 	at loci.common.RandomAccessInputStream.readInt(RandomAccessInputStream.java:564)
   [testng] 	at loci.formats.in.libczi.LibCZI.getBlock(LibCZI.java:416)
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.getSubBlockMeta(ZeissCZIReader.java:3076)
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.setSpaceAndTimeInformation(ZeissCZIReader.java:3373)
   [testng] 	at loci.formats.in.ZeissCZIReader$MetadataInitializer.initializeMetadata(ZeissCZIReader.java:2127)
   [testng] 	at loci.formats.in.ZeissCZIReader.initFile(ZeissCZIReader.java:1286)
   [testng] 	at loci.formats.FormatReader.setId(FormatReader.java:1496)
   [testng] 	at loci.formats.ImageReader.setId(ImageReader.java:875)

IllegalArgumentException: No dimension C found:

   [testng] java.lang.IllegalArgumentException: No dimension C found
   [testng] 	at loci.formats.in.ZeissCZIReader$ModuloDimensionEntries.getDimension(ZeissCZIReader.java:1836)
   [testng] 	at loci.formats.in.ZeissCZIReader.initFile(ZeissCZIReader.java:1270)
   [testng] 	at loci.formats.FormatReader.setId(FormatReader.java:1496)
   [testng] 	at loci.formats.in.ZeissCZIReader.addSlidePreviewIfExists(ZeissCZIReader.java:1346)
   [testng] 	at loci.formats.in.ZeissCZIReader.initFile(ZeissCZIReader.java:1223)
   [testng] 	at loci.formats.FormatReader.setId(FormatReader.java:1496)
   [testng] 	at loci.formats.ImageReader.setId(ImageReader.java:875)

NicoKiaru · 2023-09-19T15:00:42Z

Ok, 35 files not working, and I cound a bit more unique errors - some of them may be multiple (arithmetic exception) because there's no stack trace:

-no dim C found:
- "/data/ayako/Mouse_MosaiX_RGB.czi"
- "/data/demolitionb/hzdr-sims.czi"
- "/data/Image Attachment of CZI is not read correctly with BioFormats 5.3.4 #2791/testwell96_withAttachment_S1-3.czi"
NPE at loci.formats.in.ZeissCZIReader$MetadataInitializer.setSpaceAndTimeInformation(ZeissCZIReader.java:3442)
- "/data/greg/Experiment-519-export.czi"
- "/data/greg/Experiment-519.czi"
- "/data/qa-7601/AxioCamICc5_Gray16_12ValidBits_1C_8Z.czi"
- "/data/qa-7601/AxioCamICc5_bgr24_24ValidBits_1C_8Z.czi"
- "/data/qa-7601/AxioCamICc5_bgr48_36ValidBits_1C_8Z.czi"
- "/data/qa-7601/AxioCamMRc_Gray16_12ValidBits_1C_9Z.czi"
- "/data/qa-7601/AxioCamMRc_bgr48_36ValidBits_1C_9Z.czi"
read beyond EOF at loci.formats.in.libczi.LibCZI.getBlock(LibCZI.java:416)
- "/data/qa-10301/141016_OP43adult_Worm3_DAPI_head.czi"
/ by zero at loci.formats.in.ZeissCZIReader.setOriginAndSize(ZeissCZIReader.java:1436)
- "/data/qa-11007/SCC47 EGF647 LAMP1488 - EGF only 3D with 405_PALM.czi"
- "/data/qa-11007/SCC47 EGF647 LAMP1488 - LAMP1 only 3D with 405 2_PALM - KAthrin.czi"
NPE at loci.formats.in.ZeissCZIReader$MetadataInitializer.setSpaceAndTimeInformation(ZeissCZIReader.java:3173)
- "/data/qa-11104/Jumping_dom_brain3_slide1.czi"
- "/data/stephane/AO5.czi"
- "/data/zeiss/zen-2012/Axio Scan.Z1/Intestine_3color_RAC.czi"
- "/data/zeiss/zen-2012/Kidney_RAC_3color.czi"
java.lang.ArithmeticException: null unspecified
- "/data/qa-17616/abeta647-dilution1milliontimesnanomolar.czi"
- "/data/qa-9410/cav_PALM_drift_correct_Convert to Image.czi"
- "/data/zeiss/ZEN 2.1 black LSM 880 -May 2015/Image 5_PALM_verrechnet.czi"
- "/data/zeiss/ZEN 2.1 black LSM 880 -May 2015/PALM_OnlineVerrechnet.czi"
- "/data/zeiss/dvds/lsm-disk1/CZI-Example Data LSM/PALM/Palm_mitDrift.czi"
ZISRAWDIRECTORY segment expected, found ZISRAWFILE instead at loci.formats.in.libczi.LibCZI.getSubBlockDirectorySegment(LibCZI.java:123)
- "/data/qa-7271/bioformats_multifile.czi"
- "/data/qa-7276/Experiment-297.czi"
NPE at loci.formats.in.ZeissCZIReader$MetadataInitializer.setSpaceAndTimeInformation(ZeissCZIReader.java:3256)
- "/data/vito/36bit cast to 48.czi"
- "/data/vito/36bit.czi"
NPE unspecified
- "/data/vito/lena 10k x 10k bnw 16bit.czi"
- "/data/vito/lena 10k x 10k bnw 8bit.czi"
NPE at loci.formats.in.ZeissCZIReader$MetadataInitializer$Corner.fromBlocks(ZeissCZIReader.java:3124)
- "/data/zeiss/dvds/lsm-disk1/CZI-Example Data LSM/Misc/Channel-ZStack-LineScan-Bidirectional-Averaging.czi"
- "/data/zeiss/dvds/lsm-disk1/CZI-Example Data LSM/ScanModes/LineScan_T3500.czi"
- "/data/zeiss/dvds/lsm-disk1/CZI-Example Data LSM/ScanModes/LineScan_T80_Z25.czi"
- "/data/zeiss/dvds/lsm-disk1/CZI-Example Data LSM/ScanModes/LineScan_Z200.czi"
xml issue
- "/data/zeiss/dvds/wf-disk1/CZI Test WF/CZI-Example Data WF/Annotation/winnt.czi"
java.lang.IndexOutOfBoundsException: Index 0 out of bounds for length 0
- "/data/zeiss/zen-2012/Young_mouse.czi"

So, it's not perfect but, by looking at these erros, it does not seem impossible. That leaves of course the crucial question of accessing similar files for me. Are there any of these which could be shared ?

I see some Axiocam issue - these kind of files I could try to generate with one of our systems. However there are some PALM dataset that I could not access and I see that they cause errors. The data are from 2012, so for these sort of files I'll clearly have a hard time finding a new sample file.

dgault · 2023-09-22T16:31:01Z

I will have to take some time to go through the datasets and see if there is any way we can provide you with a way to reproduce and test, it may take me a few weeks to do so though.

NicoKiaru · 2023-10-19T11:38:04Z

Don't you think there's a possibility to directly share these files ? I will not use them for anything else than fixing bugs, and will not updload them publicly anywhere.

NicoKiaru · 2023-10-20T15:36:39Z

I know you're not super keen on changing the reader, but without the possibility for me to get the failing image, I just can't move forward. I'd just like to point out that these issues:

#4110
#4103
#4102
#3785

(and probably these ones)
#3919
#3790

are not happenning in the quick start reader.

I do not know what's the best way forward, but either:

you having a look at the alternative reader
or me being able to be able to fix these remaining issues
are options maybe worth considering ?

JavascriptMick · 2023-10-23T02:36:31Z

Hi Guys,

Thank you so much for working on this issue. FWIW, we went to a lot of trouble to ensure that this file we uploaded (https://zenodo.org/record/8423633) as an example is completely free of I.P and we are happy for it to be public, part of your test suite etc....

NicoKiaru · 2023-10-23T09:03:48Z

Hello @JavascriptMick ,

I'll try to answer in the forum, but I think that what you request is a new feature, and it's not a bug of the reader. I think your file behaves correctly in bio-formats.

There's no way to stitch all scenes as a single image in the reader currently (only all tiles from a scene are stitched). And even if it was, it's a problem on the OMERO side... How to deal with a czi file which can be opened in two different ways ? That breaks somehow the 'immutability' concept of what is an image raw data.

dgault · 2023-10-23T13:53:18Z

@NicoKiaru unfortunately the sample files that are currently failing the tests are from files provided private that we are able to share. I realise that it will make it very difficult to move forward with resolving the failures. As a next step I will try and spend some time reviewing the PR over the next week to see if I can resolve the initial failures and unlock the next steps.

Thanks to @JavascriptMick for making that sample data publicly available, that is always a tremendous help. For reference the relevant forum post for that data is https://forum.image.sc/t/zoom-from-overview-to-detailed-scan-for-imported-czi-files/85002/1

JavascriptMick

Hi, Michael Dausmann from CMRI here. I raised an issue in the forums and I think maybe this PR is a result so I spent some time to do a review. Hope you don't mind. I was only able to pickout things that are maybe mistakes rather than really review the structure of what is done here. Thanks again for looking at our issue.

JavascriptMick · 2023-10-23T23:16:29Z