Concat_images result not consistent between load/save cycles #986

spolcyn · 2021-01-02T21:52:32Z

Describe the bug
After using concat_images to concatenate two NIfTI images, then saving the new image to disk and loading it back into memory, the values of the data arrays are no longer equal.

In my case, the source and destination NIfTI file both have uint16 data, so there should be no issue in floating point loss, and any conversions to float32 or greater along the way should not result in a loss of precision.

To Reproduce

Use concat_images on two NIfTI images (one of which should have a integer data type, e.g., uint16)
Save the result
Load the result
Comparing the images' data arrays with np.array_equal

See the test written into the pull request, removing the patch in funcs.py to trigger the failure. I'd expect this issue to generally occur with NIfTI's using integers as their data format.

Alternatively, use the script in this gist with your own NIfTI file.

Expected behavior
The post-concat in-memory representation of the data should be equal to the data after it's been saved/re-loaded from disk.

Actual behavior
The data was no longer equal.

Likely Reason
It is almost certain that this is related to the datatype of the dataobj after concat_images. My analysis of the reason is as follows:

Images with uint16 dataobj's enter the function
Out_data is created with the correct shape, but with default dtype=np.float64
Out_data gets the data, upcasting it to a float64
Out_data is used to form the new image
The new image, when written to disk, finds a conflict between the datatype it should be on disk (uint16) and the current dataobj type (float64).
To get around this problem, a scaling slope is applied -- however, with the 32 bit float in the header, the scaling is not perfect (at least with values in my test images).
a. Note: This can be verified by examining the NIfTI header binary directly, and for my test I found an scl_slope of 0x3d4e00ce = 0.0502937361598, and each data element was off by exactly 19 + 5/6. Doing the math, 1/(19 + 5/6) = 0.050420168067227, which is close but not quite exactly the scl_slope value. (This may suggest choosing the scl_slope value in the save() routine could be tweaked, but without really digging into that code myself, I'm not sure -- it would appear the most accurate approximation of that number for a float32 is 0.0504201687872409820556640625, or 0x3d4e8561 in hex).
Accordingly, when the image is re-read in, the float64 result is slightly off (slightly less than the true value for my test images), and the uint16 result is also incorrect (at least for my test images, as the values typically work out to be slightly less than the true value, which results in a floor to a full integer beneath the true value)

Potential Fix
Including dtype in out_data = np.empty(out_shape, dtype=img0.header.get_data_dtype()) in funcs.py fixes the problem (alternatively, .dataobj.dtype could be used, but using the header seems more maintainable).

I've put in a PR with a test modification and my proposed fix, happy to discuss it either there or here.

Related to #1001

The text was updated successfully, but these errors were encountered:

spolcyn mentioned this issue Jan 2, 2021

BF: Fix concat_images dtype issue #987

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concat_images result not consistent between load/save cycles #986

Concat_images result not consistent between load/save cycles #986

spolcyn commented Jan 2, 2021 •

edited

Concat_images result not consistent between load/save cycles #986

Concat_images result not consistent between load/save cycles #986

Comments

spolcyn commented Jan 2, 2021 • edited

spolcyn commented Jan 2, 2021 •

edited