• zerofk@lemm.ee
    link
    fedilink
    English
    arrow-up
    28
    arrow-down
    1
    ·
    2 days ago

    “And while Spectral JPEG XL dramatically reduces file sizes, its lossy approach may pose drawbacks for some scientific applications.”

    This is the part that confuses me. First of all, many applications that need spectral data need it to be as accurate as possible. Lossy compression in that might not be acceptable.

    More interestingly (and I’ll read the actual paper for this): which data will be more compressed? Simply put, JPEG achieves its best compression by keeping the brightness but discarding colour. Which dimension in which spectral space do the researchers think can be more compressed than others? In this case there is no human visual system to base the decision on.

    • rice@lemmy.org
      link
      fedilink
      English
      arrow-up
      15
      ·
      2 days ago

      jpeg xl does support lossless and their 69 page paper does mention this so I am unsure why they are putting the lossy aspect of this as the comparison to their “lossless ZIP COMPRESSION of OpenEXR”

      page 51 has more detail on compression stuff. The openEXR does also support lossy. Anyway I think page 51-52 would answer it for someone that knows more about openEXR which I sure don’t

      Their comparison images do clearly show data being lost as well so they aren’t even using visually lossless of jpeg xl they are actually just going full lossy. Must be some use case somewhere?

    • hera@feddit.uk
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      2 days ago

      I literally can’t think of a scientific use case where lossy compression would be acceptable?

  • rdri@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    1 day ago

    Last I checked, JPEG XL takes a lot of time and resources to encode (create) an image, if you actually want it to be far more optimized than JPEG.

  • AbouBenAdhem@lemmy.world
    link
    fedilink
    English
    arrow-up
    47
    arrow-down
    1
    ·
    2 days ago

    Spectral JPEG XL utilizes a technique used with human-visible images, a math trick called a discrete cosine transform (DCT), to make these massive files smaller […] it then applies a weighting step, dividing higher-frequency spectral coefficients by the overall brightness (the DC component), allowing less important data to be compressed more aggressively.

    This all sounds like standard jpeg compression. Is it just jpeg with extra channels?

    • Prok@lemmy.world
      link
      fedilink
      English
      arrow-up
      39
      ·
      2 days ago

      Yeah, it compresses better too though, and jpeg XL can be configured to compress lossless, which I imagine would also work here

        • Cocodapuf@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          2 days ago

          In my experience, as you increase the quality level of a jpeg, the compression level drops significantly, much more than with some other formats, notably PNG. I’d be curious to see comparisons with png and gif. I wouldn’t be surprised if the new jpeg compresses better at some resolutions, but not all, or with only some kind of images.

          • rice@lemmy.org
            link
            fedilink
            English
            arrow-up
            6
            ·
            2 days ago

            jpeg xl has been in development from FLIF for like 15 years there are tons of comparisons all over, even live ones on youtube

    • wischi@programming.dev
      link
      fedilink
      English
      arrow-up
      8
      ·
      edit-2
      2 days ago

      It’s not just like jpeg with extra channels. It’s technically far superior, supports loss less compression, and the way the decompression works would make thumbnails obsolete. It can even recompress already existing JPEGs even smaller without additional generation loss. It’s hard to describe what a major step this format would be without getting very technical. A lot of operating systems and software already support it, but the Google chrome team is practically preventing widespread adoption because of company politics.

      https://issues.chromium.org/issues/40168998

        • wischi@programming.dev
          link
          fedilink
          English
          arrow-up
          4
          ·
          edit-2
          1 day ago

          JPEG does not support lossless compression. There was an extension to the standard in 1993 but most de/encoders don’t implement that and it never took off. With JPEG XL you get more bang for your buck and the same visual quality will get you a smaller file. There would be no more need for thumbnails because of improved progressive decoding.

          https://youtu.be/UphN1_7nP8U

          • uis@lemm.ee
            link
            fedilink
            English
            arrow-up
            1
            ·
            15 hours ago

            Then same can be said about JPEG LS and JPEG XL. Most browsers don’t implement that.

        • wischi@programming.dev
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 day ago

          JPEG does not support lossless compression. There was an extension to the standard in 1993 but most de/encoders don’t implement that and it never took off. With JPEG XL you get more bang for your buck and the same visual quality will get you a smaller file. There would be no more need for thumbnails because of improved progressive decoding.

          https://youtu.be/UphN1_7nP8U

    • zerofk@lemm.ee
      link
      fedilink
      English
      arrow-up
      5
      ·
      2 days ago

      Kind of, but JPEG converts image data to its own internal 3 came channel colour space before applying DCT. It is not compressing the R, G and B channels of most images. So a multichannel compression is not just compressing each channel separately.

      • AbouBenAdhem@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 days ago

        Yeah, jpeg converts to lab (or something similar, I think). But the dimensions are the same: one channel for lightness, and then a number of channels one less than the total number of sampled frequencies to capture the rest of the color space.

  • pelya@lemmy.world
    link
    fedilink
    English
    arrow-up
    8
    ·
    2 days ago

    What, pickle.dump your enormous Numpy array not good enough for you anymore? Not even fancy zlib.compress(pickle.dumps(enormousNumpyArray)) will satisfy you? Are you a scientist or a spectral data photographer?

    • KingRandomGuy@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 day ago

      I guess part of the reason is to have a standardized method for multi and hyper spectral images, especially for storing things like metadata. Simply storing a numpy array may not be ideal if you don’t keep metadata on what is being stored and in what order (i.e. axis order, what channel corresponds to each frequency band, etc.). Plus it seems like they extend lossy compression to this modality which could be useful for some circumstances (though for scientific use you’d probably want lossless).

      If compression isn’t the concern, certainly other formats could work to store metadata in a standardized way. FITS, the image format used in astronomy, comes to mind.

      • pelya@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 day ago

        Saving arbitrary metadata is the exact use case for pickle module, you just put it together with your numpy array into a tuple. jpeg format has support for storing metadata, but they are an afterthought like .mp3 tags, half of applications do not support them.

        I can imagine multichannel jpeg to be used in photo editing software, so you can effortlessly create false-color plots of your infrared data, maybe even apply a beauty filter to your Eagle Nebula microwave scans.

        • KingRandomGuy@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 day ago

          I agree that pickle works well for storing arbitrary metadata, but my main gripe is that it isn’t like there’s an exact standard for how the metadata should be formatted. For FITS, for example, there are keywords for metadata such as the row order, CFA matrices, etc. that all FITS processing and displaying programs need to follow to properly read the image. So to make working with multi-spectral data easier, it’d definitely be helpful to have a standard set of keywords and encoding format.

          It would be interesting to see if photo editing software will pick up multichannel JPEG. As of right now there are very few sources of multi-spectral imagery for consumers, so I’m not sure what the target use case would be though. The closest thing I can think of is narrowband imaging in astrophotography, but normally you process those in dedicated astronomy software (i.e. Siril, PixInsight), though you can also re-combine different wavelengths in traditional image editors.

          I’ll also add that HDF5 and Zarr are good options to store arrays in Python if standardized metadata isn’t a big deal. Both of them have the benefit of user-specified chunk sizes, so they work well for tasks like ML where you may have random accesses.