AI language models can exceed PNG and FLAC in lossless compression, says study

FlickOfTheBean@beehaw.org · 1 year ago

AI language models can exceed PNG and FLAC in lossless compression, says study

Heresy_generator@kbin.social · 1 year ago

It’s neat from a research and proof-of-concept perspective but practically speaking I’d like to see the CPU cycles required for the LLM compression compared to PNG or FLAC compression. We’ve always known we can increase compression by throwing more computing power at the problem but we settle on a happy medium at the intersection of “good enough” for compression and performance.

CanadaPlus@lemmy.sdf.org · 1 year ago

Bad. The time performance is bad. Even if it was O(n) there’s billions of parameters involved in an LLM.

skip0110@lemm.ee · edit-2 1 year ago

I think this model has billions of weights. So I believe that means the model itself is quite large. Since the receiver needs to already have this model, I’d suggest that rather than compressing the data, we have instead pre encoded it, embedded it in the model weights, and thus the “compression” is just basically passing a primary key that points to the data to be compressed in the model.

It’s like, if you already have a copy of a book, I can “compress” any text in that book into 2 numbers: a page offset, and a word offset on that page. But that’s cheating because, at some point, we had to transfer to book too!

puttputt@beehaw.org · 1 year ago

Yeah, it’s like saying I can “compress” a png of the Mona Lisa to just the string “Mona Lisa” because I have a database of art.

EdgeOfToday@lemm.ee · 1 year ago

With a neural network, you wouldn’t be able to mathematically prove that the signal is perfectly recovered 100% of the time for all possible inputs. That is the case with PNG and FLAC. If you’re just listening to music and need a good compression ratio, then sure, it won’t be a big deal if a couple of bits are wrong. But that’s also why we have lossy compression. If the goal is to make signal degradation imperceptible to a human, then you could get a much better compression ratio using neural networks. If it’s truly critical that the signal isn’t corrupted, it would probably be better to just use the original method.

astraeus@programming.dev · 1 year ago

Seems like another “hey, what if we used LLMs for this” scenarios. It might be more effective, but exactly how many more resources are being used to make it do the same work as current compression algorithms? Effective doesn’t mean efficient and I think for lossless applications efficient is truly more important.

christophski@feddit.uk · 1 year ago

Ok but what if we used LLM AND blockchain for this

ezures@lemmy.wtf · 1 year ago

Im sure we can squeeze an nft in there somewhere

astraeus@programming.dev · 1 year ago

Our company has been looking for a brilliant innovator like you, how would you like to apply for a new position called professional cool sounding tech peddler, I mean director of creative technology?

christophski@feddit.uk · 1 year ago

I want 200k and 30% of the company

astraeus@programming.dev · 1 year ago

Best I can do is $125k and $300k in company stock over 4 years