Data Compression Explained (2012)

176 points29 comments4 days ago
rurban

The leader boards are from the pre Fabrice Bellard days, btw. Neural network modeling helped finding better patterns in text.

Also, you could say the same for the related data search problem. How to prepare data, so that it can most efficiently searched. Smallest encoding vs fastest search. Databases are mostly very, very stupid compared to more data-specific tuned algorithms. Like factor 1000 slower and bigger.

show comments
dang

Related:

Data Compression Explained (2011) - https://news.ycombinator.com/item?id=40631931 - June 2024 (1 comment)

Data Compression Explained - https://news.ycombinator.com/item?id=5931493 - June 2013 (14 comments)

Data Compression Explained by Matt Mahoney - https://news.ycombinator.com/item?id=1179242 - March 2010 (1 comment)

Bnjoroge

I wonder what if anything has changed ever since this article. Is llm-based compression more mainstream?

usernametaken29

Isn’t the idea of AI precisely to find universal compression from arbitrary input data, at least with LLMs?

show comments
firesteelrain

Matt Mahoney is one of the best when it comes to compression. He is retired now.

blastro

i was told that middle-out was best

brownpoints

I say transformers are the best compression systems

wps

This is the guy who created Zpaq btw. Super interesting but niche backup/archive software.

blobbers

Matt is a great guy to explain this kind of stuff. He's very helpful.

show comments
NooneAtAll3

does anyone have any sources to read about ai-based compression?

I remember hearing a lot about "compression is a lot about prediction", but I don't remember reading any practical result

show comments