NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Can gzip be a language model? (nathan.rs)
nathan-barry 3 days ago [-]
LLMs are very good at lossless compression via arithmetic coding. But I didn't know that it was possible to go the reverse direction (do language modeling via a compressor). It's not super great quality, but I'm surprised it worked! Other compression algorithms (like PPMd) use variable n-grams under the hood, and should be much better (although less interesting due to already containing basic language models internally).
eventualcomp 2 days ago [-]
Reminds me of this youtube video: https://m.youtube.com/watch?v=jkdWzvMOPuo

I liked the comments explaining why this worked.

chinallm_ai 3 days ago [-]
[flagged]
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 12:36:09 GMT+0000 (Coordinated Universal Time) with Vercel.