• Home
  • AI
  • DeepSeek-R1-beating perf in a 32B package? El Reg digs its claws into Alibaba’s QwQ

Hands on How to tame its hypersensitive hyperparameters and get it running on your PC

How much can reinforcement learning – and a bit of extra verification – improve large language models, aka LLMs? Alibaba’s Qwen team aims to find out with its latest release, QwQ.

Share this post

Subscribe to our newsletter

Keep up with the latest blog posts by staying updated. No spamming: we promise.
By clicking Sign Up you’re confirming that you agree with our Terms and Conditions.

Related posts

🎙 AI Assistant(voice)