Tech News — Latest News

All EN RU

Faster AI training by quietly cloning the model

A new paper introduces a method to speed up reward-based fine-tuning by having the model generate a cheap, compressed copy of itself to draft text, wh…

rlposttraining efficiency training

A 35-billion-parameter agent that punches like a trillion-parameter model

A 35-billion-parameter model called Agents-A1 matches trillion-parameter models on multi-step agent tasks, according to a new paper from Shanghai AI L…

research agents mixtureofexperts training

LLMs believe false statements even after explicit warnings that they're false

Fine-tuning tests show "bias ... toward confidently representing the claims as true."

AI falsehoods LLMs research training

Anthropic blames dystopian sci-fi for training AI models to act “evil”

But training on "synthetic stories" that model good AI behavior can help.

AI Anthropic clause ethics morals sci-fi stories training

Study: AI models that consider user's feeling are more likely to make errors

Overtuning can cause models to "prioritize user satisfaction over truthfulness.”

AI Oxford science study training tuning warmth