Tech News — Latest News

All topics - игры AI Gear News Tech agents ai api architecture automation beginners career claude devchallenge devops javascript llm machinelearning mcp opensource performance productivity programming python react security showdev tutorial typescript webdev

All EN RU

Faster AI training by quietly cloning the model

A new paper introduces a method to speed up reward-based fine-tuning by having the model generate a cheap, compressed copy of itself to draft text, wh…

rlposttraining efficiency training

A 35-billion-parameter agent that punches like a trillion-parameter model

A 35-billion-parameter model called Agents-A1 matches trillion-parameter models on multi-step agent tasks, according to a new paper from Shanghai AI L…

research agents mixtureofexperts training

LLMs believe false statements even after explicit warnings that they're false

Fine-tuning tests show "bias ... toward confidently representing the claims as true."

AI falsehoods LLMs research training

GymStats — писал для себя, открыт для всех, кто ходит в спортзал

Был я как-то очередной раз в спортзале: делал упражнения, поглядывая на предыдущие значения из заметок и записывая новые туда же. Придя домой, я обнов…

gym спортзал тренировки training

Anthropic blames dystopian sci-fi for training AI models to act “evil”

But training on "synthetic stories" that model good AI behavior can help.

AI Anthropic clause ethics morals sci-fi stories training

Study: AI models that consider user's feeling are more likely to make errors

Overtuning can cause models to "prioritize user satisfaction over truthfulness.”

AI Oxford science study training tuning warmth