Как научить языковую модель читать транзакции: превращаем историю платежей в базу знаний
Меня зовут Дмитрий Валов, я тимлид команды «Инструменты для банка (агенты)» в Sber AI Lab — Центре практического искусственного интеллекта Сбера. Боль…
Latest Team Management news from Tech News
Меня зовут Дмитрий Валов, я тимлид команды «Инструменты для банка (агенты)» в Sber AI Lab — Центре практического искусственного интеллекта Сбера. Боль…
Most RAG demos answer "what's the right chunk?" Very few can answer the two questions a regulator or an auditor will actually ask: Replay this decisio…
What I built A small business owner needed an automated customer support system that works 24/7, answering questions based only on their internal poli…
In the previous post , we talked about context windows. The model has a fixed-size desk and everything has to fit on it at once. When too much is on t…
A support agent tells a customer their plan is still Enterprise, even though finance downgraded it last week. A coding copilot forgets a repo conventi…
Pattern Defined Precise Definition: Context Compression is an inference pattern that utilizes a specialized "selector" model or a ranker to distill la…
I've been working on a log analysis platform that helps debug Spring Boot applications by analyzing logs and stack traces using RAG. https://loganalyz…
In the healthcare industry, data is both an organization's most valuable asset and its most heavily guarded liability. While industries like e-commerc…
LLM Cost Attribution with OTel, Next.js for AI Agents, LLM Security Testing Today's Highlights This week, we delve into practical strategies for manag…
A common failure pattern in a retrieval-augmented generation (RAG) system is a progressive decline in performance. This decline, which can be difficul…
Agent Orchestration & Workflow Automation: Dynamic Workflows, Robust Agent Patterns, and On-Commit AI Code Review Today's Highlights This week's h…
NiDaan: Building an Offline AI Diagnostic Assistant for Rural Health Workers in India Building AI that works without internet in places where it matte…
I once submitted an essay with three citations that I hadn't personally verified. The AI had suggested them, and they sounded right. None of them exis…
These days, everyone seems to be building AI agents. So I figured I should probably build one too. But instead of another generic demo, I wanted to so…
RAG SOTA, Agent Harnessing, and Langfuse Observability for AI Frameworks Today's Highlights Today's top stories delve into optimizing RAG performance …
RAG SOTA: I Tested 7 Pipelines and Built SEQUOIA (Open Source) After 20+ hours of compute time on local hardware, I benchmarked 7 RAG configurations a…
Most RAG frameworks today assume: a huge dependency graph mandatory LLM orchestration opinionated pipelines complex configuration But many real-world …
Problem Statement We have a misinformation problem. But more specifically, we have a speed problem. A journalist spots a suspicious claim. They search…
When people talk about “AI agents,” they imagine something autonomous, intelligent, and reliable. In reality, most agents collapse under their own wei…
Key Takeaways Storing vectors in an Oracle VECTOR column alongside content, metadata, and provenance means retrieval happens inside the database. Exis…
Many developers are jumping directly into AI agent frameworks like: LangChain CrewAI LangGraph But without understanding the foundations first, things…
I almost didn’t start this project. I kept thinking: “I should learn more backend first.” “I should understand AI properly first.” “I should probably …
By this point, KernelMind had already evolved far beyond the original “embeddings over code” idea. The system now had: AST-aware chunking fully qualif…
AI Agents Observability, Python Logging for OTel, & PySpark Code Linter Today's Highlights This week's highlights focus on critical tooling and ob…
The premise is simple: compiled knowledge is more reusable than retrieved fragments. RAG became the default answer to a straightforward question - how…
Most modern knowledge systems optimize retrieval, and that is understandable. Search is visible, easy to demo, and feels magical when it works. Type a…
It is May 2026, and the field has stopped pretending hallucinations are going to disappear. What has happened instead is more interesting. Researchers…
The Silent Failure of Pipeline RAG Every article in this series has been trying to answer the same question: how do we make retrieval better? Better c…
LLM Persistent Memory & Python Tooling Elevate AI Agent Workflows Today's Highlights This week highlights practical advances in AI agent developme…
Sliding Window Chunking Sliding Window Chunking is a more intensive chunking mechanism. In this method, a window size is defined based on a character …