Переименовал две колонки и поймал два инцидента
Про безопасные миграции написано уже тысячу раз. Мы все наизусть знаем и про expand/contract, и про обратную совместимость, и про то, что схему нельзя…
Latest Web news from Tech News
Про безопасные миграции написано уже тысячу раз. Мы все наизусть знаем и про expand/contract, и про обратную совместимость, и про то, что схему нельзя…
I'm a big fan of aviation, and one lesson from aviation safety has always stuck with me: accidents rarely happen because of a single mistake. Instead,…
I'll never forget my first post-mortem meeting. I was a junior engineer, and the server had crashed at 3 AM during a holiday sale. The report started …
Tales from the Bare Metal — Episode 05 In the early hours of 10 March 2021, a fire began in a power room in Strasbourg. By morning an entire data cent…
Series intro. I'm a non-CS solo dev who built and shipped a production stock screener almost entirely by "vibe coding" with an AI agent. The site work…
I'm an SRE at Sony Interactive Entertainment. After a week where my teammate had four incidents (and four RCAs), I built something for the blank-page …
The agent had a list. I asked it to pick an item. It refused. Element not found Refresh. Same. So I opened DevTools and pasted in: document . querySel…
20 мая в 06:01:55 МСК Watchtower по расписанию проверил 14 контейнеров на нашем VPS, нашёл 5 обновлений и пересоздал. Среди обновлённых - n8n, который…
Postmortem: A Vercel Edge Function Timeout Caused Our Global API to Fail for 30 Minutes On October 17, 2024, at 14:22 UTC, our global API experienced …
Postmortem: How a Corrupted Node Modules Folder Caused 3-Hour Outage for Our CI Pipeline Published: October 26, 2024 | Author: DevOps Team | 5 min rea…
In Q3 2024, a production AI incident classifier mislabeled 42% of critical security incidents as 'low priority' over 72 hours, causing $2.1M in SLA br…
Postmortem: A Corrupted Loki 2.10 Log Store Caused 3 Days of Lost Debug Data Overview On October 12, 2024, at 09:15 UTC, our observability team detect…
Postmortem: How a LangGraph 0.1 Multi-Agent Bug Broke Our 2026 Customer Support Bot Executive Summary On October 12, 2026, our production customer sup…
On October 12, 2024, a misconfigured Kafka 3.8 broker idempotent write setting caused a 20-minute total outage that delayed 1,047,892 order messages, …