DevOps — Tech News

EN

I almost burned ₹4,000 on Claude API overnight — so I built llm-cost-guard

I almost burned ₹4,000 on Claude API overnight — so I built llm-cost-guard Last month I wrote what I thought was a harmless script. Batch-process 847 …

claude llm monitoring showdev

EN

Cron Job Monitoring Tools Compared: From DIY to Fully Managed

Cron's biggest problem isn't scheduling — it's silence. A cron job can fail every night for a month, and unless you're manually checking logs on the s…

cron monitoring devops webdev

EN

Using TimescaleDB and Collectd for observability

I have used many timescale databases over the years and have found most to be wanting. Often over complicated and under performant. PostgreSQL with Ti…

postgres timescaledb monitoring devops

EN

OpenTelemetry Observability Guide: How to Optimize Metrics, Logs, and Traces at Scale

Introduction Modern cloud-native systems generate an enormous amount of telemetry data every second. Applications, containers, Kubernetes clusters, AP…

devops distributedsystems monitoring tutorial

EN

I built a self-hosted log search tool for my team

The backstory Some time ago I adopted Quickwit at my company. For anyone who hasn't used it: Quickwit is a search engine that runs full-text search di…

opensource devops monitoring showdev

EN

Errors, traces, logs, metrics: when to reach for what

When should I reach for a log, a trace, or a metric? I hit that question constantly when I instrument code, and I watch coding agents hit it too. It s…

monitoring devops logging observability

EN

Monitor Medium Publications and Newsletter Feeds via API

Monitor Medium Publications and Newsletter Feeds via API Readers follow collections —Towards Data Science, niche newsletters—not just individual write…

medium api newsletter monitoring

EN

I Built My Own Analytics Platform Instead of Paying for PostHog

Introduction Let me clear one thing up right out of the gate: this is not a teardown of PostHog. In fact, PostHog is an incredible piece of software a…

development monitoring api analytics

EN

Power BI Visual Monitoring: Automatically Detecting Broken Visuals in Power BI Reports

Key Use Cases Power BI Visual Monitoring can be used for: power bi visual monitoring power bi report visual monitoring visual regression testing for P…

programming monitoring datascience

EN

Netdata's Intrusive Cloud Account Prompts: How to Disable and Regain Focus on System Monitoring

Introduction Netdata, a once-revered open-source monitoring tool, has increasingly compromised its core functionality through aggressive and intrusive…

monitoring userexperience opensource cloud

RU

Простая сложная VictoriaMetrics

Привет, я Сергей Истомин, DevOps-инженер в KTS . А ниже моя история про построение мультитенантного скоупа кластеров VictoriaMetrics с разными периода…

monitoring мониторинг victoriametrics victoria metrics vmstorage vmcluster grafana kubernetes мультитенантность

RU

IncidentRelay: self-hosted on-call, alert routing и уведомления без SaaS и канадских номеров

Привет, Habr! Мы разрабатываем  IncidentRelay  - self-hosted систему для on-call scheduling, маршрутизации алертов и доставки уведомлений. И…

monitoring alertmanager on-call incident management duty

EN

How to add Honeycomb traces to your AI Slack bot

Pipa is our agent for studio operations at Lunch Pail Labs . She lives in Slack, is powered by E2B sandboxes, and uses OpenCode for the harness. When …

agents ai monitoring tutorial

EN

Setting Up Alerts and Notifications for Performance Bottlenecks

TL;DR Alert on symptoms, not causes – users feel latency and errors, not high CPU. Alert on p95 latency and error rates, not internal metrics. Use SLO…

devops monitoring performance sre

EN

Building an Application Log Analytics Platform with Amazon S3 Tables: Cost Optimization by Migrating from CloudWatch Logs

Introduction Our team had been using CloudWatch Logs as the log storage layer for our identity management system, but as the service grew, the associa…

architecture aws infrastructure monitoring

EN

From Eclipses to P95 Latency: What the Joseon Dynasty Can Teach Us About Incident Response

From Eclipses to P95 Latency: What the Joseon Dynasty Can Teach Us About Incident Response The Joseon Dynasty ruled Korea for more than five centuries…

devops monitoring performance sre

EN

Why I Built MentionFox Instead of Just Using Mention.com

Most founders who build a competitor to an existing tool do it because they couldn't afford the original. That wasn't my situation. I was paying for M…

monitoring socialmedia serverless claude

EN

Benchmarking time-series databases for ecommerce infrastructure monitoring

Time-series database performance under ecommerce load: real benchmark results Your monitoring stack becomes your worst enemy during traffic spikes if …

timeseries monitoring database performance

EN

من فكرة طلابية إلى نظام مراقبة ذكي – رحلة مستوصف

في قلب كل ابتكار عظيم تكمن قصة إنسانية ملهمة، قصة شغف وتحديات وإصرار لا يلين. هذا هو جوهر رحلة فريق "المستوصف"، الذي بدأ كفكرة مشروع تخرج طموحة وتحول …

ai career monitoring startup

EN

Enterprise Log Management in OCI: Moving Logs from OCI Logging to Object Storage Using Service Connector Hub

Introduction Logs are one of the most valuable sources of information in any cloud environment. Whether you're troubleshooting application failures, i…

serviceconnector loggroup monitoring

EN

Why Your Website Can Be "Up" And Still Broken: A Deep Dive Into Latency Phases

Why Your Website Can Be "Up" And Still Broken Most uptime monitors tell you one thing: is the server responding? But that binary answer misses the ful…

webdev performance monitoring devops

EN

5 Uptime Monitoring Mistakes That Cost Developers Hours of Debugging

5 Uptime Monitoring Mistakes That Cost Developers Hours of Debugging I've been building and maintaining web applications for years, and I've watched t…

webdev devops monitoring productivity

EN

Building a Public Status Page: What to Show and What to Hide

Building a Public Status Page: What to Show and What to Hide A public status page is one of the highest-leverage things you can do for user trust. Whe…

webdev devops monitoring javascript

EN

Kubelet Metrics: How cAdvisor and CRI Collect Kubernetes Stats

This article was originally published on LearnKube TL;DR: This article dissects the Kubernetes metrics pipeline through kubelet, cAdvisor, and CRI to …

kubernetes devops architecture monitoring

EN

Unlocking Insights with Observability: My Journey with OpenTelemetry

Unlocking Insights with Observability: My Journey with OpenTelemetry As a Full Stack Engineer specializing in DevOps, AI Infrastructure, and Cloud, I'…

observability devops monitoring

RU

Антипаттерны Zabbix в крупной инфраструктуре: каталог базовых граблей

Две сцены, которые видел в разных компаниях в последний год. Сцена первая. На стене в кабинете директора по ИТ висит большой телек, на&…

zabbix monitoring operation framework infrastucture

EN

I got tired of guessing which model holds my VRAM, so I built a tiny dashboard

Quick story. I run a small homelab — one box, an NVIDIA card, around ten Docker containers, and a couple of local model servers (Ollama mostly, vLLM w…

ai docker monitoring showdev

RU

Inspector v3: как я сделал свой центр управления Kubernetes на старом ноутбуке

Привет, Хабр! Меня зовут Артём, в YADRO я работаю инженером инфраструктуры: виртуализация, мониторинг, контейнеризация — это мое ежедневное. Также зан…

k8s monitoring kubernetes инференс мониторинг

EN

One container to replace Grafana + Loki + Tempo + Prometheus

The standard observability stack: Grafana + Loki + Tempo + Prometheus. Four services to deploy, four configs to learn, dashboards to set up before you…

opensource dotnet docker monitoring

RU

Heartbeat-мониторинг cron-job'ов: dead-man-switch на FastAPI

Обычный uptime-мониторинг проверяет, отвечает ли сервис на запросы. Cron-job ничего не отвечает — он запускается раз в N часов, делает работу и молча …

cron heartbeat dead-man-switch monitoring alerting bash FastAPI Celery devops linux