AGTP: A Home for Your Agents
You have built agents. They are in production. Some of them are doing important work. You are mostly sure of that. What you are less sure of: how many…
Latest Web news from Tech News
You have built agents. They are in production. Some of them are doing important work. You are mostly sure of that. What you are less sure of: how many…
I almost burned ₹4,000 on Claude API overnight — so I built llm-cost-guard Last month I wrote what I thought was a harmless script. Batch-process 847 …
Cron's biggest problem isn't scheduling — it's silence. A cron job can fail every night for a month, and unless you're manually checking logs on the s…
I have used many timescale databases over the years and have found most to be wanting. Often over complicated and under performant. PostgreSQL with Ti…
Introduction Modern cloud-native systems generate an enormous amount of telemetry data every second. Applications, containers, Kubernetes clusters, AP…
The backstory Some time ago I adopted Quickwit at my company. For anyone who hasn't used it: Quickwit is a search engine that runs full-text search di…
When should I reach for a log, a trace, or a metric? I hit that question constantly when I instrument code, and I watch coding agents hit it too. It s…
Monitor Medium Publications and Newsletter Feeds via API Readers follow collections —Towards Data Science, niche newsletters—not just individual write…
Introduction Let me clear one thing up right out of the gate: this is not a teardown of PostHog. In fact, PostHog is an incredible piece of software a…
Key Use Cases Power BI Visual Monitoring can be used for: power bi visual monitoring power bi report visual monitoring visual regression testing for P…
Introduction Netdata, a once-revered open-source monitoring tool, has increasingly compromised its core functionality through aggressive and intrusive…
Pipa is our agent for studio operations at Lunch Pail Labs . She lives in Slack, is powered by E2B sandboxes, and uses OpenCode for the harness. When …
TL;DR Alert on symptoms, not causes – users feel latency and errors, not high CPU. Alert on p95 latency and error rates, not internal metrics. Use SLO…
Introduction Our team had been using CloudWatch Logs as the log storage layer for our identity management system, but as the service grew, the associa…
From Eclipses to P95 Latency: What the Joseon Dynasty Can Teach Us About Incident Response The Joseon Dynasty ruled Korea for more than five centuries…
Most founders who build a competitor to an existing tool do it because they couldn't afford the original. That wasn't my situation. I was paying for M…
Time-series database performance under ecommerce load: real benchmark results Your monitoring stack becomes your worst enemy during traffic spikes if …
Introduction Logs are one of the most valuable sources of information in any cloud environment. Whether you're troubleshooting application failures, i…
Why Your Website Can Be "Up" And Still Broken Most uptime monitors tell you one thing: is the server responding? But that binary answer misses the ful…
5 Uptime Monitoring Mistakes That Cost Developers Hours of Debugging I've been building and maintaining web applications for years, and I've watched t…
Building a Public Status Page: What to Show and What to Hide A public status page is one of the highest-leverage things you can do for user trust. Whe…
This article was originally published on LearnKube TL;DR: This article dissects the Kubernetes metrics pipeline through kubelet, cAdvisor, and CRI to …
Unlocking Insights with Observability: My Journey with OpenTelemetry As a Full Stack Engineer specializing in DevOps, AI Infrastructure, and Cloud, I'…
Quick story. I run a small homelab — one box, an NVIDIA card, around ten Docker containers, and a couple of local model servers (Ollama mostly, vLLM w…
The standard observability stack: Grafana + Loki + Tempo + Prometheus. Four services to deploy, four configs to learn, dashboards to set up before you…
Обычный uptime-мониторинг проверяет, отвечает ли сервис на запросы. Cron-job ничего не отвечает — он запускается раз в N часов, делает работу и молча …
Full Example YAML Here’s a deployment using all three Kubernetes probes: containers : - name : api image : my-api:latest startupProbe : httpGet : path…
Book: LLM Observability Pocket Guide: Picking the Right Tracing & Evals Tools for Your Team Also by me: Thinking in Go (2-book series) — Complete …
Most security tooling works by asking you to define what "bad" looks like upfront. Falco gives you YAML rules. OSSEC has signatures. Wazuh has a 5,000…
If you’ve shipped an app using Lovable, Bolt, Replit, or similar tools — what happens when something breaks in production? Specifically curious about:…