How I built a Bluesky scraper using the AT Protocol API (and published it on Apify)
Bluesky hit 40 million users earlier this year, and unlike Twitter, it runs on an open protocol — the AT Protocol — where public data is genuinely pub…
Tech news from the best sources
Bluesky hit 40 million users earlier this year, and unlike Twitter, it runs on an open protocol — the AT Protocol — where public data is genuinely pub…
Note: This is a cross-post. Canonical version (full long-form) lives on my blog: https://blog.spinov.online/blog/ethical-scraping-is-a-rate-limit-ques…
How to set up refresh-token-only OAuth for a multi-tenant Apify Actor (Gmail, 10 minutes) If you're shipping an Apify Actor that calls a per-user Goog…
Когда в 2023-2024 году Яндекс и Google запустили генеративные ответы поверх поисковой выдачи, классические SEO-метрики начали ломаться по одной. Позиц…
Why my Reddit scraper went from 92% to 61% success rate in 30 days (and how I fixed it in one config flag) I publish a small Reddit scraper actor on t…
If your brand competes for Chinese consumers and you're not actively monitoring conversations on Weibo, RedNote, Bilibili, Douban, and Xueqiu, you're …
I pulled a 100-row sample of Sitemap to see whether the dataset is rich enough to support pipeline health checks, content auditing, structured-data va…