Tech News — Latest News

RU

Открытые LLM в продакшене: 8 выводов о llama.cpp, Gemma и Qwen

Об открытых языковых моделях написано много — и почти все статьи посвящены знакомству, в лучшем случае — «медовому месяцу» использования. Бенчмарки, р…

ollama

EN

Building an Autonomous Agent on an M1 Mac, by Choice

For about 3 months I've been running an autonomous agent — one that thinks up and writes its own social media posts and comments — unattended, 4 sessi…

discuss ollama llm agents

RU

От стримов к вебсокетам: как я боролся с буферизацией и наконец победил

Привет. Меня зовут Николай Пискунов, я руководитель направления Big Data и эксперт курса Cloud DevSecOps по безопасной разработке от Академии вАЙТИ&nb…

spring ai spring boot java websocket stomp server-sent events sse ollama llm typescript

EN

Popular open source AI developer tool Ollama raises $65M, grows to nearly 9M users

Benchmark-backed Ollama has amassed 176,000 stars, and nearly 17,000 forks on GitHub by helping developers easily run AI on their PCs.

AI Startups TC Benchmark Partners Exclusive ollama theory ventures

EN

Does a Second GPU Increase Ollama's Context Window? (Quadro P2000 + RTX 3090 Tested)

TL;DR Short version: no. I dropped a much older GPU ( Quadro P2000, 5GB, Pascal, 2016 ) next to an RTX 3090 (24GB, Ampere) on the same box, ran the sa…

llm ollama vllm gpu

EN

LLM Quantization Levels Compared: Q4_K_M vs Q8_0 vs FP16 [2026]

Originally published at kunalganglani.com — read it there for inline code, hero image, and live links. LLM Quantization Levels Compared: Q4_K_M vs Q8_…

localllm quantization gguf ollama

RU

Как я хакнул рынок труда: пишем свой ИИ-комбайн для автооткликов на HH.ru

Всем привет! Если вы хоть раз искали работу в IT за последний год, то знаете, что рынок беспощаден к новичкам. Нужно откликнуться на сотни вакансий, а…

python playwright hh.ru ollama llama3 автоматизация поиск работы искусственный интеллект парсинг карьера

RU

[Перевод] Создание кластер-осведомлённого ИИ-агента с Kubernetes, Argo CD и GitOps

Команда VK Cloud перевела разбор запуска self-hosted (размещаемого на собственных мощностях), read-only ИИ-агента внутри кластера Kubernetes, где всю …

vk cloud kubernetes argo cd gitops llm ai agent ollama mistral rbac observability

RU

Безопасный AI-мониторинг Oracle в закрытом контуре с использованием Python, Ollama и V$WAITCLASSMETRIC

  Безопасный AI-мониторинг Oracle в закрытом контуре с использованием Python, Ollama и V$WAITCLASSMETRIC Введение В существующей системе монитори…

oracle database ollama

EN

Running a Whole RAG Agent Offline: LangGraph + Ollama + Embedded Qdrant (Zero API Keys)

Most RAG tutorials open with "set your OPENAI_API_KEY ." This one doesn't need it. In Part 1 I claimed the LLM and embeddings are behind a swappable b…

langchain llm rag ollama

EN

Building a Local-First Voice Copilot for the Shell with HoldSpeak and Ollama

The Promise: A Private, Voice-Activated Shell The dream of a voice-activated command line is compelling: speak a command, see it executed. But for man…

python cli voice ollama

EN

I Built an AI Content Team That Posts to My Blog While I Sleep

I used to write blog posts the old way. Open a blank page. Stare at it. Write something. Rewrite it three times. Publish. Repeat every two weeks when …

ai automation productivity ollama

RU

Где заканчивается вызов LLM и начинается backend система: локальный RAG на FastAPI и Ollama

Хотел разобраться где заканчивается простой вызов локальной LLM и начинается backend система. Сначала всё выглядело просто: frontend отправляет вопрос…

rag llm fastapi ollama python backend embeddings vector store request_id локальная llm

RU

Искусственный интеллект с LangChain. Разработка ИИ-агентов на Python

Представляем новый практический курс по ИИ-агентам на Python от мастера обучающей литературы Владимира Дронова . Книга наверняка вызовет интерес у все…

langchain ии llm python нейросети программирование rag gigachat yandexgpt ollama

EN

Hermes-Crew Hybrid: A Hybrid Architecture for Secure Multi-Agent AI Workflows

Hermes-Crew Hybrid: A Hybrid Architecture for Secure Multi-Agent AI Workflows I built a hybrid system that combines a central orchestrator (Hermes) wi…

ai security crewai ollama

EN

I Built a Private AI Brain on My Laptop for $0

Last week I couldn't shake an idea: what if I had an AI that knew everything I know ? Not ChatGPT — something on my hardware, holding my knowledge, an…

ai selfhosted ollama docker

RU

Anthropic, Fable 5, Claude Code и большой отбор игрушек

9 июня Anthropic выкатила  Claude Fable 5 , он же Mythos 5 в закрытом контуре. 12 июня доступ к обеим версиям сняли. А между этими датами уместил…

claude fable 5 mythos 5 anthropic llm локальные модели vllm ollama hugging face управление зависимостями ИИ-безопасность

EN

Giving Your Local LLM Safe Filesystem Access With Ollama Tool Use

A local LLM that can read your files is genuinely useful. A local LLM that can read your files without guardrails is a path-traversal bug with a chat …

ai ollama typescript security

EN

I Replaced My $20/mo AI Tools With Local Models: My Full Stack

I was paying $20/mo for Copilot and reaching for the Claude API on every side project. Then I added it up. Copilot, a code-review SaaS trial, the occa…

ai ollama productivity webdev

RU

Эволюция клиента для Ollama: от PostgreSQL к MongoDB

Привет. Меня зовут Николай Пискунов, я руководитель направления Big Data и эксперт курса Cloud DevSecOps по безопасной разработке от Академии вАЙТИ Be…

java spring boot postgresql ollama llm artificial intelligence react typescript code review code review ai

EN

How I fixed silent Ollama failures in my local AI Assistant

How I fixed silent Ollama failures in my local AI assistant Neo-AI is an offline assistant with episodic memory, running entirely on-device using Olla…

python opensource machinelearning ollama

RU

Локальные LLM на Arch Linux и как увеличить скорость генерации в 20 раз

Приветствую всех читателей Хабра, в этой статье я хочу поделиться своим опытом в запуске локальных LLM, протестировать работоспособность инт…

arch linux llama.cpp ollama qwen3.6 gemma4 github huggingface intel arc b580

RU

Helix Agent Ai — российский самообучающийся AI-агент. Полное руководство по развертыванию и использованию в 2026 году

Заголовок: Helix — российский самообучающийся AI-агент с поддержкой MCP: полное руководство по развертыванию и использованию в 2026 году Читать далее

helix ai-agent self-hosted ollama mcp langgraph on-premise приватность python

EN

Open Notebook Review: Self-Hosted NotebookLM Alternative

Originally published on andrew.ooo — visit the original for any updates, code snippets that aged out, or follow-up posts. TL;DR Open Notebook (by Luis…

opennotebook notebooklmalternative selfhosted ollama

EN

Doubling Qwen3.6-27B on One RTX 3090: ollama llama.cpp + MTP, Lever by Lever (35.7 80.2 tok/s)

A reader on my last post said Ollama was leaving a lot on the table — that a tuned backend with multi-token prediction (MTP) could roughly double my 3…

ollama llm performance machinelearning

RU

NeoBrain: A Local Alternative to Character.AI

🧠 NeoBrain: локальный аналог Character.AI Запусти ИИ-персонажей на своём ПК — без интернета, VPN и слежки. 🤔 Проблема Character.AI хорош, но: ❌ Заблок…

ai opensource fastapi ollama

EN

Fitting WhisperX large-v3 + a 24B LLM on one 3090: a reproducible context-capping recipe

This is the technical, reproducible version of a fix I shipped on my own homelab. If you want the narrative version, that's on Medium. This one is the…

homelab ollama localllm devops

EN

Do You Have a Homelab? Secure Your Local LLM Artifacts

We used to build homelabs around Linux servers, Docker containers, and NAS drives. It was about uptime, RAID levels, and monitoring CPU temps. Now, th…

homelab llmsecurity sbom ollama

RU

Математика больших чисел: из игры с нулевой суммой в игру с растущей суммой

🧮 Математика больших чисел: из игры с нулевой суммой в игру с растущей суммой Алгоритмические торговые стратегии это в игра с нулевой суммой. Ты зараб…

матанализ корреляция финансы исследование исскуственный интеллект машинное+обучение парсер osint osint tools ollama

EN

Local-first: a Model on Your Own Machine, Zero Cloud

This is the concrete, runnable walkthrough for Post 1 of the Portway series . The goal: stand up a single model behind an OpenAI-compatible endpoint o…

ollama python ai llm