AI - Page 3 - Rost Glukhov | Personal site and technical blog

Chunking Strategies in RAG Comparison: Alternatives, Trade‑offs, and Examples

Chunking is the most under-estimated hyperparameter in Retrieval ‑ Augmented Generation (RAG): it silently determines what your LLM “sees”, how expensive ingestion becomes, and how much of the LLM’s context window you burn per answer.

Retrieval-Augmented Generation (RAG) Tutorial: Architecture, Implementation, and Production Guide

Production-focused guide to building RAG systems: chunking, vector stores, hybrid retrieval, reranking, evaluation, and when to choose RAG over fine-tuning.

LLM Hosting in 2026: Local, Self-Hosted and Cloud Infrastructure Compared

Strategic guide to hosting large language models locally with Ollama, llama.cpp, vLLM, or in the cloud. Compare tools, performance trade-offs, and cost considerations.

Self-hosting LLMs keeps data, models, and inference under your control-a practical path to AI sovereignty for teams, enterprises, nations.

Comparing LLMs performance on Ollama on 16GB VRAM GPU

Running large language models locally gives you privacy, offline capability, and zero API costs. This benchmark reveals exactly what one can expect from 14 popular LLMs on Ollama on an RTX 4080.

Top 17 Trending Python Projects on GitHub

The Python ecosystem this month is dominated by Claude Skills and AI agent tooling. This overview analyzes the top trending Python repositories on GitHub.

The Rust ecosystem is exploding with innovative projects, particularly in AI coding tools and terminal applications. This overview analyzes the top trending Rust repositories on GitHub this month.

Top 19 Trending Go Projects on GitHub - January 2026

The Go ecosystem continues to thrive with innovative projects spanning AI tooling, self-hosted applications, and developer infrastructure. This overview analyzes the top trending Go repositories on GitHub this month.

This comprehensive guide provides background and a detailed comparison of Anaconda, Miniconda, and Mamba - three powerful tools that have become essential for Python developers and data scientists working with complex dependencies and scientific computing environments.

Open WebUI is a powerful, extensible, and feature-rich self-hosted web interface for interacting with large language models.

Melbourne’s tech community continues to thrive in 2026 with an impressive lineup of conferences, meetups, and workshops spanning software development, cloud computing, AI, cybersecurity, and emerging technologies.

vLLM is a high-throughput, memory-efficient inference and serving engine for Large Language Models (LLMs) developed by UC Berkeley’s Sky Computing Lab.

DGX Spark AU Pricing: $6,249-$7,999 at Major Retailers

The NVIDIA DGX Spark (GB10 Grace Blackwell) is now available in Australia at major PC retailers with local stock. If you’ve been following the global DGX Spark pricing and availability, you’ll be interested to know that Australian pricing ranges from $6,249 to $7,999 AUD depending on storage configuration and retailer.

Detecting AI Slop: Techniques & Red Flags

The proliferation of AI-generated content has created a new challenge: distinguishing genuine human writing from “AI slop” - low-quality, mass-produced synthetic text.

Self-Hosting Cognee: Choosing LLM on Ollama

Cognee is a Python framework for building knowledge graphs from documents using LLMs. But does it work with self-hosted models?

BAML vs Instructor: Structured LLM Outputs

When working with Large Language Models in production, getting structured, type-safe outputs is critical. Two popular frameworks - BAML and Instructor - take different approaches to solving this problem.

AI

Chunking Strategies in RAG Comparison: Alternatives, Trade‑offs, and Examples

Retrieval-Augmented Generation (RAG) Tutorial: Architecture, Implementation, and Production Guide

LLM Hosting in 2026: Local, Self-Hosted and Cloud Infrastructure Compared

LLM Self-Hosting and AI Sovereignty

Comparing LLMs performance on Ollama on 16GB VRAM GPU

Top 17 Trending Python Projects on GitHub

Top 23 Trending Rust Projects on GitHub - January 2026

Top 19 Trending Go Projects on GitHub - January 2026

Anaconda vs Miniconda vs Mamba Guide

Open WebUI: Self-Hosted LLM Interface

Melbourne Tech Events to Go To in 2026

vLLM Quickstart: High-Performance LLM Serving - in 2026

DGX Spark AU Pricing: $6,249-$7,999 at Major Retailers

Detecting AI Slop: Techniques & Red Flags

Self-Hosting Cognee: Choosing LLM on Ollama

BAML vs Instructor: Structured LLM Outputs