Browse 37 articles on AI engineering, machine learning, LLMs, MLOps, and modern software development. Written by Misha Lubich, AI Engineer & Technical Leader with experience at Apple, GitHub, and cutting-edge AI startups.
AI UX gets weird fast when every feature invents its own loading state, confidence label, and apology paragraph. A design system keeps the product trustworthy.
Category: AI Products | Reading time: 3 min | Tags: UX, Design Systems, AI Products, Interaction, Frontend
June's AI market feels less like a breakthrough month and more like the morning after a very expensive demo party. The winners are building boring control systems.
Category: Hot Takes | Reading time: 4 min | Tags: AI Industry, Agents, Strategy, Startups, LLMOps
A practical runbook from debugging an agent stack where HTTP was green, dashboards were calm, and the agent was quietly doing interpretive dance with tool calls.
A RAG answer can be correct and still lose the user. I learned that the boring way: by watching a good retrieval pipeline feel slow enough to be broken.
Category: AI Products | Reading time: 3 min | Tags: RAG, Latency, Performance, Architecture, Product
Long-context models tempt teams to treat the prompt as a database. That works until you need auditable state, incremental updates, and retrieval that survives a page refresh.
Category: AI Architecture | Reading time: 2 min | Tags: Context Windows, RAG, State, LLM, Architecture
Demos optimize for applause. Production optimizes for margin, rollback, and an angry user with a spreadsheet. Here is the checklist I use before calling something shipped.
Category: AI Products | Reading time: 1 min | Tags: Product, Launch, LLMOps, Governance, Checklist
The model says the row was updated. The audit log disagrees. Until you treat tool I/O like distributed systems, agents will keep shipping confident lies.
Category: MLOps | Reading time: 1 min | Tags: Agents, Tool Calling, Reliability, Observability, Production
OpenClaw turned personal AI agents from a demo into something people actually run: a local-or-near-local control plane wired into the apps you already live in. Here is how it works, where it shines, and where it can burn you—illustrated with architecture diagrams and honest tradeoffs.
Category: Open Source | Reading time: 9 min | Tags: OpenClaw, AI Agents, Messaging, Security, Self-Hosting
The fastest way to lose trust in an AI feature is shipping it with vibes and no evaluation harness. In 2026, release quality is mostly decided before launch day.
Category: AI Architecture | Reading time: 2 min | Tags: AI Evaluation, LLM, Quality, Testing, Production
I wired up my first MCP server on a Sunday. By Tuesday I believed I'd solved tool calling forever. A month later I was drawing boxes on a whiteboard about auth, gateways, and who exactly gets sued if the agent deletes the wrong row.
Category: AI Architecture | Reading time: 4 min | Tags: MCP, API Design, Security, Agents, Production
Everyone's talking about agentic coding in 2026. The charts look great. But if you actually ask engineers what they're willing to hand off end-to-end, the room gets quiet. That gap isn't hypocrisy — it's the whole story.
Category: Engineering Culture | Reading time: 5 min | Tags: Agentic AI, Claude, Engineering Leadership, AI Tools, Process
Hot takes declared RAG dead. Long-context models were supposed to replace it. But in early 2026, Cursor is shipping RAG pipelines, engineers are still optimizing chunking, and retrieval is evolving — not dying. Here's what's actually happening.
Category: AI Architecture | Reading time: 6 min | Tags: RAG, Long Context, LLM, Architecture
After 6 months of using Cursor as my primary IDE, my velocity has tripled. Agentic workflows—not comment-driven inline gen—are what actually moved the needle.
Category: AI Products | Reading time: 3 min | Tags: Cursor, AI Coding, Developer Tools, IDE, Productivity
CrewAI, AutoGen, and LangGraph promise autonomous agent teams. I deployed all three to production. Here's the unvarnished truth about what works and what's pure marketing.
Category: AI Architecture | Reading time: 3 min | Tags: CrewAI, Multi-Agent, AutoGen, LangGraph, Production
I've tried every AI coding CLI — Aider, Mentat, GPT-Engineer. Claude Code is the first one I trust to make changes across a real codebase without supervision.
Category: AI Products | Reading time: 3 min | Tags: Claude Code, Anthropic, Terminal, AI Coding, Developer Tools
We've gotten incredibly good at building AI systems. We're still terrible at knowing whether they actually work. Evals are the bottleneck nobody's fixing.
Category: MLOps | Reading time: 3 min | Tags: Evaluation, Testing, Quality, MLOps, Best Practices
90% of 'AI-powered' startups are solutions searching for problems. The graveyard of AI products is full of technically brilliant ideas that nobody needed.
Category: AI Products | Reading time: 2 min | Tags: Product, Startups, Strategy, Market Fit
Every company will need fine-tuned models within 18 months. The problem is that 95% of fine-tuning efforts fail because teams treat it like training from scratch.
Category: AI Architecture | Reading time: 2 min | Tags: Fine-Tuning, LLM, Training, Best Practices