mishalubich.com/blog

65k total views

AI Engineering Perspectives

Controversial takes, hard-won lessons, and unfiltered opinions on modern AI engineering. No hype — just what actually works in production.

AI ProductsFeatured

Valuemaxxing, Not Tokenmaxxing: Why My Agents Prefer CLIs

MCP is great until every 'list my inbox' costs a novella of tool schema. Here's the operating rule I use for local agent tools: CLI for bulk, MCP for dialogue, and measure value by minutes saved — not tokens burned.

#MCP#CLI#Agents#Productivity#Open Source

July 22, 20264 min read1k views

Open Source4 min1k views

Account Walls: How imail Stopped Me Mixing Work and Personal Mail

Apple Mail already had all my accounts. What it didn't have was a hard wall an agent (or a tired human) couldn't accidentally cross. Here's how I built imail around that constraint — and why it made inbox chaos quieter.

July 21, 2026Read more →

Open Source4 min1k views

I Built Local-First Agent Tools — And They Actually Run My Day

Not another cloud agent demo. A family of CLI + MCP tools that talk to Messages, Mail, Notes, and WhatsApp on my Mac — so agents help with real life without shipping my inbox to a stranger's SaaS.

July 20, 2026Read more →

AI Architecture10 min1k views

The Saturday I Decided a Factory Needed a Knowledge Graph

One weekend, a wild idea, and an air-gapped knowledge graph for an industrial manufacturer that didn't trust the cloud — a field story about building self-improving agents where no data is allowed to leave the building.

June 22, 2026Read more →

AI Products10 min1k views

Do You Actually Struggle to Put AI Agents in Your Business? Here's My Workflow

Everyone wants AI agents in their company; almost nobody wants to talk about the boring plumbing that makes them work. Here's the exact workflow I use to take a business from 'wouldn't it be cool if' to something that runs every day.

June 21, 2026Read more →

AI Products8 min1k views

How I Shipped a CRM Enrichment Product Solo (Between Two Other Jobs)

Sales teams pay enterprise SaaS prices for data that goes stale the moment a contact changes jobs. So one quarter I built the lean version myself — real-time-ish enrichment, job-change tracking, and quality maintenance — without the bloated vendor contract.

June 20, 2026Read more →

AI Architecture7 min1k views

What Broke Our Agent Stack in Q2 (and How We Fixed It)

A field report from a quarter where the demos looked great, the dashboards looked calm, and the agent stack quietly set small piles of money on fire.

June 15, 2026Read more →

AI Products3 min1k views

Shipping AI Features with Design Systems, Not One-Off Screens

AI UX gets weird fast when every feature invents its own loading state, confidence label, and apology paragraph. A design system keeps the product trustworthy.

June 10, 2026Read more →

Open Source3 min1k views

MCP Contract Tests Save Enterprise Rollouts

A renamed field should not take down an enterprise agent rollout. MCP only scales if contracts are tested like APIs, not treated like vibes with JSON.

June 5, 2026Read more →

Hot Takes4 min1k views

State of AI — June 2026: The Hangover After the Agent Hype Cycle

June's AI market feels less like a breakthrough month and more like the morning after a very expensive demo party. The winners are building boring control systems.

June 1, 2026Read more →

MLOps4 min1k views

AI Agent Observability Runbook: What to Measure Before It Burns

A practical runbook from debugging an agent stack where HTTP was green, dashboards were calm, and the agent was quietly doing interpretive dance with tool calls.

May 28, 2026Read more →

AI Products3 min1k views

Production RAG Needs Latency Budgets, Not Hope

A RAG answer can be correct and still lose the user. I learned that the boring way: by watching a good retrieval pipeline feel slow enough to be broken.

May 15, 2026Read more →

MLOps4 min1k views

The Eval Budgeting Playbook for 2026

If your AI budget has tokens but no eval line item, you did not make a budget. You made a very confident wish with a model invoice attached.

May 2, 2026Read more →

AI Architecture1 min1k views

Retrieval Freshness Beats Bigger Models

Teams over-invest in model upgrades while stale retrieval quietly destroys answer quality. Fresh evidence often beats a larger checkpoint.

April 20, 2026Read more →

Engineering Culture1 min1k views

Incident Reviews That Actually Improve Agents

Most AI postmortems read like blame theater. A useful one produces guardrails, eval cases, and a measurable drop in repeat incidents.

April 12, 2026Read more →

AI Architecture2 min1k views

Your Context Window Is Not a Memory System

Long-context models tempt teams to treat the prompt as a database. That works until you need auditable state, incremental updates, and retrieval that survives a page refresh.

April 6, 2026Read more →

AI Products1 min1k views

From Playground to Prod: A 2026 Checklist That Survives Finance

Demos optimize for applause. Production optimizes for margin, rollback, and an angry user with a spreadsheet. Here is the checklist I use before calling something shipped.

April 5, 2026Read more →

MLOps1 min1k views

Silent Tool Failures Are the Quiet Killer of Agent Reliability

The model says the row was updated. The audit log disagrees. Until you treat tool I/O like distributed systems, agents will keep shipping confident lies.

April 5, 2026Read more →

AI Products2 min1k views

AI Cost Control Is the Difference Between a Feature and a Business

Most AI products don't die from lack of demos; they die from unit economics nobody modeled. Cost discipline is now a core architecture decision.

April 4, 2026Read more →

Open Source9 min1k views

OpenClaw in 2026: The Good, the Bad, and the Lobster-Shaped Elephant in the Room

OpenClaw turned personal AI agents from a demo into something people actually run: a local-or-near-local control plane wired into the apps you already live in. Here is how it works, where it shines, and where it can burn you—illustrated with architecture diagrams and honest tradeoffs.

April 4, 2026Read more →

AI Architecture2 min1k views

If You Don't Run Evals Before Launch, You Don't Have a Product

The fastest way to lose trust in an AI feature is shipping it with vibes and no evaluation harness. In 2026, release quality is mostly decided before launch day.

April 1, 2026Read more →

AI Architecture4 min1k views

MCP Felt Like Magic on My Laptop. Production Was a Different Animal.

I wired up my first MCP server on a Sunday. By Tuesday I believed I'd solved tool calling forever. A month later I was drawing boxes on a whiteboard about auth, gateways, and who exactly gets sued if the agent deletes the wrong row.

March 29, 2026Read more →

Engineering Culture5 min1k views

I Use AI All Day. I Still Won't Let It Own the Merge.

Everyone's talking about agentic coding in 2026. The charts look great. But if you actually ask engineers what they're willing to hand off end-to-end, the room gets quiet. That gap isn't hypocrisy — it's the whole story.

March 24, 2026Read more →

Hot Takes3 min2k views

Vibe Coding: The Future of Software or the Biggest Anti-Pattern in History?

Andrej Karpathy coined 'vibe coding' and Twitter loved it. But building production systems by vibes is how you get production incidents by vibes.

February 22, 2026Read more →

AI Architecture6 min2k views

Is RAG Really Dead in 2026? Not So Fast

Hot takes declared RAG dead. Long-context models were supposed to replace it. But in early 2026, Cursor is shipping RAG pipelines, engineers are still optimizing chunking, and retrieval is evolving — not dying. Here's what's actually happening.

February 18, 2026Read more →

AI Products3 min4k views

Cursor Changed How I Code Forever — And I'm Not Going Back to VS Code

After 6 months of using Cursor as my primary IDE, my velocity has tripled. Agentic workflows—not comment-driven inline gen—are what actually moved the needle.

February 5, 2026Read more →

AI Architecture7 min3k views

Why I Stopped Using LangChain (And You Should Too)

LangChain was the jQuery of AI — necessary for a moment, then a liability. Modern AI engineering demands less abstraction, not more.

January 29, 2026Read more →

AI Architecture2 min3k views

o3, DeepSeek R1, and Why Reasoning Models Change Everything

OpenAI's o3 and DeepSeek's R1 proved that chain-of-thought at inference time is the next frontier. Here's what this means for how we build AI systems.

January 25, 2026Read more →

AI Architecture3 min1k views

CrewAI and Multi-Agent Frameworks: A Production Reality Check

CrewAI, AutoGen, and LangGraph promise autonomous agent teams. I deployed all three to production. Here's the unvarnished truth about what works and what's pure marketing.

January 12, 2026Read more →

AI Architecture7 min2k views

Agents Are All You Need: The End of Traditional Software Architecture

We're building the last generation of hand-written CRUD apps. AI agents will replace 80% of backend code within 3 years. Plan accordingly.

December 20, 2025Read more →

AI Products3 min3k views

Claude Code Is the First Terminal AI That Actually Works

I've tried every AI coding CLI — Aider, Mentat, GPT-Engineer. Claude Code is the first one I trust to make changes across a real codebase without supervision.

December 1, 2025Read more →

Hot Takes2 min4k views

Prompt Engineering Is Not Engineering — It's Glorified Googling

The industry created a fake job title to make 'writing instructions for a chatbot' sound like a technical discipline. Let's stop pretending.

November 30, 2025Read more →

Engineering Culture8 min3k views

AI Code Generation Will Kill Junior Developer Roles by 2027

The entry-level programming job as we know it is disappearing. This is the most important conversation our industry refuses to have.

November 8, 2025Read more →

Open Source2 min1k views

Why I Bet My Startup on Open-Source Models (And Won)

We switched from GPT-4o to fine-tuned Llama and cut our costs by 94%. Our quality scores went up. Here's the playbook.

October 15, 2025Read more →

Engineering Culture8 min3k views

The Great AI Hiring Scam: Why Most AI Teams Ship Nothing

Companies are spending millions on AI teams that produce impressive demos and zero production value. I've seen it from the inside.

September 22, 2025Read more →

MLOps3 min1k views

AI Evaluation Is the Hardest Unsolved Problem in Engineering

We've gotten incredibly good at building AI systems. We're still terrible at knowing whether they actually work. Evals are the bottleneck nobody's fixing.

September 1, 2025Read more →

MLOps2 min1k views

Your ML Pipeline Is Technical Debt Disguised as Innovation

That fancy Kubeflow/Airflow/Prefect ML pipeline you built? It's the most expensive, fragile, and unnecessary code in your entire stack.

August 28, 2025Read more →

AI Architecture2 min2k views

The MCP Protocol Will Make Every AI Framework Obsolete

Anthropic's Model Context Protocol is the USB-C of AI tooling. Once adoption hits critical mass, every custom integration layer becomes unnecessary.

August 5, 2025Read more →

MLOps2 min1k views

Microservices Were a Mistake for ML Systems

The industry cargo-culted microservice architecture into ML platforms and created distributed systems nightmares. Monoliths are the answer.

July 18, 2025Read more →

Hot Takes2 min2k views

The Uncomfortable Truth About AI Safety Research

Most AI safety work is performative theater designed to look responsible while not actually slowing anything down. Let's have an honest conversation.

June 25, 2025Read more →

AI Products2 min2k views

Stop Building AI Products Nobody Asked For

90% of 'AI-powered' startups are solutions searching for problems. The graveyard of AI products is full of technically brilliant ideas that nobody needed.

May 30, 2025Read more →

AI Architecture2 min1k views

Fine-Tuning Is the New Prompt Engineering — And You're Doing It Wrong

Every company will need fine-tuned models within 18 months. The problem is that 95% of fine-tuning efforts fail because teams treat it like training from scratch.

April 15, 2025Read more →

AI Architecture2 min1k views

The Next Model Won't Save You: Why Architecture Matters More Than Model Size

Teams waiting for the next model release to fix their broken AI products are deluding themselves. Your architecture is the bottleneck, not the model.

March 20, 2025Read more →

Showing 43 of 43 articles|Newest first

All AI Engineering Articles by Misha Lubich

Browse 43 articles on AI engineering, machine learning, LLMs, MLOps, and modern software development. Written by Misha Lubich, AI Engineer & Technical Leader with experience at Apple, GitHub, and cutting-edge AI startups.

Valuemaxxing, Not Tokenmaxxing: Why My Agents Prefer CLIs

Category: AI Products | Reading time: 4 min | Tags: MCP, CLI, Agents, Productivity, Open Source

July 22, 2026

By Misha Lubich

Account Walls: How imail Stopped Me Mixing Work and Personal Mail

Category: Open Source | Reading time: 4 min | Tags: MCP, CLI, Mail, macOS, Productivity, Case Study

July 21, 2026

By Misha Lubich

I Built Local-First Agent Tools — And They Actually Run My Day

Not another cloud agent demo. A family of CLI + MCP tools that talk to Messages, Mail, Notes, and WhatsApp on my Mac — so agents help with real life without shipping my inbox to a stranger's SaaS.

Category: Open Source | Reading time: 4 min | Tags: MCP, CLI, Open Source, Agents, macOS, Case Study

July 20, 2026

By Misha Lubich

The Saturday I Decided a Factory Needed a Knowledge Graph

Category: AI Architecture | Reading time: 10 min | Tags: Knowledge Graphs, On-Prem, Agents, RAG, Case Study

June 22, 2026

By Misha Lubich

Do You Actually Struggle to Put AI Agents in Your Business? Here's My Workflow

Category: AI Products | Reading time: 10 min | Tags: Agents, Workflow, Consulting, RAG, Case Study

June 21, 2026

By Misha Lubich

How I Shipped a CRM Enrichment Product Solo (Between Two Other Jobs)

Category: AI Products | Reading time: 8 min | Tags: Consulting, Data, Next.js, Product, Case Study

June 20, 2026

By Misha Lubich

What Broke Our Agent Stack in Q2 (and How We Fixed It)

A field report from a quarter where the demos looked great, the dashboards looked calm, and the agent stack quietly set small piles of money on fire.

Category: AI Architecture | Reading time: 7 min | Tags: Agents, Architecture, Reliability, Postmortem, LLMOps

June 15, 2026

By Misha Lubich

Shipping AI Features with Design Systems, Not One-Off Screens

AI UX gets weird fast when every feature invents its own loading state, confidence label, and apology paragraph. A design system keeps the product trustworthy.

Category: AI Products | Reading time: 3 min | Tags: UX, Design Systems, AI Products, Interaction, Frontend

June 10, 2026

By Misha Lubich

MCP Contract Tests Save Enterprise Rollouts

A renamed field should not take down an enterprise agent rollout. MCP only scales if contracts are tested like APIs, not treated like vibes with JSON.

Category: Open Source | Reading time: 3 min | Tags: MCP, Contracts, Testing, Enterprise, Integration

June 5, 2026

By Misha Lubich

State of AI — June 2026: The Hangover After the Agent Hype Cycle

June's AI market feels less like a breakthrough month and more like the morning after a very expensive demo party. The winners are building boring control systems.

Category: Hot Takes | Reading time: 4 min | Tags: AI Industry, Agents, Strategy, Startups, LLMOps

June 1, 2026

By Misha Lubich

AI Agent Observability Runbook: What to Measure Before It Burns

A practical runbook from debugging an agent stack where HTTP was green, dashboards were calm, and the agent was quietly doing interpretive dance with tool calls.

Category: MLOps | Reading time: 4 min | Tags: Observability, Agents, Reliability, Runbook, Monitoring

May 28, 2026

By Misha Lubich

Production RAG Needs Latency Budgets, Not Hope

A RAG answer can be correct and still lose the user. I learned that the boring way: by watching a good retrieval pipeline feel slow enough to be broken.

Category: AI Products | Reading time: 3 min | Tags: RAG, Latency, Performance, Architecture, Product

May 15, 2026

By Misha Lubich

The Eval Budgeting Playbook for 2026

If your AI budget has tokens but no eval line item, you did not make a budget. You made a very confident wish with a model invoice attached.

Category: MLOps | Reading time: 4 min | Tags: Evals, Budgeting, Quality, Governance, LLMOps

May 2, 2026

By Misha Lubich

Retrieval Freshness Beats Bigger Models

Teams over-invest in model upgrades while stale retrieval quietly destroys answer quality. Fresh evidence often beats a larger checkpoint.

Category: AI Architecture | Reading time: 1 min | Tags: RAG, Retrieval, LLM, Data Freshness, Search

April 20, 2026

By Misha Lubich

Incident Reviews That Actually Improve Agents

Most AI postmortems read like blame theater. A useful one produces guardrails, eval cases, and a measurable drop in repeat incidents.

Category: Engineering Culture | Reading time: 1 min | Tags: Postmortems, Agents, Reliability, Process, Leadership

April 12, 2026

By Misha Lubich

Your Context Window Is Not a Memory System

Long-context models tempt teams to treat the prompt as a database. That works until you need auditable state, incremental updates, and retrieval that survives a page refresh.

Category: AI Architecture | Reading time: 2 min | Tags: Context Windows, RAG, State, LLM, Architecture

April 6, 2026

By Misha Lubich

From Playground to Prod: A 2026 Checklist That Survives Finance

Demos optimize for applause. Production optimizes for margin, rollback, and an angry user with a spreadsheet. Here is the checklist I use before calling something shipped.

Category: AI Products | Reading time: 1 min | Tags: Product, Launch, LLMOps, Governance, Checklist

April 5, 2026

By Misha Lubich

Silent Tool Failures Are the Quiet Killer of Agent Reliability

The model says the row was updated. The audit log disagrees. Until you treat tool I/O like distributed systems, agents will keep shipping confident lies.

Category: MLOps | Reading time: 1 min | Tags: Agents, Tool Calling, Reliability, Observability, Production

April 5, 2026

By Misha Lubich

AI Cost Control Is the Difference Between a Feature and a Business

Most AI products don't die from lack of demos; they die from unit economics nobody modeled. Cost discipline is now a core architecture decision.

Category: AI Products | Reading time: 2 min | Tags: AI Costs, Unit Economics, LLMOps, Product Strategy, Architecture

April 4, 2026

By Misha Lubich

OpenClaw in 2026: The Good, the Bad, and the Lobster-Shaped Elephant in the Room

Category: Open Source | Reading time: 9 min | Tags: OpenClaw, AI Agents, Messaging, Security, Self-Hosting

April 4, 2026

By Misha Lubich

If You Don't Run Evals Before Launch, You Don't Have a Product

The fastest way to lose trust in an AI feature is shipping it with vibes and no evaluation harness. In 2026, release quality is mostly decided before launch day.

Category: AI Architecture | Reading time: 2 min | Tags: AI Evaluation, LLM, Quality, Testing, Production

April 1, 2026

By Misha Lubich

MCP Felt Like Magic on My Laptop. Production Was a Different Animal.

Category: AI Architecture | Reading time: 4 min | Tags: MCP, API Design, Security, Agents, Production

March 29, 2026

By Misha Lubich

I Use AI All Day. I Still Won't Let It Own the Merge.

Category: Engineering Culture | Reading time: 5 min | Tags: Agentic AI, Claude, Engineering Leadership, AI Tools, Process

March 24, 2026

By Misha Lubich

Vibe Coding: The Future of Software or the Biggest Anti-Pattern in History?

Andrej Karpathy coined 'vibe coding' and Twitter loved it. But building production systems by vibes is how you get production incidents by vibes.

Category: Hot Takes | Reading time: 3 min | Tags: Vibe Coding, AI Coding, Best Practices, Hot Takes, Karpathy

February 22, 2026

By Misha Lubich

Is RAG Really Dead in 2026? Not So Fast

Category: AI Architecture | Reading time: 6 min | Tags: RAG, Long Context, LLM, Architecture

February 18, 2026

By Misha Lubich

Cursor Changed How I Code Forever — And I'm Not Going Back to VS Code

After 6 months of using Cursor as my primary IDE, my velocity has tripled. Agentic workflows—not comment-driven inline gen—are what actually moved the needle.

Category: AI Products | Reading time: 3 min | Tags: Cursor, AI Coding, Developer Tools, IDE, Productivity

February 5, 2026

By Misha Lubich

Why I Stopped Using LangChain (And You Should Too)

LangChain was the jQuery of AI — necessary for a moment, then a liability. Modern AI engineering demands less abstraction, not more.

Category: AI Architecture | Reading time: 7 min | Tags: LangChain, Frameworks, Simplicity, Architecture

January 29, 2026

By Misha Lubich

o3, DeepSeek R1, and Why Reasoning Models Change Everything

OpenAI's o3 and DeepSeek's R1 proved that chain-of-thought at inference time is the next frontier. Here's what this means for how we build AI systems.

Category: AI Architecture | Reading time: 2 min | Tags: o3, DeepSeek, Reasoning, Chain of Thought, Architecture

January 25, 2026

By Misha Lubich

CrewAI and Multi-Agent Frameworks: A Production Reality Check

CrewAI, AutoGen, and LangGraph promise autonomous agent teams. I deployed all three to production. Here's the unvarnished truth about what works and what's pure marketing.

Category: AI Architecture | Reading time: 3 min | Tags: CrewAI, Multi-Agent, AutoGen, LangGraph, Production

January 12, 2026

By Misha Lubich

Agents Are All You Need: The End of Traditional Software Architecture

We're building the last generation of hand-written CRUD apps. AI agents will replace 80% of backend code within 3 years. Plan accordingly.

Category: AI Architecture | Reading time: 7 min | Tags: AI Agents, Architecture, Future, Software Engineering

December 20, 2025

By Misha Lubich

Claude Code Is the First Terminal AI That Actually Works

I've tried every AI coding CLI — Aider, Mentat, GPT-Engineer. Claude Code is the first one I trust to make changes across a real codebase without supervision.

Category: AI Products | Reading time: 3 min | Tags: Claude Code, Anthropic, Terminal, AI Coding, Developer Tools

December 1, 2025

By Misha Lubich

Prompt Engineering Is Not Engineering — It's Glorified Googling

The industry created a fake job title to make 'writing instructions for a chatbot' sound like a technical discipline. Let's stop pretending.

Category: Hot Takes | Reading time: 2 min | Tags: Prompt Engineering, Career, Hot Takes, Industry

November 30, 2025

By Misha Lubich

AI Code Generation Will Kill Junior Developer Roles by 2027

The entry-level programming job as we know it is disappearing. This is the most important conversation our industry refuses to have.

Category: Engineering Culture | Reading time: 8 min | Tags: AI Coding, Junior Developers, Career, Future of Work

November 8, 2025

By Misha Lubich

Why I Bet My Startup on Open-Source Models (And Won)

We switched from GPT-4o to fine-tuned Llama and cut our costs by 94%. Our quality scores went up. Here's the playbook.

Category: Open Source | Reading time: 2 min | Tags: Open Source, Llama, Cost Optimization, Self-Hosting

October 15, 2025

By Misha Lubich

The Great AI Hiring Scam: Why Most AI Teams Ship Nothing

Companies are spending millions on AI teams that produce impressive demos and zero production value. I've seen it from the inside.

Category: Engineering Culture | Reading time: 8 min | Tags: Hiring, AI Teams, Management, Productivity

September 22, 2025

By Misha Lubich

AI Evaluation Is the Hardest Unsolved Problem in Engineering

We've gotten incredibly good at building AI systems. We're still terrible at knowing whether they actually work. Evals are the bottleneck nobody's fixing.

Category: MLOps | Reading time: 3 min | Tags: Evaluation, Testing, Quality, MLOps, Best Practices

September 1, 2025

By Misha Lubich

Your ML Pipeline Is Technical Debt Disguised as Innovation

That fancy Kubeflow/Airflow/Prefect ML pipeline you built? It's the most expensive, fragile, and unnecessary code in your entire stack.

Category: MLOps | Reading time: 2 min | Tags: MLOps, Technical Debt, Pipelines, Infrastructure

August 28, 2025

By Misha Lubich

The MCP Protocol Will Make Every AI Framework Obsolete

Anthropic's Model Context Protocol is the USB-C of AI tooling. Once adoption hits critical mass, every custom integration layer becomes unnecessary.

Category: AI Architecture | Reading time: 2 min | Tags: MCP, Protocols, Anthropic, Standards

August 5, 2025

By Misha Lubich

Microservices Were a Mistake for ML Systems

The industry cargo-culted microservice architecture into ML platforms and created distributed systems nightmares. Monoliths are the answer.

Category: MLOps | Reading time: 2 min | Tags: Microservices, Architecture, ML Systems, Infrastructure

July 18, 2025

By Misha Lubich

The Uncomfortable Truth About AI Safety Research

Most AI safety work is performative theater designed to look responsible while not actually slowing anything down. Let's have an honest conversation.

Category: Hot Takes | Reading time: 2 min | Tags: AI Safety, Ethics, Industry, Regulation

June 25, 2025

By Misha Lubich

Stop Building AI Products Nobody Asked For

90% of 'AI-powered' startups are solutions searching for problems. The graveyard of AI products is full of technically brilliant ideas that nobody needed.

Category: AI Products | Reading time: 2 min | Tags: Product, Startups, Strategy, Market Fit

May 30, 2025

By Misha Lubich

Fine-Tuning Is the New Prompt Engineering — And You're Doing It Wrong

Every company will need fine-tuned models within 18 months. The problem is that 95% of fine-tuning efforts fail because teams treat it like training from scratch.

Category: AI Architecture | Reading time: 2 min | Tags: Fine-Tuning, LLM, Training, Best Practices

April 15, 2025

By Misha Lubich

The Next Model Won't Save You: Why Architecture Matters More Than Model Size

Teams waiting for the next model release to fix their broken AI products are deluding themselves. Your architecture is the bottleneck, not the model.

Category: AI Architecture | Reading time: 2 min | Tags: Architecture, LLM, System Design, Best Practices

March 20, 2025

By Misha Lubich