close

DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Mental Framework for Unlocking Agentic Workflows

The Mental Framework for Unlocking Agentic Workflows

Image Image 3
Comments
11 min read
Defluffer promises -45% tokens. I measured the semantic cost of that savings and it's uncomfortable

Defluffer promises -45% tokens. I measured the semantic cost of that savings and it's uncomfortable

Comments
8 min read
I Wrote a Python Interpreter in Python. What I Learned Has Nothing to Do With Python

I Wrote a Python Interpreter in Python. What I Learned Has Nothing to Do With Python

Comments
8 min read
AI Agents That Pass Your Tests. That's the Problem.

AI Agents That Pass Your Tests. That's the Problem.

Comments
9 min read
Why AI feature rollouts fail before the model does

Why AI feature rollouts fail before the model does

Comments
8 min read
Aprenda avaliar a qualidade do seu agente de AI, RAG e LLM

Aprenda avaliar a qualidade do seu agente de AI, RAG e LLM

Image Image Image 5
Comments
22 min read
Lakera Guard Was Acquired for $300M. Here Is the Free Alternative We Built for Developers.

Lakera Guard Was Acquired for $300M. Here Is the Free Alternative We Built for Developers.

Comments
4 min read
MCP Security in 2026: How to Protect Your AI Agents from Prompt Injection

MCP Security in 2026: How to Protect Your AI Agents from Prompt Injection

Comments
7 min read
Defluffer promete -45% en tokens. Yo medí el costo semántico del ahorro y es incómodo

Defluffer promete -45% en tokens. Yo medí el costo semántico del ahorro y es incómodo

Comments
9 min read
When Your LLM Provider Pulls the Rug: Lessons from Anthropic's OAuth Shutdown

When Your LLM Provider Pulls the Rug: Lessons from Anthropic's OAuth Shutdown

Comments
2 min read
Stop prompting "write me an API" — teach the LLM the shape first

Stop prompting "write me an API" — teach the LLM the shape first

Comments
2 min read
Cloudflare Workers HTML to Markdown on the Free Plan

Cloudflare Workers HTML to Markdown on the Free Plan

Comments
5 min read
llama.cpp Speculative Checkpointing, Ollama Multimodal Tool, MLX vs GGUF for Gemma 4

llama.cpp Speculative Checkpointing, Ollama Multimodal Tool, MLX vs GGUF for Gemma 4

Comments
4 min read
The Model Doesn't Matter. The Harness Does.

The Model Doesn't Matter. The Harness Does.

Comments
4 min read
The Rise of Inference Optimization: The Real LLM Infra Trend Shaping 2026

The Rise of Inference Optimization: The Real LLM Infra Trend Shaping 2026

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.