Red-teaming an LLM app before users do
Jailbreaks, prompt injection and PII leakage — a checklist mapped to the OWASP LLM Top 10.
SecurityRed-teamOWASP
Field notes on shipping AI you can trust — evals, RAG, agents, and the engineering discipline around them.
Jailbreaks, prompt injection and PII leakage — a checklist mapped to the OWASP LLM Top 10.
Tool-use orchestration with guardrails on every hop, and how to keep the bill predictable.
When to retry retrieval, when to abstain, and how to wire the feedback loop.
The traces, metrics and cost signals worth wiring from day one — not after the incident.