Protect the whole system
LLM security is not only model safety. The system includes prompts, retrieval, tools, user permissions, logs, secrets, output handling, and human review.
- Test indirect prompt injection through documents and webpages.
- Validate tool arguments and permissions outside the model.
- Log security-relevant traces without exposing sensitive data broadly.