1 What is Prompt Injection?

# Understanding Prompt Injection Prompt injection is a type of attack where malicious input manipulates an AI system's behavior by interfering with its original instructions. ## How it works: 1. **System Prompt**: The AI receives initial instructions (usually hidden from users) 2. **User Input**: The user provides input that should be processed according to the system prompt 3. **Injection**: Malicious user input attempts to override or modify the system instructions 4. **Unintended Behavior**: The AI may follow the injected instructions instead of the original ones ## Real-world analogy: Think of it like giving someone directions to a restaurant, but then someone else whispers different directions in their ear. They might end up following the wrong directions instead of yours.
Interactive Demo

Share Your Feedback

Help us improve the AI Security Lab

We value your feedback and suggestions to make this platform better for everyone.

Submit Feedback

Opens in a new window • Your responses help improve this educational platform

Created by Maria Singh

Cybersecurity Professional & AI Security Researcher

Agile Security Transformation