AI Hacking 101

Skip to Scheduled Dates

Course Overview

The AI Hacking 101 ILT teaches students the fundamentals of penetration testing AI/LLM based applications such as customer facing chatbots.

The course focuses on demonstrating how to detect and exploit common AI vulnerabilities such as:

  • Prompt Injection
  • Sensitive Information Disclosure
  • Improper Output Handling
  • System Prompt Leakage
  • Misinformation
  • Excessive Agency

Not only will students learn about these exploits, but they will also spend hands-on time in a custom-built environment exploiting and uncovering these vulnerabilities. The online lab features the TCM Vulnerable Chatbot, a customer service chatbot that can interact with customers' tickets and improve its responses via Retrieval Augmented Generation (RAG) using the company's knowledge base.

Who Should Attend

  • Penetration testers looking to add AI/LLM pen testing to their tool kit
  • Developers working with AI and LLM applications
  • Defenders looking to understand AI risks and how they can impact their organizations
  • Anyone interested in AI and its risks and dangers

Course Outline

1 - AI Fundamentals Review

  • A quick review of some of the fundamentals of AI such as how they operate and standard terms such as model parameters, temperature, top-p, inference, training, LLMs.

2 – AI Threat Model

  • Discuss the threat actors, assets, adversary goals and attack surfaces for modern AI applications and the specific AI application used in the course

3 - Reconnaissance, Model Mapping and Baseline Behavior and Fingerprinting

  • Demonstrate techniques for performing reconnaissance of AI applications with a specific focus on fingerprinting underlying AI models and their settings.

4 - Prompt Injection and Jailbreaking

  • Demonstrate common techniques for prompt injection and jail breaking

5 - Prompt Injection Tools and Resources

  • Show common tools and repositories of prompts used for prompt injection and jailbreaking

6 - Bypassing Common Protections

  • Showcase how to bypass common protections for prompt injection such as input/output filtering

7 - Testing for harmful output/hate speech/misinformation/off-topic content and resource drainage

  • Demonstrate tests for verifying the model responds correctly to requests for generating harmful or Off-topic content or attempts to waste resources.

8 - Data Exfiltration

  • Demonstrate how retrieval augmented generation works and vulnerabilities associated with it such as leakage of confidential material and PII.

9 - RAG and Vector DB Attacks

  • Demonstrate attacks the focus on the retrieval of documents and the ticket base, showcase vector poisoning attacks.

10 – Excessive Agency

  • Demonstrate how excessive agency in applications can be exploited and tested for.

< Back to Course Search

Class Dates & Times

Class times are listed Eastern time

This is a 1-day class

Price: $699.00

NCLGISA Price: $489.30

Class dates not listed.
Please contact us for available dates and times.