David vonThenen

Aug 20, 2025 • World Congress 2025

Confuse, Obfuscate, Disrupt: Using Adversarial Techniques for Better AI and True Anonymity

What if a single pixel could trick your AI into seeing a cat as a dog? Learn how adversarial attacks can expose hidden flaws and build more resilient systems.

#1about 1 minute

The importance of explainable AI and data quality

AI models are only as good as their training data, which is often plagued by bias, noise, and inaccuracies that explainable AI helps to uncover.

#2about 3 minutes

Identifying common data inconsistencies in AI models

Models can be compromised by issues like annotation errors, data imbalance, and adversarial samples, which can be measured with tools like Captum.

#3about 2 minutes

The dual purpose of adversarial AI attacks

Intentionally introducing adversarial inputs can be used for good to test model boundaries, or for bad to obfuscate data and protect personal privacy.

#4about 3 minutes

How to confuse NLP models with creative inputs

Natural language processing models can be disrupted using techniques like encoding, code-switching, misspellings, and even metaphors to prevent accurate interpretation.

#5about 4 minutes

Visualizing model predictions with the Captum library

The Captum library for PyTorch helps visualize which parts of an input, like words in a sentence or pixels in an image, contribute most to a model's final prediction.

#6about 6 minutes

Manipulating model outputs with subtle input changes

Simple misspellings can flip a sentiment analysis result from positive to negative, and adding a single pixel can cause an image classifier to misidentify a cat as a dog.

#7about 2 minutes

Using an adversarial pattern t-shirt to evade detection

A t-shirt printed with a specific adversarial pattern can disrupt a real-time person detection model, effectively making the wearer invisible to the AI system.

#8about 2 minutes

Techniques for defending models against adversarial attacks

Defenses against NLP attacks include normalization and grammar checks, while vision attacks can be mitigated with image blurring, bit-depth reduction, or advanced methods like FGSM.

#9about 2 minutes

Defeating a single-pixel attack with image blurring

Applying a simple Gaussian blur to an image containing an adversarial pixel smooths out the manipulation, allowing the model to correctly classify the image.

Understanding security risks from adversarial attacks on models

03:05 MIN

Understanding security risks from adversarial attacks on models

Explainable machine learning explained

Fundamental AI vulnerabilities and malicious misuse

04:38 MIN

Fundamental AI vulnerabilities and malicious misuse

A hundred ways to wreck your AI - the (in)security of machine learning systems

Deconstructing AI attacks from evasion to model stealing

06:27 MIN

Deconstructing AI attacks from evasion to model stealing

A hundred ways to wreck your AI - the (in)security of machine learning systems

Navigating the new landscape of AI and cybersecurity

09:15 MIN

Navigating the new landscape of AI and cybersecurity

From Monolith Tinkering to Modern Software Development

Q&A on creating patterns and de-poisoning images

02:03 MIN

Q&A on creating patterns and de-poisoning images

Hacking AI - how attackers impose their will on AI

Understanding the core principles of hacking AI systems

02:28 MIN

Understanding the core principles of hacking AI systems

Hacking AI - how attackers impose their will on AI

AI privacy concerns and prompt engineering

03:43 MIN

AI privacy concerns and prompt engineering

Coffee with Developers - Cassidy Williams -

Manipulating AI with prompt injection and hidden commands

05:17 MIN

Manipulating AI with prompt injection and hidden commands

WeAreDevelopers LIVE - Is Software Ever Truly Accessible?

Featured Partners

Hacking AI - how attackers impose their will on AI

Hacking AI - how attackers impose their will on AI

Mirko Ross

about 2 years ago • World Congress 2023

AI: Superhero or Supervillain? How and Why with Scott Hanselman

AI: Superhero or Supervillain? How and Why with Scott Hanselman

Scott Hanselman

about 2 years ago • World Congress 2024

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Alex Soto

about 7 months ago • World Congress 2025

A hundred ways to wreck your AI - the (in)security of machine learning systems

A hundred ways to wreck your AI - the (in)security of machine learning systems

Balázs Kiss

about 3 years ago • World Congress 2023

The AI Elections: How Technology Could Shape Public Sentiment

The AI Elections: How Technology Could Shape Public Sentiment

Martin Förtsch & Thomas Endres

about 2 years ago • World Congress 2024

The AI Security Survival Guide: Practical Advice for Stressed-Out Developers

The AI Security Survival Guide: Practical Advice for Stressed-Out Developers

Mackenzie Jackson

about 2 years ago • World Congress 2024

Skynet wants your Passwords! The Role of AI in Automating Social Engineering

Skynet wants your Passwords! The Role of AI in Automating Social Engineering

Wolfgang Ettlinger & Alexander Hurbean

about 3 years ago • World Congress 2023

Manipulating The Machine: Prompt Injections And Counter Measures

Manipulating The Machine: Prompt Injections And Counter Measures

Georg Dresler

about 2 years ago • World Congress 2024

Related Articles

View all articles

Daniel Cranney

Panel Discussion: Responsible AI in Practice - Real-World Examples and Challenges

IntroductionIn the ever-evolving landscape of artificial intelligence, the concept of "responsible AI" has emerged as a cornerstone for ethical and practical AI implementation. During the WWC24 Panel discussion, three eminent experts—Mina, Bjorn Brin...

Panel Discussion: Responsible AI in Practice - Real-World Examples and Challenges

Chris Heilmann

WWC24 Talk - Scott Hanselman - AI: Superhero or Supervillain?

Join Scott Hanselman at WWC24 to explore AI's role as a superhero or supervillain. Scott shares his 32 years of experience in software engineering, discusses AI myths, ethical dilemmas, and tech advancements. Engage with his live demos and insights o...

WWC24 Talk - Scott Hanselman - AI: Superhero or Supervillain?

Chris Heilmann

Exploring AI: Opportunities and Risks for Developers

In today's rapidly evolving tech landscape, the integration of Artificial Intelligence (AI) in development presents both exciting opportunities and notable risks. This dynamic was the focus of a recent panel discussion featuring industry experts Kent...

Exploring AI: Opportunities and Risks for Developers

Chris Heilmann

Dev Digest 116 - WWWAI?

This time, learn how to un-AI Google's search results, what's new on the web, avoid a new security hole and go back to BASICS with us. News and ArticlesWhat a week. Google, Microsoft, OpenAI and many others had their big flagship events announcing th...

Dev Digest 116 - WWWAI?

From learning to earning

Jobs that call for the skills explored in this talk.

ML Data Engineer - Object Detection & Active Learning

autonomous-teaming

Remote

NoSQL

NumPy

Pandas

Docker

ML Data Engineer - Object Detection & Active Learning

autonomous-teaming

Remote

NoSQL

NumPy

Pandas

Docker

AI & Embedded ML Engineer (Real-Time Edge Optimization)

autonomous-teaming

Remote

GIT

Linux

PyTorch

AI & Digital Concepter / Consultant

Virtual Identity AG

Modélisation et Détection des Cyberattaques Assistées par l'IA // Modelling and detection of cyber attack assisted by AI

Association Bernard Gregory
Canton de Nancy-2, France

Data analysis

Machine Learning

Junior ML-AI Speech Enhancement and Denoising Software Engineer

Analog Devices

Junior

Matlab

PyTorch

Tensorflow

AI Governance Consultant

TRUSTEQ GmbH

Conversational AI & Machine Learning Engineer

Deloitte

Machine Learning

Conversational AI & Machine Learning Engineer

Deloitte

DevOps

Docker

PyTorch

Tensorflow

Kubernetes

+2