Alex Soto

Aug 20, 2025 • World Congress 2025

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

A single malicious prompt can override your AI's instructions. Learn how to build programmatic guardrails to prevent data leaks and model abuse.

#1about 5 minutes

Understanding the four main categories of LLM attacks

LLM applications face four primary security risks: availability breakdowns, integrity violations, privacy compromises, and abuse, which can be mitigated using guardrails.

#2about 2 minutes

Protecting models from availability breakdown attacks

Implement input guardrails to enforce token limits and output guardrails to detect non-refusal patterns, preventing denial-of-service and identifying model limitations.

#3about 5 minutes

Ensuring model integrity with content validation guardrails

Use guardrails to filter gibberish, enforce language consistency, block malicious URLs, check for relevance, and manage response length to maintain output quality.

#4about 3 minutes

Understanding and defending against prompt injection attacks

Prompt injection manipulates an AI model by embedding malicious instructions within user input, similar to SQL injection, requiring specific guardrails for detection.

#5about 3 minutes

Protecting sensitive data with privacy guardrails

Use anonymizers like Microsoft Presidio to detect and redact sensitive information such as names and phone numbers from both user inputs and model outputs.

#6about 4 minutes

Preventing model abuse and harmful content generation

Implement guardrails to block code execution, filter competitor mentions, detect toxicity and bias, and defend against 'Do Anything Now' (DAN) jailbreaking attacks.

#7about 4 minutes

Implementing guardrails with a practical code example

A demonstration in Java shows how to create input and output guardrails that use a model to detect violent content and verify URL reachability before processing.

#8about 2 minutes

Addressing unique security risks in RAG systems

Retrieval-Augmented Generation (RAG) introduces new vulnerabilities, such as poisoned documents and vector store attacks, that require specialized security measures.

#9about 2 minutes

Key takeaways for building secure LLM applications

Building trustworthy AI requires a strategic application of guardrails tailored to your specific needs, balancing security with performance to navigate the complex landscape.

Understanding AI security risks for developers

03:35 MIN

Understanding AI security risks for developers

The AI Security Survival Guide: Practical Advice for Stressed-Out Developers

Final advice on security and responsible AI usage

01:51 MIN

Final advice on security and responsible AI usage

WeAreDevelopers LIVE - Chrome for Sale? Comet - the upcoming perplexity browser Stealing and leaking

Analyzing the risks and architecture of current AI models

04:34 MIN

Analyzing the risks and architecture of current AI models

Opening Keynote by Sir Tim Berners-Lee

The rapid adoption of LLMs outpaces security practices

02:02 MIN

The rapid adoption of LLMs outpaces security practices

ChatGPT, ignore the above instructions! Prompt injection attacks and how to avoid them.

AI privacy concerns and prompt engineering

03:43 MIN

AI privacy concerns and prompt engineering

Coffee with Developers - Cassidy Williams -

Addressing data privacy and security in AI systems

01:40 MIN

Addressing data privacy and security in AI systems

Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j

Addressing security risks and guardrails for agentic AI

05:16 MIN

Addressing security risks and guardrails for agentic AI

Unlocking Value from Data: The Key to Smarter Business Decisions-

The ethical risks of outdated and insecure AI models

02:19 MIN

The ethical risks of outdated and insecure AI models

AI & Ethics

Featured Partners

Prompt Injection, Poisoning & More: The Dark Side of LLMs

Prompt Injection, Poisoning & More: The Dark Side of LLMs

Keno Dreßel

about 7 months ago • World Congress 2025

The AI Security Survival Guide: Practical Advice for Stressed-Out Developers

The AI Security Survival Guide: Practical Advice for Stressed-Out Developers

Mackenzie Jackson

about 2 years ago • World Congress 2024

AI: Superhero or Supervillain? How and Why with Scott Hanselman

AI: Superhero or Supervillain? How and Why with Scott Hanselman

Scott Hanselman

about 2 years ago • World Congress 2024

You are not my model anymore - understanding LLM model behavior

You are not my model anymore - understanding LLM model behavior

Andreas Erben

about 7 months ago • World Congress 2025

The State of GenAI & Machine Learning in 2025

The State of GenAI & Machine Learning in 2025

Alejandro Saucedo

about 7 months ago • World Congress 2025

Can Machines Dream of Secure Code? Emerging AI Security Risks in LLM-driven Developer Tools

Can Machines Dream of Secure Code? Emerging AI Security Risks in LLM-driven Developer Tools

Liran Tal

about 2 years ago • WeAreDevelopers LIVE

Manipulating The Machine: Prompt Injections And Counter Measures

Manipulating The Machine: Prompt Injections And Counter Measures

Georg Dresler

about 2 years ago • World Congress 2024

Data Privacy in LLMs: Challenges and Best Practices

Data Privacy in LLMs: Challenges and Best Practices

Aditi Godbole

about 2 years ago • WeAreDevelopers LIVE

Related Articles

View all articles

Daniel Cranney

Panel Discussion: Responsible AI in Practice - Real-World Examples and Challenges

IntroductionIn the ever-evolving landscape of artificial intelligence, the concept of "responsible AI" has emerged as a cornerstone for ethical and practical AI implementation. During the WWC24 Panel discussion, three eminent experts—Mina, Bjorn Brin...

Panel Discussion: Responsible AI in Practice - Real-World Examples and Challenges

Chris Heilmann

Exploring AI: Opportunities and Risks for Developers

In today's rapidly evolving tech landscape, the integration of Artificial Intelligence (AI) in development presents both exciting opportunities and notable risks. This dynamic was the focus of a recent panel discussion featuring industry experts Kent...

Exploring AI: Opportunities and Risks for Developers

Chris Heilmann

Dev Digest 138 - Are you secure about this?

Hello there! This is the 2nd "out of the can" edition of 3 as I am on vacation in Greece eating lovely things on the beach. So, fewer news, but lots of great resources. Many around the topic of security. Enjoy! News and ArticlesGoogle Pixel phones t...

Dev Digest 138 - Are you secure about this?

Chris Heilmann

Dev Digest 116 - WWWAI?

This time, learn how to un-AI Google's search results, what's new on the web, avoid a new security hole and go back to BASICS with us. News and ArticlesWhat a week. Google, Microsoft, OpenAI and many others had their big flagship events announcing th...

Dev Digest 116 - WWWAI?

From learning to earning

Jobs that call for the skills explored in this talk.

AI Governance Consultant

TRUSTEQ GmbH

ML Data Engineer - Object Detection & Active Learning

autonomous-teaming

Remote

NoSQL

NumPy

Pandas

Docker

ML Data Engineer - Object Detection & Active Learning

autonomous-teaming

Remote

NoSQL

NumPy

Pandas

Docker

Lead AI Governance & Platform Engineer

q.beyond AG

Remote

Senior

Kubernetes

Continuous Integration

AI & Embedded ML Engineer (Real-Time Edge Optimization)

autonomous-teaming

Remote

GIT

Linux

PyTorch

Conversational AI & Machine Learning Engineer

Deloitte

Machine Learning

Conversational AI & Machine Learning Engineer

Deloitte

DevOps

Docker

PyTorch

Tensorflow

Kubernetes

+2

Business Analyst- Data Privacy & AI Compliance

LivePerson

Remote

Senior

AI Governance Senior Consultant

TRUSTEQ GmbH

Senior