Jodie Burchell

Oct 30, 2024 • WeAreDevelopers LIVE

Lies, Damned Lies and Large Language Models

What if 40% of your LLM's answers are just plain wrong? Learn how to measure factuality and build more reliable AI applications.

#1about 2 minutes

Understanding the dual nature of large language models

LLMs can generate both creative, coherent text and factually incorrect "hallucinations," posing a significant challenge for real-world applications.

#2about 4 minutes

The architecture and evolution of LLMs

The combination of the scalable Transformer architecture and massive text datasets enables models like GPT to develop "parametric knowledge" as they grow in size.

#3about 3 minutes

How training data quality influences model behavior

The quality of web-scraped datasets like Common Crawl, even after filtering, directly contributes to model hallucinations by embedding misinformation.

#4about 2 minutes

Differentiating between faithfulness and factuality hallucinations

Hallucinations are categorized as either faithfulness errors, which contradict a given source text, or factuality errors, which stem from incorrect learned knowledge.

#5about 3 minutes

Using the TruthfulQA dataset to measure misinformation

The TruthfulQA dataset provides a benchmark for measuring an LLM's tendency to repeat common misconceptions and conspiracy theories across various categories.

#6about 6 minutes

A practical guide to benchmarking LLM hallucinations

A step-by-step demonstration shows how to use Python, LangChain, and Hugging Face Datasets to run the TruthfulQA benchmark on a model like GPT-3.5 Turbo.

#7about 4 minutes

Exploring strategies to reduce LLM hallucinations

Key techniques to mitigate hallucinations include careful prompt crafting, domain-specific fine-tuning, output evaluation, and retrieval-augmented generation (RAG).

#8about 4 minutes

A deep dive into retrieval-augmented generation

RAG reduces hallucinations by augmenting prompts with relevant, up-to-date information retrieved from a vector database of document embeddings.

#9about 2 minutes

Overcoming challenges with advanced RAG techniques

Naive RAG can fail due to poor retrieval or generation, but advanced methods like Rowan selectively apply retrieval to significantly improve factuality.

Understanding the problem of LLM hallucinations

02:29 MIN

Understanding the problem of LLM hallucinations

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Addressing the key challenges of large language models

02:55 MIN

Addressing the key challenges of large language models

Large Language Models ❤️ Knowledge Graphs

Explaining how large language models work and why they hallucinate

05:49 MIN

Explaining how large language models work and why they hallucinate

Innovating Developer Tools with AI: Insights from GitHub Next

Why web data is essential for training large language models

01:27 MIN

Why web data is essential for training large language models

How to scrape modern websites to feed AI agents

Addressing the core challenges of large language models

05:18 MIN

Addressing the core challenges of large language models

Accelerating GenAI Development: Harnessing Astra DB Vector Store and Langflow for LLM-Powered Apps

Understanding the risks of large language models

06:47 MIN

Understanding the risks of large language models

Inside the Mind of an LLM

Understanding the limitations of large language models

02:20 MIN

Understanding the limitations of large language models

Knowledge graph based chatbot

Demonstrating LLM hallucinations with tricky questions

06:55 MIN

Demonstrating LLM hallucinations with tricky questions

Give Your LLMs a Left Brain

Featured Partners

Creating Industry ready solutions with LLM Models

Creating Industry ready solutions with LLM Models

Vijay Krishan Gupta & Gauravdeep Singh Lotey

about 2 years ago • WeAreDevelopers LIVE

What do language models really learn

What do language models really learn

Tanmay Bakshi

about 6 years ago • WeAreDevelopers LIVE

Data Privacy in LLMs: Challenges and Best Practices

Data Privacy in LLMs: Challenges and Best Practices

Aditi Godbole

about 2 years ago • WeAreDevelopers LIVE

How to Avoid LLM Pitfalls - Mete Atamel and Guillaume Laforge

How to Avoid LLM Pitfalls - Mete Atamel and Guillaume Laforge

Meta Atamel & Guillaume Laforge

about 11 months ago • Coffee With Developers

Large Language Models ❤️ Knowledge Graphs

Large Language Models ❤️ Knowledge Graphs

Michael Hunger

about 2 years ago • World Congress 2024

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon

about 11 months ago • WeAreDevelopers LIVE

A beginner’s guide to modern natural language processing

A beginner’s guide to modern natural language processing

Jodie Burchell

about 2 years ago • WeAreDevelopers LIVE

Inside the Mind of an LLM

Inside the Mind of an LLM

Emanuele Fabbiani

about 7 months ago • World Congress 2025

Related Articles

View all articles

Luis Minvielle

What Are Large Language Models?

Developers and writers can finally agree on one thing: Large Language Models, the subset of AIs that drive ChatGPT and its competitors, are stunning tech creations. Developers enjoying the likes of GitHub Copilot know the feeling: this new kind of te...

What Are Large Language Models?

Daniel Cranney

How machine learning can help us tell fact from fiction

A decade ago, machine learning was everywhere. While the rise of generative AI has meant artificial intelligence has stolen the spotlight to some degree, it’s machine learning (ML) that silently powers its most impressive achievements.From chatbots t...

How machine learning can help us tell fact from fiction

Krissy Davis

The Best Large Language Models on The Market

Large language models are sophisticated programs that enable machines to comprehend and generate human-like text. They have been the foundation of natural language processing for almost a decade. Although generative AI has only recently gained popula...

The Best Large Language Models on The Market

Chris Heilmann

Dev Digest 137 - AI'm not sure about this

Hello fellow developer, this is the 1st "out of the can" edition of 3 as I am on vacation in Greece going "whee are you cute" at donkeys. So, fewer news, but lots of great resources. Enjoy! News and ArticlesOpenAI has been the big topic winning in th...

Dev Digest 137 - AI'm not sure about this

From learning to earning

Jobs that call for the skills explored in this talk.

Machine Learning & Data Engineer

vengine GmbH
Hamburg, Germany

Junior

Intermediate

Python

Project Manager, Electrical (Machine Learning)

Linesight

Machine Learning

Conversational AI & Machine Learning Engineer

Deloitte

Machine Learning

Conversational AI & Machine Learning Engineer

Deloitte

DevOps

Docker

PyTorch

Tensorflow

Kubernetes

+2

Machine Learning Expert (Llms)

NLP People
Municipality of Madrid, Spain

Senior

Machine Learning

Machine Learning Engineer

JD.COM INTERNATIONAL UK LTD

Machine Learning

Machine Learning Engineer

JD.COM INTERNATIONAL UK LTD

Machine Learning

Phd Position On "human-centered Design And Evaluation Of Learning Analytics And Ai Tools In Edu[...]

Universidad De Valladolid
Municipality of Valladolid, Spain

€17K

Data analysis

Machine Learning

AI Knowledge Base Developer

4Com GmbH & Co. KG

Docker

GraphQL

Kubernetes

Data analysis

Machine Learning

+1