Phil Nash

Aug 20, 2024 • World Congress 2024

Build RAG from Scratch

You don't need complex tools to start with RAG. This session builds a surprisingly effective system from scratch using basic vectorization and cosine similarity.

#1about 3 minutes

Why large language models need retrieval augmented generation

Large language models have knowledge cutoffs and lack access to private data, a problem solved by providing relevant context at query time using RAG.

#2about 1 minute

How similarity search and vector embeddings power RAG

RAG relies on similarity search, not keyword search, which captures meaning by converting text into numerical representations called vector embeddings.

#3about 6 minutes

Building a simple bag-of-words vectorizer from scratch

A basic vector embedding can be created by tokenizing text, building a vocabulary of unique words, and representing each document as a vector of word counts.

#4about 8 minutes

Comparing document vectors using cosine similarity

Cosine similarity measures the angle between two vectors to determine their semantic closeness by focusing on direction (meaning) rather than magnitude.

#5about 3 minutes

Understanding the limitations of a bag-of-words model

The simple bag-of-words model is sensitive to vocabulary, slow to scale, and fails to capture nuanced semantic meaning like word order or synonyms.

#6about 4 minutes

Using professional embedding models and vector databases

Production RAG systems use sophisticated embedding models and specialized vector databases for efficient, accurate, and scalable similarity search.

#7about 2 minutes

Exploring advanced RAG techniques and other applications

Beyond basic similarity search, techniques like ColBERT and knowledge graphs can improve retrieval accuracy, and vector search can power features like related content recommendations.

Understanding retrieval-augmented generation systems

03:45 MIN

Understanding retrieval-augmented generation systems

AI Model Management Life Circles: ML Ops For Generative AI Models From Research to Deployment

Powering real-time AI with retrieval augmented generation

02:42 MIN

Powering real-time AI with retrieval augmented generation

Scrape, Train, Predict: The Lifecycle of Data for AI Applications

Understanding Retrieval-Augmented Generation (RAG)

02:53 MIN

Understanding Retrieval-Augmented Generation (RAG)

Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j

How retrieval-augmented generation (RAG) works

01:19 MIN

How retrieval-augmented generation (RAG) works

Make it simple, using generative AI to accelerate learning

Understanding retrieval-augmented generation (RAG)

05:31 MIN

Understanding retrieval-augmented generation (RAG)

Exploring LLMs across clouds

What is Retrieval Augmented Generation (RAG)?

01:59 MIN

What is Retrieval Augmented Generation (RAG)?

Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB

Enhancing AI responses with retrieval augmented generation

01:49 MIN

Enhancing AI responses with retrieval augmented generation

Bringing the power of AI to your application.

A deep dive into retrieval-augmented generation

04:10 MIN

A deep dive into retrieval-augmented generation

Lies, Damned Lies and Large Language Models

Featured Partners

Building Blocks of RAG: From Understanding to Implementation

Building Blocks of RAG: From Understanding to Implementation

Ashish Sharma

about a year ago • WeAreDevelopers LIVE

Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation

Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation

Carl Lapierre

about a year ago • World Congress 2024

Large Language Models ❤️ Knowledge Graphs

Large Language Models ❤️ Knowledge Graphs

Michael Hunger

about 2 years ago • World Congress 2024

Make it simple, using generative AI to accelerate learning

Make it simple, using generative AI to accelerate learning

Duan Lightfoot

about 2 years ago • World Congress 2024

Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB

Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB

Dieter Flick

about 2 years ago • World Congress 2023

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon

about 11 months ago • WeAreDevelopers LIVE

Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j

Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j

about a year ago

Using LLMs in your Product

Using LLMs in your Product

Daniel Töws

about 2 years ago • World Congress 2024

Related Articles

View all articles

Daniel Cranney

How to Use Generative AI to Accelerate Learning to Code

It’s undeniable that generative-AI and LLMs have transformed how developers work. Hours of hunting Stack Overflow can be avoided by asking your AI-code assistant, multi-file context can be fed to the AI from inside your IDE, and applications can be b...

How to Use Generative AI to Accelerate Learning to Code

Daniel Cranney

Panel Discussion: Responsible AI in Practice - Real-World Examples and Challenges

IntroductionIn the ever-evolving landscape of artificial intelligence, the concept of "responsible AI" has emerged as a cornerstone for ethical and practical AI implementation. During the WWC24 Panel discussion, three eminent experts—Mina, Bjorn Brin...

Panel Discussion: Responsible AI in Practice - Real-World Examples and Challenges

Daniel Cranney

Stephan Gillich - Bringing AI Everywhere

In the ever-evolving world of technology, AI continues to be the frontier for innovation and transformation. Stephan Gillich, from the AI Center of Excellence at Intel, dove into the subject in a recent session titled "Bringing AI Everywhere," sheddi...

Stephan Gillich - Bringing AI Everywhere

Eli McGarvie

16 Ways Developers Can Use ChatGPT-4 and GPT-4o

ChatGPT has been busy getting new designations. If you’ve been scrolling on 𝕏 over the last week, then you’ve seen the ChatGPT-4o announcement and probably thought of Joaquin Phoenix’s virtual girlfriend on Her.Beyond the references to flicks, the la...

16 Ways Developers Can Use ChatGPT-4 and GPT-4o

From learning to earning

Jobs that call for the skills explored in this talk.

Part-Time - AI Operations Support (Voice AI & Automation)

Auralinx

Remote

AI Engineer - Generative AI /pixelhead)

Conrad Electronic SE

AI & Embedded ML Engineer (Real-Time Edge Optimization)

autonomous-teaming

Remote

GIT

Linux

PyTorch

Conversational AI & Machine Learning Engineer

Deloitte

Machine Learning

Conversational AI & Machine Learning Engineer

Deloitte

DevOps

Docker

PyTorch

Tensorflow

Kubernetes

+2

Fullstack Web Entwickler - Next.js & AI

Rocken AG

Next.js

TypeScript

Product Owner Generative AI

univativ GmbH & Co. KG

€88-98K

JIRA

Confluence

Continuous Integration

AI Software Engineer

Ratbacher GmbH

Remote

€60K

GIT

Machine Learning

Generative AI Lead (Legacy Code Conversion)

Amdocs
Kontich, Belgium

Senior

Terraform

Kubernetes

Machine Learning

Continuous Integration