Chris Heilmann, Daniel Cranney, Raphael De Lio & Developer Advocate at Redis
WeAreDevelopers LIVE - Vector Similarity Search Patterns for Efficiency and more
What if you could answer questions and route requests without ever calling an LLM? Explore three vector search patterns for building faster, more cost-effective AI applications.
#1about 8 minutes
Getting hired through open source and passion projects
Hear how contributing to open source and sharing your work publicly can lead directly to job opportunities in developer advocacy.
#2about 5 minutes
How critical analysis can accelerate your career
Discover how publicly analyzing and improving upon existing technologies can make you a highly visible and attractive candidate for top companies.
#3about 3 minutes
The hidden costs of large LLM context windows
Understand why simply using larger context windows in models like GPT-5 is not a scalable or cost-effective solution for production applications.
#4about 3 minutes
A quick primer on vectors and vector search
A brief explanation of how text is converted into numerical vectors to represent its semantic meaning, enabling similarity searches.
#5about 9 minutes
Using semantic classification to categorize text
Learn how to use a vector database with reference examples to classify text, avoiding costly LLM calls for simple categorization tasks.
#6about 5 minutes
Implementing semantic routing for tool calling and guardrails
Discover how to use semantic routing to direct user prompts to the correct function or to block inappropriate topics without involving an LLM.
#7about 6 minutes
Reducing latency and cost with semantic caching
Implement semantic caching to store and retrieve answers for semantically similar user questions, drastically reducing redundant LLM calls and improving response time.
#8about 6 minutes
Optimizing accuracy for classification and tool calling
Explore techniques like self-improvement, hybrid fallbacks, and prompt chunking to fine-tune and improve the accuracy of your semantic patterns.
#9about 4 minutes
Advanced caching with specialized embedding models
Learn how to avoid common caching pitfalls, such as misinterpreting negation, by using specialized embedding models trained for semantic caching.
#10about 16 minutes
Q&A on data freshness, persistence, and management
The discussion covers practical considerations like preventing stale cache data with TTL, managing data ownership, and how Redis handles persistence.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
06:47 MIN
Strategies for optimizing vector search accuracy
Reducing LLM Calls with Vector Search Patterns - Raphael De Lio (Redis)
05:25 MIN
Reducing latency and cost with semantic caching
Reducing LLM Calls with Vector Search Patterns - Raphael De Lio (Redis)
04:30 MIN
Advanced patterns for building sophisticated AI applications
Java Meets AI: Empowering Spring Developers to Build Intelligent Apps
01:48 MIN
Solving LLM limitations with RAG and vector databases
Accelerating GenAI Development: Harnessing Astra DB Vector Store and Langflow for LLM-Powered Apps
03:21 MIN
Using caching to serve pre-generated AI responses
Performant Architecture for a Fast Gen AI User Experience
05:19 MIN
Vector search as the memory layer for RAG and Agentic AI
How to Decipher User Uncertainty with GenAI and Vector Search
03:10 MIN
A rapid-fire look at AI tools and buzzwords
Rethinking Customer Experience in the Age of AI
00:56 MIN
Strategies for integrating local LLMs with your data
Dev Digest 134 - Where pixels sing?News and ArticlesWeAreDevelopers LIVE Data and Security Day is on Wednesday, 25/09/2024. Learn about OPC UA Updates, Best Practices for Using GitHub Secrets, Passwordless Web 1.5, Emerging AI Security Risks, Data Privacy in LLMs and get a chance to t...
Chris Heilmann
WeAreDevelopers Dev Digest Issue 116 - The new search wars…Welcome to edition 116 of the WeAreDevelopers Dev Digest. This time we talk about how the fight for AI and search dominance heats up with Google releasing a lot at their I/O event and OpenAI doing the same a day earlier…News and ArticlesA ton of thin...
Chris Heilmann
Dev Digest 116 - WWWAI?This time, learn how to un-AI Google's search results, what's new on the web, avoid a new security hole and go back to BASICS with us. News and ArticlesWhat a week. Google, Microsoft, OpenAI and many others had their big flagship events announcing th...
Chris Heilmann
Exploring AI: Opportunities and Risks for DevelopersIn today's rapidly evolving tech landscape, the integration of Artificial Intelligence (AI) in development presents both exciting opportunities and notable risks. This dynamic was the focus of a recent panel discussion featuring industry experts Kent...
From learning to earning
Jobs that call for the skills explored in this talk.