Vision for Websites: Training Your Frontend to See
Build web apps that see. Learn how to implement powerful visual search with vector embeddings in just a few lines of code.
#1about 1 minute
Defining vision as the ability to deduce and understand
The concept of vision for websites is redefined from simply seeing to the ability to deduce, understand, and act on information.
#2about 4 minutes
Demo of a multimodal e-commerce search application
A live demonstration showcases an e-commerce store where users can search for products using both text queries and by uploading images.
#3about 2 minutes
What is multimodality in artificial intelligence?
Multimodality enables search queries to use multiple media types like text, images, and audio to capture more context and improve user interaction.
#4about 2 minutes
Why multimodal AI creates richer user experiences
Multimodal interfaces provide more natural and context-aware interactions, moving beyond simple keyword searches to a more intuitive experience.
#5about 4 minutes
Differentiating generative AI from embedding models
Embedding models encapsulate information into numerical representations (vectors), unlike generative models which create new data.
#6about 4 minutes
How vector search works by measuring distance
Vector search operates by converting a query into an embedding and finding the closest, most semantically similar items in a multidimensional space.
#7about 2 minutes
Creating a unified space for multimodal search
Different data types like text, images, and audio are processed by specific encoders and plotted into a single, unified vector space for cross-modal queries.
#8about 9 minutes
Implementing text-based image search with Weaviate
A code walkthrough demonstrates how to build a text-to-image search feature using a Next.js frontend and a Weaviate backend with a `nearText` query.
#9about 4 minutes
Implementing visual search with an image query
The code for an image-to-image search is explained, showing how a base64 image is sent to the backend to perform a `nearImage` vector search.
#10about 2 minutes
Expanding vision to other creative applications
Beyond e-commerce, multimodal vision can be applied to creative use cases like movie recommenders, educational tools, and map navigation.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
08:03 MIN
Exploring modern tools for web interaction and analysis
WeAreDevelopers LIVE - the weekly developer show with Chris Heilmann and Daniel Cranney
12:31 MIN
Discussing modern web development news and trends
WeAreDevelopers LIVE - GraalVM in action, Static Analysis insights and more
01:14 MIN
The future of on-device AI in web development
Generative AI power on the web: making web apps smarter with WebGPU and WebNN
01:57 MIN
Presenting live web scraping demos at a developer conference
Tech with Tim at WeAreDevelopers World Congress 2024
00:52 MIN
Will AI replace developers? An AI-built demo
From Syntax to Singularity: AI’s Impact on Developer Roles
01:57 MIN
The future of web development is faster and simpler
The Eternal Sunshine of the Zero Build Pipeline
10:29 MIN
Exploring the future of AI in FinTech
OpenAI for FinTech: Building a Stock Market Advisor Chatbot
02:46 MIN
A demo of client-side AI using the NPU
Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based
The Best Upcoming IT WebinarsNow that you already know what IT webinars are and how they can help you level up your professional appeal, you might want actually to get into one. Live tech webinars are one of the best ways to stay on top of the latest trends and tools because eit...
Luis Minvielle
10 Developer Websites in 2023As a web developer, you're always investigating how to level up your skills and streamline your workflow. That's why we've gathered a collection of 10 innovative tools that are guaranteed to boost your productivity, enhance your coding abilities, ele...
Chris Heilmann
Dev Digest 116 - WWWAI?This time, learn how to un-AI Google's search results, what's new on the web, avoid a new security hole and go back to BASICS with us. News and ArticlesWhat a week. Google, Microsoft, OpenAI and many others had their big flagship events announcing th...
Chris Heilmann
WeAreDevelopers LIVE days are changing - get ready to take partStarting with this week's Web Dev Day edition of WeAreDevelopers LIVE Days, we changed the the way we run these online conferences. The main differences are:Shorter talks (half an hour tops)More interaction in Q&AA tips and tricks "Did you know" sect...
From learning to earning
Jobs that call for the skills explored in this talk.