Generative AI power on the web: making web apps smarter with WebGPU and WebNN
What if your web app could run generative AI without cloud costs or latency? Discover how WebGPU and the upcoming WebNN API make on-device AI a reality.
#1about 1 minute
Generative AI use cases and cloud provider limitations
Cloud-based AI faces challenges like required internet connectivity, data privacy risks, and high costs, creating a need for local alternatives.
#2about 13 minutes
Running large language models locally with Web LLM
Web LLM enables running multi-gigabyte language models like Llama 3 directly in the browser for offline use, despite initial download and initialization times.
#3about 2 minutes
The technology behind in-browser AI execution
In-browser AI performance is accelerated by combining WebAssembly for efficient computation and the new WebGPU API for direct access to the system's GPU.
#4about 4 minutes
Boosting performance with the upcoming WebNN API
The Web Neural Network (WebNN) API provides access to dedicated Neural Processing Units (NPUs) for even faster, more efficient on-device model inference.
#5about 6 minutes
Solving model duplication with the new Prompt API
The experimental Prompt API addresses the issue of redundant model downloads by allowing websites to access a single, shared OS-level model like Gemini Nano.
#6about 3 minutes
Using the Prompt API for on-device data extraction
A demonstration shows how the Prompt API can use a local model to accurately extract structured data from unstructured text, highlighting its practical application.
#7about 2 minutes
Generating images in the browser with WebSD
WebSD brings text-to-image generation to the browser by running Stable Diffusion models locally using WebGPU, enabling creative AI tasks without cloud dependency.
#8about 1 minute
Weighing the pros and cons of local AI models
Local AI models offer superior privacy, offline availability, and low cost, but come with trade-offs like lower quality, high system requirements, and slower performance.
#9about 1 minute
The future of on-device AI in web development
While cloud-based models are currently superior, the trend towards more compact open-source models and OS-integrated AI suggests a growing role for local AI in specialized web applications.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
01:41 MIN
Two primary approaches for browser-based AI
Prompt API & WebNN: The AI Revolution Right in Your Browser
02:08 MIN
The future of on-device AI hardware and APIs
From ML to LLM: On-device AI in the Browser
02:51 MIN
Introducing the Web Neural Network (WebNN) standard
Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based
03:24 MIN
Running on-device AI in the browser with Gemini Nano
Exploring Google Gemini and Generative AI
04:04 MIN
Accelerating performance with the WebNN API
Prompt API & WebNN: The AI Revolution Right in Your Browser
04:03 MIN
Leveraging hardware like the CPU, GPU, and NPU
Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based
01:55 MIN
Key benefits of running AI in the browser
From ML to LLM: On-device AI in the Browser
03:12 MIN
Exploring the future of the WebAssembly ecosystem
WebAssembly Revolution: Elevating JavaScript's Reach and Performance
How AI Will Eat The World 🤖Of generative-AI-for-everything and synthetic pleasuresRemember the web3 hype? Tech bros with easy access to cheap liquidity wanted to create a decentralised, peer-to-peer internet powered by blockchain technology. Spoiler alert, it did not work. And...
Chris Heilmann
Dev Digest 116 - WWWAI?This time, learn how to un-AI Google's search results, what's new on the web, avoid a new security hole and go back to BASICS with us. News and ArticlesWhat a week. Google, Microsoft, OpenAI and many others had their big flagship events announcing th...
Daniel Cranney
How to Use Generative AI to Accelerate Learning to CodeIt’s undeniable that generative-AI and LLMs have transformed how developers work. Hours of hunting Stack Overflow can be avoided by asking your AI-code assistant, multi-file context can be fed to the AI from inside your IDE, and applications can be b...
Daniel Cranney
Stephan Gillich - Bringing AI EverywhereIn the ever-evolving world of technology, AI continues to be the frontier for innovation and transformation. Stephan Gillich, from the AI Center of Excellence at Intel, dove into the subject in a recent session titled "Bringing AI Everywhere," sheddi...
From learning to earning
Jobs that call for the skills explored in this talk.