Christian Liebel

Aug 20, 2024 • World Congress 2024

Generative AI power on the web: making web apps smarter with WebGPU and WebNN

What if your web app could run generative AI without cloud costs or latency? Discover how WebGPU and the upcoming WebNN API make on-device AI a reality.

#1about 1 minute

Generative AI use cases and cloud provider limitations

Cloud-based AI faces challenges like required internet connectivity, data privacy risks, and high costs, creating a need for local alternatives.

#2about 13 minutes

Running large language models locally with Web LLM

Web LLM enables running multi-gigabyte language models like Llama 3 directly in the browser for offline use, despite initial download and initialization times.

#3about 2 minutes

The technology behind in-browser AI execution

In-browser AI performance is accelerated by combining WebAssembly for efficient computation and the new WebGPU API for direct access to the system's GPU.

#4about 4 minutes

Boosting performance with the upcoming WebNN API

The Web Neural Network (WebNN) API provides access to dedicated Neural Processing Units (NPUs) for even faster, more efficient on-device model inference.

#5about 6 minutes

Solving model duplication with the new Prompt API

The experimental Prompt API addresses the issue of redundant model downloads by allowing websites to access a single, shared OS-level model like Gemini Nano.

#6about 3 minutes

Using the Prompt API for on-device data extraction

A demonstration shows how the Prompt API can use a local model to accurately extract structured data from unstructured text, highlighting its practical application.

#7about 2 minutes

Generating images in the browser with WebSD

WebSD brings text-to-image generation to the browser by running Stable Diffusion models locally using WebGPU, enabling creative AI tasks without cloud dependency.

#8about 1 minute

Weighing the pros and cons of local AI models

Local AI models offer superior privacy, offline availability, and low cost, but come with trade-offs like lower quality, high system requirements, and slower performance.

#9about 1 minute

The future of on-device AI in web development

While cloud-based models are currently superior, the trend towards more compact open-source models and OS-integrated AI suggests a growing role for local AI in specialized web applications.

Two primary approaches for browser-based AI

01:41 MIN

Two primary approaches for browser-based AI

Prompt API & WebNN: The AI Revolution Right in Your Browser

The future of on-device AI hardware and APIs

02:08 MIN

The future of on-device AI hardware and APIs

From ML to LLM: On-device AI in the Browser

Introducing the Web Neural Network (WebNN) standard

02:51 MIN

Introducing the Web Neural Network (WebNN) standard

Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based

Running on-device AI in the browser with Gemini Nano

03:24 MIN

Running on-device AI in the browser with Gemini Nano

Exploring Google Gemini and Generative AI

Accelerating performance with the WebNN API

04:04 MIN

Accelerating performance with the WebNN API

Prompt API & WebNN: The AI Revolution Right in Your Browser

Leveraging hardware like the CPU, GPU, and NPU

04:03 MIN

Leveraging hardware like the CPU, GPU, and NPU

Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based

Key benefits of running AI in the browser

01:55 MIN

Key benefits of running AI in the browser

From ML to LLM: On-device AI in the Browser

Exploring the future of the WebAssembly ecosystem

03:12 MIN

Exploring the future of the WebAssembly ecosystem

WebAssembly Revolution: Elevating JavaScript's Reach and Performance

Featured Partners

Prompt API & WebNN: The AI Revolution Right in Your Browser

Prompt API & WebNN: The AI Revolution Right in Your Browser

Christian Liebel

about 7 months ago • World Congress 2025

Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based

Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based

Maxim Salnikov

about 7 months ago • World Congress 2025

From ML to LLM: On-device AI in the Browser

From ML to LLM: On-device AI in the Browser

Nico Martin

about a year ago • WeAreDevelopers LIVE

Exploring the Future of Web AI with Google

Exploring the Future of Web AI with Google

Thomas Steiner

about a year ago • Coffee With Developers

AI: Superhero or Supervillain? How and Why with Scott Hanselman

AI: Superhero or Supervillain? How and Why with Scott Hanselman

Scott Hanselman

about 2 years ago • World Congress 2024

WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA

WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA

Ankit Patel

about 2 years ago • World Congress 2024

Performant Architecture for a Fast Gen AI User Experience

Performant Architecture for a Fast Gen AI User Experience

Nathaniel Okenwa

about 2 years ago • World Congress 2024

Your Next AI Needs 10,000 GPUs. Now What?

Your Next AI Needs 10,000 GPUs. Now What?

Anshul Jindal & Martin Piercy

about 7 months ago • World Congress 2025

Related Articles

View all articles

Adrien Book

How AI Will Eat The World 🤖

Of generative-AI-for-everything and synthetic pleasuresRemember the web3 hype? Tech bros with easy access to cheap liquidity wanted to create a decentralised, peer-to-peer internet powered by blockchain technology. Spoiler alert, it did not work. And...

How AI Will Eat The World 🤖

Chris Heilmann

Dev Digest 116 - WWWAI?

This time, learn how to un-AI Google's search results, what's new on the web, avoid a new security hole and go back to BASICS with us. News and ArticlesWhat a week. Google, Microsoft, OpenAI and many others had their big flagship events announcing th...

Dev Digest 116 - WWWAI?

Daniel Cranney

How to Use Generative AI to Accelerate Learning to Code

It’s undeniable that generative-AI and LLMs have transformed how developers work. Hours of hunting Stack Overflow can be avoided by asking your AI-code assistant, multi-file context can be fed to the AI from inside your IDE, and applications can be b...

How to Use Generative AI to Accelerate Learning to Code

Daniel Cranney

Stephan Gillich - Bringing AI Everywhere

In the ever-evolving world of technology, AI continues to be the frontier for innovation and transformation. Stephan Gillich, from the AI Center of Excellence at Intel, dove into the subject in a recent session titled "Bringing AI Everywhere," sheddi...

Stephan Gillich - Bringing AI Everywhere

From learning to earning

Jobs that call for the skills explored in this talk.

Product Owner/Projektleiter (m/w/d)

relyon AG
Tübingen, Germany

Junior

Intermediate

Senior

Scrum

AI Engineer - Generative AI /pixelhead)

Conrad Electronic SE

Fullstack Web Entwickler - Next.js & AI

Rocken AG

Next.js

TypeScript

AI & Embedded ML Engineer (Real-Time Edge Optimization)

autonomous-teaming

Remote

GIT

Linux

PyTorch

Product Owner Generative AI

univativ GmbH & Co. KG

€88-98K

JIRA

Confluence

Continuous Integration

AI Engineer - Generative AI (m/f/nb/pixelhead) for our subsidiary RE-IN

RE-INvent Retail GmbH

Microservices

Full Stack Developer focused on AI Development

SBI GmbH

DevOps

Gitlab

Pandas

Docker

PyTorch

+8

AI Web Software Developer DevOps Expert

webLyzard
Vienna, Austria

DevOps

Docker

PostgreSQL

Kubernetes

Elasticsearch

+2

Application Engineer - AI Enablement

Alena Nicolai

Intermediate

Linux