Why your local AI app feels slow (and it’s not your GPU)

You open the app, write a query, and hit enter. Nothing happens for a second, and then the first output appears. After that, it starts streaming smoothly. When you look at the metrics, you observe that the GPU isn’t pinned at 100%, tokens per second are healthy, and your local AI model runs without breaking a sweat.

Onufriy Likarchuk
Ukraine

Wear OS 7 will keep track of deliveries and sports scores on your wrist

A Pixel Watch 4 running Wear OS 6, not Wear OS 7. | Photo by Amelia Holowaty Krales / The Verge Amid the flurry of today's Google I/O announcements, Google shared details about Wear OS 7, the next major update to its smartwatch platform. To help you keep track of...

Sophia Wilson
Atlanta

Lossless Scaling is the only Steam Deck plugin worth your time, and here’s why

It's no secret that I love my Steam Deck, and one of the things that makes it better is the plugins supplied by Decky Loader. It's even better on more powerful hardware like the ROG Ally X if you install SteamOS, because it gives you more computing grunt to run...

Emily Brown
Houston

VS Code’s terminal isn’t just convenient — it’s the only editor that remembers your workflow

I have been trying to move away from VS Code and its forks, like Cursor and Antigravity, for the same reasons many developers are experimenting with newer editors. I switched to a leaner editor like Zed because it felt lighter, cleaner, and noticeably faster once I spent a few hours...

Daniel Martinez
Dallas

Demis Hassabis said this might be the ‘foothills of the singularity.’ What?

Welcome to a "profound moment for humanity," according to Google DeepMind CEO Demis Hassabis, who closed out Google I/O's keynote presentation on Tuesday, saying: Google's cutting-edge research and products will help unlock AGI's incredible potential for the benefit of the entire world. When we look back at this time, I...

Daniel Martinez
Dallas

I almost bought a used Nvidia Tesla GPU for my home lab, then I read what owners actually deal with

A used Tesla GPU listing on a used marketplace is one of the most tempting things for home labbers shopping around. 24 GB of VRAM on a card that originally sold for over $5,000, now hovering around the $300 mark on eBay. For anyone trying to run local LLMs or...

Anita Trost
Germany

Plex is tripling the price of a lifetime pass to $750 after doubling it last year

I am dying to know how much money Plex is about to make the next six weeks charging people to stream their own video from their own homes. Today, it's giving every prospective customer until July 1st to lock in a lifetime subscription at today's rates - before it triples...

Sophia Wilson
Atlanta

I stopped chasing Ultra and my mid-range GPU became a powerhouse

In the tech community right now, it feels like there's a massive wave of ultra-settings fatigue. Between unoptimized PC ports, VRAM anxieties, and the sheer cost of flagship cards as well as other PC components, users are begging for someone to tell them it's okay to slide that graphics preset...

Sophia Wilson
Atlanta

Nintendo’s $500 Switch 2 bundle includes a game, and it’s available now

Mario Kart World is one of three games you can choose from in the bundle. | Photo by Amelia Holowaty Krales / The Verge Nintendo recently teased the “Choose Your Game” Switch 2 console and digital game bundle, coming in early June. However, multiple retailers (including Nintendo itself) are already...

Cynthia Delgado
Mexico

I ignored NotebookLM Tools tags for too long, and now I cannot work without them

I had 19 NotebookLM notebooks on my dashboard, and some of their titles didn’t tell me exactly what was in them. Some notebooks were abandoned, while others remained active. I’d try to spot a notebook by its icon, but that didn’t always work either, since some were random ones I’d...

Jane Smith
Los Angeles

We react to Google I/O 2026

What better way to unwind from a two-hour keynote presentation than to pore over the weirdest and wildest details, from a Gmail bot you can converse with to DeepMind's leader saying the singularity is near. The Vergecast went live right after the show, with senior AI reporter Hayden Field joining...

Sophia Wilson
Atlanta

Google’s Gemini Omni can generate “anything from any input,” including video

Among a flurry of AI-related announcements at I/O 2026, including more Search AI integration, Gemini 3.5, a new Gemini Spark AI personal assistant, Google revealed Gemini Omni, an AI model that can "create anything from any input — starting with video." The platform's first model, Gemini Omni Flash, is rolling...

Michael Johnson
Chicago

The future of Google is a search box that does everything

Last year, after watching Google's I/O keynote, I wrote that it felt like Google's future was Google googling. After watching this year's I/O keynote on Tuesday, I don't think Google just wants to google for you - I think it wants to do everything for you, all from a search...

William Garcia
Boston

Previous Story Next Story

Language

Why your local AI app feels slow (and it’s not your GPU)

Wear OS 7 will keep track of deliveries and sports scores on your wrist

Lossless Scaling is the only Steam Deck plugin worth your time, and here’s why

VS Code’s terminal isn’t just convenient — it’s the only editor that remembers your workflow

Demis Hassabis said this might be the ‘foothills of the singularity.’ What?

I almost bought a used Nvidia Tesla GPU for my home lab, then I read what owners actually deal with

Plex is tripling the price of a lifetime pass to $750 after doubling it last year

I stopped chasing Ultra and my mid-range GPU became a powerhouse

Nintendo’s $500 Switch 2 bundle includes a game, and it’s available now

I ignored NotebookLM Tools tags for too long, and now I cannot work without them

We react to Google I/O 2026

Google’s Gemini Omni can generate “anything from any input,” including video

The future of Google is a search box that does everything

Express your creativity and start building your project