Articles IA — Page 53

How fast is 10 tokens per second really?

How fast is 10 tokens per second really?

How fast is 10 tokens per second really? Neat little HTML app by Mike Veerman (source code here) which simulates LLM token output speeds from 5/second to 800/second. Useful if you see a model advertised as "30 tokens/second" and want to get a feel for what that actually looks like. Via Hacker News Tags: ai, generative-ai, llms

20 mai 2026

Google I/O, Gemini Spark, Antigravity

Google I/O, Gemini Spark, Antigravity

It's hard to find much to write about Google I/O this year because I have a policy of not writing about anything that I can't try out myself, and a lot of the big announcements are "coming soon". I actually prefer to write about things that are in general availability, because I've had instances in the past where the previews didn't match what was released to the general public later on. Aside from Gemini 3.5 Flash the most interesting announcement looks to be Google's upcoming OpenClaw…

20 mai 2026

IA

Could generative AI turn out to be the tech industry’s Vietnam? And could public backlash lead AI to a better place?

We live in interesting times

20 mai 2026

Marcus on AI

datasette-agent-charts 0.1a1

datasette-agent-charts 0.1a1

Release: datasette-agent-charts 0.1a1 More color! Bar and waffle charts without a color column are shaded by magnitude with a sequential color scheme; color columns holding text values use the observable10 categorical scheme. #2 Now checks execute-sql permission before running the query to find the column names. Charts now display interactive tooltips. Fixed a bug where waffleY charts were not described to the agent. Tags: datasette, datasette-agent

20 mai 2026

The Agent Stack Bet

IA

The Agent Stack Bet

The following article originally appeared on the Elevate newsletter and is being reposted here with the author’s permission. Peek under the hood of most “production agents” shipping today and you won’t find intelligence. You’ll find custom plumbing, fragile session logic, shared service accounts, and a security model held together by hope. This can be so […]

20 mai 2026

O'Reilly Radar — AI/ML

llm-gemini 0.32

llm-gemini 0.32

Release: llm-gemini 0.32 New model gemini-3.5-flash for Gemini 3.5 Flash. See also my notes on Gemini 3.5 Flash, and the pelican I drew using this upgrade to the plugin. Tags: gemini, llm

19 mai 2026

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Today at Google I/O, Google released Gemini 3.5 Flash. This one skipped the -preview modifier and went straight to general availability, and Google appear to be using it for a whole lot of their key products: 3.5 Flash is available today to billions of people globally: For everyone via the Gemini app and AI Mode in Google Search For developers in our agent-first development platform Google Antigravity and Gemini API in Google AI Studio and Android Studio For enterprises in Gemini Enterprise…

19 mai 2026

Don't Worry About the Vase

IA

Childhood And Education #19: Letting Kids Be Kids #2

I cannot emphasize enough the need to let kids be kids.

19 mai 2026

datasette-llm-accountant 0.1a4

datasette-llm-accountant 0.1a4

Release: datasette-llm-accountant 0.1a4 Fixed bug tracking chains of responses. Refs datasette-llm#7 Tags: llm, datasette

19 mai 2026

Daily Dose of Data Science

IA

The top Hermes integrations

... to give your agent superpowers

19 mai 2026

llm-gemini 0.32a0

llm-gemini 0.32a0

Release: llm-gemini 0.32a0 Compatible with llm>=0.32a0 alpha - adds the ability to stream reasoning tokens. Tags: gemini, llm

19 mai 2026

datasette-llm 0.1a8

datasette-llm 0.1a8

Release: datasette-llm 0.1a8 Fix for bug where llm_prompt_context() hook did not fully collect chains of responses. #7

19 mai 2026

When an Agent Deletes the Production Database

IA

When an Agent Deletes the Production Database

Another day, another example of an AI Agent “running rogue” and doing something the human operator didn’t want it to do. The tl;dr is that Jeremy (Jer) Crane, founder of PocketOS, was using Claude to perform some routine DB maintenance. Claude then proceeded to delete the production database and all backups hosted at their cloud […]

19 mai 2026

O'Reilly Radar — AI/ML

IA

AI Artifact Catalogs: Durable Standards Worth Institutional Investment

Companies everywhere are trying to leverage AI to boost internal productivity metrics. Some, like Ramp and Intercom, are succeeding. Many are failing. To make matters more complicated, the narrative around what tooling enables these gains is constantly shifting. For software engineers, auto-complete via GitHub Copilot was the bleeding-edge tool of choice in 2024. Then it […]

19 mai 2026

O'Reilly Radar — AI/ML