Flux
Toutes les catégories

Programmation

1557 articles

Gemma 4 audio with MLX

Gemma 4 audio with MLX

Thanks to a tip from Rahim Nathwani, here's a uv run recipe for transcribing an audio file on macOS using the 10.28 GB Gemma 4 E2B model with MLX and mlx-vlm: uv run --python 3.13 --with mlx_vlm --with torchvision --with gradio \ mlx_vlm.generate \ --model google/gemma-4-e2b-it \ --audio file.wav \ --prompt "Transcribe this audio" \ --max-tokens 500 \ --temperature 1.0 Your browser does not support the audio element. I tried it on this 14 second .wav file and it output the following: This front…

Simon Willison's Weblog
SQLite 3.53.0

SQLite 3.53.0

SQLite 3.53.0 SQLite 3.52.0 was withdrawn so this is a pretty big release with a whole lot of accumulated user-facing and internal improvements. Some that stood out to me: ALTER TABLE can now add and remove NOT NULL and CHECK constraints - I've previously used my own sqlite-utils transform() method for this. New json_array_insert() function and its jsonb equivalent. Significant improvements to CLI mode, including result formatting. The result formatting improvements come from a new library, the…

Simon Willison's Weblog
Axios Supply Chain Attack Reaches OpenAI macOS Signing Pipeline, Forces Certificate Rotation

Axios Supply Chain Attack Reaches OpenAI macOS Signing Pipeline, Forces Certificate Rotation

The recent compromise of the widely used Axios library has now been linked to downstream impact on software distribution pipelines, after OpenAI disclosed that a malicious version of the package was executed inside its macOS app signing workflow. The incident adds a new dimension to the ongoing supply chain campaign that has already targeted high-impact Node.js maintainers through a social engineering campaign attributed to North Korean actors. Malicious Axios Version Executed in CI # According…

Socket
ChatGPT voice mode is a weaker model

ChatGPT voice mode is a weaker model

I think it's non-obvious to many people that the OpenAI voice mode runs on a much older, much weaker model - it feels like the AI that you can talk to should be the smartest AI but it really isn't. If you ask ChatGPT voice mode for its knowledge cutoff date it tells you April 2024 - it's a GPT-4o era model. This thought inspired by this Andrej Karpathy tweet about the growing gap in understanding of AI capability based on the access points and domains people are using the models with: [...] It…

Simon Willison's Weblog
The PHP Podcast 2026.04.09

The PHP Podcast 2026.04.09

🎙️ The PHP Podcast April 9, 2026 | Guest Hosts: Joe Ferguson & Sara Golemon 🎙️ Guest Hosts Joe Ferguson Senior Developer at PHP Architect Running for PHP 8.6 Release Manager (hands-on position, third attempt). Working on PHP infrastructure with Derek using Ansible and Proxmox. Fixed emoji Unicode support on people.php.net. @joepferguson Sara Golemon […] The post The PHP Podcast 2026.04.09 appeared first on PHP Architect.

PHP Architect
Community Corner: I Said No With Dr Jen Fry

Community Corner: I Said No With Dr Jen Fry

In this episode, Scott talks with Dr Jen Fry about Sports Geography, Saying NO, and her keynote at https://phptek.io/ (tickets still available). Links: Our Discord – https://discord.gg/aMTxunVx Buy our shirts – https://store.phparch.com/products/community-corner-podcast-t-shirt Jen’s Links: LinkedIn – https://www.linkedin.com/in/jenfry13/ Website – https://jenfrytalks.com/ Scott’s Links: Website – https://scott.keck-warren.com/ Bluesky – https://bsky.app/profile/scottkeckwarren.bsky.social…

PHP Architect
Don't Kill the Goose That Lays the Golden Eggs

Don't Kill the Goose That Lays the Golden Eggs

March 2026 was a bad month. Back-to-back supply chain attacks with incident response teams running nonstop, real damage across ecosystems that millions of developers depend on, and legitimate questions about how we secure critical infrastructure. What it didn't have was an excuse to write an obituary for open source. In the wake of the attacks, familiar criticism has been making the rounds. It's a hot take dressed up as a reckoning. The argument goes something like this: open source is…

Socket