Chronicle AI Brief, May 3, 2026

Last 3 hours(10)

‘This is fine’ creator says AI startup stole his art

AI startup accused of using artist's work without permission in billboards

TechCrunch AI·2026-05-03 20:16 UTC·news0.83(n 0.89 · t 0.72)
In Harvard study, AI offered more accurate emergency room diagnoses than two human doctors

AI models show higher accuracy than human doctors in ER diagnoses per Harvard study

TechCrunch AI·2026-05-03 18:00 UTC·news0.82(n 0.87 · t 0.72)
Gemma 4 E2B runs surprisingly well on my 8GB Android phone, so I built a private voice notes app around it.

Gemma 4 E2B runs on 8GB Android phone with JSON output

r/LocalLLaMA·2026-05-03 18:17 UTC·model release0.78(n 0.91 · t 0.50)
BYOMesh – New LoRa mesh radio offers 100x the bandwidth

Hacker News (AI-filtered)·2026-05-03 18:03 UTC0.71(n 0.93 · t 0.65)
Help with personal MLflow project [P]

Request for MLflow project data for CLI tool

r/MachineLearning·2026-05-03 19:58 UTC·discussion0.68(n 0.91 · t 0.55)
One bash permission slipped...

How? It kept getting chained bash commands wrong, with wrong escapes. So it created many bad directories, and tried "fixing" its mistake. It offered to run a large bash command, with rm -rf inside, and stupid me missed it. I'm glad I push everything often. But the disruption is massive. FAQ: No, I don't run this on my personal computer. It's an isolated proxmox VM for coding with LLMs. submitted by /u/TheQuantumPhysicist [link] [comments]

r/LocalLLaMA·2026-05-03 19:12 UTC·discussion0.67(n 0.93 · t 0.50)
I Trained an AI to Beat Final Fight… Here’s What Happened [p]

AI trained to beat Final Fight via behavior cloning

r/MachineLearning·2026-05-03 19:45 UTC·discussion0.67(n 0.88 · t 0.55)
First time GPU buyer. Got a RTX 5000 Pro. Was it a bad decision compared to two 3090s?

RTX 5000 Pro vs 3090s GPU performance comparison

r/LocalLLaMA·2026-05-03 18:01 UTC·discussion0.67(n 0.92 · t 0.50)
What a time to be alive from 1tk/sec to 20-100tk/sec for huge models

https://www.reddit.com/r/LocalLLaMA/comments/1eb6to7/llama_405b_q4_k_m_quantization_running_locally/ https://www.reddit.com/r/LocalLLaMA/comments/1ebbgkr/llama_31_405b_q5_k_m_running_on_amd_epyc_9374f

r/LocalLLaMA·2026-05-03 17:46 UTC·discussion0.63(n 0.81 · t 0.50)
Mistral Medium 3.5 on AMD Strix Halo

Mistral Medium 3.5 performance on AMD Strix Halo

r/LocalLLaMA·2026-05-03 18:49 UTC·discussion0.60(n 0.68 · t 0.50)

Earlier today(35)

Cloudflare Builds High-Performance Infrastructure for Running LLMs

Cloudflare has recently announced new infrastructure designed to run large AI language models across its global network. As these models rely on costly hardware and must handle large volumes of incomi

InfoQ AI/ML/Data·2026-05-03 10:58 UTC·tutorial0.83(n 0.89 · t 0.78)
Evolving Deep Learning Optimizers [R]

Genetic algorithm framework for optimizer discovery

r/MachineLearning·2026-05-03 12:13 UTC·paper0.78(n 0.93 · t 0.55)
Built a efficient and fast MRI compression program called KMRI [P]

KMRI: Efficient MRI compression tool using Zstd

r/MachineLearning·2026-05-03 10:03 UTC·tool0.77(n 0.90 · t 0.55)
Local LLM Benchmark about Backend Generation by Function Calling (GLM vs Qwen vs DeepSeek)

Backend generation benchmark: GLM vs Qwen vs DeepSeek

r/LocalLLaMA·2026-05-03 13:59 UTC·discussion0.75(n 0.84 · t 0.50)
GLaDOS TTS Build Kit: Train GLaDOS Voice if You Own Portal 1 and 2

GLaDOS TTS build kit for training voice from Portal games

r/LocalLLaMA·2026-05-03 03:05 UTC·tool0.74(n 0.90 · t 0.50)
Local image generation on Mac: 10 models compared (SD 1.5 → Flux dev → Qwen-Image → Gemini)

10 image models compared on M1 Max for quality and speed

r/LocalLLaMA·2026-05-03 01:08 UTC·discussion0.73(n 0.89 · t 0.50)
Quoting Anthropic

Simon Willison discusses Anthropic's statements without detailed analysis

Simon Willison·2026-05-03 15:13 UTC·discussion0.73(n 0.80 · t 0.90)
Same prompt, different morals: how frontier AI models diverge on ethical dilemmas

A new benchmark puts leading language models through 100 everyday ethical scenarios, from data misuse in sales to protocol violations in oncology. Behind the results lies a bigger question: who decide

The Decoder·2026-05-03 07:00 UTC·news0.70(n 0.90 · t 0.74)
Microsoft caught sneaking "Co-Authored-by Copilot" into VS Code commits - even with AI off

Microsoft quietly slipped a "Co-Authored-by Copilot" line into Git commits in Visual Studio Code - even for developers who had turned off the AI features entirely. The article Microsoft caught sneakin

The Decoder·2026-05-03 09:31 UTC·news0.69(n 0.87 · t 0.74)
AI music is flooding streaming services — but who wants it?

This is The Stepback, a weekly newsletter breaking down one essential story from the tech world. For more on how AI is changing music and the music industry, follow Terrence O'Brien. The Stepback arri

The Verge AI·2026-05-03 12:00 UTC·news0.69(n 0.91 · t 0.68)
China is falling behind in the AI race, according to a US government benchmark

A US government agency says China is now eight months behind in the AI race, but independent data doesn't back that up. And while US labs keep chasing smarter models, the price edge from Deepseek and

The Decoder·2026-05-03 08:12 UTC·news0.69(n 0.86 · t 0.74)
Xiaomi's open-weight MiMo-V2.5-Pro takes aim at Claude Opus with hours-long autonomous coding

Xiaomi's new MiMo-V2.5-Pro nearly matches Anthropic's Claude Opus 4.6 on coding benchmarks while burning 40 to 60 percent fewer tokens, according to the company. The release pushes Xiaomi deeper into

The Decoder·2026-05-03 07:24 UTC·news0.68(n 0.82 · t 0.74)
UAI Reviews disappeared [D]

Report of missing UAI reviews on submissions

r/MachineLearning·2026-05-03 15:48 UTC·discussion0.67(n 0.90 · t 0.55)
OpenAI's o1 correctly diagnosed 67% of ER patients vs. 50-55% by triage doctors

Hacker News (AI-filtered)·2026-05-03 00:30 UTC0.67(n 0.91 · t 0.65)
A Qwen finetune, that feels VERY human

Hello guys, So TL;DR, I was asked by multiple people to make an Assistant_Pepe_32B version, but the best base model contender was Qwen3-32B, a model that is very hard to tune on anything other than ST

r/LocalLLaMA·2026-05-03 17:20 UTC·discussion0.66(n 0.91 · t 0.50)
Should I follow-up with the editor for a TMLR paper awaiting final decision? [D]

Query about following up on TMLR paper's final decision

r/MachineLearning·2026-05-03 10:42 UTC·discussion0.66(n 0.91 · t 0.55)
Thoughts on independent researcher affiliation? [D]

Do you discount papers with independent researcher affiliation? I am between jobs and have completed a side research project not affiliated with my new upcoming role or my previous role so I cannot list either affiliation. Will listing independent researcher (solo author) with Gmail domain for the preprint discount the paper’s credibility? For context, I have published at A* venues and have prior solo author papers as well. submitted by /u/Pure-Ad9079 [link] [comments]

r/MachineLearning·2026-05-03 09:48 UTC·discussion0.66(n 0.91 · t 0.55)
[Paper on Hummingbird+: low-cost FPGAs for LLM inference] Qwen3-30B-A3B Q4 at 18 t/s token-gen, 24GB, expected $150 mass production cost

submitted by /u/ayake_ayake [link] [comments]

r/LocalLLaMA·2026-05-03 12:55 UTC·discussion0.65(n 0.91 · t 0.50)
RTX A5000 Pro Balckwell 48GB

RTX A5000 Pro 48GB for training and inference

r/LocalLLaMA·2026-05-03 13:31 UTC·discussion0.65(n 0.90 · t 0.50)
Anyone submit ML articles to ACM journals (eg. TOPML or TIST)? [D]

Have any of you submitted ML articles to ACM journals (eg. TOPML or TIST)? How long did the process take, and were the reviews high-quality? How does it compare to other journals (eg. TMLR) in terms of difficulty? Thanks. submitted by /u/random_sydneysider [link] [comments]

r/MachineLearning·2026-05-03 02:18 UTC·discussion0.65(n 0.93 · t 0.55)
Does the "6 months gap" still hold?

Hi. It is quite a consensus that the "jump" in quality of agentic development happened sometime in December 2025, transforming from "nice to have", to actually performing. It was also long discussed that open source models lag the state of the art by 6 to 12 months. Now, does it mean that to get the equivalence of Dec 2025 frontier performance (Opus 4.5?) from Open source models, we should still wait a few months? What has your experiences been like? submitted by /u/ihatebeinganonymous [link] [comments]

r/LocalLLaMA·2026-05-03 09:32 UTC·discussion0.65(n 0.92 · t 0.50)
Open Weights Models Hall of Fame

I read a lot of "whengguf" type posts. I think we should sometimes stop and be grateful. I want to say big thanks to all of the people and companies who gave us so much fun and productivity, sacrifici

r/LocalLLaMA·2026-05-03 13:45 UTC·discussion0.64(n 0.88 · t 0.50)
Built a Voice Agents from Scratch GitHub tutorial: mic > Whisper > local LLM (GGUF) > Kokoro > speaker, fully local, no API keys

Been building this for a while and finally cleaned it up enough to share. voice-agents-from-scratch is a numbered, chapter-by-chapter repo that walks the full real-time pipeline: Microphone capture Wh

r/LocalLLaMA·2026-05-03 16:06 UTC·discussion0.64(n 0.85 · t 0.50)
3xR9700 for semi-autonomous research and development - looking for setup/config ideas.

3xR9700 setup for semi-autonomous AI research

r/LocalLLaMA·2026-05-03 13:17 UTC·discussion0.64(n 0.85 · t 0.50)
Karpathy's MicroGPT running at 50,000 tps on an FPGA

Sure, it's only 4,192 parameters, but it's a start. Project write-up here: https://v2.talos.wtf/ and github repository here: https://github.com/Luthiraa/TALOS-V2 Some of the speed comes from having the weights onboard, rather than in external memory. Onboard ROM means with 16 bit weights current FPGAs max out at 20-30 million parameters, but maybe this and Taalas (https://taalas.com/ - similar names are unlikely a coincidence) will lead to more onboard ROM appearing in FPGAs or FPGAs dedicated to SLMs. submitted by /u/jawondo [link] [comments]

r/LocalLLaMA·2026-05-03 01:54 UTC·discussion0.63(n 0.92 · t 0.50)
GPT 5.5 just leaked its chain of thought to me in codex, and it looks like an idea from 5 months ago in this sub.

https://www.reddit.com/r/LocalLLaMA/comments/1p0lnlo/make_your_ai_talk_like_a_caveman_and_decrease/ In the middle of a project I'm working on, I got this output from GPT 5.5-medium via codex: Implemented the narrower fix in Homm3ImportUnitPreviewModelHook.cs? Need absolute path. Need know cwd absolute. v:... Use markdown. final with path. Need avoid bogus path. Use Homm3ImportUnitPreviewModelHook.cs? Format requires /abs/path. Windows abs maybe v:.... Use angle. Final no too long. Need include uncommitted. Proceed. submitted by /u/Homeschooled316 [link] [comments]

r/LocalLLaMA·2026-05-03 01:35 UTC·discussion0.63(n 0.91 · t 0.50)
If you've been waiting to try local AI development, please try it

I have snobbishly long felt that the local models were not 'up to my standards' for local development, or otherwise able to compete with GHCP, Claude Code, Cursor etc. Boy was I wrong. With the rapid

r/LocalLLaMA·2026-05-03 11:00 UTC·discussion0.63(n 0.84 · t 0.50)
Qwen3.6-27B vs 35B, I prefer 35B but more people here post about 27B...

I've had better results quality wise with 35B AND it's much faster than 27B. Just curious cause I see lots of people post about 27B. Am I doing something wrong with 27B? Use cases are multi-stage pipe

r/LocalLLaMA·2026-05-02 23:51 UTC·discussion0.60(n 0.84 · t 0.50)
Qwen/SAE-Res-Qwen3.5-27B-W80K-L0_100 · Hugging Face

I can't believe my luck! one of my next research steps was going to be on vector based model steering, and look at the gift that qwen gave us. You can learn about this here https://youtu.be/5L_tYKt2ENo submitted by /u/FaustAg [link] [comments]

r/LocalLLaMA·2026-05-02 21:45 UTC·discussion0.60(n 0.84 · t 0.50)
CAISI releases evaluation report: DeepSeek V4 becomes the most powerful model in China, but still lags about 8 months behind the US frontier

https://preview.redd.it/pz8qeln0auyg1.png?width=1400&format=png&auto=webp&s=00ee5218734cfae4783d702411d63e3a4c6bbc60 https://preview.redd.it/hem9mad5auyg1.png?width=1184&format=png&auto=webp&s=2a26fec2b49204e64b44a78b30902ab80f7df53c https://preview.redd.it/s0d8qkd6auyg1.png?width=1400&format=png&auto=webp&s=1db808f9749870c8a06854e555b21259473546a6 https://preview.redd.it/gp6zy6k7auyg1.png?width=1400&format=png&auto=webp&s=094023d03d424808e708a601b61f2ba0343feca6 https://www.nist.gov/news-events/news/2026/05/caisi-evaluation-deepseek-v4-pro submitted by /u/External_Mood4719 [link] [comments]

r/LocalLLaMA·2026-05-03 03:10 UTC·discussion0.60(n 0.80 · t 0.50)
Qwen3.6-27B vs Coder-Next

Burned about 20 hours of side-by-side compute on my two RTX PRO 6000 Blackwells trying to get a definitive answer on which of these two models was clearly better. As with many things in life, after ma

r/LocalLLaMA·2026-05-03 03:30 UTC·discussion0.57(n 0.71 · t 0.50)
AI-generated actors and scripts are now ineligible for Oscars

Oscars ban AI-generated actors and scripts from eligibility

TechCrunch AI·2026-05-02 21:54 UTC·news0.40(n 0.00 · t 0.72)
Toy experiment: frozen Pythia-70M can use a forward-derived fast memory for contextual one-shot symbolic recall [D]

Toy experiment on Pythia-70M memory for recall

r/MachineLearning·2026-05-02 22:32 UTC·discussion0.36(n 0.00 · t 0.55)
Tinygrad Driver testing!

Boutta Thrash some MoE speeds on a blackwell + m3 Ultra RDMA cluster. Theres a bit less than 2tb of ram here. I want to exchange ideas with you guys and make some cool experiments. what benches would you guys like to see? EDIT: Given all the interest on this post, I will be streaming this on the sub’s discord. Let me know what you guys want to do and I’ll add these to the list! Follow me on x @mlx_reaper submitted by /u/Street-Buyer-2428 [link] [comments]

r/LocalLLaMA·2026-05-02 23:09 UTC·discussion0.35(n 0.00 · t 0.50)
Ban phrases on llama.cpp with this script.

Check the README for setup instructions: https://github.com/BigStationW/llama-cpp-phrase-ban submitted by /u/Total-Resort-3120 [link] [comments]

r/LocalLLaMA·2026-05-02 21:22 UTC·discussion0.35(n 0.00 · t 0.50)

Yesterday & older(9)

Most Companies Aren't Anywhere Near Ready for AI

Companies struggle to define AI needs, leading to implementation challenges

Daniel Miessler·2026-05-02 19:00 UTC·opinion0.41(n 0.00 · t 0.78)
fabrica - A terminal-based minimal coding agent harness

Comments

Lobsters (AI tag)·2026-05-02 19:49 UTC0.40(n 0.00 · t 0.70)
The best AI dictation apps, tested and ranked

Review of top AI dictation apps for email, notes, and coding

TechCrunch AI·2026-05-02 16:00 UTC·tutorial0.39(n 0.00 · t 0.72)
Even the latest AI models make three systematic reasoning errors, ARC-AGI-3 analysis shows

The ARC Prize Foundation analyzed 160 game runs of OpenAI's GPT-5.5 and Anthropic's Opus 4.7 on the ARC-AGI-3 benchmark. Three systematic error patterns explain why both models stay below 1 percent on

The Decoder·2026-05-02 13:31 UTC·news0.39(n 0.00 · t 0.74)
Disneyland Now Uses Face Recognition on Visitors

Plus: The NSA tests Anthropic’s Mythos Preview to find vulnerabilities, a Finnish teen is charged over the Scattered Spider hacking spree, and more.

WIRED AI·2026-05-02 10:30 UTC·news0.39(n 0.00 · t 0.76)
xAI's new Custom Voices feature turns a minute of speech into a usable voice clone

xAI now lets developers clone their own voices for AI applications. The new "Custom Voices" feature builds on the recently launched Grok Speech-to-Text and Text-to-Speech APIs. The article xAI's new C

The Decoder·2026-05-02 12:14 UTC·news0.39(n 0.00 · t 0.74)
Nvidia CEO Jensen Huang calls out tech leaders' "god complex" over reckless AI job loss predictions

AI scaremongering costs jobs instead of protecting them, says Nvidia CEO Jensen Huang. Talking young people out of future careers, he argues, does real harm to society. The article Nvidia CEO Jensen H

The Decoder·2026-05-02 10:32 UTC·news0.39(n 0.00 · t 0.74)
Elon Musk calls himself a fool for giving OpenAI $38 million that became an $800 billion company

Elon Musk called himself a "fool" in court, warned of a "Terminator" future, and admitted that xAI taps OpenAI's models for its own AI training. Week one of Musk's trial against Sam Altman delivered p

The Decoder·2026-05-02 09:27 UTC·news0.38(n 0.00 · t 0.74)
ChatGPT now tracks users for ads by default as OpenAI looks for new revenue

OpenAI has turned on marketing cookies by default for free ChatGPT users in countries where ads are running. Tracking is automatically active for free accounts but not for paying subscribers. You can

The Decoder·2026-05-02 08:30 UTC·news0.38(n 0.00 · t 0.74)

You're caught upNext refresh follows the public schedule.

Chronicle AI Brief, May 3, 2026

Previous editions