Chronicle 54 items · updated 2026-05-03 20:27 UTC

Chronicle AI Brief, May 3, 2026

The latest in AI, clustered and ranked. Repeated hype gets pushed down so the actual signal stays up top.

Top News

In Harvard study, AI offered more accurate emergency room diagnoses than two human doctors

Harvard study finds AI outperforms ER doctors in diagnostic accuracy

A Harvard study compared AI models to emergency room doctors in diagnosing patients. At least one large language model achieved higher accuracy than human doctors in real-world cases, suggesting potential for AI in clinical decision-making.

TechCrunch AI·2026-05-03 18:00 UTC·news·0.82
Viewing 2026-05-03
Last 3 hours(10)
  1. One bash permission slipped...

    How? It kept getting chained bash commands wrong, with wrong escapes. So it created many bad directories, and tried "fixing" its mistake. It offered to run a large bash command, with rm -rf inside, and stupid me missed it. I'm glad I push everything often. But the disruption is massive. FAQ: No, I don't run this on my personal computer. It's an isolated proxmox VM for coding with LLMs. submitted by /u/TheQuantumPhysicist [link] [comments]

    r/LocalLLaMA·2026-05-03 19:12 UTC·discussion0.67(n 0.93 · t 0.50)
    Thumbnail for One bash permission slipped...
  2. What a time to be alive from 1tk/sec to 20-100tk/sec for huge models

    https://www.reddit.com/r/LocalLLaMA/comments/1eb6to7/llama_405b_q4_k_m_quantization_running_locally/ https://www.reddit.com/r/LocalLLaMA/comments/1ebbgkr/llama_31_405b_q5_k_m_running_on_amd_epyc_9374f

    r/LocalLLaMA·2026-05-03 17:46 UTC·discussion0.63(n 0.81 · t 0.50)
Earlier today(35)
  1. Cloudflare Builds High-Performance Infrastructure for Running LLMs

    Cloudflare has recently announced new infrastructure designed to run large AI language models across its global network. As these models rely on costly hardware and must handle large volumes of incomi

    InfoQ AI/ML/Data·2026-05-03 10:58 UTC·tutorial0.83(n 0.89 · t 0.78)
    Thumbnail for Cloudflare Builds High-Performance Infrastructure for Running LLMs
  2. Quoting Anthropic

    Simon Willison discusses Anthropic's statements without detailed analysis

    Simon Willison·2026-05-03 15:13 UTC·discussion0.73(n 0.80 · t 0.90)
  3. AI music is flooding streaming services — but who wants it?

    This is The Stepback, a weekly newsletter breaking down one essential story from the tech world. For more on how AI is changing music and the music industry, follow Terrence O'Brien. The Stepback arri

    The Verge AI·2026-05-03 12:00 UTC·news0.69(n 0.91 · t 0.68)
    Thumbnail for AI music is flooding streaming services — but who wants it?
  4. UAI Reviews disappeared [D]

    Report of missing UAI reviews on submissions

    r/MachineLearning·2026-05-03 15:48 UTC·discussion0.67(n 0.90 · t 0.55)
  5. A Qwen finetune, that feels VERY human

    Hello guys, So TL;DR, I was asked by multiple people to make an Assistant_Pepe_32B version, but the best base model contender was Qwen3-32B, a model that is very hard to tune on anything other than ST

    r/LocalLLaMA·2026-05-03 17:20 UTC·discussion0.66(n 0.91 · t 0.50)
  6. Thoughts on independent researcher affiliation? [D]

    Do you discount papers with independent researcher affiliation? I am between jobs and have completed a side research project not affiliated with my new upcoming role or my previous role so I cannot list either affiliation. Will listing independent researcher (solo author) with Gmail domain for the preprint discount the paper’s credibility? For context, I have published at A* venues and have prior solo author papers as well. submitted by /u/Pure-Ad9079 [link] [comments]

    r/MachineLearning·2026-05-03 09:48 UTC·discussion0.66(n 0.91 · t 0.55)
  7. RTX A5000 Pro Balckwell 48GB

    RTX A5000 Pro 48GB for training and inference

    r/LocalLLaMA·2026-05-03 13:31 UTC·discussion0.65(n 0.90 · t 0.50)
  8. Anyone submit ML articles to ACM journals (eg. TOPML or TIST)? [D]

    Have any of you submitted ML articles to ACM journals (eg. TOPML or TIST)? How long did the process take, and were the reviews high-quality? How does it compare to other journals (eg. TMLR) in terms of difficulty? Thanks. submitted by /u/random_sydneysider [link] [comments]

    r/MachineLearning·2026-05-03 02:18 UTC·discussion0.65(n 0.93 · t 0.55)
  9. Does the "6 months gap" still hold?

    Hi. It is quite a consensus that the "jump" in quality of agentic development happened sometime in December 2025, transforming from "nice to have", to actually performing. It was also long discussed that open source models lag the state of the art by 6 to 12 months. Now, does it mean that to get the equivalence of Dec 2025 frontier performance (Opus 4.5?) from Open source models, we should still wait a few months? What has your experiences been like? submitted by /u/ihatebeinganonymous [link] [comments]

    r/LocalLLaMA·2026-05-03 09:32 UTC·discussion0.65(n 0.92 · t 0.50)
  10. Open Weights Models Hall of Fame

    I read a lot of "whengguf" type posts. I think we should sometimes stop and be grateful. I want to say big thanks to all of the people and companies who gave us so much fun and productivity, sacrifici

    r/LocalLLaMA·2026-05-03 13:45 UTC·discussion0.64(n 0.88 · t 0.50)
  11. Karpathy's MicroGPT running at 50,000 tps on an FPGA

    Sure, it's only 4,192 parameters, but it's a start. Project write-up here: https://v2.talos.wtf/ and github repository here: https://github.com/Luthiraa/TALOS-V2 Some of the speed comes from having the weights onboard, rather than in external memory. Onboard ROM means with 16 bit weights current FPGAs max out at 20-30 million parameters, but maybe this and Taalas (https://taalas.com/ - similar names are unlikely a coincidence) will lead to more onboard ROM appearing in FPGAs or FPGAs dedicated to SLMs. submitted by /u/jawondo [link] [comments]

    r/LocalLLaMA·2026-05-03 01:54 UTC·discussion0.63(n 0.92 · t 0.50)
  12. GPT 5.5 just leaked its chain of thought to me in codex, and it looks like an idea from 5 months ago in this sub.

    https://www.reddit.com/r/LocalLLaMA/comments/1p0lnlo/make_your_ai_talk_like_a_caveman_and_decrease/ In the middle of a project I'm working on, I got this output from GPT 5.5-medium via codex: Implemented the narrower fix in Homm3ImportUnitPreviewModelHook.cs? Need absolute path. Need know cwd absolute. v:... Use markdown. final with path. Need avoid bogus path. Use Homm3ImportUnitPreviewModelHook.cs? Format requires /abs/path. Windows abs maybe v:.... Use angle. Final no too long. Need include uncommitted. Proceed. submitted by /u/Homeschooled316 [link] [comments]

    r/LocalLLaMA·2026-05-03 01:35 UTC·discussion0.63(n 0.91 · t 0.50)
  13. If you've been waiting to try local AI development, please try it

    I have snobbishly long felt that the local models were not 'up to my standards' for local development, or otherwise able to compete with GHCP, Claude Code, Cursor etc. Boy was I wrong. With the rapid

    r/LocalLLaMA·2026-05-03 11:00 UTC·discussion0.63(n 0.84 · t 0.50)
  14. Qwen/SAE-Res-Qwen3.5-27B-W80K-L0_100 · Hugging Face

    I can't believe my luck! one of my next research steps was going to be on vector based model steering, and look at the gift that qwen gave us. You can learn about this here https://youtu.be/5L_tYKt2ENo submitted by /u/FaustAg [link] [comments]

    r/LocalLLaMA·2026-05-02 21:45 UTC·discussion0.60(n 0.84 · t 0.50)
    Thumbnail for Qwen/SAE-Res-Qwen3.5-27B-W80K-L0_100 · Hugging Face
  15. CAISI releases evaluation report: DeepSeek V4 becomes the most powerful model in China, but still lags about 8 months behind the US frontier

    https://preview.redd.it/pz8qeln0auyg1.png?width=1400&format=png&auto=webp&s=00ee5218734cfae4783d702411d63e3a4c6bbc60 https://preview.redd.it/hem9mad5auyg1.png?width=1184&format=png&auto=webp&s=2a26fec2b49204e64b44a78b30902ab80f7df53c https://preview.redd.it/s0d8qkd6auyg1.png?width=1400&format=png&auto=webp&s=1db808f9749870c8a06854e555b21259473546a6 https://preview.redd.it/gp6zy6k7auyg1.png?width=1400&format=png&auto=webp&s=094023d03d424808e708a601b61f2ba0343feca6 https://www.nist.gov/news-events/news/2026/05/caisi-evaluation-deepseek-v4-pro submitted by /u/External_Mood4719 [link] [comments]

    r/LocalLLaMA·2026-05-03 03:10 UTC·discussion0.60(n 0.80 · t 0.50)
    Thumbnail for CAISI releases evaluation report: DeepSeek V4 becomes the most powerful model in China, but still lags about 8 months behind the US frontier
  16. Qwen3.6-27B vs Coder-Next

    Burned about 20 hours of side-by-side compute on my two RTX PRO 6000 Blackwells trying to get a definitive answer on which of these two models was clearly better. As with many things in life, after ma

    r/LocalLLaMA·2026-05-03 03:30 UTC·discussion0.57(n 0.71 · t 0.50)
    Thumbnail for Qwen3.6-27B vs Coder-Next
  17. Tinygrad Driver testing!

    Boutta Thrash some MoE speeds on a blackwell + m3 Ultra RDMA cluster. Theres a bit less than 2tb of ram here. I want to exchange ideas with you guys and make some cool experiments. what benches would you guys like to see? EDIT: Given all the interest on this post, I will be streaming this on the sub’s discord. Let me know what you guys want to do and I’ll add these to the list! Follow me on x @mlx_reaper submitted by /u/Street-Buyer-2428 [link] [comments]

    r/LocalLLaMA·2026-05-02 23:09 UTC·discussion0.35(n 0.00 · t 0.50)
    Thumbnail for Tinygrad Driver testing!
  18. Ban phrases on llama.cpp with this script.

    Check the README for setup instructions: https://github.com/BigStationW/llama-cpp-phrase-ban submitted by /u/Total-Resort-3120 [link] [comments]

    r/LocalLLaMA·2026-05-02 21:22 UTC·discussion0.35(n 0.00 · t 0.50)
    Thumbnail for Ban phrases on llama.cpp with this script.
Yesterday & older(9)
  1. Disneyland Now Uses Face Recognition on Visitors

    Plus: The NSA tests Anthropic’s Mythos Preview to find vulnerabilities, a Finnish teen is charged over the Scattered Spider hacking spree, and more.

    WIRED AI·2026-05-02 10:30 UTC·news0.39(n 0.00 · t 0.76)
    Thumbnail for Disneyland Now Uses Face Recognition on Visitors
You're caught upNext refresh follows the public schedule.

Previous editions

Same signal-first ranking, earlier dates.

Open archive