Skip to main content
morning wire

DeepSeek Discount, Coding Solved, Agent Visibility

EN·English
Multilingual audio is part of the $29/month plan. Start a 14-day free trial — no card charged during trial. 开始试用
0:000:00
在 X 上分享

文字稿

Saturday, May 23. Latest on the Edge of AI: DeepSeek is making its flagship discount permanent, a Reddit post claims coding is basically solved for 90% of tasks, and a new open-source devtool is trying to make agent projects visible. Let's get into it.

DeepSeek, the Chinese open-source lab, is making its 75 percent discount on V4-Pro permanent. At $0.435 per million input tokens, it's at least 11.5 times cheaper than GPT-5.5 and over 34 times cheaper on output tokens [1]. That pricing isn't a promotion. It's the new sticker price. For token-hungry agentic systems that chew through millions of tokens in a single refactor, this kind of gap squeezes Western providers hard. The signal: frontier pricing just became a two-tier market, and the cheaper tier is open-weight.

Different beat. A Reddit post on r/singularity claims coding is basically solved for the boring 90 percent of tasks [10 — unverified Reddit post]. The author says they mass refactored a 120-file FastAPI service: 400 steps, 2 million tokens, $3 total, zero human input. They used DeepSeek V4 and Tencent's Hunyuan Hy3 preview as cheap workers — 21 billion active parameters, roughly $0.18 per million input tokens, about 80 times cheaper than Opus. Tencent reports 99.99 percent step success across 495 production runs, and the author says that tracked for routine refactors. The one catch: the model confidently introduced a deadlock into an async event handler, which the author calls "genuinely funny." Which matters because: the cheap-open-weight frontier is now fast enough and reliable enough for production refactors, and the only remaining moat is the hard 10 percent that still needs a frontier reasoning model.

Now, on the devtools side. A new open-source tool called AgentLantern aims to expose the hidden execution graph of AI agent projects [5 — unverified Reddit post]. The problem: agent frameworks make it easy to create agents, tasks, and tools, but once a project grows beyond a few agents, the real execution graph gets buried across code, YAML files, and framework-specific abstractions. At runtime, logs rarely show which agent did what, which tool was called, or where the failure happened. AgentLantern is a devtool that visualizes that graph. The angle: as agentic workflows scale from demos to production, observability becomes the bottleneck, and this is the first serious open-source attempt to solve it.

Quietly. Elon Musk's xAI has gone all in on natural gas, while SpaceX is obsessed with orbital data centers [6 — TechCrunch]. TechCrunch reports Musk has given up on solar power on Earth — a reversal from the "solar-electric economy" he once promised. The read: xAI's energy strategy is now explicitly fossil-fuel-dependent, and the orbital data center play is a separate bet that doesn't solve for terrestrial carbon. The signal: the biggest private AI compute buildout is locking into natural gas, and that has regulatory and reputational implications that nobody in the xAI orbit is talking about yet.

Three labs, three bets. The open-weight pricing war is the one that changes the economics of production AI. The pattern is consistent: cost per token is dropping faster than capability is improving, and the cheap workers are getting fast enough to matter.

That is the edge for today.

来源

  1. https://the-decoder.com/deepseek-makes-its-75-percent-discount-permanent-pricing-output-tokens-at-least-34x-below-gpt-5-5/

话题关联

相关简报