DeepSeek permanently cuts flagship model API price by 75%

DeepSeek cut the API price of its flagship model by 75% permanently as of June 1, an aggressive move that reverberated through the market. The reduction continues DeepSeek's established playbook of using radical price-performance to win developer mindshare, and it lands amid a broader Chinese-lab price war that includes MiniMax's open-weights M3 and Alibaba's Qwen push.
The pricing shock was felt acutely by users: an r/DeepSeek thread titled "Pricing is crazy" drew 731 upvotes, and another, "65 million tokens for 7 dollars lol," captured the sheer cost collapse with 154 upvotes. A separate thread asking whether users "have to migrate to a new ai site again" reflected both the appeal and the churn anxiety that DeepSeek's pricing creates.
The strategic implication, flagged by Digitimes, is on AI infrastructure: if frontier-adjacent capability is available at a fraction of Western API prices, it pressures the unit economics that justify massive data-center and accelerator spend. That tension dovetails with the week's broader "tokenmaxxing" backlash, where executives warn against burning tokens indiscriminately — cheaper tokens cut both ways, lowering cost but encouraging over-consumption.
The move also competes directly on the dimension where MiniMax M3 is making noise — claiming GPT-5.5-class performance at ~10% of the cost. For developers, the upshot is that price is now a primary axis of competition among capable models, not just capability. Watch whether Western providers respond with cuts of their own and how the price war affects DeepSeek's path toward any public listing.