0. Claude Opus 4.6 (anthropic.com)
2334 points · 1016 comments · by HellsMaddy
Anthropic has launched Claude Opus 4.6, an upgraded model featuring a 1M token context window and industry-leading performance in agentic coding, finance, and reasoning. The update introduces "adaptive thinking" and "effort" controls, alongside new integrations for Excel and PowerPoint to enhance autonomous workplace productivity. [src]
The release of Claude Opus 4.6 has sparked debate over Anthropic's marketing strategy, with some users arguing the model's "bread and butter" remains coding despite attempts to appeal to a broader audience [8]. While the model demonstrates impressive long-context retrieval by identifying 49 out of 50 spells in the first four *Harry Potter* books [3], critics point out that its lead in benchmarks was almost immediately challenged by new competitors [6]. Discussion also focused on the economic viability of "agent teams" [5] and skepticism regarding the reliability of benchmarks given potential server-load fluctuations [7].
1. GPT-5.3-Codex (openai.com)
1523 points · 600 comments · by meetpateltech
OpenAI has introduced GPT-5.3-Codex, a faster and more capable agentic model designed to autonomously handle complex software engineering, research, and computer-use tasks. The model features state-of-the-art performance on industry benchmarks and was instrumental in its own development, debugging, and deployment. [src]
The release of GPT-5.3-Codex has highlighted a philosophical divide in AI development between "human-in-the-loop" collaborative steering and fully autonomous, agentic systems [0][6][7]. While some users remain skeptical of AI's ability to solve non-trivial, original problems [2] and distrust benchmark scores that don't reflect real-world experience [1][3], others are focused on the implications of "dogfooding," noting that this model was instrumental in its own creation [4][8]. This rapid pace of advancement has led to increased competition between labs [5] and growing anxiety among software engineers regarding job security [9].
2. Don't rent the cloud, own instead (blog.comma.ai)
1207 points · 498 comments · by Torq_boi
Comma.ai CTO Harald Schäfer details how the company saved an estimated $20 million by building a $5 million in-house data center, arguing that owning hardware offers better engineering incentives, lower costs, and greater self-reliance than renting cloud compute. [src]
The debate over cloud versus on-premise infrastructure centers on the trade-off between high operational costs and the significant capital expenditure and staffing risks of ownership [0][4][6]. While cloud providers are criticized for pushing inefficient, overcomplicated architectures and "managed services" that inflate bills [1][3], many argue that the cost of hiring specialized engineers to manage bare metal often exceeds the savings for all but the largest companies [2][4][9]. Consequently, a spectrum of hybrid options has emerged, such as rented bare metal or managed private clouds, which offer significant savings over AWS while mitigating the physical risks of hardware maintenance [0][5].
3. We tasked Opus 4.6 using agent teams to build a C Compiler (anthropic.com)
727 points · 723 comments · by modeless
Anthropic researchers successfully used "agent teams" of 16 parallel Claude instances to autonomously build a 100,000-line C compiler from scratch. Costing $20,000 in API fees, the Rust-based compiler can build the Linux kernel and run complex software like Doom across multiple hardware architectures. [src]
The successful creation of a 100,000-line C compiler capable of booting Linux is seen as a significant milestone that demonstrates the rapidly evolving capabilities of LLMs [0][1][7]. However, critics argue the "clean-room" claim is misleading, suggesting the model is essentially "decompressing" or plagiarizing existing compiler knowledge from its training data rather than innovating [2][6][9]. While the project highlights a massive leap in agentic performance, the resulting compiler remains less efficient than GCC, required $20,000 in API costs, and still relies on "cheats" like calling out to GCC for specific 16-bit tasks [0][1][4].
4. My AI Adoption Journey (mitchellh.com)
956 points · 397 comments · by anurag
Software developer Mitchell Hashimoto outlines his transition from AI skeptic to power user by moving beyond chatbots to background agents that handle research, triage, and routine coding tasks while he focuses on deep manual work. [src]
The discussion highlights a shift among experienced developers who, despite initial skepticism, are finding significant value in AI agents by treating them as tools for narrow, reviewable tasks rather than "drawing the owl" in one go [0][3][9]. While some argue that the endorsement of high-caliber developers should prompt skeptics to re-evaluate their stance [2][8], others express concern that the speed of "agentic coding" may bypass essential security and reliability guarantees provided by traditional line-by-line code reviews [1][4]. Success with these tools appears to rely on "harness-engineering"—maintaining a tight loop of small diffs and fast verification to prevent the AI from drifting away from project constraints [7][9].
5. Flock CEO calls Deflock a “terrorist organization” (2025) [video] (youtube.com)
671 points · 517 comments · by cdrnsf
Flock CEO calls Deflock a “terrorist organization” (2025) [video]: - YouTube
Auf YouTube findest du die angesagtesten Videos und Tracks [src]
The discussion centers on the Flock CEO’s characterization of the activist group DeFlock as a "terrorist organization" similar to Antifa, a comparison many commenters view as an authoritarian attempt to demonize anti-fascist sentiment [0][2][8]. Critics argue that while the CEO claims the service isn't "forced" on anyone, the company uses massive VC funding to bypass public consent through lobbying and "lawfare" [1][3][4]. While some debate whether there is a legal expectation of privacy in public spaces, others contend that automated mass surveillance is fundamentally different from individual observation and lacks the democratic mandate of a public referendum [3][5].
6. The time I didn't meet Jeffrey Epstein (scottaaronson.blog)
385 points · 579 comments · by pfdietz
Computer scientist Scott Aaronson clarifies that while his name appears in the "Epstein Files," he never met or contacted Jeffrey Epstein, having declined potential research funding in 2010 after his family warned him about Epstein’s criminal background. [src]
The discussion centers on how extreme wealth and power lead to corruption, with some arguing that systemic checks like taxation and term limits are necessary to prevent such concentration [0][6]. However, others contend that taxation merely shifts power to an authoritarian government [3] or that power inherently attracts corrupt individuals rather than just corrupting them [8].
A significant portion of the debate focuses on Bill Gates's character and legacy; critics view his philanthropic efforts as a tax-sheltered means of maintaining control [1][2][9], while defenders point to his massive charitable donations as evidence of genuine altruism [4]. Additionally, Jeffrey Epstein’s own writing is analyzed as a pseudo-intellectual "word salad," prompting anecdotes about similar individuals who use incoherent jargon to mimic intelligence [5].
7. OpenClaw is what Apple intelligence should have been (jakequist.com)
514 points · 415 comments · by jakequist
The open-source framework OpenClaw is driving a surge in Mac Mini sales by allowing users to run AI agents that automate computer workflows, highlighting a missed opportunity for Apple to dominate the agentic AI platform market. [src]
The emergence of OpenClaw has sparked debate over whether Apple missed a "killer app" opportunity by focusing on notification summaries rather than agentic automation [0][4]. While some argue Apple is wisely waiting for the industry to solve catastrophic security risks like prompt injection [1][5], others point out that users are already buying Mac Minis specifically to run these third-party agents [2][7][9]. Amidst this, there is significant frustration with the current state of Siri, which many find "borderline useless" due to restrictive permission hurdles [6].
8. It's 2026, Just Use Postgres (tigerdata.com)
522 points · 326 comments · by turtles3
PostgreSQL extensions now allow a single database to replace specialized tools like Elasticsearch, Pinecone, and Redis by offering native support for BM25 search, vectors, time-series, and message queues, significantly reducing architectural complexity and operational overhead for 99% of use cases. [src]
While many users praise PostgreSQL as a "miracle" for its performance and versatility [3], critics argue that the "just use Postgres" mantra ignores the high operational costs and expert "babysitting" required to scale it for specialized workloads [0][7]. There is a strong consensus that while it is an excellent default choice [4], purpose-built tools like Redis remain superior for specific data structures [9], and alternatives like SQLite or MySQL are often preferred for their simplicity and lower maintenance overhead [1][5][8]. Additionally, some participants expressed frustration with the linked article itself, labeling it as AI-generated content [2].
9. LinkedIn checks for 2953 browser extensions (github.com)
524 points · 236 comments · by mdp
LinkedIn silently probes for 2,953 Chrome extensions on every page load, a practice documented in a new GitHub repository that identifies the specific extensions being tracked. [src]
LinkedIn’s practice of scanning for nearly 3,000 browser extensions is primarily viewed as a defensive measure against data scraping and automation tools [0]. While some users defend a business's right to prevent abuse, others criticize the privacy implications and express little sympathy for a major data broker [1][3]. Technically, Firefox users appear immune to this detection because the browser uses randomized UUIDs for extension resources, whereas Chrome’s static IDs allow for easy fingerprinting [2][7][8].
Your daily Hacker News summary, brought to you by ALCAZAR. Protect what matters.