0. I’ve joined Anthropic (twitter.com)
1426 points · 616 comments · by dmarcos
Andrej Karpathy, a prominent AI researcher and former founding member of OpenAI, has announced that he is joining the AI safety and research company Anthropic. [src]
The discussion regarding Andrej Karpathy joining Anthropic is divided between those who see him as a top-tier talent who strengthens the pre-training team [1][9] and skeptics who view the move as a "celebrity hire" or marketing stunt intended to boost IPO value [2][3][8]. While some users criticize his frequent job-hopping, others point out that his five-year tenure at Tesla is significant for the tech industry [0][6]. There is also a broader debate about Anthropic’s trajectory, with some users praising their safety-conscious culture [1] while others fear the company is becoming an "industry tornado" that prioritizes hype over product merit [7][8].
1. Google changes its search box (blog.google)
698 points · 931 comments · by berkeleyjunk
Google is redesigning its iconic search box to integrate Gemini AI, shifting the platform from a list of links toward a conversational interface that provides direct answers and synthesized information. [src]
The integration of AI into Google Search has sparked significant concern regarding "Google Zero," a scenario where the search engine ceases to drive traffic to external websites, leading site owners to question the value of allowing crawlers at all [0][2]. Users report frequent inaccuracies and "bullshit answers" that complicate professional work and potentially endanger users seeking medical or financial advice [3][6][8]. While some have already shifted their habits toward LLMs or alternative search engines, there is a strong consensus that these AI summaries often present "random stuff" as ground truth while lacking the essential primary sources required for factual reliability [1][4][5][9].
2. Gemini 3.5 Flash (blog.google)
959 points · 655 comments · by spectraldrift
Google has introduced Gemini 3.5 Flash, a high-speed, cost-efficient AI model designed for low-latency performance and high-volume tasks. [src]
The release of Gemini 3.5 Flash has sparked significant concern over its pricing, which represents a 3x to 6x increase over previous Flash models and positions it closer to the cost of older "Pro" versions [0][7]. While some users praise Google’s focus on optimizing smaller models [6], others argue that these rising costs make AI increasingly inaccessible to individuals and suggest that serving LLMs profitably remains a major challenge [2][4][9]. Early testing shows mixed results: the model demonstrates impressive reasoning capabilities for complex SVG generation, yet it can still struggle with anatomical logic in images and carries a high per-request cost for long outputs [1][3].
3. The last six months in LLMs in five minutes (simonwillison.net)
795 points · 587 comments · by yakkomajuri
At PyCon US 2026, Simon Willison summarized the previous six months of LLM progress, highlighting a November 2025 inflection point where coding agents became reliable daily tools and the rise of "Claws"—personal AI assistants—driven by powerful new open-weight models from Google, GLM, and Alibaba. [src]
The recent "inflection point" in LLM capabilities has sparked a polarized debate between users who find agents capable of high-quality, professional-grade work and those who view them as overhyped tools prone to errors [0][1][2]. While some developers claim to have transitioned entirely to AI-driven coding for professional tasks, critics argue that the output often lacks coherence and requires significant "babysitting" to reach production standards [1][3][4]. This divide is exacerbated by disagreements over whether the technology is truly revolutionary or if the perceived progress is largely a result of effective marketing [0][2][9].
4. I’ve built a virtual museum with nearly every operating system you can think of (virtualosmuseum.org)
964 points · 223 comments · by andreww591
The Virtual OS Museum is a downloadable Linux-based virtual machine featuring over 570 pre-configured operating systems and 250 platforms, allowing users to explore the history of computing from 1948 to the present through a custom, snapshot-enabled launcher. [src]
The virtual museum sparked deep nostalgia for niche interfaces, such as the unique "pad" input system of Domain/OS [1] and the "paper folder" desktop environment found on early Compaq Windows 3.1 machines [2][3]. While users praised the collection's breadth, some noted the absence of historically significant systems like Novell Netware, the Pick operating system, and the now-lost pre-Domain/OS AEGIS [6][7][9]. Due to the massive 120GB file size and slow server speeds, several commenters are actively attempting to create a torrent to facilitate easier access for the community [4][5].
5. Apple unveils new accessibility features (apple.com)
725 points · 381 comments · by interpol_p
Apple has unveiled new accessibility updates powered by Apple Intelligence, including natural language voice control, AI-generated video subtitles, and a feature allowing Apple Vision Pro users to control power wheelchairs using eye-tracking technology. [src]
Apple’s new accessibility features are viewed as a strategic "stealth test" for agentic AI, following a pattern where the company debuts advanced tech in niche tools before a broader rollout [1][4]. While users praise Apple's leadership in accessibility, there is significant criticism regarding their lagging speech-to-text and autocorrect capabilities, which some feel have regressed over the last decade [2]. The discussion also highlights the "unimaginable" speeds at which blind users process audio, noting that sighted people often require practice just to tolerate 1.5x or 2.0x speeds [0][3][5][7].
6. Minnesota becomes first state to ban prediction markets (npr.org)
785 points · 245 comments · by ortusdux
Minnesota Governor Tim Walz signed a law making it a felony to host or advertise prediction markets, prompting a federal lawsuit from the Trump administration which argues the industry should be regulated by the Commodity Futures Trading Commission rather than individual states. [src]
Minnesota’s ban on prediction markets is seen as legally stronger than potential bans in other states because Minnesota also prohibits sports betting, avoiding contradictions regarding the morality or mechanics of gambling [0]. However, commenters debate whether federal CFTC regulations on commodities futures might preempt state law, or if prediction markets are fundamentally different from sportsbooks because they function as peer-to-peer exchanges rather than "house" models [2][3][4]. While some argue these markets are a "scourge" prone to insider trading and harmful real-world incentives, others contend they are functionally similar to the stock market or traditional trading exchanges [1][5][8].
7. Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks (github.com)
681 points · 251 comments · by zambelli
Forge is an open-source reliability layer that uses multi-layer guardrails to boost the accuracy of local 8B models on agentic tasks from 53% to 99%, allowing small models to outperform frontier APIs like Claude Sonnet in multi-step workflows. [src]
The discussion centers on how "guardrail" frameworks like Forge improve agentic performance by enforcing tool-call correctness and handling common failure modes, such as misinterpreting empty search results as tool errors [0][9]. While these harnesses allow small local models to rival frontier models on specific tasks, users noted that "effective attention" remains a bottleneck; larger models like Claude Opus still handle long-horizon tasks and massive context windows more reliably [4][6]. There is a strong consensus that managing message history—specifically through "compaction" or summarizing old tool responses—is essential for preventing context drift in extended agent sessions [0][4][6]. Additionally, developers are increasingly viewing these control layers as "LLM middleware" that treats the infrastructure around the model as a first-class priority [1][7].
8. Tesla's lithium refinery discharges 231,000 gallons of polluted wastewater a day (autonocion.com)
497 points · 243 comments · by atombender
An independent lab report found carcinogens and heavy metals in wastewater discharged from Tesla’s Texas lithium refinery, sparking a dispute with a local drainage district after the facility was marketed as an "acid-free clean process" that would only produce sand and limestone byproducts. [src]
While Tesla maintains the discharge is fully permitted and legal [1][6], critics point out that the lab report found hexavalent chromium and arsenic, neither of which are listed as allowable pollutants in the permit [9]. Some users argue the levels are negligible, noting that arsenic levels were below federal drinking water standards and chromium was barely above reporting limits [0][6]. However, others express concern over the long-term bioaccumulation of these toxins [2] and debate whether the sampling methodology—conducted downstream rather than at the outfall—accurately reflects Tesla's specific contribution to the ditch's pollution [1].
9. Show HN: Gaussian Splat of a Strawberry (superspl.at)
528 points · 200 comments · by danybittel
A developer shared a 3D Gaussian Splat reconstruction of a strawberry, providing images of the physical camera and lighting setup used to capture the scene. [src]
Gaussian splatting is praised for its high performance on mobile devices [3] and its "dreamy" aesthetic, where detail degrades into a blurry, impressionistic style rather than a hard cutoff [4]. While users are impressed by the visual fidelity of these 3D reconstructions [1][2], some note technical limitations such as missing geometry on the underside of objects [7]. A major point of debate is whether splats can eventually support dynamic animation like traditional polygon meshes [5][6], or if they will be superseded by AI-driven generation and Neural Radiance Fields (NeRFs) [0][8][9].
Brought to you by ALCAZAR. Protect what matters.