News Feed

Engineers do get promoted for writing simple code

seangoedecke.com RSS feed

seangoedecke.com RSS feed · Mar 26, 2026

The piece argues that engineers are not promoted for writing overly complex code; instead, simple, maintainable code that reliably ships features tends to advance careers because managers care about results. It explains that non-technical managers often misread complexity as difficulty, so delivering straightforward solutions builds a stronger reputation over time, while deliberately complicating work backfires. The article cautions against using perceived cleverness to justify complexity and emphasizes prioritizing first-order outcomes over second-order tricks.

datasette-files-s3 0.1a1

Simon Willison's Weblog

Simon Willison's Weblog · Mar 25, 2026

datasette-files-s3 0.1a1 introduces a backend enhancement that enables storing and retrieving files in an S3 bucket via datasette-files. The release adds a mechanism to fetch S3 configuration periodically from a URL, enabling the use of time-limited IAM credentials restricted to a bucket prefix for improved security.

Thoughts on slowing the fuck down

Simon Willison's Weblog

Simon Willison's Weblog · Mar 25, 2026

Simon Willison distills Mario Zechner's critique of agent-driven software, arguing that the urge to generate vast amounts of code with AI agents accelerates progress but erodes discipline, leading to a rapidly growing, hard-to-reason-about codebase. The piece warns that removing human bottlenecks causes tiny, seemingly harmless booboos to accumulate into cognitive debt, and it advocates slowing down by setting limits on generated code and preserving hand-written architecture and API decisions. Overall, it calls for a balance between speed and thorough, human-driven design as software systems scale.

datasette-llm 0.1a1

Simon Willison's Weblog

Simon Willison's Weblog · Mar 25, 2026

Datasette-llm 0.1a1 introduces a base plugin that exposes LLM models to other Datasette plugins and adds a register_llm_purposes() hook along with get_purposes() to list registered model purposes. This enables centralized configuration of which models handle which tasks (for example enrichment versus SQL assistance) and lets plugins request a model by purpose, while the new hook supports admin UI workflows for assigning models to purposes.

Regulation, Innovation

Federal Reserve (Speeches & Testimony)

Federal Reserve (Speeches & Testimony) · Mar 25, 2026

Randall D. Guynn's testimony outlines the Federal Reserve's approach to balancing financial-sector innovation with safety, emphasizing transparency and public feedback in supervision. He centers on AI, digital assets, and bank–fintech partnerships as the three priority areas, detailing benefits, governance needs, and risks such as explainability and privacy. The speech also describes steps to increase transparency (releasing operating principles and manuals) and notes regulatory actions on digital assets, tokenized securities, and interagency coordination to clarify rules for banks and their technology partners.

‘A List of Chain Restaurants Whose Names Contain Unusual Structures’

Daring Fireball

Daring Fireball · Mar 25, 2026

The piece reflects on a list of chain restaurant names built from unusual structural words, arguing that 'Place' is a noun rather than a true structure and thus not on the list. It also shares nostalgic anecdotes about ShowBiz Pizza Place, noting how people sometimes call it 'ShowBiz Pizza Palace' and using the memory to illustrate the quirks of naming. Overall, the article is a light, anecdotal reflection rather than data-driven analysis.

Improved Analytics in App Store Connect

Daring Fireball

Daring Fireball · Mar 25, 2026

Apple has rolled out a major update to Analytics in App Store Connect, featuring a refreshed UI focused on measuring app and game performance with a privacy-first approach and a new comprehensive support guide. The update removes the previous cross‑app aggregate view and sets three months as the default reporting period, with dashboards slated for deprecation, prompting critiques about catalog-wide visibility. Critics, including MacStories’ John Voorhees, note that cross‑app aggregation is a key loss even as Apple notes cross‑app reporting concerns and keeps shorter periods like 24 hours and seven days available, though less visible.

Why We Founded Airbase

a16z News

a16z News · Mar 25, 2026

Airbase argues that the radio frequency spectrum is a critical but finite resource that is increasingly bottlenecked by aging coordination systems and surging demand from 5G, satellites, and autonomous systems. The piece presents Airbase’s mission to transform spectrum into a software-defined, real-time, fluid asset to address constraints, underutilization, and vulnerability, noting that regulators are already using the company's tools.

LiteLLM Hack: Were You One of the 47,000?

Simon Willison's Weblog

Simon Willison's Weblog · Mar 25, 2026

The article analyzes the LiteLLM supply‑chain incident on PyPI, using the BigQuery PyPI dataset to quantify the impact. It reports 46,996 downloads of the exploited LiteLLM versions (1.82.7 and 1.82.8) during a 46‑minute window, and identifies 2,337 dependent packages, with 88% not pinning versions, indicating widespread exposure to the compromised release.

Maximize AI Infrastructure Throughput by Consolidating Underutilized GPU Workloads

NVIDIA Technical Blog

NVIDIA Technical Blog · Mar 25, 2026

In production Kubernetes environments, small AI models that require only a fraction of a GPU’s VRAM end up occupying entire GPUs due to how schedulers map models to GPUs, creating waste and lower throughput. The article highlights underutilized GPU resources when lightweight ASR or TTS models need about 10 GB of VRAM but still consume a full GPU. It argues for consolidating underutilized GPU workloads or enabling cross-model sharing to maximize AI infrastructure throughput.

How Centralized Radar Processing on NVIDIA DRIVE Enables Safer, Smarter Level 4 Autonomy

NVIDIA Technical Blog

NVIDIA Technical Blog · Mar 25, 2026

The article discusses NVIDIA DRIVE’s centralized radar processing as a path to safer, smarter Level 4 autonomy. It notes that automotive radar data is currently limited to CFAR outputs rather than raw RGB-like data and argues that existing communications and compute architectures haven’t kept pace with AI needs, advocating centralized processing as a solution.

The Truth about Venture Capitalists (in 2007)

a16z News

a16z News · Mar 25, 2026

The article revisits a 2007 essay to explain the core logic of venture capital: funds raise large pools of money to invest in high-risk startups with the aim of roughly a 10x return within a 4–6 year horizon, so only businesses with scalable leverage should seek VC funding. It argues for choosing the right partner (not just the firm), outlines the kinds of help VCs can provide beyond cash, and explains why VCs may pass on deals for reasons like lack of leverage, early timing, or team concerns. It also notes that many profitable, non-leveraged businesses may be better served by bootstrapping until they can credibly achieve high-growth exits.

Robotics Needs Fewer Roboticists*

a16z News

a16z News · Mar 25, 2026

Robotics Needs Fewer Roboticists argues that real-world robot deployment is stalled not by technology alone but by a talent and culture mismatch: the field has prioritized research prestige over reliability and customer needs, bottlenecking deployment. The author suggests we need more operators, product builders, and outsiders—people who can deploy, integrate, and operationalize robotics at scale—and fewer per-capita roboticists focused solely on R&D. He also calls for an 'application layer' that sits on top of autonomy, treats deployment as a forcing function for next-stage research, and shows how deployment-driven business models can compound model improvements.

Investing in Glimpse

a16z News

a16z News · Mar 25, 2026

Glimpse is an AI-powered back-office solution for CPG brands that automates retail deductions by ingesting claims from retailer portals, EDI, email, and PDFs into a single source of truth, then identifying disputes and executing them to recover revenue. The piece notes early, sizable impact—serving 200+ brands and helping large multinationals recover millions—while tracing the founders’ Purdue/YC origins and announcing a Series A investment from a16z. It frames deductions as a high-leverage revenue opportunity for brands.

Designing Protein Binders Using the Generative Model Proteina-Complexa

NVIDIA Technical Blog

NVIDIA Technical Blog · Mar 25, 2026

The article discusses designing protein binders using a generative model called Proteina-Complexa. It highlights the vast search space of amino acid sequences and structures, emphasizing the need to carefully optimize binder–target interactions to achieve strong, specific binding. It also points to generative modeling as a way to navigate this space and improve binder design.

Scaling Token Factory Revenue and AI Efficiency by Maximizing Performance per Watt

NVIDIA Technical Blog

NVIDIA Technical Blog · Mar 25, 2026

The article argues that power is the primary bottleneck in modern AI infrastructure and that maximizing performance per watt is essential for scaling AI factories and revenue. It frames AI data centers as token factories tightly bound to the energy ecosystem, where access to land and power drives throughput and economics. The core takeaway is to boost revenue and efficiency by optimizing energy use and intelligence per watt.

Vibe Coding XR: Accelerating AI + XR prototyping with XR Blocks and Gemini

The latest research from Google

The latest research from Google · Mar 25, 2026

Google's Vibe Coding XR is a rapid prototyping workflow that combines the XR Blocks framework with Gemini Canvas to translate natural‑language prompts into fully interactive, physics‑aware WebXR apps for Android XR, with a desktop simulated reality for quick testing. The system uses a specialized prompt and templates to generate XR experiences in under 60 seconds and supports sharing via public links, demonstrated through examples like a dandelion visualization and immersive math/chemistry tutors, with an onsite CHI 2026 demo and open access via the live demo. Preliminary VCXR60 evaluation shows an initial ~70% success rate due to XR Blocks/API issues, followed by iterative improvements across 11 releases, with Pro Mode offering the most reliable results.

Claude Can Now Take Control of Your Mac

Daring Fireball

Daring Fireball · Mar 25, 2026

Claude can now take control of your Mac, allowing the AI to perform tasks by pointing, clicking, and navigating on screen—opening files, using the browser, and running dev tools—with no setup, in the Claude Pro/Max research preview and integrated with Dispatch for mobile tasking. This marks a notable milestone, with Anthropic shipping agentic AI on macOS before Apple. The piece also critiques the Mac client as an Electron-based app and questions real-data practicality, offering a skeptical take on the feature's usefulness.

WSJ: ‘OpenAI Plans Launch of Desktop “Superapp”’

Daring Fireball

Daring Fireball · Mar 25, 2026

OpenAI plans to consolidate its ChatGPT app, the Codex coding platform, and the browser into a desktop “superapp” to simplify user experience and sharpen focus on enterprise customers. Fidji Simo will lead the product revamp with Greg Brockman assisting, as the company shifts from standalone products toward a unified platform to streamline resources and compete with Anthropic. The move represents a strategic pivot toward an integrated product ecosystem rather than a collection of separate apps.

OpenAI Is Closing Sora

Daring Fireball

Daring Fireball · Mar 25, 2026

OpenAI is shutting down the Sora app, with a note from Sora on X thanking the community and promising timelines for the app, API, and preserving users' work. The post also leaves a blunt assessment that, despite some initial fun, what was created with Sora ultimately didn’t matter and was merely an expensive lark.

iOS 26.4

Daring Fireball

Daring Fireball · Mar 25, 2026

iOS 26.4 brings a UI reorganization in the App Store: apps and purchase history are merged, with a separate App Updates section that now requires two taps to access. The MacRumors piece by Juli Clover frames the change as making updates more logical, even if the extra tap felt odd at first. The author also notes a personal preference for manually updating apps so they can read release notes and assess bug fixes and performance improvements.

Tracing Sucks

Cra.mr

Cra.mr · Mar 25, 2026

Tracing Sucks argues that distributed tracing is expensive and brittle in practice due to hard-to-propagate trace IDs, unreliable auto-instrumentation, and ballooning data costs, making full end-to-end spans impractical for many systems. The author recommends treating traces more like structured logs—relying on semantic conventions and broad trace propagation without demanding perfect caller-level accuracy to reduce instrumentation burden. Sentry’s approach is to attach trace context to all data so events can be traced via logs, effectively prioritizing log-based observability over full tracing.

Auto mode for Claude Code

Simon Willison's Weblog

Simon Willison's Weblog · Mar 24, 2026

Simon Willison reports on Claude Code’s new auto mode, a permission system where a classifier decides whether an action should run, with safeguards evaluated before execution. The piece details the default allow and soft_deny rules (e.g., read-only operations, declared dependencies, and risky Git actions) and notes the classifier runs on Claude Sonnet 4.6 with customizable filters. Willison also argues that AI-based protections remain imperfect and highlights the need for deterministic sandboxing to limit file and network access, given supply-chain and prompt-injection concerns.

Barr, Developing Communities through Public-private Partnerships

Federal Reserve (Speeches & Testimony)

Federal Reserve (Speeches & Testimony) · Mar 24, 2026

Governor Barr argues that public–private partnerships, enabled by the Community Reinvestment Act, LIHTC, and the New Markets Tax Credit, are essential for revitalizing underserved communities and expanding opportunity. He cites data showing NMTC leverages more private investment per government dollar and highlights four CRA-driven projects—Sharswood Ridge in Philadelphia, Dreambuild in the Rio Grande Valley, Appalachia Community Capital, and the Memphis Medical District Collaborative—as illustrative successes. The speech positions the Fed’s CRA implementation and broader community-development work as central to these outcomes, while also touching on a brief monetary-policy update.

Following Google’s Lead With Pixel Phones, Samsung Announces AirDrop Support With Galaxy S26 Phones

Daring Fireball

Daring Fireball · Mar 24, 2026

Samsung is rolling out AirDrop-style Quick Share on the Galaxy S26 series, starting March 23 and expanding from Korea to Europe, the Americas, and Asia. The article notes the likely use of a reverse-engineered AirDrop implementation similar to Google’s Pixel approach and discusses potential security implications, while Apple has not commented and the writer speculates on AirDrop becoming a de facto standard. The piece blends rollout logistics with light technical commentary rather than presenting new data.

Back to feed

Auto mode for Claude Code

Simon Willison's Weblog

Mar 24, 2026

3/24/2026

Claude Code Auto Mode Introduces Model-Mediated Pre-Execution Review With Classifier Safeguards

Auto mode for Claude Code · Simon Willison's Weblog

Science, Technology & Innovation · Mar 24, 2026

Claude Code’s new “auto” mode replaces explicit user permission prompts with a model-mediated pre-execution classifier (Claude Sonnet 4.6) that reviews and blocks actions that exceed task scope, touch unrecognized infrastructure, or show signs of hostile influence—creating built-in safeguards and customizable filters but shifting security reliance from deterministic policies to classifier judgment and scope/trust inference.

3/24/2026

Auto Mode Security Is Probabilistic Filtering, Not A Primary Containment

Auto mode for Claude Code · Simon Willison's Weblog

Science, Technology & Innovation · Mar 24, 2026

Auto mode uses probabilistic intent/environment classifiers that admit false negatives, so it can approve risky steps and should be treated as a filtering layer (defense-in-depth) rather than a deterministic security boundary—run agents in robust sandboxes and use model-based permissioning only as secondary control.

3/24/2026

Auto Mode Implements A Policy Taxonomy Distinguishing Allowed Project-Scoped Operations From Risky Or Destructive Actions

Auto mode for Claude Code · Simon Willison's Weblog

Science, Technology & Innovation · Mar 24, 2026

Anthropic's auto mode embeds a default, transcript-centered policy taxonomy (viewable via `claude auto-mode defaults`) that permits repo-scoped local file ops, safe read-only HTTP/API calls, and manifest-declared dependency installs, while soft-denying scope escalation (e.g., cd to ~/, /etc, other repos), irreversible/destructive actions (force-push, direct pushes to main/master, mass cloud deletions), and executing externally downloaded code—exposing a granular workflow-permissions model based on the starting-repo trust boundary that may not match enterprise trust assumptions.

3/24/2026

Auto Mode Reduces Some Dependency Risk but Leaves Broader Repository Dependency Risk, Highlighting the Need for Deterministic Dependency Controls

Auto mode for Claude Code · Simon Willison's Weblog

Science, Technology & Innovation · Mar 24, 2026

The default allow-list permits manifest-driven installs (e.g., pip install -r, npm install) when repository manifests are unchanged, blocking some agent-originated typosquatting but creating a supply-chain blind spot because it doesn't ensure dependencies are pinned, safe, or uncompromised, so evaluation of agent platforms should prioritize deterministic dependency controls (pinning, lockfile enforcement, provenance, network sandboxing) over permission classifiers alone.