Have your agent record video demos of its work with shot-scraper video · Simon Willison's Weblog
Science, Technology & Innovation · Jun 30, 2026
The release was enabled not by new application logic but by Playwright fixes—removing browser chrome/startup artifacts, adding finer-grained screencast control, and lifting an 800px width cap (landed in playwright‑python 1.61.0)—showing upstream dependency maturity, not local engineering, dictated when browser-recorded demos became polished enough for product walkthroughs.
Have your agent record video demos of its work with shot-scraper video · Simon Willison's Weblog
Science, Technology & Innovation · Jun 30, 2026
GPT-5.5 generated a complete demo storyboard YAML by reading a branch’s source changes and a command’s `--help`, showing that well-designed CLI help can act as embedded agent instructions and that investing in example-rich `--help` makes tools more directly usable by agents without extra orchestration.
Have your agent record video demos of its work with shot-scraper video · Simon Willison's Weblog
Science, Technology & Innovation · Jun 30, 2026
The `shot-scraper video` command turns a YAML storyboard into a reproducible Playwright-based browser recording pipeline that lets agents produce deterministic demo videos (MP4/WebM) of end-to-end UI interactions—launching servers, injecting JS, simulating clipboard, clicking/filling, waiting on selectors and validating text/URLs—so code changes can be accompanied by executable demo scripts and videos (example: Datasette CSV import), improving QA and review trust.
Have your agent record video demos of its work with shot-scraper video · Simon Willison's Weblog
Science, Technology & Innovation · Jun 30, 2026
A coding agent (GPT-5.5 xhigh) generated the implementation, docs, and YAML schema for the shot-scraper video feature while a human used the generated documentation to spot redundancy, inconsistency, and confusion and steer iterative revisions—Pydantic validation made the design easier to inspect—demonstrating agents’ value as rapid proposal generators that speed non-core infrastructure work when outputs are pressure-tested via docs, schemas, and validation.