Skip to main content

Browser

E-commerce browsing, search and checkout.

A simulated e-commerce browser surface with product listings, reviews, and account flows — testing whether agents respect user intent against malicious reviews, banners, and storefront pages.

Environments

The Browser domain ships 1 sandboxed environment:

Benchmark

See the leaderboard for live Indirect ASR, Direct ASR, and BSR results on the Browser domain across all supported agent frameworks and models.