Web Agent
Web Agent is the data access and execution entry point from agents to the open web. It turns messy real webpages into environments that agents can search, extract, clean, and operate.
Core problem
The real web is not agent-friendly:
- HTML contains noise and inconsistent structure.
- Dynamic rendering, PDFs, tables, and media are hard to process reliably.
- Search results need ranking, deduplication, summary, and source validation.
- Browser actions need isolation and state tracking.
Web Agent makes the web consumable and executable for agents.
API families
| API family | Role |
|---|---|
| Search / SelectAll | Multi-source search, broad data discovery, ranking, and summaries |
| Textify | Converts complex webpages, PDFs, and dynamic content into LLM-friendly Markdown/text |
| Do / Track Anything | Lets agents click, type, scroll, and monitor page changes |
| Sandboxed Execution | Runs browser work in isolated environments to reduce security and pollution risks |
Differentiation
Web Agent is not only a crawler. It combines with other SAK modules:
- GUM enables profile-aware query rewriting.
- GenAuth controls access to protected resources.
- Source and execution traces preserve the search, extraction, and action process.
Best fit
Web Agent fits teams that need live web data, deep research, vertical data extraction, dynamic webpage handling, and agentic search.