Web Agents Python Packages

agentlab

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

2K 598 121

clawbench-eval

Open-source benchmark for browser AI agents on daily tasks.

2K 453 26

langchain-tinyfish

LangChain integration for TinyFish Web Agent - AI-powered web automation

560 13 6

langchain-openterms

openterms sdk

534 0 0

uground-demo-test

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

514 315 19

doomarena-taubench

TauBench extensions for DoomArena

401 62 7

doomarena

A framework to test the security and robustness of AI agents

391 62 7

clawbench-harness

ClawBench: Can AI Agents Complete Everyday Online Tasks? (alias of claw-bench)

312 453 26

nail-clawbench

Open-source benchmark for browser AI agents on daily tasks.

296 453 26

openclawbench

ClawBench: Can AI Agents Complete Everyday Online Tasks? (alias of claw-bench)

294 453 26

claw-harness

ClawBench: Can AI Agents Complete Everyday Online Tasks? (alias of claw-bench)

286 453 26

tinyfish-adk

TinyFish Web Agent tools for Google Agent Development Kit (ADK)

275 13 6

clawbench-cli

ClawBench: Can AI Agents Complete Everyday Online Tasks? (alias of claw-bench)

252 453 26

doomarena-promptceptor

Promptceptor tool

215 62 7

claw-eval

ClawBench: Can AI Agents Complete Everyday Online Tasks? (alias of claw-bench)

157 453 26

realtask-bench

ClawBench: Can AI Agents Complete Everyday Online Tasks? (alias of claw-bench)

127 453 26

task-harness

Open-source benchmark for browser AI agents on daily tasks.

126 453 26

claw-ai

ClawBench: Can AI Agents Complete Everyday Online Tasks? (alias of claw-bench)

122 453 26

everyday-bench

ClawBench: Can AI Agents Complete Everyday Online Tasks? (alias of claw-bench)

119 453 26

nail-bench

Open-source benchmark for browser AI agents on daily tasks.

119 453 26

life-bench

Open-source benchmark for browser AI agents on daily tasks.

118 453 26

claw-agent

ClawBench: Can AI Agents Complete Everyday Online Tasks? (alias of claw-bench)

117 453 26

mcq-bench

ClawBench: Can AI Agents Complete Everyday Online Tasks? (alias of claw-bench)

115 453 26

nail-eval

Open-source benchmark for browser AI agents on daily tasks.

111 453 26