Tongyi Deep Research, the Leading Open-source Deep Research Agent
-
Updated
Nov 2, 2025 - Python
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).
Run Surfer-H agents powered by Holo1 using the Surfer-H-CLI. Includes example tasks, scripts, and configurations.
Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934
Web-Navigator is an agent for web browsing and scraping websites.
[NAACL2025] LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications
The Library for LLM-based multi-agent applications
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval And Synthesis For SLMs
Opensource benchmark evaluating web operators/agents performance
Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"
Code for 🌍 UI-Simulator: LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training
Screen recording and computer interaction capture tool that records keyboard/mouse input, screen video, DOM snapshots, and accessibility trees. Perfect for creating datasets to train and evaluate computer-use AI models.
Screen recording and computer interaction capture tool that records keyboard/mouse input, screen video, DOM snapshots, and accessibility trees. Perfect for creating datasets to train and evaluate computer-use AI models.
Python scripts for generating and categorizing web browsing tasks for benchmark datasets
Neurosim is a Python framework for building, running, and evaluating AI agent systems. It provides core primitives for agent evaluation, cloud storage integration, and an LLM-as-a-judge system for automated scoring.
Evaluation system for computer-use agents that uses LLMs to assess agent performance on web browsing and interaction tasks. This judge system reads screenshots, agent trajectories, and final results to provide detailed scoring and feedback.
AI-powered Chrome side panel assistant that understands natural language and performs real actions in your browser.
A web application that summarizes the content of any public web page using advanced AI language models.
Add a description, image, and links to the web-agent topic page so that developers can more easily learn about it.
To associate your repository with the web-agent topic, visit your repo's landing page and select "manage topics."