SeeAct
✨SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, focusing on large multimodal models (LMMs) like GPT-4V. It consists of a robust codebase for running web agents on live websites and an innovative framework that utilizes LMMs as generalist web agents.
PythonWorkflow AutomationMultimodal