DISCOVER THE FUTURE OF AI AGENTSarrow_forward

All Projects

3 projects

DeepVideoDiscovery

A video content discovery tool developed by Microsoft that uses deep learning technology to automatically identify and extract key content from videos, helping users efficiently browse and understand video information。

PythonPyTorchMultimodal

LLaVA-Plus

LLaVA-Plus is a multimodal assistant system that learns to use tools, combining large language models with visual capabilities to enable AI agents to perform general vision tasks.

PythonPyTorchMultimodal

OSWorld

OSWorld is a benchmarking platform for evaluating multimodal agents' capabilities in performing open-ended tasks within real computer environments. It supports multiple virtualization platforms including VMware, VirtualBox, Docker, and AWS, offering diverse task scenarios and comprehensive evaluation metrics.

PythonDockerMultimodal
Per page

Page 1 / 1 · 3 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.

rocket_launch