DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
✨A benchmark platform featuring 100 PhD-level research tasks across 22 distinct fields, systematically evaluating Deep Research Agents (DRAs) on report generation quality and information retrieval capabilities.
Python大语言模型Deep Learning