DeepVideoDiscovery
✨A video content discovery tool developed by Microsoft that uses deep learning technology to automatically identify and extract key content from videos, helping users efficiently browse and understand video information。
A video content discovery tool developed by Microsoft that uses deep learning technology to automatically identify and extract key content from videos, helping users efficiently browse and understand video information。
LLaVA-Plus is a multimodal assistant system that learns to use tools, combining large language models with visual capabilities to enable AI agents to perform general vision tasks.
A comprehensive platform to generate, animate and schedule AI characters with automated workflows for training LoRA models, creating images/videos, and publishing to social media.
A text-to-speech model optimized for dialogue scenarios like LLM assistants, supporting mixed Chinese and English input. It generates natural and expressive speech with fine-grained control over prosodic features like laughter and pauses.
Page 1 / 1 · 4 total
Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.