Discover the toolingbehind modern intelligence

Creative, Vision & Voice#agent#agentic-ai

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

4.1k827Python

Generative-Media-Skills

SamurAIGPT

Creative, Vision & Voice#agent-tools#ai-agents

Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.

3.4k385Shell

MiniMax-MCP

MiniMax-AI

Creative, Vision & Voice#image-generation#image-to-video

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

1.5k268Python

amical

amicalhq

Creative, Vision & Voice#ai#ai-note-taking-app

🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no keyboard needed. 🆓 Powered by open source models, works offline, fast and accurate.

1.3k116TypeScript

minutes

silverstein

Creative, Vision & Voice#agent-skills#ai

Every meeting, every idea, every voice note — searchable by your AI. Open-source, privacy-first conversation memory layer.

1.3k131Rust

douyin-mcp-server

yzfly

LLM & GenAI Engineering#agentskills#claude

提取抖音无水印视频链接，视频文案，douyin-mcp-server，mcp，claude skill，支持龙虾

1.1k223HTML

muapi-cli

SamurAIGPT

Creative, Vision & Voice#ai#cli

Official CLI for muapi.ai — generate images, videos & audio from the terminal. MCP server, 14 AI models, npm + pip installable.

99984Python

claude-video-vision

jordanrendric

Creative, Vision & Voice#claude-code#claude-code-plugin

Give Claude the ability to watch and understand videos — Claude Code plugin with frame extraction and multimodal audio analysis

78688TypeScript

MimikaStudio

BoltzmannEntropy

Creative, Vision & Voice#apple-silicon#audio-book-converter

MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic MCP Support

56478Dart

AcademicAgent

Jennyee1

ScholarMind - 面向大模型Agent领域的多模态学术 Agent | Multimodal Academic Research Agent with Knowledge Graph & Learning Path Planning

Creative, Vision & Voice

3363Python

sdk

vargHQ

Creative, Vision & Voice#ai-sdk#ai-video

AI video generation SDK — JSX for videos. One API for Kling, Flux, ElevenLabs, Veed. Built on Vercel AI SDK.

31521TypeScript

skills

elevenlabs

Creative, Vision & Voice#ai-agents#elevenlabs

Collections of skills for building with ElevenLabs

30039Python

dream-to-video-skill

mediastormDev

AGENT

AI agent skill that transforms dream descriptions into cinematic videos — auto-generates prompts, submits to Jimeng via browser automation, and downloads finished videos with post-processing effects.

Automation & Workflows

29135Python

DeLive

XimilalaXiang

Creative, Vision & Voice#agent-skill#ai

System audio capture + multi-provider ASR + local-first AI review workspace. Floating live captions, 12 ASR backends, 60+ languages, AI summary/chat/mindmap, Open API, MCP server, and Agent Skill.

2199TypeScript

yt-dlp-downloader-skill

MapleShaw

Cursor Agent Skill for downloading videos using yt-dlp

Creative, Vision & Voice

18228Shell

ai

team-telnyx

Creative, Vision & Voice#agent-skills#ai

Official one-stop shop for AI Agents and developers building with Telnyx.

1769Shell

music-genre-finder

joeseesun

🎵 Intelligent music genre search with 5947 genres from RateYourMusic - Claude Code skill for quick lookup, smart recommendations, and hierarchical exploration

LLM & GenAI Engineering

17028

editor-pro-max

Hainrixz

Creative, Vision & Voice#ai#claude-code

AI-powered video editor by @soyenriquerocha — built with Remotion + Claude Code. Describe videos in natural language, get professional MP4s. 25 components, 9 templates, 7 skills.

16774TypeScript

edumcp

aieducations

Creative, Vision & Voice#ai#ai-game

EDUMCP is a protocol that integrates the Model Context Protocol (MCP) with applications in the education field, dedicated to achieving seamless interconnection and interoperability among different AI models, educational applications, smart hardware, and teaching AGENTs.

15525Python

MiniMax-MCP-JS

MiniMax-AI