Discover the toolingbehind modern intelligence
The operating system for AI infrastructure discovery. Search a live map of MCP servers, AI agents, LLM tools, automation systems, and developer infrastructure.
The operating system for AI infrastructure discovery. Search a live map of MCP servers, AI agents, LLM tools, automation systems, and developer infrastructure.
Lowest stars first
Every indexed repo, lowest star count first — find hidden gems before they trend.
TypeScript SDK and CLI for building AI agents that autonomously join, listen, and speak in X/Twitter Spaces. Supports multiple LLM providers (OpenAI, Claude, Groq), speech-to-text (Whisper, Deepgram),
🎬 Autonomous AI agent that transforms long-form videos into viral short-form content across YouTube Shorts, TikTok, and Instagram Reels using advanced AI analysis and automation.
A high-performance Model Context Protocol (MCP) server providing local speech-to-text transcription using whisper.cpp, optimized for Apple Silicon.
Bidirectional voice MCP server for Claude Code — listen (STT) and speak (TTS) on Apple Silicon via mlx-audio
Created an AI AGENT which uses whisper stt and maya tts (no more -- i am using piper tts) locally along with ollama to provide an assistant behaviour vocally.
🔊 Text-to-Speech MCP plugin for Claude Code - hear audio feedback while coding (OpenAI TTS)
Advanced ffmpeg mcp server
Luma AI Video + Audio + Image Generation and RunwayML Video Generation from Image and Text
IA na Prática: LLM, RAG, MCP, Agents, Function Calling, Multimodal, TTS/STT e mais
A Claude skill that teaches your AI to watch videos. Use it to learn, absorb, copy, or give visual feedback like you would to a real person.
Turns any text into a video montage of movie clips featuring the specified text
🖼️ Workshop: Build a multimodal AI agent with Haystack & GPT-4o — featuring image understanding, document retrieval, conversational memory, and human-in-the-loop safety controls
YouTube/audio transcription, image, video generation, AI voice (TTS) & OCR for Claude Code, Cursor & Windsurf. Up to 20x cheaper via deAPI.
Reverse-engineered Doubao (豆包) API → OpenAI-compatible REST service. Free multimodal chat, image/video/music generation, and file hosting for AI agents.
Ultra-fast local TTS for AI agents. ~90ms to first sound.
Audio feedback plugin for Claude Code with TTS announcements, sound effects, and contextual AI messages. Supports multiple providers and languages.
基于Qwen Agent框架,融合JAKA机械臂、视觉检测、语音识别与合成、MCP数据库的多模态大模型
No description provided.
AI agent skills for video, image, speech & music generation. Works with Claude Code, Cursor, Windsurf, OpenCode, ClawHub.
Generate images with Google Whisk AI (Imagen 3.5) directly from Claude Code
AI Video Engine — Turn structured data into videos. JSON timeline → Scene components → HTML/MP4.
Flowstay is a MacOS app that allows instant transcription across all your apps with auto-paste. Stay in your flow state. 2x faster typing with your voice.
The KlicStudio MCP server is a connector based on the Model Context Protocol (MCP), designed to facilitate interactions with KlicStudio services. Acting as a bridge between large language models (LLMs
Study Guide for the AI-102: Designing and Implementing a Microsoft Azure AI Solution Exam