Prompt tools, token counters, and LLM helper utilities
69 tools
Suno
Text-to-speech model from Suno
Coqui
Deep learning toolkit for text-to-speech
Cross-lingual text-to-speech synthesis
Rhasspy
Fast local neural text-to-speech
MyShell
Instant voice cloning by MyShell
RVC Project
Retrieval-based voice conversion
neonbjb
Multi-voice text-to-speech system
yl4579
Human-level text-to-speech synthesis
OpenAI
OpenAI's image-text model
Salesforce
Salesforce's vision-language model
Microsoft
Large Language and Vision Assistant
THUDM
Visual language model from Tsinghua
Alibaba
Alibaba's vision-language model
Shanghai AI Lab
Open multimodal dialogue model
Microsoft's vision foundation model
Adept
Adept's multimodal model
Hugging Face
Hugging Face's open VLM
Vision-CAIR
Vision-language understanding with GPT-4
IDEA-Research
Open-set object detection with text
Meta
Meta's Segment Anything Model
Segment Anything for images and video
Showing 49-69 of 69 tools