AI-powered search engine for Telegram Mini Apps, communities and stickers
JavaScript is required to use this website.
✔️ Higgs Audio: an open platform for training and experimenting with audio LLMs Higgs Audio by boson-ai is a repository for researchers and developers who want to quickly assemble, train, and test audio models: speech recognition, audio question-answering, multimodal voice agents, and custom experiments with embeddings. Key ideas • Unified framework: the project structure simplifies working with datasets, preprocessing, and training launch. • Flexible configs: switch models, batch sizes, augmentations, and optimization strategies via customizable YAML/JSON parameters. • Modular blocks: encoders, decoders, prompt adapters, and task heads can be combined without rewriting the core. • Quick start: ready-made scripts for data preparation and training on one or multiple GPU nodes. • Experimental playground: conveniently try fine-tuning for your domain acoustics (podcasts, calls, streams, noisy datasets). Typical use cases 1. Train a small speech recognition model on your own corpus. 2. Create a voice bot: audio input → text → LLM → audio response. 3. Fine-tune an embedding model for sound search (similar signals, music fragments, events). 4. Research zero-shot / few-shot adaptation of audio models to new languages or accents. https://github.com/boson-ai/higgs-audio
Community: Artificial Intelligence
Posted: 2025-08-19T11:20:06.000Z
This community post is not available in your browser. Please use a modern browser to view this post.