speech

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

speech multimodal rag edge-ai vector-database vision-transformer llm-inference

Updated Jun 11, 2024
Python

zawi01 / stipa

Star

MATLAB implementation of the Speech Transmission Index for Public Address (STIPA) method for evaluating the speech transmission quality.

audio evaluation speech psychoacoustics intelligibility speech-transmission-index stipa

Updated Jun 11, 2024
MATLAB

lhotse-speech / lhotse

Star

Tools for handling speech data in machine learning projects.

audio python data machine-learning ai deep-learning speech pytorch speech-recognition kaldi

Updated Jun 12, 2024
Python

google / voice-builder

Star

An opensource text-to-speech (TTS) voice building tool

nlp text-to-speech speech tts speech-synthesis festvox

Updated Jun 11, 2024
JavaScript

huggingface / datasets

Star

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

nlp machine-learning natural-language-processing computer-vision deep-learning tensorflow numpy speech pandas pytorch datasets hacktoberfest

Updated Jun 11, 2024
Python

DigitalPhonetics / IMS-Toucan

Star

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated Jun 11, 2024
Python

weirongxu / auditory-reader

Star

📖 A Speech Reader, Support Epub, URL, Text.

speech epub reader utterance

Updated Jun 11, 2024
TypeScript

Camb-ai / MARS5-TTS

Star

MARS5 speech model (TTS) from CAMB.AI

text-to-speech speech speech-synthesis prosody voice-cloning voice-cloneai

Updated Jun 11, 2024
Python

modelscope / modelscope

Star

ModelScope: bring the notion of Model-as-a-Service to life.

python nlp science machine-learning deep-learning cv speech multi-modal

Updated Jun 11, 2024
Python

SahilAggarwal2004 / react-text-to-speech

Star

An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.

react text-to-speech typescript component queue text react-library npm-package speech javascript-library speech-synthesis highlight synthesis say webspeech-api react-tts tts-react

Updated Jun 11, 2024
TypeScript

ictnlp / StreamSpeech

Star

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Updated Jun 11, 2024
Python

pytorch / audio

Star

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio python machine-learning speech pytorch io audio-processing

Updated Jun 12, 2024
Python

SuperKogito / SER-datasets

Sponsor

Star

A collection of datasets for the purpose of emotion recognition/detection in speech.

audio speech datasets emotions emotions-recognition speech-emotion-recognition audio-datasets multimodal-emotion-recognition

Updated Jun 10, 2024
HTML

Improve this page

Add a description, image, and links to the speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech

Here are 1,629 public repositories matching this topic...

MahmoudAshraf97 / whisper-diarization

IAHispano / Applio

readbeyond / aeneas

KevKibe / African-Whisper

grecosalvatore / drift-lens

OvidijusParsiunas / deep-chat

Pictalk-speech-made-easy / pictalk-frontend

dusty-nv / NanoLLM

zawi01 / stipa

lhotse-speech / lhotse

google / voice-builder

huggingface / datasets

DigitalPhonetics / IMS-Toucan

weirongxu / auditory-reader

Camb-ai / MARS5-TTS

modelscope / modelscope

SahilAggarwal2004 / react-text-to-speech

ictnlp / StreamSpeech

pytorch / audio

SuperKogito / SER-datasets

Improve this page

Add this topic to your repo