Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
2
7
25
Abdel-aziz Harane
abdel-aziz-harane
Follow
burtenshaw's profile picture
PhysiQuanty's profile picture
2 followers
·
33 following
abdelazizharane
abdel-aziz-harane
AI & ML interests
Natural Language Processing (TTS, ASR, MT); Computer Vision; Machine Learning
Recent Activity
reacted
to
ylacombe
's
post
with 🔥
about 1 month ago
Yesterday, we released Parler-TTS and Data-Speech, fully open-source reproduction of work from the paper: https://huggingface.co/papers/2402.01912 Parler-TTS is a lightweight text-to-speech (TTS) model that can generate high-quality, natural sounding speech in the style of a given speaker (gender, pitch, speaking style, etc). https://huggingface.co/collections/parler-tts/parler-tts-fully-open-source-high-quality-tts-models-66164ad285ba03e8ffde214c Parler-TTS Mini v0.1, is the first iteration Parler-TTS model trained using 10k hours of narrated audiobooks. It generates high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation). To improve the prosody and naturalness of the speech further, we're scaling up the amount of training data to 50k hours of speech. The v1 release of the model will be trained on this data, as well as inference optimisations, such as flash attention and torch compile. https://huggingface.co/parler-tts/parler_tts_mini_v0.1 Data-Speech can be used for annotating speech characteristics in a large-scale setting. https://huggingface.co/collections/parler-tts/open-source-speech-datasets-annotated-using-data-speech-661648ffa0d3d76bfa23d534 This work is both scalable and easily modifiable and will hopefully help the TTS research community explore new ways of conditionning speech synthesis. All of the datasets, pre-processing, training code and weights are released publicly under permissive license, enabling the community to build on our work and develop their own powerful TTS models.
liked
a dataset
3 months ago
google/WaxalNLP
upvoted
an
article
3 months ago
Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR
View all activity
Organizations
abdel-aziz-harane
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
3 months ago
google/WaxalNLP
Viewer
•
Updated
26 days ago
•
2.56M
•
45.8k
•
227
liked
a model
8 months ago
ibm-granite/granite-docling-258M
Image-Text-to-Text
•
0.3B
•
Updated
Sep 23, 2025
•
692k
•
1.18k
liked
a model
9 months ago
openai/whisper-large-v3
Automatic Speech Recognition
•
2B
•
Updated
Aug 12, 2024
•
5.35M
•
•
5.76k
liked
a dataset
over 1 year ago
abdel-aziz-harane/Corpus-Chadian-languages_shu
Viewer
•
Updated
Feb 24, 2025
•
310
•
29
•
1
liked
6 models
over 1 year ago
google/gemma-7b
Text Generation
•
9B
•
Updated
Jun 27, 2024
•
71.5k
•
•
3.34k
deepseek-ai/DeepSeek-R1
Text Generation
•
685B
•
Updated
Mar 27, 2025
•
5.15M
•
•
13.4k
coqui/XTTS-v2
Text-to-Speech
•
Updated
Dec 11, 2023
•
9.69M
•
3.57k
hexgrad/Kokoro-82M
Text-to-Speech
•
Updated
Apr 10, 2025
•
12.5M
•
•
6.24k
meta-llama/Meta-Llama-3-70B-Instruct
Text Generation
•
71B
•
Updated
Jun 18, 2025
•
107k
•
•
1.51k
black-forest-labs/FLUX.1-schnell
Text-to-Image
•
Updated
Aug 16, 2024
•
522k
•
•
4.97k
liked
a Space
almost 2 years ago
Runtime error
Agents
45
Multilingual TTS
🌍
45
Convert text into speech in multiple languages
liked
2 models
about 2 years ago
meta-llama/Llama-2-7b-chat-hf
Text Generation
•
7B
•
Updated
Apr 17, 2024
•
302k
•
4.77k
google-bert/bert-base-uncased
Fill-Mask
•
0.1B
•
Updated
Feb 19, 2024
•
69.8M
•
•
2.67k
liked
4 models
over 2 years ago
mistralai/Mixtral-8x7B-Instruct-v0.1
47B
•
Updated
Jul 24, 2025
•
889k
•
4.69k
agomberto/trocr-base-printed-fr
Image-to-Text
•
Updated
May 9, 2023
•
80
•
2
bigscience/bloom
Text Generation
•
176B
•
Updated
Jul 28, 2023
•
5.52k
•
5.01k
iocuydi/llama-2-amharic-3784m
Updated
Mar 16, 2024
•
1
•
18
liked
a model
almost 3 years ago
tiiuae/falcon-40b
Text Generation
•
42B
•
Updated
Aug 9, 2024
•
32.9k
•
2.44k
liked
a Space
almost 3 years ago
Runtime error
Agents
Featured
164
JoJoGAN
🌍
164
liked
a Space
about 3 years ago
Paused
Featured
1.48k
IF
🔥
1.48k
Load more