H. Aldhaheri
aenawi
AI & ML interests
LLMs Agents
Organizations
None yet
DeepResearch Models
Translation-Models
-
tencent/Hunyuan-MT-7B
Translation • 8B • Updated • 4.85k • 731 -
tencent/Hunyuan-MT-Chimera-7B
Translation • 8B • Updated • 1.87k • 92 -
swiss-ai/Apertus-8B-Instruct-2509
Text Generation • 8B • Updated • 123k • • 465 -
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale
Paper • 2509.14008 • Published • 90
Speech-To-Text
Papers - Researches
Arabic Datasets
Embedding Models
-
WhereIsAI/UAE-Large-V1
Feature Extraction • 0.3B • Updated • 2.3M • • 237 -
intfloat/multilingual-e5-large
Feature Extraction • 0.6B • Updated • 7.83M • • 1.21k -
sentence-transformers/distiluse-base-multilingual-cased-v1
Sentence Similarity • 0.1B • Updated • 1.12M • • 131 -
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Sentence Similarity • 0.1B • Updated • 51.8M • • 1.28k
Datasets
-
ahmedheakl/resume-atlas
Viewer • Updated • 13.4k • 209 • 10 -
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
Paper • 2506.20920 • Published • 79 - RunningAgents285
Infinite Dataset Hub
♾285Search and save datasets generated with a LLM in real time
-
IntrEx: A Dataset for Modeling Engagement in Educational Conversations
Paper • 2509.06652 • Published • 26
Train-On-Datasets
Cybersecurity Models
Animation
Text2Image LLMs
LLMs
Spaces For Demos
Models-Support-Arabic
Speech-to-Speech
Token-Classification
-
hatmimoha/arabic-ner
Token Classification • 0.1B • Updated • 1.66k • 22 -
Ammar-alhaj-ali/arabic-MARBERT-poetry-classification
Text Classification • Updated • 66 • 3 -
CAMeL-Lab/bert-base-arabic-camelbert-mix-ner
Token Classification • Updated • 33.4k • • 15 -
SinaLab/ArabicNER-Wojood
Token Classification • Updated • 44 • 10
Neo4j-Cypher
Coding
Text-To-Speech
Animation
DeepResearch Models
Text2Image LLMs
Translation-Models
-
tencent/Hunyuan-MT-7B
Translation • 8B • Updated • 4.85k • 731 -
tencent/Hunyuan-MT-Chimera-7B
Translation • 8B • Updated • 1.87k • 92 -
swiss-ai/Apertus-8B-Instruct-2509
Text Generation • 8B • Updated • 123k • • 465 -
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale
Paper • 2509.14008 • Published • 90
LLMs
Speech-To-Text
Spaces For Demos
Papers - Researches
Models-Support-Arabic
Arabic Datasets
Speech-to-Speech
Embedding Models
-
WhereIsAI/UAE-Large-V1
Feature Extraction • 0.3B • Updated • 2.3M • • 237 -
intfloat/multilingual-e5-large
Feature Extraction • 0.6B • Updated • 7.83M • • 1.21k -
sentence-transformers/distiluse-base-multilingual-cased-v1
Sentence Similarity • 0.1B • Updated • 1.12M • • 131 -
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Sentence Similarity • 0.1B • Updated • 51.8M • • 1.28k
Token-Classification
-
hatmimoha/arabic-ner
Token Classification • 0.1B • Updated • 1.66k • 22 -
Ammar-alhaj-ali/arabic-MARBERT-poetry-classification
Text Classification • Updated • 66 • 3 -
CAMeL-Lab/bert-base-arabic-camelbert-mix-ner
Token Classification • Updated • 33.4k • • 15 -
SinaLab/ArabicNER-Wojood
Token Classification • Updated • 44 • 10
Datasets
-
ahmedheakl/resume-atlas
Viewer • Updated • 13.4k • 209 • 10 -
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
Paper • 2506.20920 • Published • 79 - RunningAgents285
Infinite Dataset Hub
♾285Search and save datasets generated with a LLM in real time
-
IntrEx: A Dataset for Modeling Engagement in Educational Conversations
Paper • 2509.06652 • Published • 26
Neo4j-Cypher
Train-On-Datasets
Coding
Cybersecurity Models