-
trpakov/vit-face-expression
Image Classification • 85.8M • Updated • 491k • • 85 -
mo-thecreator/vit-Facial-Expression-Recognition
Image Classification • 85.8M • Updated • 2.64k • • 25 -
dima806/pets_facial_expression_detection
Image Classification • 85.8M • Updated • 12 • 1 -
HardlyHumans/Facial-expression-detection
Image Classification • 85.8M • Updated • 276 • 1
G
smogs-wlike
AI & ML interests
None yet
Organizations
None yet
VKR
-
Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition
Paper • 2405.16701 • Published -
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
Paper • 2401.05698 • Published -
Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition
Paper • 2103.09154 • Published -
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild
Paper • 1901.02839 • Published
vkr_tmp
Inno: Final project
-
pyannote/speaker-diarization-3.1
Automatic Speech Recognition • Updated • 15.5M • 1.4k -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 6.5M • • 5.24k -
Running21
Leaderboard / AudioBench
🥇21Explore audio analysis tools with AudioBench Leaderboard
-
cointegrated/rut5-base-absum
Summarization • 0.2B • Updated • 643 • 30
CV kursovik
-
trpakov/vit-face-expression
Image Classification • 85.8M • Updated • 491k • • 85 -
mo-thecreator/vit-Facial-Expression-Recognition
Image Classification • 85.8M • Updated • 2.64k • • 25 -
dima806/pets_facial_expression_detection
Image Classification • 85.8M • Updated • 12 • 1 -
HardlyHumans/Facial-expression-detection
Image Classification • 85.8M • Updated • 276 • 1
vkr_tmp
VKR
-
Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition
Paper • 2405.16701 • Published -
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
Paper • 2401.05698 • Published -
Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition
Paper • 2103.09154 • Published -
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild
Paper • 1901.02839 • Published
Inno: Final project
-
pyannote/speaker-diarization-3.1
Automatic Speech Recognition • Updated • 15.5M • 1.4k -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 6.5M • • 5.24k -
Running21
Leaderboard / AudioBench
🥇21Explore audio analysis tools with AudioBench Leaderboard
-
cointegrated/rut5-base-absum
Summarization • 0.2B • Updated • 643 • 30