A collection of evaluation benchmarks for the Italian language.
Simone Conia
AI & ML interests
Natural Language Processing, Multilinguality, Knowledge Graphs, Semantics, Large Language Models
Recent Activity
authored a paper 1 day ago
ReTraceQA: Evaluating Reasoning Traces of Small Language Models in Commonsense Question Answering authored a paper 1 day ago
AgREE: Agentic Reasoning for Knowledge Graph Completion on Emerging Entities liked a model 13 days ago
principled-intelligence/gemma-4-E2B-it-text-only