Assess and benchmark time series forecasting models
Assess text similarity and accuracy scores
Measure model performance over time with automatic evaluation