Run agent evaluation and submit answers to scoring API
Generate code answers and fetch timezone times via chat
Identify face expressions in images