Salesforce/blip2-opt-2.7b
Image-Text-to-Text • 4B • Updated • 739k • 446
None defined yet.
Learning from Language Feedback via Variational Policy Distillation
The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation