BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
Paper • 2308.09936 • Published • 1
Natural Language Processing, Bias and Fairness in NLP
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks
LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training