-
meta-llama/Llama-2-7b-hf
Text Generation • 7B • Updated • 500k • 2.27k -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
Paper • 2401.04398 • Published • 25 -
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models
Paper • 2402.01118 • Published • 32
Alina
iblub
·
AI & ML interests
None yet
Recent Activity
liked
a Space about 1 month ago
k-mktr/gpu-poor-llm-arena liked
a Space about 1 year ago
timm/timmAttentionViz updated
a model over 1 year ago
iblub/idefics2-8b-mwm-finetuned-qlora_8bit_10e Organizations
None yet
LLM
-
meta-llama/Llama-2-7b-hf
Text Generation • 7B • Updated • 500k • 2.27k -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
Paper • 2401.04398 • Published • 25 -
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models
Paper • 2402.01118 • Published • 32
CV
models 21
iblub/idefics2-8b-mwm-finetuned-qlora_8bit_10e
Updated
iblub/idefics2-8b-mwm-finetuned-8bit
Updated
iblub/idefics2-8b-mwm-finetuned
Updated
iblub/idefics2-8b-docvqa-finetuned-tutorial
Updated
iblub/a2c-PandaReachDense-v2
Reinforcement Learning • Updated
• 2
iblub/detr-finetuned-balloon
Object Detection • Updated
• 1
iblub/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning • Updated
iblub/ppo-lunar-lander-week8
Reinforcement Learning • Updated
iblub/poca-SoccerTwos
Reinforcement Learning • Updated
• 6
iblub/ppo-Pyramid
Reinforcement Learning • Updated
• 10
datasets 0
None public yet