Repetition makes this unusable

#17

by zero1zero - opened Mar 4

Mar 4

Added variations of small to large repetition_penalty and/or presence_penalty and it continuously had problems. I stayed with the recommended greedy decoding but gave low temp a try and it still showed issues.

Seems like this must be an inherent issue with the model? Has anyone seen this work without eventually getting into a repeat loop?

This seems to happen mainly with table structure or visual elements where it wants to use "[-|_|.]" to represent something. This seems like its related to table parsing but also happens on representing visual content or spacing. I'm using the "OCR" task.

Hosted on a 4090 through vllm.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment