Models specialized in extracting structured information (JSON) from text, PDFs, scans, spreadsheets, etc.
AI & ML interests
Interactive NLP development
Recent Activity
View all activity
The best compact Zero-Shot NER models with MIT license
-
numind/NuNER_Zero
Token Classification β’ 0.4B β’ Updated β’ 14.5k β’ 96 -
numind/NuNER_Zero-span
Token Classification β’ Updated β’ 76 β’ 17 -
numind/NuNER_Zero-4k
Token Classification β’ Updated β’ 36 β’ 19 -
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data
Paper β’ 2402.15343 β’ Published β’ 15
-
NuMarkdown 8b Thinking
π37Reasoning model specialized for OCR/Markdown generation.
-
numind/NuMarkdown-8B-Thinking
Image-to-Text β’ 8B β’ Updated β’ 28.2k β’ 216 -
numind/NuMarkdown-8B-Thinking-GGUF
8B β’ Updated β’ 317 β’ 1 -
numind/NuMarkdown-8B-Thinking-mlx-8bits
Image-to-Text β’ Updated β’ 44 β’ 1
The Best Eng/Multi Token Classification foundation models with MIT license
-
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data
Paper β’ 2402.15343 β’ Published β’ 15 -
numind/NuNER-v2.0
Token Classification β’ 0.1B β’ Updated β’ 7.6k β’ 40 -
numind/NuNER-v0.1
Token Classification β’ Updated β’ 6.86k β’ 63 -
numind/NuNER-multilingual-v0.1
Token Classification β’ Updated β’ 6.6k β’ 68
Models specialized in extracting structured information (JSON) from text, PDFs, scans, spreadsheets, etc.
-
NuMarkdown 8b Thinking
π37Reasoning model specialized for OCR/Markdown generation.
-
numind/NuMarkdown-8B-Thinking
Image-to-Text β’ 8B β’ Updated β’ 28.2k β’ 216 -
numind/NuMarkdown-8B-Thinking-GGUF
8B β’ Updated β’ 317 β’ 1 -
numind/NuMarkdown-8B-Thinking-mlx-8bits
Image-to-Text β’ Updated β’ 44 β’ 1
The best compact Zero-Shot NER models with MIT license
-
numind/NuNER_Zero
Token Classification β’ 0.4B β’ Updated β’ 14.5k β’ 96 -
numind/NuNER_Zero-span
Token Classification β’ Updated β’ 76 β’ 17 -
numind/NuNER_Zero-4k
Token Classification β’ Updated β’ 36 β’ 19 -
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data
Paper β’ 2402.15343 β’ Published β’ 15
The Best Eng/Multi Token Classification foundation models with MIT license
-
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data
Paper β’ 2402.15343 β’ Published β’ 15 -
numind/NuNER-v2.0
Token Classification β’ 0.1B β’ Updated β’ 7.6k β’ 40 -
numind/NuNER-v0.1
Token Classification β’ Updated β’ 6.86k β’ 63 -
numind/NuNER-multilingual-v0.1
Token Classification β’ Updated β’ 6.6k β’ 68