--- title: README emoji: 🐠 colorFrom: yellow colorTo: yellow sdk: static pinned: false thumbnail: >- https://cdn-uploads.huggingface.co/production/uploads/63044350fc783bfc74462d5c/C1LPGkFkycvoNJS52HQjn.jpeg --- A community organization for Wikimedians interested in creating, contributing to, using, and writing about datasets and models. (Thumbnail from [Johnson, Kaffee, and Redi '24](https://arxiv.org/pdf/2410.08918)) # Datasets of interest * [wikimedia/wikipedia](https://huggingface.co/datasets/wikimedia/wikipedia) * [wikimedia/wikisource](https://huggingface.co/datasets/wikimedia/wikisource) * [Wikimedia Commons URLs](https://github.com/ryanrudes/wikimedia) (40M, from 2022, via Ryan Rudes. new data needed) * the [Nomic Atlas](https://huggingface.co/datasets/wikimedia/wikipedia/discussions/48) map of words in WP (2023) * [Wikipedia-based Image Text](https://huggingface.co/datasets/wikimedia/wit_base) ## Collections to review [Sourced from Wikimedia](https://huggingface.co/collections/davanstrien/sourced-from-wikimedia-64f9f2ac4639c9edf83effa2), [OpenLID](https://huggingface.co/laurievb/OpenLID) ## Tools * [Storm](https://storm.genie.stanford.edu/) (and coming: DataStorm) * Citation checkers: * [Alex O](https://wiki-cite-checker.replit.app/) * [Citation needed](https://chromewebstore.google.com/detail/wikipedia-add-a-fact/kecnjhdipdihkibljeicopdcoinghmhj) chrome extension * [WikiChat](https://wikichat.genie.stanford.edu/) * [Spinach](https://github.com/stanford-oval/spinach) - SPARQL-based study * [WikiCrow](https://wikicrow.ai/) - via [FutureHouse Platform](https://platform.edisonscientific.com/) * [CitationNeeded](https://meta.wikimedia.org/wiki/Future_Audiences/Experiment:Citation_Needed]) experiment # See also * The HF organization for the [Wikimedia Foundation](https://huggingface.co/wikimedia) * [WikiProject AI](https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Artificial_Intelligence) on English Wikipedia * [Waikiki](https://meta.wikimedia.org/wiki/Waikiki) project on the Meta-wiki * [WikiConradict](https://arxiv.org/abs/2406.13805)