OpenRefine
OpenRefine is a free data wrangling tool that can be used to process, manipulate and clean tabular (spreadsheet) data and connect it with knowledge bases ("spreadsheets on steroids" / "a swiss army knife for data"). It is widely used by librarians, in the cultural sector, by journalists and scientists, and is taught in many curricula and workshops around the world.
- OpenRefine is a popular tool for Wikidata batch editing since 2018.
- OpenRefine supports Wikimedia Commons batch editing and uploading, thanks to a Wikimedia grant, since 2022.
- OpenRefine can also be used to batch import and edit data items and media files in Wikibases.
OpenRefine is a community-supported open source project, licensed under the BSD license. It has a graphical user interface in more than 15 languages.
General links
- OpenRefine's website:
- Download OpenRefine:
- Download and install the Wikimedia Commons extension for OpenRefine:
- General OpenRefine documentation:
Bug reports and requests
- On GitHub (for OpenRefine in general):
- On Wikimedia Phabricator (mainly for Wikimedia Commons reconciliation):
Uploaded via OpenRefine
It is possible to track the usage of OpenRefine to upload files to Wikimedia Commons, via the Uploaded with OpenRefine category, using the Commons Uploaders in cat tool.
Staatliche Museen zu Berlin
A collection of coins from the Staatliche Museen zu Berlin.
Swedish Museum of Performing Arts
Some files from the Swedish Museum of Performing Arts.
National Library of Israel
Some files from the National Library of Israel GLAM initiative.
Talk about OpenRefine with its community and with Wikimedia users
- Ask questions on OpenRefine's community forum:
- Join the Telegram group for Wikimedians who use OpenRefine: