OpenRefine/Download
Appearance
Phabricator project: #OpenRefine
OpenRefine is a free data wrangling tool that can be used to process, manipulate and clean tabular (spreadsheet) data and connect it with knowledge bases ("spreadsheets on steroids" / "a swiss army knife for data"). It is widely used by librarians, in the cultural sector, by journalists and scientists, and is taught in many curricula and workshops around the world.
OpenRefine for Wikidata
- It is possible to edit Wikidata with OpenRefine simply by downloading the tool or using PAWS (see section below), as from the version 3.0 onwards, OpenRefine already includes the Wikidata extension.
- This page includes some guidelines on how to edit Wikidata with OpenRefine, including some text and video tutorials, as well as reference manuals.
- There are also guidelines on Wikidata documenting the different ways one can edit Wikidata with OpenRefine:
OpenRefine for Wikimedia Commons
- There is a Wikimedia Commons extension for OpenRefine.
- It is recommended to install it (download and install) for additional helpful features to work with Wikimedia Commons.
- The user is able to check thumbnail previews of Commons files, load files from Commons categories, specific GREL scripts, and other features.
- OpenRefine for Wikimedia Commons editing:
Related tools
OpenRefine edits on Wikidata and Wikimedia Commons can be undone with the EditGroups tool.
- EditGroups on Wikidata
- EditGroups on Wikimedia Commons (note: is not able to undo uploads; only edits to existing files)
Cloud version of OpenRefine (on PAWS) for Wikimedians
Is it difficult for you to run OpenRefine on your own computer?
Run OpenRefine in PAWS on Wikimedia’s Cloud Services (you need a Wikimedia account and an internet connection):
OpenRefine for Wikibase
Reconciliation
- Wikidata’s reconciliation service
- Wikimedia Commons’ reconciliation service
- Setting up reconciliation services for Wikibases
- Various (other) reconciliation services via the Reconciliation Testbench