Event:WikiCon Australia 2024/Submissions/Using OpenRefine & IRMNG to improve Australian Biodiversity
Appearance
Using OpenRefine & IRMNG to improve Australian Biodiversity
[edit]Abstract/description
[edit]We
- demonstrate how to download a Darwin core csv file from IRMNG which may represent the taxa named by a particular taxonomist. The list will not be complete as IRMNG is very incomplete with respect to Australian Faunal Directory and World Register of Marine Species taxon databases.
- import this file into openRefine and create a project.
In openRefine, we learn to
- reconcile columns... with taxon names (Accept only perfect matches NOT synonyms)
- create new columns
- by splitting a column
- by copying a column
- by using GREL functions such as substring, replace, indexOf ...
- subset for further processing (and using flags and stars)
An alternative approach
[edit]Using the following queries for APNI and AFD taxa:
- For genera with APNI ids (and no authority) plus taxon author citation
- For species with APNI ids (and no authority)
- For genera with AFD ids (and no authority) plus taxon author citation
- for AFD arachnid genera (limiting a query)
- For species with AFD ids (and no authority)
Modify these queries
[edit]- to pick a family, genus, order
and download the query result as a CSV file
The tasks thereafter closely match those discussed above and include
- forming links to the APNI and AFD pages for the taxon
- grabbing the authority and the publication from these links
to create lists of authors, taxon year of publication, publication name and page, and again, creating a schema to upload the reconciled authors and publications to wikidata.
What I am hoping to achieve
[edit]At the end of the session, participants will have learned
- how to create a project in openRefine
- why & how to facet
- how to split a column (and how to undo an action)
- how to reconcile a column with its wikidata
- some useful GREL functions
- how to create a schema for uploading data to wikidata
to ultimately create Wikidata entries like that for Illawarra wisharti.
Relationship to Wiki skills or to the theme
[edit]Learning how to use openRefine to import statements and items into Wikidata
Username/s
[edit]- MargaretRDonald (talk) 01:27, 26 September 2024 (UTC)
Session type & duration
[edit]4 x two hour online sessions