<< Back to workshops and courses index | >> To the Github repo of this page |
OpenRefine is a well-known tool for editing, enriching and manipulating data. It is widely used within the Wikimedia community to add data to Wikidata. As from version 3.7, you can also upload images to Wikimedia Commons, enriched with structured data. In this workshop you will learn step by step how to do that.
In this workshop you will learn
This workshop is suitable for people who
but who do not yet know how to use OpenRefine to add images and structured data to Wikimedia Commons.
This workshop is therefore not suitable for people who have never worked with OpenRefine and/or Wikidata.
## Required preparation
OpenRefine 3.7 SNAPSHOT: See instructions above.
These URLs are also provided in the reconciliation service test bench and in this list of Wikibase manifests.
OpenRefine schema: The OpenRefine schema that will be used to upload the images and data to Commons can be found here
If you want to build up the OpenRefine project from scratch, you can use these raw source materials
Online images: We are going to upload the 18 images from Nederlandsche havengezichten enz. to Commons. These images can be directly requested via http://resolver.kb.nl/resolve?urn=urn:gvn:KONB16:533939704&role=page&count=4&size=large (count=1, count=2, count=3… count=18)
Please note that the the domain *.kb.nl has been whitelisted, so Wikimedia Commons accepts uploads from resolver.kb.nl.
Local images: This page holds 18 individual images, which have been downloaded into the images folder in this repo. These are only relevant if you want to upload these local images, rather then from the URLs.
Excel file: All necessary data for our uploads to Commons is contained in this Excel-file. It will be used as input for creating our OpenRefine project during the workshop.
This Excel lets you choose if you want to upload the files (to Commons) from the local images folder, or from the URLs above.
Commons category: As a preparation for this workshop, a couple these images have already been uploaded to the Commons Category:Nederlandsche_havengezichten_enz.,_1780-1781_-_KONB16:533939704. These will be used for illustration and demo purposes.
Example file: One file within this category is File:De Haven van Amsterdam - Nederlandsche havengezichten enz. - KONB16-533939704 - Prent 3 van 18.jpg. We will use this example file for guidance, it holds
PDF slides: The outline, explanations, tips & tricks etc. that will be demonstrated during the workshop can be seen in this PDF-presentation in Dutch. You can also use it as guidance if you want to do this workshop by yourself.
The PDF is also available on Wikimedia Commons and Zenodo.
This workshop is given by Olaf Janssen, the Wikimedia coordinator of the Koninklijke Bibliotheek, the national library of the Netherlands. In this role he stimulates and facilitates collaboration between the collections, knowledge, open data and staff of the KB on the one hand, and the projects of the Wikimedia movement, such as Wikipedia, Wikimedia Commons, Wikidata and Wikibase, on the other. He is also active as a volunteer within the community.
Feel free to contact Olaf via olaf.janssen(at)kb.nl
This workshop was given during
All workshop materials are released into the public domain under the Creative Commons Zero v1.0 Universal and can therefore be reused freely and openly. Attribution is not required, but still appreciated.
This page was last updated on 14 December 2022.