| << Back to homepage | >> To the Github repo of this page |
Creates a datestamped HTML report and a corresponding Excel file listing all Wikipedia articles (in all languages) in which (one or more) images from a given category tree on Wikimedia Commons are used.
Collect metadata from Wikimedia Commons files or categories and write them into an Excel sheet — safely, in chunks, and with per-file JSON snapshots.
A robust, Windows-safe downloader for Wikimedia Commons files - Download Wikimedia Commons files by nested category tree or flat list, preview before downloading, slice a subset, use Windows-safe unique filenames, and log to Excel.
Reads a Wikimedia Commons FileURL column from an Excel sheet, looks up the corresponding MediaInfo entity IDs (M-IDs), and writes the results back into the same Excel workbook.
Identifies public domain (PD) or PD-like (Creative Commons) license templates in Wikimedia Commons files, alongside simplified creation/publication dates.
Code, scripts and stories about dealing with structured data for KB files on Wikimedia Commons.
More info about Structured Data on Commons efforts of the KB.
Lists all things depicted in all images in a Wikimedia Commons category
Adds Wikidata QIDs to the structured data for one or more properties (e.g., P180 – depicts, P170 – creator) on Wikimedia Commons files from an Excel sheet.
Tools to add, update, improve an manage structured data for Delpher files on Wikimedia Commons
Import structured data to Wikimedia Commons using Pywikibot. Read data from a dict.csv file and add it as structured data to Wikimedia Commons.
Code, scripts and stories about replacing outdated or non-persistent URLs of KB services in Wikipedia, Wikimedia Commons and Wikidata
Files for interaction between OpenRefine and KB Wikibases, for reconciling and uploading data to Wikibases of the KB, using Openfine
Utility scripts for working with Wikipedia data dumps
A simple tool for quickly downloading non-Wikimedia Commons files
Checks the HTTP status codes of a list of URLs, with support for retries, timeouts, and detailed logging. Works for all URLs, not necessarily Wikimedia-related URLs.
Saving URLs of webpages of the KB in bulk to the Wayback Machine of The Internet Archive. Some websites managed by the KB, national library of the Netherlands, have been or will be discontinued. To preserve the content of these sites (e.g. for sourcing Wikipedia articles or cultural heritage preservation purposes) the KB actively submits websites to the The Wayback Machine web.archive.org.
A collection of video and audio processing tools.