KBNLwikimedia

<< Back to homepage >> To the Github repo of this page


Overview of tools and scripts on Github related to the Wikimedia effort of the KB


Wikimedia Commons tools

GLAMorousToHTML

Creates a datestamped HTML report and a corresponding Excel file listing all Wikipedia articles (in all languages) in which (one or more) images from a given category tree on Wikimedia Commons are used.

Wikimedia Commons File Metadata Downloader

Collect metadata from Wikimedia Commons files or categories and write them into an Excel sheet — safely, in chunks, and with per-file JSON snapshots.

Wikimedia Commons File Downloader

A robust, Windows-safe downloader for Wikimedia Commons files - Download Wikimedia Commons files by nested category tree or flat list, preview before downloading, slice a subset, use Windows-safe unique filenames, and log to Excel.

Wikimedia Commons URL M-ID Excel Extractor

Reads a Wikimedia Commons FileURL column from an Excel sheet, looks up the corresponding MediaInfo entity IDs (M-IDs), and writes the results back into the same Excel workbook.

Identifies public domain (PD) or PD-like (Creative Commons) license templates in Wikimedia Commons files, alongside simplified creation/publication dates.

Wikimedia Commons structured data tools

Code, scripts and stories about dealing with structured data for KB files on Wikimedia Commons.

More info about Structured Data on Commons efforts of the KB.

- Wikimedia Commons category Depicts (P180) extractor

Lists all things depicted in all images in a Wikimedia Commons category

- WriteSDoCfromExcel

Adds Wikidata QIDs to the structured data for one or more properties (e.g., P180 – depicts, P170 – creator) on Wikimedia Commons files from an Excel sheet.

- Delpher-Commons structured data tools - TO UPDATE!!

Tools to add, update, improve an manage structured data for Delpher files on Wikimedia Commons

- dict2sdc

Import structured data to Wikimedia Commons using Pywikibot. Read data from a dict.csv file and add it as structured data to Wikimedia Commons.

Other Wikimedia tools

WikimediaKBURLReplacement

Code, scripts and stories about replacing outdated or non-persistent URLs of KB services in Wikipedia, Wikimedia Commons and Wikidata

OpenRefine-Wikibase

Files for interaction between OpenRefine and KB Wikibases, for reconciling and uploading data to Wikibases of the KB, using Openfine

wikipedia-utilities

Utility scripts for working with Wikipedia data dumps

Non-Wikimedia tools, but still handy

TO ADD: General File Downloader

A simple tool for quickly downloading non-Wikimedia Commons files

TO ADD: URL http status checker

Checks the HTTP status codes of a list of URLs, with support for retries, timeouts, and detailed logging. Works for all URLs, not necessarily Wikimedia-related URLs.

SaveToWaybackMachine

Saving URLs of webpages of the KB in bulk to the Wayback Machine of The Internet Archive. Some websites managed by the KB, national library of the Netherlands, have been or will be discontinued. To preserve the content of these sites (e.g. for sourcing Wikipedia articles or cultural heritage preservation purposes) the KB actively submits websites to the The Wayback Machine web.archive.org.

videotools

A collection of video and audio processing tools.