Wikipedia and
Wikidata access with Python
Wiki
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/2709853/HNL_Wiki_Wiki_Bus.jpg)
Wikipedia
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/2712191/1000px-Wikipedia_svg_logo.svg.png)
Wikimedia
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/2712170/1024px-Wikimedia_logo_family_complete-2013.svg.png)
MediaWiki
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/1949298/MediaWiki-notext.svg.png)
Wiki CMS
Built mostly in:
- PHP
- JavaScript
Working with Docker
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/3924041/docker-whale.png)
Docker images used
We build the one above following README instructions, modifying Bash scripts and Dockerfile if desired
We can build this one as well. Alternately we can reuse it from:
https://hub.docker.com/r/toniher/debian-python-mediawiki/
However you need to mount example scripts provided in the repo above
Bots
The hidden ones
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/2714821/diderobot.png)
MediaWiki API
Accessing MediaWiki programatically
MediaWiki API clients / libraries
-
Python (Mwclient, Pywikibot)
-
Perl (MediaWiki::API, MediaWiki::Bot)
-
JavaScript (nodemw)
-
etc.
Mwclient
A MediaWiki API Python client library
RESTBase
A sophisticated Wikipedia REST API
Wikidata
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/2715180/I_love_Wikidata_because.jpg)
Wikidata: Namespaces
- «Wiki ones»: User, Wikidata, Category, etc.
- Wikibase ones: Item, Property (both entities)
Wikidata: Concepts
- Labels
- Descriptions
- Aliases
- Sitelinks
- Statements
Wikidata: Concepts
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/2712331/wikidata-genewiki.jpg)
Wikidata: Statements
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/2712292/Wikidata-elements.png)
RDF
(Resource Description Framework)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/3251280/RDF_example.svg.png)
Wikidata API access
MediaWiki API access: Like any other MediaWiki.
https://wikidata.org/w/api.php
Access via JSON (or other) exports
SPARQL endpoint access
Wikidata API clients
Linked (Open) Data
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/2715215/1000px-LOD_Cloud_Diagram_as_of_September_2011.svg.png)
SPARQL: Endpoints and interfaces
Wikidata SPARQL inteface
Interface to multiple endpoints
Example query:
SPARQL: API Access
Example for Wikidata: https://query.wikidata.org/sparql
SPARQL client in Python
Wikimedia Hackathon 2018
![](https://s3.amazonaws.com/media-p.slid.es/uploads/76302/images/3824855/Columnes_UAB.jpeg)
#wmhack
18-20 May
Wikipedia and Wikidata access with Python
By Similis.cc
Wikipedia and Wikidata access with Python
These slides are a short introduction to Wikipedia and Wikidata content access with Python
- 5,976