Data and Wikipedia

STRUCTURED DATA

http://wiki.dbpedia.org/
DBpedia semantics, linked data, RDF
Old? Difference from Wikidata?

https://en.wikipedia.org/wiki/Wikidata
Wikidata semantics, linked data

STATISTICS

Just for the record will have to compulsory report through metrics:
https://metrics.wmflabs.org Wiki metrics, statistics on Wiki edits

Cohort = group of usernames, opt-in form
Cohort -- military term, or: group of accomplices

All that can currently be measured, has to be processable through wikipedia:

*participation is only measurable in bytes/edits - remove/add
*all participation needs to be translated to text (or bytes)
*"crunch rate" - survival of new users
*Images? how do they count!
What if we would (auto)-transcribe a conversation
*
Reading vs writing: 'using' does not count, only editing

Quantity over quality 
*Change in popularity/relevance of articles?
*Metrics on notability?
*
New people joining -- 'outside' discussion
*How many people from outside the cohort were brought in

Relation between off-line and on-line contributions
*Care work? Financial support is already a way it becomes visible
*Add all admin work, emails, ... to weigh in the care work, the invisible work. This is extra-important once you want to create change.

Ref: someone helping others to edit, or make it easier to edit as newcomer
*Think about ways this could be valued

What if all presentations, notes were added to talk pages in the meetup pages
*"just for the bytes"
*hard to find, not so much a good place to publish this kind of material

Are edits on talkpages and article pages treated the same?

Would be interesting to have metrics distinguished according to namespace, to show the difference between edits/bytes added to articles and ones added to talkpages. Not conflating contributions to "thinking Wikipedia" and "making wikipedia".








*
*
*

TEXTMINING

Pattern: Mining Wikipedia