Data and Wikipedia STRUCTURED DATA http://wiki.dbpedia.org/ DBpedia semantics, linked data, RDF Old? Difference from Wikidata? https://en.wikipedia.org/wiki/Wikidata Wikidata semantics, linked data STATISTICS Just for the record will have to compulsory report through metrics: https://metrics.wmflabs.org Wiki metrics, statistics on Wiki edits Cohort = group of usernames, opt-in form Cohort -- military term, or: group of accomplices All that can currently be measured, has to be processable through wikipedia: *participation is only measurable in bytes/edits - remove/add *all participation needs to be translated to text (or bytes) *"crunch rate" - survival of new users * Images? how do they count! What if we would (auto)-transcribe a conversation * Reading vs writing: 'using' does not count, only editing Quantity over quality *Change in popularity/relevance of articles? *Metrics on notability? * New people joining -- 'outside' discussion *How many people from outside the cohort were brought in Relation between off-line and on-line contributions *Care work? Financial support is already a way it becomes visible *Add all admin work, emails, ... to weigh in the care work, the invisible work. This is extra-important once you want to create change. Ref: someone helping others to edit, or make it easier to edit as newcomer *Think about ways this could be valued What if all presentations, notes were added to talk pages in the meetup pages *"just for the bytes" *hard to find, not so much a good place to publish this kind of material Are edits on talkpages and article pages treated the same? Would be interesting to have metrics distinguished according to namespace, to show the difference between edits/bytes added to articles and ones added to talkpages. Not conflating contributions to "thinking Wikipedia" and "making wikipedia". * * * TEXTMINING Pattern: Mining Wikipedia