WDProp: Web Application to Analyse Multilingual Aspects of Wikidata Properties

OpenSym 2021, 15th-17th September, 2021

John Samuel

CPE Lyon

Creative Commons License

Outline

Outline

  1. Introduction
  2. WDProp: Wikidata Property Analysis
  3. Conclusion and Future Works

1. Introduction

1. Introduction

1. Introduction

Wikidata Properties

Wikidata Properties

1. Introduction

Wikidata Property: Labels and descriptions

Example Wikidata Property: country (P17)

1. Introduction

Wikidata Property: from Proposal to Usage

Property proposal, creation and usage on Wikidata

1. Introduction

Wikidata Property: Missing descriptions

Missing multilingual descriptions on Property P31

1. Introduction

2. WDProp: Wikidata Property Analysis

WDProp

  1. WDProp:
    • Collaborative Multilingual Multi-domain Ontology development: is it possible to achieve a truly multilingual experience?
  2. Properties:
    • Not so much influenced by bots!
  3. Goals:
    • Understanding Wikidata property proposal, creation and translation
    • Available templates and their usage
    • Providing real-time statistics to (multilingual) contributors

2. WDProp: Wikidata Property Analysis

2. WDProp: Wikidata Property Analysis

Towards 'Everything' about Wikidata properties

Information on Wikidata Properties

2. WDProp: Wikidata Property Analysis

2. WDProp: Wikidata Property Analysis

Wikidata Properties

Wikidata properties

2. WDProp: Wikidata Property Analysis

Wikidata Properties

Wikidata properties: All included deleted ones

2. WDProp: Wikidata Property: P856

Wikidata Properties

Wikidata Property: P856 (official website)

2. WDProp: Wikidata Property: P856

Wikidata Properties

Missing labels in languages: P856

2. WDProp: Wikidata Property Analysis

Wikiprojects

WikiProjects: Programming Languages

2. WDProp: Wikidata Property Analysis

Wikiprojects

WikiProjects

2. WDProp: Wikidata Property Analysis

Wikiprojects

WikiProjects: Properties

2. WDProp: Wikidata Property Analysis

Wikidata Properties: Statistcs of Multilingual Labels

Real time information on number of multilingual labels

2. WDProp: Wikidata Property Analysis

WDProp: Translation path

Wikidata properties: Translation path of three Wikidata Properties

2. WDProp: Wikidata Property Analysis

WDProp: Translation path

Wikidata properties: Possible vandalism (removal of labels)

2. WDProp: Wikidata Property Analysis

WDProp: Translation path

Wikidata properties: Possible vandalism (removal of labels, descriptions and aliases)

2. WDProp: Wikidata Property Analysis

WDProp: Translation path

Wikidata properties: Visualization of Translation

3. Conclusion and Future Works

Wikdiata to Wikipedia

Exporting data from Wikidata to multiple multilingual Wikipedia articles

3. Conclusion and Future Works

Future Works

  1. Search:
    • Improving the search experience
  2. Visualize:
    • Export as SVG
  3. Download:
    • API
    • Download as JSON, CSV

References

  1. Chamard, Thibaut, and John Samuel. Multilingual Wikidata Property Translation Flow Dataset. Zenodo, 8 July 2019. DOI.org (Datacite), doi:10.5281/ZENODO.3271357.
  2. Kaffee, L. A., Piscopo, A., Vougiouklis, P., Simperl, E., Carr, L., & Pintscher, L. (2017, August). A glimpse into Babel: an analysis of multilinguality in Wikidata. In Proceedings of the 13th International Symposium on Open Collaboration (p. 14). ACM.
  3. Müller-Birn, C., Karran, B., Lehmann, J., & Luczak-Rösch, M. (2015, August). Peer-production system or collaborative ontology engineering effort: What is Wikidata?. In Proceedings of the 11th International Symposium on Open Collaboration (p. 20). ACM.
  4. Pellissier Tanon, T., & Kaffee, L. A. (2018, April). Property label stability in Wikidata: evolution and convergence of schemas in collaborative knowledge bases. In Companion of the The Web Conference 2018 on The Web Conference 2018 (pp. 1801-1803). International World Wide Web Conferences Steering Committee.

References

  1. Samoilenko, A., Karimi, F., Kunegis, J., Edler, D., & Strohmaier, M. (2015, June). Linguistic influence patterns within the global network of Wikipedia language editions. In Proceedings of the ACM Web Science Conference (p. 54). ACM.
  2. Samuel, J. (2017) Collaborative Approach to Developing a Multilingual Ontology: A Case Study of Wikidata. In : Research Conference on Metadata and Semantics Research. Springer, Cham, 2017. p. 167-172.
  3. Samuel, J. (2018). Towards Understanding and Improving Multilingual Collaborative Ontology Development in Wikidata. In: WikiWorkshop 2018
  4. Samuel, John. Johnsamuelwrites/Wdprop: V0.12. v0.12, Zenodo, 2021. DOI.org (Datacite), doi:10.5281/ZENODO.1174371.
  5. Stefaner, M., Taraborelli, D., & Ciampaglia, G. L. (2011). Notabilia–Visualizing Deletion Discussions on Wikipedia.

Thank you

Questions?