Creating Structured Data for Online Bios
Introduction The study aims to define an effective method and framework for describing online biographical sketches using structured data, or, in a broad sense, “metadata” – the structured, encoded data that describe characteristics of information-bearing entities. Using artists as an example, the most widely seen structured data can be found in two primary spots on the web: (1) the “knowledge graph” provided by search engines when one searches for an artist, and (2) the “info-box” which appears on the right side of a Wikipedia article for an artist. However, in order for a knowledge graph or info-box to exist, the individual’s life has to be well-documented in a Wikipedia article or in a dedicated record in Wikidata (formerly Freebase) knowledge base. The online biographical sketches, or “bios,” that have become a very popular publishing format online are the focus of this study. Recognized biographies in the informational medium of published books, dedicated special journal issues, documentary films, drama and biographical movies, etc. were not considered since they are usually already covered by Wikipedia and search engines’ knowledge graphs. The creator of an online biography can be anyone who uses webpages to record and interpret a particular human