Publications and Open Material

23May '18

Search, Filter, Fork, and Link Open Data

Authors Sebastian Neumaier, Lőrinc Thurnay, Thomas J. Lampoltshammer and Tomáš Knap

The ADEQUATe platform: data- and community-driven quality improvements

This document describes the outcome of the ADEQUATe project, the ADEQUATe platform: a framework to monitor the quality of (Governmental) Open Data catalogs, to re-publish improved and linked versions of the datasets and their respective metadata descriptions, and to include the community in the quality improvement process.

25Jul '16

Multi-level semantic labelling of numerical values

Authors Sebastian Neumaier, Jürgen Umbrich, Josiane Xavier Parreira, and Axel Polleres

Abstract: With the success of Open Data a huge amount of tabular data sources became available that could potentially be mapped and linked into the Web of (Linked) Data. Most existing approaches to “semantically label” such tabular data rely on mappings of textual information to classes, properties, or instances in RDF knowledge bases in order to link – and eventually transform – tabular data into RDF.

25Jul '16

Automated Quality Assessment of Metadata across Open Data Portals

Authors Sebastian Neumaier, Jürgen Umbrich, and Axel Polleres, Vienna University of Economics and Business, Vienna, Austria

Abstract: The Open Data movement has become a driver for publicly available data on the Web. More and more data – from governments, public institutions but also from the private sector – is made available online and is mainly published in so called Open Data portals. However, with the increasing number of published resources, there are a number of concerns with regards to the quality of the data sources and the corresponding metadata, which compromise the searchability, discoverability and usability of resources.

30Jun '16

Characteristics of Open Data CSV Files

Authors Johann Mitlöhner, Sebastian Neumaier, Jürgen Umbrich, and Axel Polleres, Vienna University of Economics and Business, Vienna, Austria
This work analyzes an Open Data corpus containing 200K tabular resources with a total file size of 413 GB from a data consumer perspective. Our study shows that ∼10 % of the resources in Open Data portals are labelled as a tabular data of which only 50 % can be considered CSV files.

08Jun '16

Best Paper Award for our Linked Data Quality Workshop Paper!

Authors Tomas Knap (Semantic Web Company)

As part of ADEQUATe project  we are working on the framework for assessing and monitoring and improving quality of open data at two open data portal - data.gv.at and opendataportal.at.

19May '16

Reflexion paper presented at CeDEM 2016 Conference: ADEQUATe - Analytics and Data Enrichment to Improve the Quality of Open Data

This reflexion paper about the overall mission of ADEQUATe was presented in a reflexion session at the CeDEM16 – International Conference for E-Democracy and Open Government, May 18-20, 2016, at the Danube University Krems, Krems, Austria.

05May '16

Paper evaluating current approaches for (semi)automatic leveraging of tabular data to Linked Data got accepted to ESWC 2016 Linked Data Quality Workshop !

Authors Tomas Knap (Semantic Web Company)
As part of ADEQUATe project, we plan to (semi)automatically leverage tabular data to Linked Data. We will present the results of the evaluation of the current approaches in that area at Linked Data Quality workshop at ESWC 2016 conference, May 30.

26Apr '16

Measures for assessing the data freshness in Open Data portals

Authors Sebastian Neumaier and Jürgen Umbrich, Vienna University of Economics and Business, Vienna, Austria

Abstract: Many applications and use cases which consume Open Data rely on up-to-date information. However, scarcely any major Open Data portal provides the users with tangible on how current the data sources are. In order to create a freshness metrics that would solve the mentioned issues, we need to i) learn the change history of a data source and ii) apply a heuristic to estimate how up-to-date a data source is. The following paper assesses which information about the change behaviour of a data source in an Open Data portal are available to derive such a change history.