The Web of Data has seen tremendous growth recently. New forms of structured data have emerged in the
form of knowledge graphs, Web markup, such as schema.org,
as well as entity-centric data in Web tables. Considering these rich, heterogeneous and evolving data sources which cover a wide variety of domains, exploitation of Web Data becomes increasingly important in the context of various applications, including dataset
search, question answering and fact verification. These applications require reliable information on dataset characteristics, including general metadata, quality features, statistical information, dynamics, licensing, and provenance. Lack of a thorough understanding
of the nature, scope and characteristics of data from particular sources limits their take-up and reuse, such that applications are often limited and focused on well-known reference datasets.
Schema.org is a set of extensible schemas that enables webmasters to embed structured data on their web pages for use by search engines and other applications.
The goal of the PROFILES’19 workshop is to bring together researchers and practitioners interested in
the development of techniques for dataset profiling and deriving quality analytics, as well as performing dataset search and dataset retrieval on the Web while taking dataset profiles into account. We are interested in approaches to analyse, characterise and
discover data sources. We aim to discuss technologies addressing data profiling and search – including semantics, information retrieval for Web Data (ranking algorithms and indexing), in particular in the context of decentralised and distributed systems, such
as the Web. We want to facilitate a discussion around data search across formats and domain-specific applications.
PROFILES offers a highly interactive forum for researchers and practitioners, bringing together experts
in the fields of the Web, Semantic Web, Web Data, Semantic Search, Databases, NLP, IR, and application domains. We envision the workshop as a forum for researchers and practitioners to come together and discuss common challenges and identify synergies for
joint initiatives.
- Dataset profile representation (vocabularies, schemas) - Profiling and assessment of novel forms of entity-centric web data - Data summarisation - Data quality analysis for search - Topic profiling of datasets - Dataset indexing and profiling approaches - Novel applications using dataset profiles - Dataset retrieval - Search results presentation for datasets - Semantic dataset search - Evaluation of dataset search tools and algorithms - Decentralised and distributed architectures and algorithms in data search - Fusing, cleaning, ranking and refining dataset search results - Approaches to personalisation in dataset search - Scalability and performance of distributed data queries - Usage of dataset profiles in search
In the PROFILES’19 edition we are also interested in application of dataset profiles in the areas of - Mobility & Logistics - Smart Cities - Machine Learning
- Workshop paper submissions due: June 28, 2019 - Workshop paper notifications sent: July 24, 2019 - Publication of workshop proceedings: August 29, 2019 - Camera-ready copies due: August 7, 2019 - PROFILES Workshop @ ISWC: October 26, 2019 or October 27, 2019 (half-day)
- Short (up to 6 pages) and full (up to 15 pages) research papers - Poster abstracts and system demonstrations should not exceed 4 pages
All submissions must be written in English and must be formatted according to the Springer LNCS proceedings
style. Each submission will be reviewed by at least 3 members of the PC. Papers will be evaluated according to their significance, originality, technical content, style, clarity and relevance to the workshop.
- Elena Demidova (L3S Research Center, Germany) - Stefan Dietze (GESIS Leibniz Institute for the Social Sciences, Germany) - John Breslin (National University of Ireland, Galway) - Simon Gottschalk (L3S Research Center, Germany)
- Charlie Abela, University of Malta - Enrico Daga, The Open University - Liubov Kovriguina, NRU ITMO - Joanna Lytra, University of Bonn - Dmitry Mouromtsev, NRU ITMO - Bernardo Pereira Nunes, PUC-Rio - Nicolas Tempelmeier, L3S Research Center - Konstantin Todorov,
Université Montpellier - Raquel Trillo-Lado, Universidad de Zaragoza - Maria Esther Vidal, Leibniz Information Centre For Science and Technology - Ran Yu, GESIS Leibniz Institute for the Social Sciences - Amrapali Zaveri, Maastricht University