ODISSEI Lunch Lecture: From PDF to Knowledge Graph: parsing CBS metadata

On Tuesday, 18 May from 12:00-13:00, Chang Sun (Maastricht University) will present her project ‘Knowledge Graph for metadata of CBS Microdata’.

We warmly invite you to attend this lecture, learn more about parsing CBS metadata, and participate in the Q&A and discussion after the lecture. Read more about the lecture below and register via the button.

Online registration has closed, but if you still wish to register, please contact us via communications@odissei-data.nl.

Currently, CBS metadata are only available in Dutch and in PDF format. This  project converts the descriptions of all CBS microdata sets into one knowledge graph with comprehensive metadata in Dutch and English using text mining and semantic web technologies. Researchers can easily query the metadata, explore the relations among multiple datasets, and find the needed variables. For example, if a researcher searches a dataset about “Age at Death” in the Health and Well-being category, all information related to this dataset will appear including keywords and variable names. “Age at Death” dataset has a keyword – “Death”. This keyword will lead to other datasets such as “Date of Death”. “Cause of Death”, “Production statistics Health and welfare” from Population, Business categories, and Health and well-being categories. This will tremendously save time and costs for the data requester but also data maintainers.

Chang Sun is a PhD student working at the Institute of Data Science at Maastricht University. She achieved her master degree in Artificial Intelligence in 2017 and then started her PhD research in the data science domain.  Currently, she is working on privacy-preserving data mining and federated/distributed machine learning technologies to solve the problem of analysing sensitive data across multiple independent data parties. She is also developing a personal data vault platform where people can take full control of their own data in order to strengthen and extend the (re-)use of personal data while maximally protecting individuals’ privacy. 


About the ODISSEI Lunch Lecture series

The ODISSEI Lunch Lectures highlight methodological issues and innovations in Social Science. This is the last Lunch Lecture of the 2020/2021 academic year. The series will continue in September 2021.

Relevant links


Image by Pexels from Pixabay