Data Science Projects (Q1-2021)


This first quarter of the 2020-2021 academic year, I supervise 6 final year projects in the Data Science Master of the UOC University, ranging from the development of a COVID-19 FAQ-based Q-A system to building knowledge graphs and performing end-to-end Natural Language Generation. Below is a list of resources and references I offer to get the students started.

Introduction to Natural Language Processing

Wikipedia and DBPedia

Building Knowledge Graphs from Texts

Information Extraction and Text Mining tasks

Language Models and Bert

Catalan Language Processing

Question Answering (Q/A) systems in general

Q/A From Knowledge Graphs

Q/A about COVID-19

Open Q/A with SQuAD

Natural Language Generation from Wikipedia Triples and texts

General

End-to-end Neural NLG

  • Neural Wikipedian: Generating Textual Summaries from Knowledge Base Triples, 2018. Code and Article.
  • Neural Text Generation from Structured Data with Application to the Biography Domain, 2016. Dataset and Article.
  • Automatic Generation of Company Descriptions, 2018. Dataset and Article.

Datasets

Evaluation