A sneak peek at the European Union Ethics Guidelines for AI

A few days ago, politico.eu published a preview of the document that the European Union will issue as guidance for ethical issues related to artificial intelligence and machine learning. The document was written by the High-level Expert Group on Artificial Intelligence, appointed by the European Commission. This advanced version of the document is available online now … Continue reading A sneak peek at the European Union Ethics Guidelines for AI

Improving Topic Modeling with Knowledge Graph Embeddings

Topic modeling techniques have been applied in many scenarios in recent years, spanning textual content, as well as many different data sources. The existing researches in this field continuously try to improve the accuracy and coherence of the results. Some recent works propose new methods that capture the semantic relations between words into the topic … Continue reading Improving Topic Modeling with Knowledge Graph Embeddings

Data Cleaning for Knowledge Extraction and Understanding on Social Media

  Social media platforms let users share their opinions through textual or multimedia content. In many settings, this becomes a valuable source of knowledge that can be exploited for specific business objectives. Brands and companies often ask to monitor social media as sources for understanding the stance, opinion, and sentiment of their customers, audience and … Continue reading Data Cleaning for Knowledge Extraction and Understanding on Social Media

Iterative knowledge extraction from social networks

Our motivation starts from the fact that knowledge in the world continuously evolves, and thus ontologies and knowledge bases are largely incomplete. We explored iterative methods, using the results as new seeds. In this paper we address the following research questions: How does the reconstructed domain knowledge evolve if the candidates of one extraction are recursively used as seeds? How does the reconstructed domain knowledge spread geographically? Can the method be used to inspect the past, present, and future of knowledge? Can the method be used to find emerging knowledge?

Myths and Challenges in Knowledge Extraction and Big Data Analysis

The knowledge we may try to extract from human-generated content, IoT and Web sources can be dispersed, informal, contradicting, unsubstantiated and ephemeral today, while already tomorrow it may be commonly accepted. The challenge is to capture and create consolidated knowledge that is new, has not been formalized yet in existing knowledge bases, and is buried inside a big, moving target (the live stream of online data). The myth is that existing tools (spanning fields like semantic web, machine learning, statistics, NLP, and so on) suffice to the objective. I explore the problem that one can face along this path.