Feature construction, encompassing both feature engineering, which involves the manual design of features by domain experts, and representation learning, which refers to the automated discovery of useful data representations during model construction, is a fundamental aspect of machine learning. Its goal is to transform raw data into a more suitable …
Automatic terminology extraction, also known as automatic term extraction (ATE), is a natural language processing (NLP) task that identifies specialized terminology from domain-specific corpora. ATE is often used for terminographic tasks (e.g., the creation of specialized dictionaries) and contributes to several complex downstream tasks (e.g., machine translation and information retrieval). …
Over the past decade, rapid advancements in natural language processing have opened up new avenues for tackling complex issues such as news bias analysis. This progress has empowered researchers to explore innovative approaches to uncovering the complex biases inherent in news production and coverage processes. News bias, a multifaceted reflection …
The thesis addresses the development of novel knowledge discovery scenarios in a modern data mining platform by utilising principles of service-oriented architecture with web services, interactive scientific workflows, knowledge discovery ontologies and automated construction of data mining workflows. We present the developed Orange4WS platform which upgrades Orange, a mature open-source …