Repository > Latest

Terminology extraction and alignment for the translation industry

Author(s): Andraž Repar (Author), Senja Pollak (Supervisor)

Year: 2025

Type: Doctoral dissertation

This PhD dissertation focuses on improving terminology extraction and alignment for applications in the translation industry. It explores three key use cases where these techniques benefit language professionals: creating client-specific terminology lists from large parallel corpora (i.e. translation memories), building domain-specific terminology resources from comparable corpora, and identifying important domain-specific …

terminology extraction translation industry corpus

View Details Download PDF

Neural approaches to automatic terminology extraction

Author(s): Hanh Thi Hong Tran (Author), Senja Pollak (Supervisor), Antoine Doucet (Co-Supervisor)

Year: 2024

Type: Doctoral dissertation

Automatic terminology extraction, also known as automatic term extraction (ATE), is a natural language processing (NLP) task that identifies specialized terminology from domain-specific corpora. ATE is often used for terminographic tasks (e.g., the creation of specialized dictionaries) and contributes to several complex downstream tasks (e.g., machine translation and information retrieval). …

automatic terminology extraction transformers token classification natural language processing

View Details Download PDF

Combining neural and symbolic representations in natural language processing

Author(s): Matej Martinc (Author), Senja Pollak (Supervisor)

Year: 2022

Type: Doctoral dissertation

The thesis addresses a novel representation learning framework, combining neural and symbolic text representations, and demonstrates its utility for tackling diverse natural language processing problems. The proposed approach, avoiding the deficiencies of purely symbolic and purely neural methods, can be applied for the generation of efficient text representations. Its usefulness …

View Details Download PDF

Automatic text parsing aided by clause splitting and intra-clausal coordination detection

Author(s): Domen Marinčič (Author), Matjaž Gams (Supervisor), Tomaž Šef (Co-Supervisor)

Year: 2008

Type: Doctoral dissertation

In language technologies, syntactic parsing represents one of the possible intermediate steps of text analysis in the applications such as machine translation, information extraction, question answering, etc. Syntactic trees are often used to demonstrate the structure of text. In the last decades, the dependency framework became a popular syntactic representation, …

View Details Download PDF

REPOSITORY > LATEST

Latest Academic Works

Terminology extraction and alignment for the translation industry

Neural approaches to automatic terminology extraction

Combining neural and symbolic representations in natural language processing

Automatic text parsing aided by clause splitting and intra-clausal coordination detection