Parameters, Interactions, and Model Selection in Distributional Semantics

Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen:
https://osnadocs.ub.uni-osnabrueck.de/handle/urn:nbn:de:gbv:700-202012223954
Open Access logo originally created by the Public Library of Science (PLoS)
Titel: Parameters, Interactions, and Model Selection in Distributional Semantics
Autor(en): Lapesa, Gabriella
ORCID des Autors: https://orcid.org/0000-0002-4418-3609
Erstgutachter: Prof. Dr. Stefan Evert
Zweitgutachter: Prof. Dr. Kai-Uwe Kühnberger
Prof. Dr. Alessandro Lenci
Apl. Prof. Dr. Sabine Schulte im Walde
Zusammenfassung: Distributional Semantic Models are one of the possible answers produced in (computational) semantics to the question of what the meaning of a word is. The distributional semantic answer to this question is a usage-based one, as distributional semantics models (henceforth, DSMs) are employed to produce semantic representations of words from co-occurrence patterns in texts or documents. DSMs have proven to be useful in many applications in the domains of Natural Language Processing. Despite this progress, however, a full understanding of the different parameters governing a DSM and their influence on model performance (which, in fact, is also important for getting a better linguistic understanding of neural word embeddings) has not been achieved yet. This is precisely the goal of this dissertation. Taken together, the experiments presented in this thesis represent (to the best of our knowledge) the largest-scope study in which window and syntax-based DSMs have been tested in all parameter settings. As a further contribution, the thesis proposes a novel methodology for the interpretation of evaluation results: we employ linear regression as a statistical tool to understand the impact of different parameters on model performance. In this way, we achieve a solid understanding of the influence of specific parameters and parameter interactions on DSM performance, which can inform the selection of DSM settings that are robust to overfitting. This thesis has a strong focus on cognitive data, that is, on DSM parameters that lend themselves to a cognitive interpretation and on evaluation tasks in which DSMs are tested in their capability of mirroring speakers’ behavior in psychological tasks (semantic priming and free associations). One of the most important contributions of this thesis is the consistent finding that neighbor rank (i.e., the rank of a word among the distributional neighbors of a target) is a better indicator of semantic similarity/relatedness than the distance in the semantic space, which is commonly used in the literature. The cognitive interpretation of this result is straightforward: neighbor rank, which is evaluated systematically for the first time in this thesis, is able to capture asymmetry in the relation between two words, while distance metrics, commonly employed in distributional semantics, are symmetric.
URL: https://osnadocs.ub.uni-osnabrueck.de/handle/urn:nbn:de:gbv:700-202012223954
Schlagworte: distributional semantics; evaluation methodology; cognitive modeling
Erscheinungsdatum: 22-Dez-2020
Lizenzbezeichnung: Attribution-NonCommercial-NoDerivs 3.0 Germany
URL der Lizenz: http://creativecommons.org/licenses/by-nc-nd/3.0/de/
Publikationstyp: Dissertation oder Habilitation [doctoralThesis]
Enthalten in den Sammlungen:FB08 - E-Dissertationen

Dateien zu dieser Ressource:
Datei Beschreibung GrößeFormat 
thesis_lapesa.pdfPräsentationsformat3,09 MBAdobe PDF
thesis_lapesa.pdf
Miniaturbild
Öffnen/Anzeigen
research_data_lapesa.zipSupplementary material16,17 MBZIP
research_data_lapesa.zip
Öffnen/Anzeigen


Diese Ressource wurde unter folgender Copyright-Bestimmung veröffentlicht: Lizenz von Creative Commons Creative Commons