Context window and polysemy interpretation : A case of Korean adverbial postposition −(u)lo - Université Paris Nanterre
Poster De Conférence Année : 2020

Context window and polysemy interpretation : A case of Korean adverbial postposition −(u)lo

Résumé

Construal of a polysemous word occurs in conjunction with a series of words, delivering various frame-semantic meanings (Goldberg,2006) and yet purporting similar interpretations (Harris,1954). In this regard, context window_a range of words surrounding a target word, affecting the determination of its characteristics_is drawing attention to the computational understanding of combinatorial properties of words. We ask how context window addresses polysemy interpretation in Korean, a language typologically different from the major Indo-European languages investigated for this task. We report computational simulations regarding how various context window sizes address polysemy of -(u)lo, which manifests polysemy due to its multiple functions mapped onto one form. We used the Sejong corpus, with semantic annotations of this postposition cross-verified by three native speakers of Korean (κ =0.95). Employing a distributional semantic model (Harris,1954), we devised an unsupervised learning algorithm by combining Singular Value Decomposition with Positive Pointwise Mutual Information. We measured model performance through accuracy rates that the model classified test sentences by the functions of -(u)lo, with manipulation of context window from one to ten. For this purpose, we used the similarity-based estimate (Dagan.et.al.,1993) by calculating cosine similarity scores between -(u)lo and its co-occurring content words. Our model achieved the highest classification accuracy rate in the window size of one, and the accuracy rates decreased as the window size increased. This trend aligns with advantages of small window sizes (Bullinaria & Levy, 2007). Considering that a narrower rangeof context window relates more to syntactic than to sematic information (Patel.et.al.,1997), our model may have employed structural, more than semantic, characteristics of tri-grams (word-target-word) for the best classification performance. Given the networks of interlinked clusters of words and symbolic units in human cognition (construct-i-con; Goldberg,2006), our findings shed light on relations between a polysemous word and an abstract schema including the word, represented as context window, in addressing word-level polysemy.

Domaines

Linguistique
Fichier non déposé

Dates et versions

hal-04113386 , version 1 (01-06-2023)

Identifiants

  • HAL Id : hal-04113386 , version 1

Citer

Seongmin Simon Mun, Gyu-Ho Shin. Context window and polysemy interpretation : A case of Korean adverbial postposition −(u)lo. IMPRS Conference 2020 Interdisciplinary Approaches in the Language Sciences, Jun 2020, Nijmegen, Netherlands. ⟨hal-04113386⟩
9 Consultations
0 Téléchargements

Partager

More