Disambiguation of newly derived nominalizations in context: A Distributional Semantics approach

Gabriella Lapesa; Lea Kawaletz; Ingo Plag; Marios Andreou; Max Kisselew; Sebastian Padó

doi:10.3366/word.2018.0131

Loading next page...

References (67)

(2011)
svm function from the sklearn Python package
Rossella Varvara (2017)
Verbs as nouns: empirical investigations on event-denoting nominalizations
Marco Baroni, Georgiana Dinu, Germán Kruszewski (2014)
Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors
(2011)
Corpus linguistics with Google
B. Fradin (2009)
Les nominalisations et la lecture 'moyen'
M. Kubát (2017)
An Introduction to Machine Learning
Ingo Plag (1999)
Morphological Productivity
Roberto Navigli (2009)
Word sense disambiguation: A survey
ACM Comput. Surv., 41
Hinrich Schütze (1998)
Automatic Word Sense Discrimination
Comput. Linguistics, 24
M. Kisselew, Laura Rimell, Alexis Palmer, Sebastian Padó (2016)
Predicting the Direction of Derivation in English Conversion
Gemma Boleda, Sebastian Padó, Jason Utt (2012)
Regular polysemy: A distributional model
B. Fradin (2011)
Remarks on State denoting nominalizations
J. Pustejovsky (2011)
Coercion in a general theory of argument selection
, 49
M. Andreou (2017)
Stereotype negation in Frame Semantics
Glossa, 2
C. Melloni (2011)
Event and Result Nominals: A Morpho-semantic Approach
(2010)
German -ung-formation: An explanation of formation and interpretation in a root-based account
(2001)
Philosophische Untersuchungen: Kritisch
F. Rainer (2014)
Polysemy in Derivation
L. Bauer, R. Lieber, Ingo Plag (2013)
The Oxford Reference Guide to English Morphology
G. Miller, W. Charles (1991)
Contextual correlates of semantic similarity
Language and Cognitive Processes, 6
Marco Baroni, Alessandro Lenci (2010)
Distributional Memory: A General Framework for Corpus-Based Semantics
Computational Linguistics, 36
J. Firth (1957)
A Synopsis of Linguistic Theory, 1930-1955
L. Bauer (1983)
English Word-Formation: Frontmatter
B. Levin (1993)
English Verb Classes and Alternations: A Preliminary Investigation
(2013)
Understanding semantics
Gemma Boleda, Sabine Walde, Toni Badia (2012)
Modeling Regular Polysemy: A Study on the Semantic Classification of Catalan Adjectives
Computational Linguistics, 38
Ryan Cotterell, Hinrich Schütze (2017)
Joint Semantic Synthesis and Morphological Analysis of the Derived Word
Transactions of the Association for Computational Linguistics, 6
R. Lieber (2016)
English Nouns: The Ecology of Nominalization
Gabriella Lapesa, S. Evert, Sabine Walde (2014)
Contrasting Syntagmatic and Paradigmatic Relations: Insights from Distributional Semantic Models
M. Marelli, Marco Baroni (2015)
Affixation in semantic space: Modeling morpheme meanings with compositional distributional semantics.
Psychological review, 122 3
I. Plag, M. Andreou, Lea Kawaletz (2018)
A frame-semantic approach to polysemy in affixation
Christopher Bishop (2006)
Pattern Recognition and Machine Learning (Information Science and Statistics)
C. Kay, I. Wotherspoon (2004)
The Oxford English Dictionary Online
Lit. Linguistic Comput., 19
Lea Kawaletz, I. Plag (2015)
Predicting the Semantics of English Nominalizations: A Frame-Based Analysis of -ment Suffixation
Antoinette Renouf, A. Kehoe, J. Banerjee (2007)
WebCorp: an integrated system for web text search
R. Baayen (2009)
41. Corpus linguistics in morphology: Morphological productivity
(2015)
Coquery: A corpus query tool
R. Baayen (1996)
The Effects of Lexical Specialization on the Growth Curve of the Vocabulary
Comput. Linguistics, 22
E. Joanis, S. Stevenson, D. James (2003)
A General Feature Space for Automatic Verb Classification
Natural Language Engineering, 14
(2012)
Sur la corrélation existant entre les suffixes -age et -ment et les distinctions sémantiques observables dans les nominalisations du français
H. Schmid (2016)
English morphology and word-formation
(2006)
Corpus linguistics and the web (Language and Computers 59)
Tillmann Pross, Antje Roßdeutscher, Sebastian Padó, Gabriella Lapesa, M. Kisselew (2017)
Integrating lexical-conceptual and distributional semantics : a case report . ∗
Z. Harris (1954)
Distributional Structure
Joakim Adilsson (2005)
Word formation in English
(2013)
Natural selection in self-organizing morphological systems
(1990)
Event nominalizations: Proposals and problems
J. Pustejovsky (1995)
The Generative Lexicon
Comput. Linguistics, 17
R. Lieber, P. Štekauer (2014)
The Oxford Handbook of Derivational Morphology
Regine Brandtner (2011)
Deverbal nominals in context : meaning variation and copredication
D. Kastovsky (1986)
English word-formation
System, 14
J. Platt (1999)
Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods
Omer Levy, Yoav Goldberg (2014)
Dependency-Based Word Embeddings
Fabian Pedregosa, G. Varoquaux, Alexandre Gramfort, V. Michel, B. Thirion, O. Grisel, Mathieu Blondel, Gilles Louppe, P. Prettenhofer, Ron Weiss, Ron Weiss, J. Vanderplas, Alexandre Passos, D. Cournapeau, M. Brucher, M. Perrot, E. Duchesnay (2011)
Scikit-learn: Machine Learning in Python
ArXiv, abs/1201.0490
C. Fellbaum (2000)
WordNet : an electronic lexical database
Language, 76
H. Marchand (1971)
Categories And Types Of Present Day English Word Formation
Magnus Sahlgren (2006)
The Word-Space Model: using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces
(2017)
Derivational morphology: An integrated perspective
Peter Turney, Patrick Pantel (2010)
From Frequency to Meaning: Vector Space Models of Semantics
J. Artif. Intell. Res., 37
K. Schuler, A. Korhonen, Neville Ryant, Martha Palmer (2008)
A large-scale classification of English verbs
Language Resources and Evaluation, 42
W. Taylor (1953)
“Cloze Procedure”: A New Tool for Measuring Readability
Journalism & Mass Communication Quarterly, 30
R. Izquierdo, Armando Suárez, German Rigau (2009)
An Empirical Study on Class-Based Word Sense Disambiguation
Antje Roßdeutscher, Hans Kamp (2007)
Syntactic and semantic constraints on the formation and interpretation of-ung-nouns
(2008)
The Corpus of Contemporary American English: 400+ million words, 1990-present
M. Uth (2011)
Französische Ereignisnominalisierungen: Abstrakte Bedeutung und regelhafte Wortbildung
B. Szymanek (2013)
Structural ambiguity in English word-formation
Nicholas Asher (2011)
Lexical Meaning in Context: CODA

Publisher: Edinburgh University Press
Copyright: Copyright © Edinburgh University Press
ISSN: 1750-1245
eISSN: 1755-2036
DOI: 10.3366/word.2018.0131
Publisher site: See Article on Publisher Site

Abstract

One of the central problems in the semantics of derived words is polysemy (see, for example, the recent contributions by Lieber 2016 and Plag et al. 2018). In this paper, we tackle the problem of disambiguating newly derived words in context by applying Distributional Semantics (Firth 1957) to deverbal -ment nominalizations (e.g. bedragglement, emplacement).We collected a dataset containing contexts of low frequency deverbal -ment nominalizations (55 types, 406 tokens, see Appendix B) extracted from large corpora such as the Corpus of Contemporary American English. We chose low frequency derivatives because high frequency formations are often lexicalized and thus tend to not exhibit the kind of polysemous readings we are interested in. Furthermore, disambiguating low-frequency words presents an especially difficult task because there is little to no prior knowledge about these words from which their semantic properties can be extrapolated.The data was manually annotated according to eventive vs. non-eventive interpretations, allowing also an ambiguous label in those cases where the context did not disambiguate. Our question then was to what extent, and under which conditions, context-derived representations such as those of Distributional Semantics can be successfully employed in the disambiguation of low-frequency derivatives.Our results show that, first, our models are able to distinguish between eventive and non-eventive readings with some success. Second, very small context windows are sufficient to find the intended interpretation in the majority of cases. Third, ambiguous instances tend to be classified as events. Fourth, the performance of the classifier differed for different subcategories of nouns, with non-eventive derivatives being harder to classify correctly. We present indirect evidence that this is due to the semantic similarity of abstract non-eventive nouns to eventive nouns. Overall, this paper demonstrates that distributional semantic models can be fruitfully employed for the disambiguation of low frequency words in spite of the scarcity of available contextual information.1

Journal

Word Structure – Edinburgh University Press

Published: Nov 1, 2018

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Disambiguation of newly derived nominalizations in context: A Distributional Semantics approach

Disambiguation of newly derived nominalizations in context: A Distributional Semantics approach

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Disambiguation of newly derived nominalizations in context: A Distributional Semantics approach

Disambiguation of newly derived nominalizations in context: A Distributional Semantics approach

References (67)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies