Abstract
Semantic features are central to many influential theories of word meaning and semantic memory, but new methods of quantifying the information embedded in feature production norms are needed to advance our understanding of semantic processing and language acquisition. This paper capitalized on databases of semantic feature production norms and age-of-acquisition ratings, and megastudies including the English Lexicon Project and the Calgary Semantic Decision Project, to examine the influence of feature distinctiveness on language acquisition, visual lexical decision, and semantic decision. A feature network of English words was constructed such that edges in the network represented feature distance, or dissimilarity, between words (i.e., Jaccard and Manhattan distances of probability distributions of features elicited for each pair of words), enabling us to quantify the relative feature distinctiveness of individual words relative to other words in the network. Words with greater feature distinctiveness tended to be acquired earlier. Regression analyses of megastudy data revealed that Manhattan feature distinctiveness inhibited performance on the visual lexical decision task, facilitated semantic decision performance for concrete concepts, and inhibited semantic decision performance for abstract concepts. These results demonstrate the importance of considering the structural properties of words embedded in a semantic feature space in order to increase our understanding of semantic processing and language acquisition.
Similar content being viewed by others
Notes
Using a logistic mixed-effects model for analyzing accuracy at the trial level led to model convergence issues and/or to degenerate models due to model complexity. Hence, accuracy for the semantic decision task was analyzed at the item-level using linear regression.
It is important to note that this additional analysis of interaction effects was post hoc in nature and not an a priori research question. The analysis was conducted in response to a reviewer who felt that it was particularly important to test whether feature distinctiveness effects were consistent across concrete and abstract concepts.
References
Adelman JS, Brown GD, Quesada JF (2006) Contextual diversity, not word frequency, determines word-naming and lexical decision times. Psychol Sci 17(9):814–823
Baayen RH, Davidson DJ, Bates DM (2008) Mixed-effects modeling with crossed random effects for subjects and items. J Mem Lang 59(4):390–412
Balota DA, Yap MJ, Hutchison KA, Cortese MJ, Kessler B, Loftis B et al (2007) The English lexicon project. Behav Res Methods 39(3):445–459
Brysbaert M, New B (2009) Moving beyond Kučera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behav Res Methods 41(4):977–990
Brysbaert M, Warriner AB, Kuperman V (2014) Concreteness ratings for 40 thousand generally known English word lemmas. Behav Res Methods 46(3):904–911. https://doi.org/10.3758/s13428-013-0403-5
Buchanan EM, Holmes JL, Teasley ML, Hutchison KA (2013) English semantic word-pair norms and a searchable Web portal for experimental stimulus creation. Behav Res Methods 45(3):746–757. https://doi.org/10.3758/s13428-012-0284-z
Buchanan EM, Valentine KD, Maxwell NP (2019) English semantic feature production norms: an extended database of 4436 concepts. Behav Res Methods. https://doi.org/10.3758/s13428-019-01243-z
Bullinaria JA, Levy JP (2007) Extracting semantic representations from word co-occurrence statistics: a computational study. Behav Res Methods 39(3):510–526
Castro N, Stella M (2019) The multiplex structure of the mental lexicon influences picture naming in people with aphasia. J Complex Netw. https://doi.org/10.1093/comnet/cnz012
Collins AM, Loftus EF (1975) A spreading-activation theory of semantic processing. Psychol Rev 82(6):407
Cree GS, McRae K (2003) Analyzing the factors underlying the structure and computation of the meaning of chipmunk, cherry, chisel, cheese, and cello (and many other such concrete nouns). J Exp Psychol Gen 132(2):163
Cree GS, McNorgan C, McRae K (2006) Distinctive features hold a privileged status in the computation of word meaning: implications for theories of semantic memory. J Exp Psychol Learn Mem Cogn 32(4):643
De Deyne S, Storms G (2008) Word associations: network and semantic properties. Behav Res Methods 40(1):213–231
De Deyne S, Navarro DJ, Perfors A, Brysbaert M, Storms G (2019) The “small world of words” english word association norms for over 12,000 cue words. Behav Res Methods 51(3):987–1006
Devereux BJ, Tyler LK, Geertzen J, Randall B (2014) The centre for speech, language and the brain (CSLB) concept property norms. Behav Res Methods 46(4):1119–1127
Engelthaler T, Hills TT (2017) Feature biases in early word learning: network distinctiveness predicts age of acquisition. Cognit Sci 41:120–140
Firth J (1957) A synopsis of linguistic theory, 1930–1955. In: Studies in linguistic analysis, philological society, Oxford; reprinted in Palmer F (ed) 1968 Selected Papers of Firth JR, Longman, Harlow
Garrard P, Lambon Ralph MA, Hodges JR, Patterson K (2001) Prototypicality, distinctiveness, and intercorrelation: analyses of the semantic attributes of living and nonliving concepts. Cognit Neuropsychol 18(2):125–174
Goldstein R, Vitevitch MS (2014) The influence of clustering coefficient on word-learning: how groups of similar sounding words facilitate acquisition. Front Psychol. https://doi.org/10.3389/fpsyg.2014.01307
Hills TT, Maouene M, Maouene J, Sheya A, Smith L (2009) Longitudinal analysis of early semantic networks. Psychol Sci 20(6):729–739. https://doi.org/10.1111/j.1467-9280.2009.02365.x
Hsiao Y, Nation K (2018) Semantic diversity, frequency and the development of lexical quality in children’s word reading. J Mem Lang 103:114–126
Jaswal VK, Hansen MB (2006) Learning words: children disregard some pragmatic information that conflicts with mutual exclusivity. Dev Sci 9(2):158–165
Kuperman V, Stadthagen-Gonzalez H, Brysbaert M (2012) Age-of-acquisition ratings for 30,000 English words. Behav Res Methods 44(4):978–990
Kuznetsova A, Brockhoff PB, Christensen RHB (2017) lmerTest package: tests in linear mixed effects models. J Stat Softw 82(13):1548–7660
Louwerse MM, Jeuniaux P (2010) The linguistic and embodied nature of conceptual processing. Cognition 114(1):96–104
Love BC, Medin DL, Gureckis TM (2004) SUSTAIN: a network model of category learning. Psychol Rev 111(2):309–332. https://doi.org/10.1037/0033-295X.111.2.309
Lund K, Burgess C (1996) Producing high-dimensional semantic spaces from lexical co-occurrence. Behav Res Methods Instrum Comput 28(2):203–208
Markman EM, Wachtel GF (1988) Children’s use of mutual exclusivity to constrain the meanings of words. Cogn Psychol 20(2):121–157
Marques JF (2005) Naming from definition: the role of feature type and feature distinctiveness. Q J Exp Psychol Sect A 58(4):603–611. https://doi.org/10.1080/02724980443000106
McRae K, Cree GS, Seidenberg MS, McNorgan C (2005) Semantic feature production norms for a large set of living and nonliving things. Behav Res Methods 37(4):547–559
Minda JP, Smith JD (2002) Comparing prototype-based and exemplar-based accounts of category learning and attentional allocation. J Exp Psychol Learn Mem Cogn 28(2):275
Montefinese M, Zannino GD, Ambrosini E (2015) Semantic similarity between old and new items produces false alarms in recognition memory. Psychol Res 79(5):785–794
Montefinese M, Vinson D, Ambrosini E (2018) Recognition memory and featural similarity between concepts: the pupil’s point of view. Biol Psychol 135:159–169
Moss HE, Tyler LK, Jennings F (1997) When leopards lose their spots: knowledge of visual properties in category-specific deficits for living things. Cognit Neuropsychol 14(6):901–950
Moss HE, Tyler LK, Taylor KI (2007) Conceptual structure. In: Gaskell MG (ed) The Oxford handbook of psycholinguistics. Oxford University Press, Oxford, UK, pp 217–234
Nelson DL, McEvoy CL, Schreiber TA (2004) The University of South Florida free association, rhyme, and word fragment norms. Behav Res Methods Instrum Comput 36(3):402–407
Peters R, Borovsky A (2019) Modeling early lexico-semantic network development: perceptual features matter most. J Exp Psychol Gen 148(4):763
Pexman PM, Lupker SJ, Hino Y (2002) The impact of feedback semantics in visual word recognition: number-of-features effects in lexical decision and naming tasks. Psychon Bull Rev 9(3):542–549. https://doi.org/10.3758/BF03196311
Pexman PM, Holyk GG, Monfils M-H (2003) Number-of-features effects and semantic processing. Mem Cognit 31(6):842–855
Pexman PM, Hargreaves IS, Siakaluk PD, Bodner GE, Pope J (2008) There are many ways to be rich: effects of three measures of semantic richness on visual word recognition. Psychon Bull Rev 15(1):161–167
Pexman PM, Heard A, Lloyd E, Yap MJ (2017) The Calgary semantic decision project: concrete/abstract decision data for 10,000 English words. Behav Res Methods 49(2):407–417. https://doi.org/10.3758/s13428-016-0720-6
Plaut DC, Shallice T (1993) Deep dyslexia: a case study of connectionist neuropsychology. Cognit Neuropsychol 10(5):377–500. https://doi.org/10.1080/02643299308253469
Randall B, Moss HE, Rodd JM, Greer M, Tyler LK (2004) Distinctiveness and correlation in conceptual structure: behavioral and computational studies. J Exp Psychol Learn Mem Cogn 30(2):393
Ratcliff R, Gomez P, McKoon G (2004) A diffusion model account of the lexical decision task. Psychol Rev 111(1):159–182. https://doi.org/10.1037/0033-295X.111.1.159
Recchia G, Jones M (2012) The semantic richness of abstract concepts. Front Hum Neurosci 6:315
Rips LJ, Shoben EJ, Smith EE (1973) Semantic distance and the verification of semantic relations. J Verbal Learn Verbal Behav 12(1):1–20
Rodd J, Gaskell G, Marslen-Wilson W (2002) Making sense of semantic ambiguity: semantic competition in lexical access. J Mem Lang 46(2):245–266. https://doi.org/10.1006/jmla.2001.2810
Siew CSQ (2019) spreadr: An R package to simulate spreading activation in a network. Behav Res Methods 51(2):910–929
Siew CSQ, Vitevitch MS (2019) The phonographic language network: using network science to investigate the phonological and orthographic similarity structure of language. J Exp Psychol Gen 148(3):475–500. https://doi.org/10.1037/xge0000575
Siew CSQ, Wulff DU, Beckage NM, Kenett YN (2019) Cognitive network science: a review of research on cognition through the lens of network representations, processes, and dynamics. Complexity, 2019
Sizemore AE, Karuza EA, Giusti C, Bassett DS (2018) Knowledge gaps in the early growth of semantic feature networks. Nat Hum Behav 2(9):682
Sloutsky VM, Fisher AV (2004) Induction and categorization in young children: a similarity-based model. J Exp Psychol Gen 133(2):166–188. https://doi.org/10.1037/0096-3445.133.2.166
Smith LB, Jones SS, Landau B, Gershkoff-Stowe L, Samuelson L (2002) Object name learning provides on-the-job training for attention. Psychol Sci 13(1):13–19
Steyvers M, Tenenbaum JB (2005) The large-scale structure of semantic networks: statistical analyses and a model of semantic growth. Cognit Sci 29(1):41–78. https://doi.org/10.1207/s15516709cog2901_3
Taylor KI, Salamoura A, Randall B, Moss H, Tyler LK (2008) Clarifying the nature of the distinctiveness by domain interaction in conceptual structure: comment on Cree, McNorgan, and McRae (2006). J Exp Psychol Learn Mem Cogn 34:719–725
Taylor KI, Devereux BJ, Tyler LK (2011) Conceptual structure: towards an integrated neurocognitive account. Lang Cognit Process 26(9):1368–1401
Taylor KI, Devereux BJ, Acres K, Randall B, Tyler LK (2012) Contrasting effects of feature-based statistics on the categorisation and basic-level identification of visual objects. Cognition 122(3):363–374
Tousignant C, Pexman PM (2012) Flexible recruitment of semantic richness: context modulates body-object interaction effects in lexical-semantic processing. Front Hum Neurosci 6:53
Tyler LK, Moss HE (2001) Towards a distributed account of conceptual knowledge. Trends in Cognitive Sciences 5(6):244–252
Tyler LK, Moss HE, Durrant-Peatfield MR, Levy JP (2000) Conceptual structure and the structure of concepts: a distributed account of category-specific deficits. Brain Lang 75(2):195–231
Vinson DP, Vigliocco G (2008) Semantic feature production norms for a large set of objects and events. Behav Res Methods 40(1):183–190
Yap MJ, Tan SE, Pexman PM, Hargreaves IS (2011) Is more always better? Effects of semantic richness on lexical decision, speeded pronunciation, and semantic classification. Psychon Bull Rev 18(4):742–750
Zdrazilova L, Pexman PM (2013) Grasping the invisible: semantic processing of abstract words. Psychon Bull Rev 20(6):1312–1318
Acknowledgements
The author thanks Nichol Castro and Li Ying for providing useful comments on earlier drafts of this manuscript. Data and analysis scripts are available on the Open Science Framework: https://osf.io/x87tr/
Funding
No funding to declare.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author declares that she has no conflict of interest.
Ethical approval
This article does not contain any studies with human participants performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Handling editor: Barry Devereux (Queen’s University Belfast); Reviewers: Blair Armstrong (University of Toronto), Gabriel Recchia (University of Cambridge).
This article is part of the special topic ‘Eliciting Semantic Properties: Methods and Applications’ guest-edited by Enrico Canessa, Sergio Chaigneau, Barry Devereux, and Alessandro Lenci.
Appendices
Appendix A
Correlations between predictors and outcome variables in the regression models.
Appendix B
Computation of Jaccard and Manhattan distance measures
Two different distance measures were computed to quantify the dissimilarity of any given pair of probability distributions: Jaccard distance and Manhattan distance. The first measure, Jaccard distance, is an example of a measure from the inner-product family of distance measures that emphasizes shared information. Jaccard distance computes the distance between two words as subtracting the intersection of the two words’ feature set distributions (i.e., a vector that represents the proportions of participants reporting each feature for a given word) over their union from 1. The second measure, Manhattan distance, is an example of a measure from the Minkowski family of distance measures that generally treats distance as the straight line between two points in Euclidean space. Manhattan distance computes the distance between two words as the sum of absolute differences between the proportions of participants reporting each feature. Mathematically, these measures were computed as follows:
Jaccard distance, dj:
Manhattan distance, dm:
where Pi = [x1, x2, …, xi] is a vector representing the proportions of participants reporting each feature for a word 1 and Qi = [x1, x2, …, xi] is a vector representing the proportions of participants reporting each feature for a word 2. i = the number of (unique) features in the feature production norms.
Rights and permissions
About this article
Cite this article
Siew, C.S.Q. Feature distinctiveness effects in language acquisition and lexical processing: Insights from megastudies. Cogn Process 21, 669–685 (2020). https://doi.org/10.1007/s10339-019-00947-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10339-019-00947-6