Feature distinctiveness effects in language acquisition and lexical processing: Insights from megastudies

Siew, Cynthia S. Q.

doi:10.1007/s10339-019-00947-6

Feature distinctiveness effects in language acquisition and lexical processing: Insights from megastudies

Research Article
Published: 23 January 2020

Volume 21, pages 669–685, (2020)
Cite this article

Cognitive Processing Aims and scope Submit manuscript

Cynthia S. Q. Siew ORCID: orcid.org/0000-0003-3384-7374¹

435 Accesses
7 Citations
3 Altmetric
Explore all metrics

Abstract

Semantic features are central to many influential theories of word meaning and semantic memory, but new methods of quantifying the information embedded in feature production norms are needed to advance our understanding of semantic processing and language acquisition. This paper capitalized on databases of semantic feature production norms and age-of-acquisition ratings, and megastudies including the English Lexicon Project and the Calgary Semantic Decision Project, to examine the influence of feature distinctiveness on language acquisition, visual lexical decision, and semantic decision. A feature network of English words was constructed such that edges in the network represented feature distance, or dissimilarity, between words (i.e., Jaccard and Manhattan distances of probability distributions of features elicited for each pair of words), enabling us to quantify the relative feature distinctiveness of individual words relative to other words in the network. Words with greater feature distinctiveness tended to be acquired earlier. Regression analyses of megastudy data revealed that Manhattan feature distinctiveness inhibited performance on the visual lexical decision task, facilitated semantic decision performance for concrete concepts, and inhibited semantic decision performance for abstract concepts. These results demonstrate the importance of considering the structural properties of words embedded in a semantic feature space in order to increase our understanding of semantic processing and language acquisition.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

English semantic feature production norms: An extended database of 4436 concepts

Article 01 May 2019

Feats: A database of semantic features for early produced noun concepts

Article 26 December 2023

Diachronic semantic change in language is constrained by how people use and learn language

Article Open access 29 June 2022

Notes

Using a logistic mixed-effects model for analyzing accuracy at the trial level led to model convergence issues and/or to degenerate models due to model complexity. Hence, accuracy for the semantic decision task was analyzed at the item-level using linear regression.
It is important to note that this additional analysis of interaction effects was post hoc in nature and not an a priori research question. The analysis was conducted in response to a reviewer who felt that it was particularly important to test whether feature distinctiveness effects were consistent across concrete and abstract concepts.

References

Adelman JS, Brown GD, Quesada JF (2006) Contextual diversity, not word frequency, determines word-naming and lexical decision times. Psychol Sci 17(9):814–823
PubMed Google Scholar
Baayen RH, Davidson DJ, Bates DM (2008) Mixed-effects modeling with crossed random effects for subjects and items. J Mem Lang 59(4):390–412
Google Scholar
Balota DA, Yap MJ, Hutchison KA, Cortese MJ, Kessler B, Loftis B et al (2007) The English lexicon project. Behav Res Methods 39(3):445–459
PubMed Google Scholar
Brysbaert M, New B (2009) Moving beyond Kučera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behav Res Methods 41(4):977–990
PubMed Google Scholar
Brysbaert M, Warriner AB, Kuperman V (2014) Concreteness ratings for 40 thousand generally known English word lemmas. Behav Res Methods 46(3):904–911. https://doi.org/10.3758/s13428-013-0403-5
Article PubMed Google Scholar
Buchanan EM, Holmes JL, Teasley ML, Hutchison KA (2013) English semantic word-pair norms and a searchable Web portal for experimental stimulus creation. Behav Res Methods 45(3):746–757. https://doi.org/10.3758/s13428-012-0284-z
Article PubMed Google Scholar
Buchanan EM, Valentine KD, Maxwell NP (2019) English semantic feature production norms: an extended database of 4436 concepts. Behav Res Methods. https://doi.org/10.3758/s13428-019-01243-z
Article PubMed Google Scholar
Bullinaria JA, Levy JP (2007) Extracting semantic representations from word co-occurrence statistics: a computational study. Behav Res Methods 39(3):510–526
PubMed Google Scholar
Castro N, Stella M (2019) The multiplex structure of the mental lexicon influences picture naming in people with aphasia. J Complex Netw. https://doi.org/10.1093/comnet/cnz012
Article PubMed PubMed Central Google Scholar
Collins AM, Loftus EF (1975) A spreading-activation theory of semantic processing. Psychol Rev 82(6):407
Google Scholar
Cree GS, McRae K (2003) Analyzing the factors underlying the structure and computation of the meaning of chipmunk, cherry, chisel, cheese, and cello (and many other such concrete nouns). J Exp Psychol Gen 132(2):163
PubMed Google Scholar
Cree GS, McNorgan C, McRae K (2006) Distinctive features hold a privileged status in the computation of word meaning: implications for theories of semantic memory. J Exp Psychol Learn Mem Cogn 32(4):643
PubMed PubMed Central Google Scholar
De Deyne S, Storms G (2008) Word associations: network and semantic properties. Behav Res Methods 40(1):213–231
PubMed Google Scholar
De Deyne S, Navarro DJ, Perfors A, Brysbaert M, Storms G (2019) The “small world of words” english word association norms for over 12,000 cue words. Behav Res Methods 51(3):987–1006
PubMed Google Scholar
Devereux BJ, Tyler LK, Geertzen J, Randall B (2014) The centre for speech, language and the brain (CSLB) concept property norms. Behav Res Methods 46(4):1119–1127
PubMed Google Scholar
Engelthaler T, Hills TT (2017) Feature biases in early word learning: network distinctiveness predicts age of acquisition. Cognit Sci 41:120–140
Google Scholar
Firth J (1957) A synopsis of linguistic theory, 1930–1955. In: Studies in linguistic analysis, philological society, Oxford; reprinted in Palmer F (ed) 1968 Selected Papers of Firth JR, Longman, Harlow
Garrard P, Lambon Ralph MA, Hodges JR, Patterson K (2001) Prototypicality, distinctiveness, and intercorrelation: analyses of the semantic attributes of living and nonliving concepts. Cognit Neuropsychol 18(2):125–174
CAS Google Scholar
Goldstein R, Vitevitch MS (2014) The influence of clustering coefficient on word-learning: how groups of similar sounding words facilitate acquisition. Front Psychol. https://doi.org/10.3389/fpsyg.2014.01307
Article PubMed PubMed Central Google Scholar
Hills TT, Maouene M, Maouene J, Sheya A, Smith L (2009) Longitudinal analysis of early semantic networks. Psychol Sci 20(6):729–739. https://doi.org/10.1111/j.1467-9280.2009.02365.x
Article PubMed PubMed Central Google Scholar
Hsiao Y, Nation K (2018) Semantic diversity, frequency and the development of lexical quality in children’s word reading. J Mem Lang 103:114–126
Google Scholar
Jaswal VK, Hansen MB (2006) Learning words: children disregard some pragmatic information that conflicts with mutual exclusivity. Dev Sci 9(2):158–165
PubMed Google Scholar
Kuperman V, Stadthagen-Gonzalez H, Brysbaert M (2012) Age-of-acquisition ratings for 30,000 English words. Behav Res Methods 44(4):978–990
PubMed Google Scholar
Kuznetsova A, Brockhoff PB, Christensen RHB (2017) lmerTest package: tests in linear mixed effects models. J Stat Softw 82(13):1548–7660
Google Scholar
Louwerse MM, Jeuniaux P (2010) The linguistic and embodied nature of conceptual processing. Cognition 114(1):96–104
PubMed Google Scholar
Love BC, Medin DL, Gureckis TM (2004) SUSTAIN: a network model of category learning. Psychol Rev 111(2):309–332. https://doi.org/10.1037/0033-295X.111.2.309
Article PubMed Google Scholar
Lund K, Burgess C (1996) Producing high-dimensional semantic spaces from lexical co-occurrence. Behav Res Methods Instrum Comput 28(2):203–208
Google Scholar
Markman EM, Wachtel GF (1988) Children’s use of mutual exclusivity to constrain the meanings of words. Cogn Psychol 20(2):121–157
CAS PubMed Google Scholar
Marques JF (2005) Naming from definition: the role of feature type and feature distinctiveness. Q J Exp Psychol Sect A 58(4):603–611. https://doi.org/10.1080/02724980443000106
Article Google Scholar
McRae K, Cree GS, Seidenberg MS, McNorgan C (2005) Semantic feature production norms for a large set of living and nonliving things. Behav Res Methods 37(4):547–559
PubMed Google Scholar
Minda JP, Smith JD (2002) Comparing prototype-based and exemplar-based accounts of category learning and attentional allocation. J Exp Psychol Learn Mem Cogn 28(2):275
PubMed Google Scholar
Montefinese M, Zannino GD, Ambrosini E (2015) Semantic similarity between old and new items produces false alarms in recognition memory. Psychol Res 79(5):785–794
PubMed Google Scholar
Montefinese M, Vinson D, Ambrosini E (2018) Recognition memory and featural similarity between concepts: the pupil’s point of view. Biol Psychol 135:159–169
PubMed Google Scholar
Moss HE, Tyler LK, Jennings F (1997) When leopards lose their spots: knowledge of visual properties in category-specific deficits for living things. Cognit Neuropsychol 14(6):901–950
Google Scholar
Moss HE, Tyler LK, Taylor KI (2007) Conceptual structure. In: Gaskell MG (ed) The Oxford handbook of psycholinguistics. Oxford University Press, Oxford, UK, pp 217–234
Google Scholar
Nelson DL, McEvoy CL, Schreiber TA (2004) The University of South Florida free association, rhyme, and word fragment norms. Behav Res Methods Instrum Comput 36(3):402–407
PubMed Google Scholar
Peters R, Borovsky A (2019) Modeling early lexico-semantic network development: perceptual features matter most. J Exp Psychol Gen 148(4):763
PubMed PubMed Central Google Scholar
Pexman PM, Lupker SJ, Hino Y (2002) The impact of feedback semantics in visual word recognition: number-of-features effects in lexical decision and naming tasks. Psychon Bull Rev 9(3):542–549. https://doi.org/10.3758/BF03196311
Article PubMed Google Scholar
Pexman PM, Holyk GG, Monfils M-H (2003) Number-of-features effects and semantic processing. Mem Cognit 31(6):842–855
PubMed Google Scholar
Pexman PM, Hargreaves IS, Siakaluk PD, Bodner GE, Pope J (2008) There are many ways to be rich: effects of three measures of semantic richness on visual word recognition. Psychon Bull Rev 15(1):161–167
PubMed Google Scholar
Pexman PM, Heard A, Lloyd E, Yap MJ (2017) The Calgary semantic decision project: concrete/abstract decision data for 10,000 English words. Behav Res Methods 49(2):407–417. https://doi.org/10.3758/s13428-016-0720-6
Article PubMed Google Scholar
Plaut DC, Shallice T (1993) Deep dyslexia: a case study of connectionist neuropsychology. Cognit Neuropsychol 10(5):377–500. https://doi.org/10.1080/02643299308253469
Article Google Scholar
Randall B, Moss HE, Rodd JM, Greer M, Tyler LK (2004) Distinctiveness and correlation in conceptual structure: behavioral and computational studies. J Exp Psychol Learn Mem Cogn 30(2):393
PubMed Google Scholar
Ratcliff R, Gomez P, McKoon G (2004) A diffusion model account of the lexical decision task. Psychol Rev 111(1):159–182. https://doi.org/10.1037/0033-295X.111.1.159
Article PubMed PubMed Central Google Scholar
Recchia G, Jones M (2012) The semantic richness of abstract concepts. Front Hum Neurosci 6:315
PubMed PubMed Central Google Scholar
Rips LJ, Shoben EJ, Smith EE (1973) Semantic distance and the verification of semantic relations. J Verbal Learn Verbal Behav 12(1):1–20
Google Scholar
Rodd J, Gaskell G, Marslen-Wilson W (2002) Making sense of semantic ambiguity: semantic competition in lexical access. J Mem Lang 46(2):245–266. https://doi.org/10.1006/jmla.2001.2810
Article Google Scholar
Siew CSQ (2019) spreadr: An R package to simulate spreading activation in a network. Behav Res Methods 51(2):910–929
PubMed PubMed Central Google Scholar
Siew CSQ, Vitevitch MS (2019) The phonographic language network: using network science to investigate the phonological and orthographic similarity structure of language. J Exp Psychol Gen 148(3):475–500. https://doi.org/10.1037/xge0000575
Article PubMed Google Scholar
Siew CSQ, Wulff DU, Beckage NM, Kenett YN (2019) Cognitive network science: a review of research on cognition through the lens of network representations, processes, and dynamics. Complexity, 2019
Sizemore AE, Karuza EA, Giusti C, Bassett DS (2018) Knowledge gaps in the early growth of semantic feature networks. Nat Hum Behav 2(9):682
PubMed PubMed Central Google Scholar
Sloutsky VM, Fisher AV (2004) Induction and categorization in young children: a similarity-based model. J Exp Psychol Gen 133(2):166–188. https://doi.org/10.1037/0096-3445.133.2.166
Article PubMed Google Scholar
Smith LB, Jones SS, Landau B, Gershkoff-Stowe L, Samuelson L (2002) Object name learning provides on-the-job training for attention. Psychol Sci 13(1):13–19
PubMed Google Scholar
Steyvers M, Tenenbaum JB (2005) The large-scale structure of semantic networks: statistical analyses and a model of semantic growth. Cognit Sci 29(1):41–78. https://doi.org/10.1207/s15516709cog2901_3
Article Google Scholar
Taylor KI, Salamoura A, Randall B, Moss H, Tyler LK (2008) Clarifying the nature of the distinctiveness by domain interaction in conceptual structure: comment on Cree, McNorgan, and McRae (2006). J Exp Psychol Learn Mem Cogn 34:719–725
PubMed Google Scholar
Taylor KI, Devereux BJ, Tyler LK (2011) Conceptual structure: towards an integrated neurocognitive account. Lang Cognit Process 26(9):1368–1401
CAS Google Scholar
Taylor KI, Devereux BJ, Acres K, Randall B, Tyler LK (2012) Contrasting effects of feature-based statistics on the categorisation and basic-level identification of visual objects. Cognition 122(3):363–374
PubMed Google Scholar
Tousignant C, Pexman PM (2012) Flexible recruitment of semantic richness: context modulates body-object interaction effects in lexical-semantic processing. Front Hum Neurosci 6:53
PubMed PubMed Central Google Scholar
Tyler LK, Moss HE (2001) Towards a distributed account of conceptual knowledge. Trends in Cognitive Sciences 5(6):244–252
PubMed Google Scholar
Tyler LK, Moss HE, Durrant-Peatfield MR, Levy JP (2000) Conceptual structure and the structure of concepts: a distributed account of category-specific deficits. Brain Lang 75(2):195–231
CAS PubMed Google Scholar
Vinson DP, Vigliocco G (2008) Semantic feature production norms for a large set of objects and events. Behav Res Methods 40(1):183–190
PubMed Google Scholar
Yap MJ, Tan SE, Pexman PM, Hargreaves IS (2011) Is more always better? Effects of semantic richness on lexical decision, speeded pronunciation, and semantic classification. Psychon Bull Rev 18(4):742–750
PubMed Google Scholar
Zdrazilova L, Pexman PM (2013) Grasping the invisible: semantic processing of abstract words. Psychon Bull Rev 20(6):1312–1318
PubMed Google Scholar

Download references

Acknowledgements

The author thanks Nichol Castro and Li Ying for providing useful comments on earlier drafts of this manuscript. Data and analysis scripts are available on the Open Science Framework: https://osf.io/x87tr/

Funding

No funding to declare.

Author information

Authors and Affiliations

Department of Psychology, National University of Singapore, 9 Arts Link, Block AS4 #02-23, Singapore, 117570, Singapore
Cynthia S. Q. Siew

Authors

Cynthia S. Q. Siew
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Cynthia S. Q. Siew.

Ethics declarations

Conflict of interest

The author declares that she has no conflict of interest.

Ethical approval

This article does not contain any studies with human participants performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Handling editor: Barry Devereux (Queen’s University Belfast); Reviewers: Blair Armstrong (University of Toronto), Gabriel Recchia (University of Cambridge).

This article is part of the special topic ‘Eliciting Semantic Properties: Methods and Applications’ guest-edited by Enrico Canessa, Sergio Chaigneau, Barry Devereux, and Alessandro Lenci.

Appendices

Appendix A

Correlations between predictors and outcome variables in the regression models.

See Tables 6, 7, and 8.

Table 6 Age-of-acquisition norms (N = 4013)

Full size table

Table 7 Visual lexical decision (English Lexicon Project) (N = 4013)

Full size table

Table 8 Semantic decision (Calgary Semantic Decision Project) (N = 1336)

Full size table

Appendix B

Computation of Jaccard and Manhattan distance measures

Two different distance measures were computed to quantify the dissimilarity of any given pair of probability distributions: Jaccard distance and Manhattan distance. The first measure, Jaccard distance, is an example of a measure from the inner-product family of distance measures that emphasizes shared information. Jaccard distance computes the distance between two words as subtracting the intersection of the two words’ feature set distributions (i.e., a vector that represents the proportions of participants reporting each feature for a given word) over their union from 1. The second measure, Manhattan distance, is an example of a measure from the Minkowski family of distance measures that generally treats distance as the straight line between two points in Euclidean space. Manhattan distance computes the distance between two words as the sum of absolute differences between the proportions of participants reporting each feature. Mathematically, these measures were computed as follows:

Jaccard distance, d_j:

$$d_{\text{j}} = 1 - \frac{{\sum \left( {P_{i} \times Q_{i} } \right)}}{{\sum P_{i}^{2} + \sum Q_{i}^{2} - \sum \left( {P_{i} \times Q_{i} } \right)}}$$

Manhattan distance, d_m:

$$d_{\text{m}} = \sum \left| {P_{i} - Q_{i} } \right|$$

where P_i = [x₁, x₂, …, x_i] is a vector representing the proportions of participants reporting each feature for a word 1 and Q_i = [x₁, x₂, …, x_i] is a vector representing the proportions of participants reporting each feature for a word 2. i = the number of (unique) features in the feature production norms.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Siew, C.S.Q. Feature distinctiveness effects in language acquisition and lexical processing: Insights from megastudies. Cogn Process 21, 669–685 (2020). https://doi.org/10.1007/s10339-019-00947-6

Download citation

Received: 23 May 2019
Accepted: 18 December 2019
Published: 23 January 2020
Issue Date: November 2020
DOI: https://doi.org/10.1007/s10339-019-00947-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Feature distinctiveness effects in language acquisition and lexical processing: Insights from megastudies

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

English semantic feature production norms: An extended database of 4436 concepts

Feats: A database of semantic features for early produced noun concepts

Diachronic semantic change in language is constrained by how people use and learn language

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Appendices

Appendix A

Appendix B

Computation of Jaccard and Manhattan distance measures

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Feature distinctiveness effects in language acquisition and lexical processing: Insights from megastudies

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

English semantic feature production norms: An extended database of 4436 concepts

Feats: A database of semantic features for early produced noun concepts

Diachronic semantic change in language is constrained by how people use and learn language

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Appendices

Appendix A

Appendix B

Computation of Jaccard and Manhattan distance measures

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation