Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Feature distinctiveness effects in language acquisition and lexical processing: Insights from megastudies

  • Research Article
  • Published:
Cognitive Processing Aims and scope Submit manuscript

Abstract

Semantic features are central to many influential theories of word meaning and semantic memory, but new methods of quantifying the information embedded in feature production norms are needed to advance our understanding of semantic processing and language acquisition. This paper capitalized on databases of semantic feature production norms and age-of-acquisition ratings, and megastudies including the English Lexicon Project and the Calgary Semantic Decision Project, to examine the influence of feature distinctiveness on language acquisition, visual lexical decision, and semantic decision. A feature network of English words was constructed such that edges in the network represented feature distance, or dissimilarity, between words (i.e., Jaccard and Manhattan distances of probability distributions of features elicited for each pair of words), enabling us to quantify the relative feature distinctiveness of individual words relative to other words in the network. Words with greater feature distinctiveness tended to be acquired earlier. Regression analyses of megastudy data revealed that Manhattan feature distinctiveness inhibited performance on the visual lexical decision task, facilitated semantic decision performance for concrete concepts, and inhibited semantic decision performance for abstract concepts. These results demonstrate the importance of considering the structural properties of words embedded in a semantic feature space in order to increase our understanding of semantic processing and language acquisition.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. Using a logistic mixed-effects model for analyzing accuracy at the trial level led to model convergence issues and/or to degenerate models due to model complexity. Hence, accuracy for the semantic decision task was analyzed at the item-level using linear regression.

  2. It is important to note that this additional analysis of interaction effects was post hoc in nature and not an a priori research question. The analysis was conducted in response to a reviewer who felt that it was particularly important to test whether feature distinctiveness effects were consistent across concrete and abstract concepts.

References

  • Adelman JS, Brown GD, Quesada JF (2006) Contextual diversity, not word frequency, determines word-naming and lexical decision times. Psychol Sci 17(9):814–823

    PubMed  Google Scholar 

  • Baayen RH, Davidson DJ, Bates DM (2008) Mixed-effects modeling with crossed random effects for subjects and items. J Mem Lang 59(4):390–412

    Google Scholar 

  • Balota DA, Yap MJ, Hutchison KA, Cortese MJ, Kessler B, Loftis B et al (2007) The English lexicon project. Behav Res Methods 39(3):445–459

    PubMed  Google Scholar 

  • Brysbaert M, New B (2009) Moving beyond Kučera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behav Res Methods 41(4):977–990

    PubMed  Google Scholar 

  • Brysbaert M, Warriner AB, Kuperman V (2014) Concreteness ratings for 40 thousand generally known English word lemmas. Behav Res Methods 46(3):904–911. https://doi.org/10.3758/s13428-013-0403-5

    Article  PubMed  Google Scholar 

  • Buchanan EM, Holmes JL, Teasley ML, Hutchison KA (2013) English semantic word-pair norms and a searchable Web portal for experimental stimulus creation. Behav Res Methods 45(3):746–757. https://doi.org/10.3758/s13428-012-0284-z

    Article  PubMed  Google Scholar 

  • Buchanan EM, Valentine KD, Maxwell NP (2019) English semantic feature production norms: an extended database of 4436 concepts. Behav Res Methods. https://doi.org/10.3758/s13428-019-01243-z

    Article  PubMed  Google Scholar 

  • Bullinaria JA, Levy JP (2007) Extracting semantic representations from word co-occurrence statistics: a computational study. Behav Res Methods 39(3):510–526

    PubMed  Google Scholar 

  • Castro N, Stella M (2019) The multiplex structure of the mental lexicon influences picture naming in people with aphasia. J Complex Netw. https://doi.org/10.1093/comnet/cnz012

    Article  PubMed  PubMed Central  Google Scholar 

  • Collins AM, Loftus EF (1975) A spreading-activation theory of semantic processing. Psychol Rev 82(6):407

    Google Scholar 

  • Cree GS, McRae K (2003) Analyzing the factors underlying the structure and computation of the meaning of chipmunk, cherry, chisel, cheese, and cello (and many other such concrete nouns). J Exp Psychol Gen 132(2):163

    PubMed  Google Scholar 

  • Cree GS, McNorgan C, McRae K (2006) Distinctive features hold a privileged status in the computation of word meaning: implications for theories of semantic memory. J Exp Psychol Learn Mem Cogn 32(4):643

    PubMed  PubMed Central  Google Scholar 

  • De Deyne S, Storms G (2008) Word associations: network and semantic properties. Behav Res Methods 40(1):213–231

    PubMed  Google Scholar 

  • De Deyne S, Navarro DJ, Perfors A, Brysbaert M, Storms G (2019) The “small world of words” english word association norms for over 12,000 cue words. Behav Res Methods 51(3):987–1006

    PubMed  Google Scholar 

  • Devereux BJ, Tyler LK, Geertzen J, Randall B (2014) The centre for speech, language and the brain (CSLB) concept property norms. Behav Res Methods 46(4):1119–1127

    PubMed  Google Scholar 

  • Engelthaler T, Hills TT (2017) Feature biases in early word learning: network distinctiveness predicts age of acquisition. Cognit Sci 41:120–140

    Google Scholar 

  • Firth J (1957) A synopsis of linguistic theory, 1930–1955. In: Studies in linguistic analysis, philological society, Oxford; reprinted in Palmer F (ed) 1968 Selected Papers of Firth JR, Longman, Harlow

  • Garrard P, Lambon Ralph MA, Hodges JR, Patterson K (2001) Prototypicality, distinctiveness, and intercorrelation: analyses of the semantic attributes of living and nonliving concepts. Cognit Neuropsychol 18(2):125–174

    CAS  Google Scholar 

  • Goldstein R, Vitevitch MS (2014) The influence of clustering coefficient on word-learning: how groups of similar sounding words facilitate acquisition. Front Psychol. https://doi.org/10.3389/fpsyg.2014.01307

    Article  PubMed  PubMed Central  Google Scholar 

  • Hills TT, Maouene M, Maouene J, Sheya A, Smith L (2009) Longitudinal analysis of early semantic networks. Psychol Sci 20(6):729–739. https://doi.org/10.1111/j.1467-9280.2009.02365.x

    Article  PubMed  PubMed Central  Google Scholar 

  • Hsiao Y, Nation K (2018) Semantic diversity, frequency and the development of lexical quality in children’s word reading. J Mem Lang 103:114–126

    Google Scholar 

  • Jaswal VK, Hansen MB (2006) Learning words: children disregard some pragmatic information that conflicts with mutual exclusivity. Dev Sci 9(2):158–165

    PubMed  Google Scholar 

  • Kuperman V, Stadthagen-Gonzalez H, Brysbaert M (2012) Age-of-acquisition ratings for 30,000 English words. Behav Res Methods 44(4):978–990

    PubMed  Google Scholar 

  • Kuznetsova A, Brockhoff PB, Christensen RHB (2017) lmerTest package: tests in linear mixed effects models. J Stat Softw 82(13):1548–7660

    Google Scholar 

  • Louwerse MM, Jeuniaux P (2010) The linguistic and embodied nature of conceptual processing. Cognition 114(1):96–104

    PubMed  Google Scholar 

  • Love BC, Medin DL, Gureckis TM (2004) SUSTAIN: a network model of category learning. Psychol Rev 111(2):309–332. https://doi.org/10.1037/0033-295X.111.2.309

    Article  PubMed  Google Scholar 

  • Lund K, Burgess C (1996) Producing high-dimensional semantic spaces from lexical co-occurrence. Behav Res Methods Instrum Comput 28(2):203–208

    Google Scholar 

  • Markman EM, Wachtel GF (1988) Children’s use of mutual exclusivity to constrain the meanings of words. Cogn Psychol 20(2):121–157

    CAS  PubMed  Google Scholar 

  • Marques JF (2005) Naming from definition: the role of feature type and feature distinctiveness. Q J Exp Psychol Sect A 58(4):603–611. https://doi.org/10.1080/02724980443000106

    Article  Google Scholar 

  • McRae K, Cree GS, Seidenberg MS, McNorgan C (2005) Semantic feature production norms for a large set of living and nonliving things. Behav Res Methods 37(4):547–559

    PubMed  Google Scholar 

  • Minda JP, Smith JD (2002) Comparing prototype-based and exemplar-based accounts of category learning and attentional allocation. J Exp Psychol Learn Mem Cogn 28(2):275

    PubMed  Google Scholar 

  • Montefinese M, Zannino GD, Ambrosini E (2015) Semantic similarity between old and new items produces false alarms in recognition memory. Psychol Res 79(5):785–794

    PubMed  Google Scholar 

  • Montefinese M, Vinson D, Ambrosini E (2018) Recognition memory and featural similarity between concepts: the pupil’s point of view. Biol Psychol 135:159–169

    PubMed  Google Scholar 

  • Moss HE, Tyler LK, Jennings F (1997) When leopards lose their spots: knowledge of visual properties in category-specific deficits for living things. Cognit Neuropsychol 14(6):901–950

    Google Scholar 

  • Moss HE, Tyler LK, Taylor KI (2007) Conceptual structure. In: Gaskell MG (ed) The Oxford handbook of psycholinguistics. Oxford University Press, Oxford, UK, pp 217–234

    Google Scholar 

  • Nelson DL, McEvoy CL, Schreiber TA (2004) The University of South Florida free association, rhyme, and word fragment norms. Behav Res Methods Instrum Comput 36(3):402–407

    PubMed  Google Scholar 

  • Peters R, Borovsky A (2019) Modeling early lexico-semantic network development: perceptual features matter most. J Exp Psychol Gen 148(4):763

    PubMed  PubMed Central  Google Scholar 

  • Pexman PM, Lupker SJ, Hino Y (2002) The impact of feedback semantics in visual word recognition: number-of-features effects in lexical decision and naming tasks. Psychon Bull Rev 9(3):542–549. https://doi.org/10.3758/BF03196311

    Article  PubMed  Google Scholar 

  • Pexman PM, Holyk GG, Monfils M-H (2003) Number-of-features effects and semantic processing. Mem Cognit 31(6):842–855

    PubMed  Google Scholar 

  • Pexman PM, Hargreaves IS, Siakaluk PD, Bodner GE, Pope J (2008) There are many ways to be rich: effects of three measures of semantic richness on visual word recognition. Psychon Bull Rev 15(1):161–167

    PubMed  Google Scholar 

  • Pexman PM, Heard A, Lloyd E, Yap MJ (2017) The Calgary semantic decision project: concrete/abstract decision data for 10,000 English words. Behav Res Methods 49(2):407–417. https://doi.org/10.3758/s13428-016-0720-6

    Article  PubMed  Google Scholar 

  • Plaut DC, Shallice T (1993) Deep dyslexia: a case study of connectionist neuropsychology. Cognit Neuropsychol 10(5):377–500. https://doi.org/10.1080/02643299308253469

    Article  Google Scholar 

  • Randall B, Moss HE, Rodd JM, Greer M, Tyler LK (2004) Distinctiveness and correlation in conceptual structure: behavioral and computational studies. J Exp Psychol Learn Mem Cogn 30(2):393

    PubMed  Google Scholar 

  • Ratcliff R, Gomez P, McKoon G (2004) A diffusion model account of the lexical decision task. Psychol Rev 111(1):159–182. https://doi.org/10.1037/0033-295X.111.1.159

    Article  PubMed  PubMed Central  Google Scholar 

  • Recchia G, Jones M (2012) The semantic richness of abstract concepts. Front Hum Neurosci 6:315

    PubMed  PubMed Central  Google Scholar 

  • Rips LJ, Shoben EJ, Smith EE (1973) Semantic distance and the verification of semantic relations. J Verbal Learn Verbal Behav 12(1):1–20

    Google Scholar 

  • Rodd J, Gaskell G, Marslen-Wilson W (2002) Making sense of semantic ambiguity: semantic competition in lexical access. J Mem Lang 46(2):245–266. https://doi.org/10.1006/jmla.2001.2810

    Article  Google Scholar 

  • Siew CSQ (2019) spreadr: An R package to simulate spreading activation in a network. Behav Res Methods 51(2):910–929

    PubMed  PubMed Central  Google Scholar 

  • Siew CSQ, Vitevitch MS (2019) The phonographic language network: using network science to investigate the phonological and orthographic similarity structure of language. J Exp Psychol Gen 148(3):475–500. https://doi.org/10.1037/xge0000575

    Article  PubMed  Google Scholar 

  • Siew CSQ, Wulff DU, Beckage NM, Kenett YN (2019) Cognitive network science: a review of research on cognition through the lens of network representations, processes, and dynamics. Complexity, 2019

  • Sizemore AE, Karuza EA, Giusti C, Bassett DS (2018) Knowledge gaps in the early growth of semantic feature networks. Nat Hum Behav 2(9):682

    PubMed  PubMed Central  Google Scholar 

  • Sloutsky VM, Fisher AV (2004) Induction and categorization in young children: a similarity-based model. J Exp Psychol Gen 133(2):166–188. https://doi.org/10.1037/0096-3445.133.2.166

    Article  PubMed  Google Scholar 

  • Smith LB, Jones SS, Landau B, Gershkoff-Stowe L, Samuelson L (2002) Object name learning provides on-the-job training for attention. Psychol Sci 13(1):13–19

    PubMed  Google Scholar 

  • Steyvers M, Tenenbaum JB (2005) The large-scale structure of semantic networks: statistical analyses and a model of semantic growth. Cognit Sci 29(1):41–78. https://doi.org/10.1207/s15516709cog2901_3

    Article  Google Scholar 

  • Taylor KI, Salamoura A, Randall B, Moss H, Tyler LK (2008) Clarifying the nature of the distinctiveness by domain interaction in conceptual structure: comment on Cree, McNorgan, and McRae (2006). J Exp Psychol Learn Mem Cogn 34:719–725

    PubMed  Google Scholar 

  • Taylor KI, Devereux BJ, Tyler LK (2011) Conceptual structure: towards an integrated neurocognitive account. Lang Cognit Process 26(9):1368–1401

    CAS  Google Scholar 

  • Taylor KI, Devereux BJ, Acres K, Randall B, Tyler LK (2012) Contrasting effects of feature-based statistics on the categorisation and basic-level identification of visual objects. Cognition 122(3):363–374

    PubMed  Google Scholar 

  • Tousignant C, Pexman PM (2012) Flexible recruitment of semantic richness: context modulates body-object interaction effects in lexical-semantic processing. Front Hum Neurosci 6:53

    PubMed  PubMed Central  Google Scholar 

  • Tyler LK, Moss HE (2001) Towards a distributed account of conceptual knowledge. Trends in Cognitive Sciences 5(6):244–252

    PubMed  Google Scholar 

  • Tyler LK, Moss HE, Durrant-Peatfield MR, Levy JP (2000) Conceptual structure and the structure of concepts: a distributed account of category-specific deficits. Brain Lang 75(2):195–231

    CAS  PubMed  Google Scholar 

  • Vinson DP, Vigliocco G (2008) Semantic feature production norms for a large set of objects and events. Behav Res Methods 40(1):183–190

    PubMed  Google Scholar 

  • Yap MJ, Tan SE, Pexman PM, Hargreaves IS (2011) Is more always better? Effects of semantic richness on lexical decision, speeded pronunciation, and semantic classification. Psychon Bull Rev 18(4):742–750

    PubMed  Google Scholar 

  • Zdrazilova L, Pexman PM (2013) Grasping the invisible: semantic processing of abstract words. Psychon Bull Rev 20(6):1312–1318

    PubMed  Google Scholar 

Download references

Acknowledgements

The author thanks Nichol Castro and Li Ying for providing useful comments on earlier drafts of this manuscript. Data and analysis scripts are available on the Open Science Framework: https://osf.io/x87tr/

Funding

No funding to declare.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cynthia S. Q. Siew.

Ethics declarations

Conflict of interest

The author declares that she has no conflict of interest.

Ethical approval

This article does not contain any studies with human participants performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Handling editor: Barry Devereux (Queen’s University Belfast); Reviewers: Blair Armstrong (University of Toronto), Gabriel Recchia (University of Cambridge).

This article is part of the special topic ‘Eliciting Semantic Properties: Methods and Applications’ guest-edited by Enrico Canessa, Sergio Chaigneau, Barry Devereux, and Alessandro Lenci.

Appendices

Appendix A

Correlations between predictors and outcome variables in the regression models.

See Tables 6, 7, and 8.

Table 6 Age-of-acquisition norms (N = 4013)
Table 7 Visual lexical decision (English Lexicon Project) (N = 4013)
Table 8 Semantic decision (Calgary Semantic Decision Project) (N = 1336)

Appendix B

Computation of Jaccard and Manhattan distance measures

Two different distance measures were computed to quantify the dissimilarity of any given pair of probability distributions: Jaccard distance and Manhattan distance. The first measure, Jaccard distance, is an example of a measure from the inner-product family of distance measures that emphasizes shared information. Jaccard distance computes the distance between two words as subtracting the intersection of the two words’ feature set distributions (i.e., a vector that represents the proportions of participants reporting each feature for a given word) over their union from 1. The second measure, Manhattan distance, is an example of a measure from the Minkowski family of distance measures that generally treats distance as the straight line between two points in Euclidean space. Manhattan distance computes the distance between two words as the sum of absolute differences between the proportions of participants reporting each feature. Mathematically, these measures were computed as follows:

Jaccard distance, dj:

$$d_{\text{j}} = 1 - \frac{{\sum \left( {P_{i} \times Q_{i} } \right)}}{{\sum P_{i}^{2} + \sum Q_{i}^{2} - \sum \left( {P_{i} \times Q_{i} } \right)}}$$

Manhattan distance, dm:

$$d_{\text{m}} = \sum \left| {P_{i} - Q_{i} } \right|$$

where Pi = [x1, x2, …, xi] is a vector representing the proportions of participants reporting each feature for a word 1 and Qi = [x1, x2, …, xi] is a vector representing the proportions of participants reporting each feature for a word 2. i = the number of (unique) features in the feature production norms.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Siew, C.S.Q. Feature distinctiveness effects in language acquisition and lexical processing: Insights from megastudies. Cogn Process 21, 669–685 (2020). https://doi.org/10.1007/s10339-019-00947-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10339-019-00947-6

Keywords