Quantifying the dynamics of failure across science, startups and security

Yin, Yian; Wang, Yang; Evans, James A.; Wang, Dashun

doi:10.1038/s41586-019-1725-y

Article
Published: 30 October 2019

Quantifying the dynamics of failure across science, startups and security

Yian Yin^1,2,3,
Yang Wang^1,2,4,
James A. Evans^5,6 &
â¦
Dashun Wang^1,2,3,4Â

Nature volumeÂ 575,Â pages 190â194 (2019)Cite this article

29k Accesses
43 Citations
940 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 05 June 2020

This article has been updated

Abstract

Human achievements are often preceded by repeated attempts that fail, but little is known about the mechanisms that govern the dynamics of failure. Here, building on previous research relating to innovation^{1,2,3,4,5,6,7}, human dynamics^8,9,10,11 and learning^{12,13,14,15,16,17}, we develop a simple one-parameter model that mimics how successful future attempts build on past efforts. Solving this model analytically suggests that a phase transition separates the dynamics of failure into regions of progression or stagnation and predicts that, near the critical threshold, agents who share similar characteristics and learning strategies may experience fundamentally different outcomes following failures. Above the critical point, agents exploit incremental refinements to systematically advance towards success, whereas below it, they explore disjoint opportunities without a pattern of improvement. The model makes several empirically testable predictions, demonstrating that those who eventually succeed and those who do not may initially appear similar, but can be characterized by fundamentally distinct failure dynamics in terms of the efficiency and quality associated with each subsequent attempt. We collected large-scale data from three disparate domains and traced repeated attempts by investigators to obtain National Institutes of Health (NIH) grants to fund their research, innovators to successfully exit their startup ventures, and terrorist organizations to claim casualties in violent attacks. We find broadly consistent empirical support across all three domains, which systematically verifies each prediction of our model. Together, our findings unveil detectable yet previously unknown early signals that enable us to identify failure dynamics that will lead to ultimateÂ success or failure. Given the ubiquitous nature of failure and the paucity of quantitative approaches to understand it, these results represent an initial step towards the deeper understanding of the complex dynamics underlying failure.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Mechanisms of chance and learning.**

Competition for priority harms the reliability of science, but reforms can help

Article 28 January 2021

Papers and patents are becoming less disruptive over time

Article 04 January 2023

The natural selection of good science

Article 17 May 2021

Data availability

This paper makes use of restricted access data from the National Institutes of Health (NIH), protected by the Privacy Act of 1974 as amended (5 U.S.C. 552a). Deidentified data necessary to reproduce all plots and statistical analyses are freely available at https://yian-yin.github.io/quantifyFailure. Those wishing to access the raw data can apply for access following the procedures outlined in the NIH Data Access Policy document (http://report.nih.gov/pdf/DataAccessPolicy.pdf). The VentureXpert database is available from Thomson Reuters. The Global Terrorism Database is publicly available at https://www.start.umd.edu/gtd/.

Code availability

Code is available at https://yian-yin.github.io/quantifyFailure.

Change history

05 June 2020
An amendment to this paper has been published and can be accessed via a link at the top of the paper.

References

Fortunato, S. et al. Science of science. Science 359, eaao0185 (2018).
ArticleÂ PubMedÂ PubMed CentralÂ CASÂ Google ScholarÂ
Harford, T. Adapt: Why Success Always Starts with Failure (Farrar, Straus and Giroux, 2011).
Wuchty, S., Jones, B. F. & Uzzi, B. The increasing dominance of teams in production of knowledge. Science 316, 1036â1039 (2007).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Jones, B. F. The burden of knowledge and the âdeath of the renaissance manâ: is innovation getting harder? Rev. Econ. Stud. 76, 283â317 (2009).
ArticleÂ MATHÂ Google ScholarÂ
Sinatra, R., Wang, D., Deville, P., Song, C. & BarabÃ¡si, A.-L. Quantifying the evolution of individual scientific impact. Science 354, aaf5239 (2016).
ArticleÂ PubMedÂ CASÂ Google ScholarÂ
Liu, L. et al. Hot streaks in artistic, cultural, and scientific careers. Nature 559, 396â399 (2018).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Hu, Y., Havlin, S. & Makse, H. A. Conditions for viral influence spreading through multiplex correlated social networks. Phys. Rev. X 4, 021031 (2014).
Google ScholarÂ
BarabÃ¡si, A.-L. The origin of bursts and heavy tails in human dynamics. Nature 435, 207â211 (2005).
ArticleÂ ADSÂ PubMedÂ CASÂ Google ScholarÂ
GonzÃ¡lez, M. C., Hidalgo, C. A. & BarabÃ¡si, A.-L. Understanding individual human mobility patterns. Nature 453, 779â782 (2008).
ArticleÂ ADSÂ PubMedÂ CASÂ Google ScholarÂ
Castellano, C., Fortunato, S. & Loreto, V. Statistical physics of social dynamics. Rev. Mod. Phys. 81, 591â646 (2009).
ArticleÂ ADSÂ Google ScholarÂ
Malmgren, R. D., Stouffer, D. B., Campanharo, A. S. & Amaral, L. A. N. On universality in human correspondence activity. Science 325, 1696â1700 (2009).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Argote, L. Organizational Learning: Creating, Retaining and Transferring Knowledge (Springer Science & Business Media, 2012).
Sitkin, S. B. Learning through failure: the strategy of small losses. Res. Organ. Behav. 14, 231â266 (1992).
Google ScholarÂ
Yelle, L. E. The learning curve: historical review and comprehensive survey. Decis. Sci. 10, 302â328 (1979).
ArticleÂ Google ScholarÂ
Dutton, J. M. & Thomas, A. Treating progress functions as a managerial opportunity. Acad. Manage. Rev. 9, 235â247 (1984).
ArticleÂ Google ScholarÂ
Huber, G. P. Organizational learning: the contributing processes and the literatures. Organ. Sci. 2, 88â115 (1991).
ArticleÂ Google ScholarÂ
Cannon, M. D. & Edmondson, A. C. Failing to learn and learning to fail (intelligently): how great organizations put failure to work to innovate and improve. Long Range Plann. 38, 299â319 (2005).
ArticleÂ Google ScholarÂ
Kaplan, S. N. & Lerner, J. in Measuring Entrepreneurial Businesses: Current Knowledge and Challenges (Univ. Chicago Press, 2016).
Eggers, J. P. & Song, L. Dealing with failure: serial entrepreneurs and the costs of changing industries between ventures. Acad. Manage. J. 58, 1785â1803 (2015).
ArticleÂ Google ScholarÂ
National Consortium for the Study of Terrorism and Responses to Terrorism. Global Terrorism Database (GTD) https://www.start.umd.edu/research-projects/global-terrorism-database-gtd (2018).
Clauset, A. & Gleditsch, K. S. The developmental dynamics of terrorist organizations. PLoS ONE 7, e48633 (2012).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Johnson, N. et al. Pattern in escalations in insurgent and terrorist activity. Science 333, 81â84 (2011).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Newell, A. & Rosenbloom, P. S. in Cognitive Skills and their Acquisition 1 (ed. Anderson, J. R.) 1â55 (Erlbaum, 1981).
Anderson, J. R. Acquisition of cognitive skill. Psychol. Rev. 89, 369â406 (1982).
ArticleÂ Google ScholarÂ
Muth, J. F. Search theory and the manufacturing progress function. Manage. Sci. 32, 948â962 (1986).
ArticleÂ Google ScholarÂ
Wright, T. P. Factors affecting the cost of airplanes. J. Aeronaut. Sci. 3, 122â128 (1936).
ArticleÂ Google ScholarÂ
March, J. G. Exploration and exploitation in organizational learning. Organ. Sci. 2, 71â87 (1991).
ArticleÂ ADSÂ Google ScholarÂ
Foster, J. G., Rzhetsky, A. & Evans, J. A. Tradition and innovation in scientistsâ research strategies. Am. Sociol. Rev. 80, 875â908 (2015).
ArticleÂ Google ScholarÂ
Arbesman, S. The Half-life of Facts: Why Everything We Know Has an Expiration Date (Penguin, 2013).
Madsen, P. M. & Desai, V. Failing to learn? The effects of failure and success on organizational learning in the global orbital launch vehicle industry. Acad. Manage. J. 53, 451â476 (2010).
ArticleÂ Google ScholarÂ
Argote, L., Beckman, S. L. & Epple, D. The persistence and transfer of learning in industrial settings. Manage. Sci. 36, 140â154 (1990).
ArticleÂ Google ScholarÂ
Kuhn, T. S. The Structure of Scientific Revolutions (Chicago Univ. Press, 2012).
Merton, R. K. Singletons and multiples in scientific discovery: a chapter in the sociology of science. Proc. Am. Phil. Soc. 105, 470â486 (1961).
Google ScholarÂ
Gompers, P., Kovner, A., Lerner, J. & Scharfstein, D. Performance persistence in entrepreneurship. J. Financ. Econ. 96, 18â32 (2010).
ArticleÂ Google ScholarÂ
de Holan, P. M. & Phillips, N. Remembrance of things past? the dynamics of organizational forgetting. Manage. Sci. 50, 1603â1613 (2004).
ArticleÂ Google ScholarÂ
Schelling, T. C. Micromotives and Macrobehavior (WW Norton & Company, 2006).
Watts, D. J. A simple model of global cascades on random networks. Proc. Natl Acad. Sci. USA 99, 5766â5771 (2002).
ArticleÂ ADSÂ MathSciNetÂ CASÂ PubMedÂ MATHÂ Google ScholarÂ
Holme, P. & Newman, M. E. Nonequilibrium phase transition in the coevolution of networks and opinions. Phys. Rev. E 74, 056108 (2006).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Ginther, D. K. et al. Race, ethnicity, and NIH research awards. Science 333, 1015â1019 (2011).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Boudreau, K. J., Guinan, E. C., Lakhani, K. R. & Riedl, C. Looking across and looking beyond the knowledge frontier: intellectual distance, novelty, and resource allocation in science. Manage. Sci. 62, 2765â2783 (2016).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Bromham, L., Dinnage, R. & Hua, X. Interdisciplinary research has consistently lower funding success. Nature 534, 684â687 (2016).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Banal-Estanol, A., Macho-Stadler, I. & PÃ©rez Castrillo, D. Key Success Drivers in Public Research Grants: Funding the Seeds of Radical Innovation in Academia? CESifo Working Paper Series 5852 (CESifo, 2016).
Ma, A., MondragÃ³n, R. J. & Latora, V. Anatomy of funded research in science. Proc. Natl Acad. Sci. USA 112, 14760â14765 (2015).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Levitt, B. & March, J. G. Organizational learning. Annu. Rev. Sociol. 14, 319â338 (1988).
ArticleÂ Google ScholarÂ
Argote, L. & Epple, D. Learning curves in manufacturing. Science 247, 920â924 (1990).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Merton, R. K. et al. The Matthew effect in science. Science 159, 56â63 (1968).
ArticleÂ ADSÂ PubMedÂ CASÂ Google ScholarÂ
Huang, J., Ertekin, S. & Giles, C. L. Efficient name disambiguation for large-scale databases. In European Conference on Principles of Data Mining and Knowledge Discovery 536â544 (Springer, 2006).
Shen, H. Inequality quantified: Mind the gender gap. Nature 495, 22â24 (2013).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
LariviÃ¨re, V., Ni, C., Gingras, Y., Cronin, B. & Sugimoto, C. R. Bibliometrics: global gender disparities in science. Nature 504, 211â213 (2013).
ArticleÂ PubMedÂ Google ScholarÂ
Yang, T. & Aldrich, H. E. Whoâs the boss? Explaining gender inequality in entrepreneurial teams. Am. Sociol. Rev. 79, 303â327 (2014).
ArticleÂ Google ScholarÂ
Argote, L., Insko, C. A., Yovetich, N. & Romero, A. A. Group learning curves: the effects of turnover and task complexity on group performance. J. Appl. Soc. Psychol. 25, 512â529 (1995).
ArticleÂ Google ScholarÂ
Bailey, C. D. Forgetting and the learning curve: a laboratory study. Manage. Sci. 35, 340â352 (1989).
ArticleÂ Google ScholarÂ

Download references

Acknowledgements

We thank C. Song, A. Clauset, B. Uzzi, B. Jones, E. Finkel, J. Van Mieghem, A. Bassamboo and Y. Xie for helpful discussions, and H. Sauermann and S. Havlin for suggesting extensions of the model, leading us to discover the kâÎ± and kâÎ±Â âÎ´ models. This work is supported by the Air Force Office of Scientific Research under award number FA9550-15-1-0162, FA9550-17-1-0089 and FA9550-19-1-0354, National Science Foundation grant SBE 1829344, the Alfred P. Sloan Foundation G-2019-12485, and Northwestern University Data Science Initiative. This work does not reflect the position of NIH.

Author information

Authors and Affiliations

Center for Science of Science and Innovation, Northwestern University, Evanston, IL, USA
Yian Yin,Â Yang WangÂ &Â Dashun Wang
Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA
Yian Yin,Â Yang WangÂ &Â Dashun Wang
McCormick School of Engineering, Northwestern University, Evanston, IL, USA
Yian YinÂ &Â Dashun Wang
Kellogg School of Management, Northwestern University, Evanston, IL, USA
Yang WangÂ &Â Dashun Wang
Department of Sociology, University of Chicago, Chicago, IL, USA
James A. Evans
Santa Fe Institute, Santa Fe, NM, USA
James A. Evans

Authors

Yian Yin
View author publications
You can also search for this author in PubMedÂ Google Scholar
Yang Wang
View author publications
You can also search for this author in PubMedÂ Google Scholar
James A. Evans
View author publications
You can also search for this author in PubMedÂ Google Scholar
Dashun Wang
View author publications
You can also search for this author in PubMedÂ Google Scholar

Contributions

D.W. conceived the project and designed the experiments; Y.Y. and Y.W. collected data and performed empirical analyses with help from D.W. and J.A.E.; Y.Y. and D.W. carried out theoretical calculations; all authors collaboratively designed the modelÂ and interpreted results; D.W. and Y.Y. wrote the manuscript; all authors edited the manuscript.

Corresponding author

Correspondence to Dashun Wang.

Ethics declarations

Competing interests

Y.W. and D.W. serve as special volunteers (unpaid) to the NIH. The remaining authors declare no competing interests.

Additional information

Peer review information Nature thanks Shlomo Havlin and Henry Sauermann for their contribution to the peer review of this work.

Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 The k model.

aâf, Simulation results from the model (Î±Â =Â 0.6) for the cases of kÂ =Â 0 (a, d) and kÂ âÂ â (b, e) in terms of the average quality (aâc) and efficiency (dâf) of each attempt. kÂ =Â 0 recovers the chance model, predicting a constant quality (c) and efficiency (f). kÂ âÂ â predicts temporal scaling that characterizes the dynamics of failure (e) with improved quality (b), recovering predictions from learning curves and Wrightâs law. gâj, Illustration of mapping between failure dynamics (g, h) and canonical ensembles (i, j). The canonical system is characterized by three different states a, b, c with corresponding energy densities E_a(h), E_b(h), E_c(h). Here we assume E_a(h)Â =Â (2ÎµhÂ âÂ 1)², E_b(h)Â =Â (2hÂ âÂ 1)² and E_c(h)Â =Â [2Îµ(1Â âÂ h)Â âÂ 1]² where ÎµÂ âÂ 0⁺. The introduction of Îµ is to distinguish state a from state c, both of which can be approximated in the limiting condition E_a(h)Â =Â E_c(h)Â =Â 0. We map fÂ âÂ (2ÎÂ âÂ 1)², NÂ âÂ ln[n], hÂ âÂ K and E_i(h)Â =Â [2Î_i(K)Â âÂ 1]². In this case, the two transition points k* and k*Â +Â 1 correspond to hÂ =Â 0 and 1 in the canonical ensemble systems.

Extended Data Fig. 2 Predicting temporal dynamics in science, entrepreneurship and security.

aâc, We compare the goodness of fit for three different models in temporal dynamics in NIH grants (a, nÂ =Â 10345), startups (b, nÂ =Â 275) and terrorist attacks (c, nÂ =Â 136). For each individual sample, we take all but the last inter-event time for model fitting (nÂ =Â 1, â¦, NÂ âÂ 1), comparing model predictions for the last inter-event time. The tested functional forms are power law, t_nÂ =Â an^b; exponential, t_nÂ =Â ab^ân; and linear, t_nÂ =Â aÂ +Â bn. We then calculate the frequency that each model reaches minimum error, defined as \(|\,\log ({t}_{N})-\,\log ({\hat{t}}_{N})|\), among all three forms. The power-law model offers consistently better predictions. dâf, As in aâc, but using \(|{t}_{N}-{\hat{t}}_{N}|\) as the loss function.

Extended Data Fig. 3 Predicting ultimate success in science, entrepreneurship and security.

aâc, Area under the receiver operating characteristic curveÂ (AUC) of the prediction task. We apply two logistic regression models (Supplementary InformationÂ 6.1) to predict ultimate success in NIH grants (a), startups (b) and terrorist attacks (c). The centres and error bars of AUC scores denote the meanÂ Â±Â s.e.m. calculated from tenfold cross-validation over 50 randomized iterations (green, model 1; red, model 2). d, e, As in a but predicting ultimate success in NIH grants for male (d) and female (e) investigators.

Extended Data Fig. 4 Model validations.

a, b, An illustration of the component dynamics. We extract all MeSH terms associated with the nth attempt, S_n, and calculate the number of new terms m_n, defined as \(|{S}_{n}-({S}_{n-1}\cup \cdots \cup {S}_{n-k})|\). b, Testing component dynamics in NIH grant applications. We calculate the dynamics of M_nÂ =Â ãm_nã/ãm₁ã using different k and compare it with T_n. The centres and error bars of M_n show the meanÂ Â±Â s.e.m. (nÂ =Â 5,899) for different k. The shaded area shows meanÂ Â±Â s.e.m. of T_n (log scale) measured on the same subset. All kÂ >Â 3 lead to similar trends between M_n and T_n. câe, Length of failure streak after randomization in science (c), entrepreneurship (d) and security (e). We take the samples used in Fig. 1 and shuffle the success/failure label from each attempt. This operation keeps both the overall success rate and the total number of attempts for each individual constant. fâh, Temporal scaling patterns within the successful group in science (f), entrepreneurship (g) and security (h). We separated the successful group into two subgroups (narrow winners and clear winners) based on eventual performance (0.9 in evaluation score for D₁, 0.5 in investment amount for D₂ and 1 in wounded individuals for D₃). The shaded area shows meanÂ Â±Â s.e.m. of T_n (log scale).

Extended Data Fig. 5 Robustness check on definition of unsuccessful group.

aâl, Robustness check as we change the threshold of inactivity to 3Â years. aâc, Failure streak in science (a), entrepreneurship (b) and security (c). Blue circles represent real data from the successful group and dashed lines represent fitted Weibull distributions. dâf, Temporal scaling patterns in science (d), entrepreneurship (e) and security (f). The shaded area shows meanÂ Â±Â s.e.m. of T_n (log scale). gâi, Performance dynamics in science (g, nÂ =Â 641, 231, 578, 190, from left to right), entrepreneurship (h, nÂ =Â 248, 1,332, 237, 1,312 from left to right) and security (i, nÂ =Â 238, 198, 236, 199, from left to right). The successful and unsuccessful groups that experienced a large number of consecutive failures before the last attempt (at least 5 for D₁, 3 for D₂ and 2 for D₃) appear indistinguishable for first failures (two-sided Welchâs t-test; PÂ =Â 0.566, 0.671 and 0.349), but quickly diverge for second failures (two-sided Welchâs t-test; PÂ =Â 2.09Â ÃÂ 10^â2, 4.95Â ÃÂ 10^â3 and 7.77Â ÃÂ 10^â2). The successful group also shows significant improvement in performance (one-sided Welchâs t-test; PÂ =Â 7.03Â ÃÂ 10^â2, 2.37Â ÃÂ 10^â2 and 2.32Â ÃÂ 10^â2), which is absent for the unsuccessful group (one-sided Welchâs t-test; PÂ =Â 0.717, 0.176 and 0.786). Data are meanÂ Â±Â s.e.m. jâl, AUC score of predicting ultimate success in science (j), entrepreneurship (k) and security (l). The centres and error bars of AUC scores denote the meanÂ Â±Â s.e.m calculated from tenfold cross-validation over 50 randomized iterations. mâx, As in aâl but using 7Â years as the threshold of inactivity. Sample sizes are s: nÂ =Â 620, 101, 559, 76; t: nÂ =Â 248, 977, 237, 989; u: nÂ =Â 216, 152, 214, 153. P values in sâu (from bottom to top) are PÂ =Â 0.883 (s), 0.671 (t), 0.456 (u); PÂ =Â 2.25Â ÃÂ 10^â2 (s), 1.38Â ÃÂ 10^â3 (t), 8.34Â ÃÂ 10^â2 (u); PÂ =Â 4.59Â ÃÂ 10^â2 (s), 2.37Â ÃÂ 10^â2 (t), 3.33Â ÃÂ 10^â2 (u); PÂ =Â 0.838 (s), 0.446 (t), 0.775 (u). *PÂ <Â 0.1, **PÂ <Â 0.05, ***PÂ <Â 0.01, NS, not significant (PÂ â¥Â 0.1).

Extended Data Fig. 6 Robustness check on D₁.

aâc, Failure streak as we change the score threshold to 55 (a), exclude revisions as successes (b) and only focus on new principal investigators without previous R01 grants (c). Blue circles represent real data from successful groups and dashed lines represent fitted Weibull distributions. dâf, Temporal scaling patterns as we change the score threshold to 55 (d), exclude revisions as successes (e) and only focus on new principal investigators without previous R01 grants (f). The shaded area shows meanÂ Â±Â s.e.m. of T_n (log scale). gâi, Performance dynamics as we change the score threshold to 55 (g, nÂ =Â 768, 189, 686, 170, from left to right), exclude revisions as successes (h, nÂ =Â 252, 145, 216, 123, from left to right) and only focus on new principal investigators without previous R01 grants (i, nÂ =Â 1,164, 308, 1,530, 334, from left to right). The successful and unsuccessful groups that experienced a large number of consecutive failures before their last attempt (at least 5 for g and h, and 3 for i) appearÂ indistinguishable for first failures (two-sided Welchâs t-test; PÂ =Â 0.242, 0.819, 0.289) but quickly diverge for second failures (two-sided Welchâs t-test; PÂ =Â 3.40Â ÃÂ 10^â4, 3.40Â ÃÂ 10^â2, 9.70Â ÃÂ 10^â7). The successful group also shows a significant improvement in performance (one-sided Welchâs t-test; PÂ =Â 4.23Â ÃÂ 10^â2, 3.04Â ÃÂ 10^â2, 1.92Â ÃÂ 10^â4), which is absent for the unsuccessful group (one-sided Welchâs t-test; PÂ =Â 0.863, 0.754, 0.997). Data are meanÂ Â±Â s.e.m. jâl, AUC score of predicting ultimate success as we change the score threshold to 55 (j), exclude revisions as successes (k) and only focus on new principal investigators without previous R01 grants (l). The centres and error bars of AUC scores denote the meanÂ Â±Â s.e.m calculated from tenfold cross-validation over 50 randomized iterations. *PÂ <Â 0.1, **PÂ <Â 0.05, ***PÂ <Â 0.01, NS, PÂ â¥Â 0.1.

Extended Data Fig. 7 Robustness check on D₂.

aâc, Failure streak as we change the threshold of high-value mergers and acquisitions (M&A) to 5% (a), exclude M&As as successes (b) and classify unicorns as successes (c). Blue circles represent real data from successful groups and dashed lines represent fitted Weibull distributions. dâf, Temporal scaling patterns as we change the threshold of high-value M&A to 5% (d), exclude M&As as successes (e) and include unicorns as successes (f). The shaded area shows meanÂ Â±Â s.e.m. of T_n (log scale). gâi, Performance dynamics as we change the threshold of high-value M&A to 5% (g, nÂ =Â 251, 1,304, 243, 1,284, from left to right), exclude M&As as successes (h, nÂ =Â 248, 1,335, 237, 1,315, from left to right) and include unicorns as successes (i, nÂ =Â 257, 1,330, 244, 1,311, from left to right). The successful and unsuccessful groups that experienced a large number of consecutive failures before their last attempt (at least 3) appear indistinguishable for first failures (two-sided Welchâs t-test; PÂ =Â 0.937, 0.647, 0.620) but quickly diverge for second failures (two-sided Welchâs t-test; PÂ =Â 9.92Â ÃÂ 10^â3, 4.94Â ÃÂ 10^â3, 6.33Â ÃÂ 10^â3). The successful group also shows a significant improvement in performance (one-sided Welchâs t-test; PÂ =Â 2.16Â ÃÂ 10^â2, 2.37Â ÃÂ 10^â2, 2.77Â ÃÂ 10^â2), which is absent for the unsuccessful group (one-sided Welchâs t-test; PÂ =Â 0.224, 0.158, 0.167). Data are meanÂ Â±Â s.e.m. jâl, AUC score forÂ predicting ultimate success as we change threshold of high-value M&A to 5% (j), exclude M&As as successes (k) and include unicorns as successes (l). The centres and error bars of AUC scores denote the meanÂ Â±Â s.e.m calculated from tenfold cross-validation over 50 randomized iterations. *PÂ <Â 0.1, **PÂ <Â 0.05, ***PÂ <Â 0.01, NS, PÂ â¥Â 0.1.

Extended Data Fig. 8 Robustness check on D₃.

aâc, Failure streak as we focus on all samples (a), samples of human-targeted attacks (b) and include vague data on fatalities (c). Blue circles represent real data from successful groups and dashed lines represent fitted Weibull distributions. dâf, Temporal scaling patterns as we focus on all samples (d), samples of human-targeted attacks (e) and include vague data on fatalities (f). The shaded area shows meanÂ Â±Â s.e.m. of T_n (log scale). gâi, Performance dynamics as we focus on all samples (g, nÂ =Â 231, 231, 229, 232, from left to right), samples of human-targeted attacks (h, nÂ =Â 176, 173, 173, 174, from left to right) and include vague data on fatalities (i, nÂ =Â 227, 147, 225, 148, from left to right). The successful and unsuccessful groups that experienced a large number of consecutive failures before their last attempt (at least 2) appearÂ indistinguishable for first failures (two-sided Welchâs t-test; PÂ =Â 0.400, 0.859, 0.395), but quickly diverge for second failures (two-sided Welchâs t-test; PÂ =Â 2.08Â ÃÂ 10^â3, 6.70Â ÃÂ 10^â3, 3.76Â ÃÂ 10^â3). The successful group also shows a significant improvement in performance (one-sided Welchâs t-test; PÂ =Â 2.55Â ÃÂ 10^â2, 5.65Â ÃÂ 10^â2, 3.77Â ÃÂ 10^â2), which is absent for the unsuccessful group (one-sided Welchâs t-test; PÂ =Â 0.970, 0.901, 0.967). Data are meanÂ Â±Â s.e.m. jâl, AUC score of predicting ultimate success as we focus on all samples (j), samples of human-targeted attacks (k) and include vague data on fatalities (l). The centres and error bars of AUC scores denote the meanÂ Â±Â s.e.m calculated from tenfold cross-validation over 50 randomized iterations. mâo, Temporal scaling patterns as we change the threshold for the successful group to fatal attacks that killed at least 5 (m), 10 (n) and 100 (o) people. *PÂ <Â 0.1, **PÂ <Â 0.05, ***PÂ <Â 0.01, NS, PÂ â¥Â 0.1.

Extended Data Fig. 9 Additional robustness checks.

aâi, Robustness check as we control for temporal variation. aâc, Failure streak in science (a), entrepreneurship (b) and security (c). Blue circles represent real data of successful groups and dashed lines represent fitted Weibull distributions. dâf, Temporal scaling patterns in science (d), entrepreneurship (e) and security (f). The shaded area shows meanÂ Â±Â s.e.m. of T_n (log scale). gâi, Performance dynamics in science (g, nÂ =Â 628, 145, 571, 123, from left to right), entrepreneurship (h, nÂ =Â 248, 1,332, 237, 1,312, from left to right) and security (i, nÂ =Â 231, 173, 229, 174, from left to right). The successful and unsuccessful groups that experienced a large number of consecutive failures before their last attempt (at least 5 for D₁, 3 for D₂ and 2 for D₃) appear indistinguishable for first failures (two-sided weighted Welchâs t-test; PÂ =Â 0.814, 0.728, 0.330) but quickly diverge for second failures (two-sided weighted Welchâs t-test; PÂ =Â 1.80Â ÃÂ 10^â2, 3.10Â ÃÂ 10^â2, 4.56Â ÃÂ 10^â2). The successful group also shows significant improvement in performance (one-sided weighted Welchâs t-test; PÂ =Â 2.10Â ÃÂ 10^â2, 1.92Â ÃÂ 10^â2, 4.53Â ÃÂ 10^â2), which is absent for the unsuccessful group (one-sided weighted Welchâs t-test; PÂ =Â 0.755, 0.175, 0.903). Data are meanÂ Â±Â s.e.m. jâl, Performance dynamics as we compare first and halfway attempts in science (j, nÂ =Â 628, 145, 582, 111, from left to right), entrepreneurship (k, nÂ =Â 248, 1,332, 240, 1,294, from left to right) and security (l, nÂ =Â 231, 173, 228, 175, from left to right). The successful and unsuccessful groups that experienced a large number of consecutive failures before their last attempt (at least 5 for D₁, 3 for D₂ and 2 for D₃) appear indistinguishable for first failures (two-sided Welchâs t-test; PÂ =Â 0.898, 0.671, 0.289) but diverge for halfway failures (two-sided Welchâs t-test; PÂ =Â 2.18Â ÃÂ 10^â5, 1.34Â ÃÂ 10^â2, 1.34Â ÃÂ 10^â2). The successful group also shows significant improvement in performance (one-sided Welchâs t-test; PÂ =Â 2.35Â ÃÂ 10^â2, 4.54Â ÃÂ 10^â2, 3.69Â ÃÂ 10^â2), which is absent for the unsuccessful group (one-sided Welchâs t-test; PÂ =Â 0.992, 0.252, 0.955). Data are meanÂ Â±Â s.e.m. mâo, Performance dynamics as we compare the first and penultimate attempts in science (m, nÂ =Â 628, 145, 896, 87, from left to right), entrepreneurship (n, nÂ =Â 248, 1,332, 227, 1,199, from left to right) and security (o, nÂ =Â 231, 173, 230, 173, from left to right). The successful and unsuccessful groups that experienced a large number of consecutive failures before the last attempt (at least 5 for D₁, 3 for D₂ and 2 for D₃) appear indistinguishable for first failures (two-sided Welchâs t-test, PÂ =Â 0.898, 0.671, 0.289) but diverge for penultimate failures (two-sided Welchâs t-test; PÂ =Â 8.50Â ÃÂ 10^â8, 3.12Â ÃÂ 10^â2, 1.13Â ÃÂ 10^â2). The successful group also shows a significant improvement in performance (one-sided Welchâs t-test; PÂ =Â 5.79Â ÃÂ 10^â9, 4.30Â ÃÂ 10^â2, 1.33Â ÃÂ 10^â2), which is absent for the unsuccessful group (one-sided Welchâs t-test; PÂ =Â 0.980, 0.138, 0.923). Data are meanÂ Â±Â s.e.m. pâr, The correlation between length of failure streak and initial performance (samples with repeated failures) in science (p, nÂ =Â 12,171), entrepreneurship (q, nÂ =Â 2,086) and security (r, nÂ =Â 441). Correlation is weak across all three datasets (Pearson correlation; rÂ =Â â0.051, â0.011, â0.107 for p, q, r, respectively). sâu, Length of failure streak still follow fat-tailed distributions conditional on bottom 10% initial performance samples in science (s, nÂ =Â 6,339), entrepreneurship (t, nÂ =Â 2,438) and security (u, nÂ =Â 1,092). Two-sided KolmogorovâSmirnov test between sample and exponential distributions rejects theÂ hypothesis that the two distributions are identical with PÂ <Â 0.01. *PÂ <Â 0.1, **PÂ <Â 0.05, ***PÂ <Â 0.01, NS, PÂ â¥Â 0.1.

Extended Data Fig. 10 Generalization of the k model.

a, The Î± parameter connects the potential to improve (1Â âÂ x) with the likelihood of creating new versions p through pÂ =Â (1Â âÂ x)^Î±. b, Phase diagram of the kâÎ± model. The two-dimensional parameter space is separated into three regimes, with boundaries at kÎ±Â =Â 1 and (kÂ âÂ 1)Î±Â =Â 1. c, The impact of Î´ parameter on scaling exponent Î³ for given kÂ =Â 1, 2, 3 and Î±Â =Â 0.4, 0.8, 1.2. We find that Î´ may affect the temporal scaling parameter when it is small, but has no further effect beyond a certain point Î´*Â =Â min(Î±,Â 1/(kÂ âÂ 1)). d, Phase diagram of the kâÎ±âÎ´ model for kÂ =Â 3, with boundaries at Î±Â =Â Î´, (kÂ âÂ 1)Î´Â =Â 1, (kÂ âÂ 1)Î´Â +Â Î±Â =Â 1, kÎ±Â =Â 1 and (kâ1)Î±Â =Â 1, respectively.

Supplementary information

Supplementary Information

This file contains the following sections: 1 Data description; 2 Related work and models; 3 Modeling failure dynamics; 4 Generalized models; 5 Empirical measurements; 6 Prediction task; 7 Robustness checks; and Supplementary Tables 1-4 and additional references.

Reporting summary

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yin, Y., Wang, Y., Evans, J.A. et al. Quantifying the dynamics of failure across science, startups and security. Nature 575, 190â194 (2019). https://doi.org/10.1038/s41586-019-1725-y

Download citation

Received: 15 February 2019
Accepted: 27 September 2019
Published: 30 October 2019
Issue Date: 07 November 2019
DOI: https://doi.org/10.1038/s41586-019-1725-y

This article is cited by

Unveiling the dynamics of team age structure and its impact on scientific innovation
- Alex J. Yang
- Huimin Xu
- Meijun Liu
Scientometrics (2024)
Data, measurement and empirical methods in the science of science
- Lu Liu
- Benjamin F. Jones
- Dashun Wang
Nature Human Behaviour (2023)
Predicting annus mirabilis with machine learning: Turkish movie industry
- Kamil Topal
- Ali Can GÃ¼nhan
- G. Baris Bagci
Multimedia Tools and Applications (2023)
The effect of structural holes on producing novel and disruptive research in physics
- Yue Wang
- Ning Li
- Yang Wang
Scientometrics (2023)
Beijingâs central role in global artificial intelligence research
- Bedoor AlShebli
- Enshu Cheng
- Talal Rahwan
Scientific Reports (2022)