Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–23 of 23 results for author: Erdil, E

.
  1. arXiv:2405.10494  [pdf, other

    econ.GN

    Estimating Idea Production: A Methodological Survey

    Authors: Ege Erdil, Tamay Besiroglu, Anson Ho

    Abstract: Accurately modeling the production of new ideas is crucial for innovation theory and endogenous growth models. This paper provides a comprehensive methodological survey of strategies for estimating idea production functions. We explore various methods, including naive approaches, linear regression, maximum likelihood estimation, and Bayesian inference, each suited to different data availability se… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  2. arXiv:2404.10102  [pdf, other

    cs.AI cs.CL

    Chinchilla Scaling: A replication attempt

    Authors: Tamay Besiroglu, Ege Erdil, Matthew Barnett, Josh You

    Abstract: Hoffmann et al. (2022) propose three methods for estimating a compute-optimal scaling law. We attempt to replicate their third estimation procedure, which involves fitting a parametric loss function to a reconstruction of data from their plots. We find that the reported estimates are inconsistent with their first two estimation methods, fail at fitting the extracted data, and report implausibly na… ▽ More

    Submitted 14 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  3. arXiv:2403.05812  [pdf, other

    cs.CL cs.AI

    Algorithmic progress in language models

    Authors: Anson Ho, Tamay Besiroglu, Ege Erdil, David Owen, Robi Rahman, Zifan Carl Guo, David Atkinson, Neil Thompson, Jaime Sevilla

    Abstract: We investigate the rate at which algorithms for pre-training language models have improved since the advent of deep learning. Using a dataset of over 200 language model evaluations on Wikitext and Penn Treebank spanning 2012-2023, we find that the compute required to reach a set performance threshold has halved approximately every 8 months, with a 95% confidence interval of around 5 to 14 months,… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  4. arXiv:2312.08595  [pdf, other

    cs.ET

    Limits to the Energy Efficiency of CMOS Microprocessors

    Authors: Anson Ho, Ege Erdil, Tamay Besiroglu

    Abstract: CMOS microprocessors have achieved massive energy efficiency gains but may reach limits soon. This paper presents an approach to estimating the limits on the maximum floating point operations per Joule (FLOP/J) for CMOS microprocessors. We analyze the three primary sources of energy dissipation: transistor switching, interconnect capacitances and leakage power. Using first-principles calculations… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  5. arXiv:2309.11690  [pdf, other

    econ.GN

    Explosive growth from AI automation: A review of the arguments

    Authors: Ege Erdil, Tamay Besiroglu

    Abstract: We examine whether substantial AI automation could accelerate global economic growth by about an order of magnitude, akin to the economic growth effects of the Industrial Revolution. We identify three primary drivers for such growth: 1) the scalability of an AI "labor force" restoring a regime of increasing returns to scale, 2) the rapid expansion of an AI labor force, and 3) a massive increase in… ▽ More

    Submitted 15 July, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

  6. arXiv:2308.05035  [pdf, other

    cs.AI cs.HC

    Expert load matters: operating networks at high accuracy and low manual effort

    Authors: Sara Sangalli, Ertunc Erdil, Ender Konukoglu

    Abstract: In human-AI collaboration systems for critical applications, in order to ensure minimal error, users should set an operating point based on model confidence to determine when the decision should be delegated to human experts. Samples for which model confidence is lower than the operating point would be manually analysed by experts to avoid mistakes. Such systems can become truly useful only if the… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted to the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  7. arXiv:2304.10004  [pdf, other

    cs.LG cs.AI stat.AP

    Power Law Trends in Speedrunning and Machine Learning

    Authors: Ege Erdil, Jaime Sevilla

    Abstract: We find that improvements in speedrunning world records follow a power law pattern. Using this observation, we answer an outstanding question from previous work: How do we improve on the baseline of predicting no improvement when forecasting speedrunning world records out to some time horizon, such as one month? Using a random effects model, we improve on this baseline for relative mean square err… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  8. arXiv:2304.05939  [pdf, other

    cs.CV cs.LG eess.IV

    Explicitly Minimizing the Blur Error of Variational Autoencoders

    Authors: Gustav Bredell, Kyriakos Flouris, Krishna Chaitanya, Ertunc Erdil, Ender Konukoglu

    Abstract: Variational autoencoders (VAEs) are powerful generative modelling methods, however they suffer from blurry generated samples and reconstructions compared to the images they have been trained on. Significant research effort has been spent to increase the generative capabilities by creating more flexible models but often flexibility comes at the cost of higher complexity and computational cost. Seve… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted to ICLR 2023

  9. arXiv:2212.05153  [pdf, other

    cs.CV cs.LG

    Algorithmic progress in computer vision

    Authors: Ege Erdil, Tamay Besiroglu

    Abstract: We investigate algorithmic progress in image classification on ImageNet, perhaps the most well-known test bed for computer vision. We estimate a model, informed by work on neural scaling laws, and infer a decomposition of progress into the scaling of compute, data, and algorithms. Using Shapley values to attribute performance improvements, we find that algorithmic improvements have been roughly as… ▽ More

    Submitted 24 August, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

  10. arXiv:2202.05271  [pdf, other

    cs.CV eess.IV stat.ML

    A Field of Experts Prior for Adapting Neural Networks at Test Time

    Authors: Neerav Karani, Georg Brunner, Ertunc Erdil, Simin Fei, Kerem Tezcan, Krishna Chaitanya, Ender Konukoglu

    Abstract: Performance of convolutional neural networks (CNNs) in image analysis tasks is often marred in the presence of acquisition-related distribution shifts between training and test images. Recently, it has been proposed to tackle this problem by fine-tuning trained CNNs for each test image. Such test-time-adaptation (TTA) is a promising and practical strategy for improving robustness to distribution s… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Manuscript under review

  11. arXiv:2112.10271  [pdf, other

    cs.CV

    Wiener Guided DIP for Unsupervised Blind Image Deconvolution

    Authors: Gustav Bredell, Ertunc Erdil, Bruno Weber, Ender Konukoglu

    Abstract: Blind deconvolution is an ill-posed problem arising in various fields ranging from microscopy to astronomy. The ill-posed nature of the problem requires adequate priors to arrive to a desirable solution. Recently, it has been shown that deep learning architectures can serve as an image generation prior during unsupervised blind deconvolution optimization, however often exhibiting a performance flu… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

  12. arXiv:2112.09645  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation

    Authors: Krishna Chaitanya, Ertunc Erdil, Neerav Karani, Ender Konukoglu

    Abstract: Supervised deep learning-based methods yield accurate results for medical image segmentation. However, they require large labeled datasets for this, and obtaining them is a laborious task that requires clinical expertise. Semi/self-supervised learning-based approaches address this limitation by exploiting unlabeled data along with limited annotated data. Recent self-supervised learning methods use… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 13 pages, 4 figures, 7 tables. This article is under review at a Journal

  13. arXiv:2102.12894  [pdf, other

    cs.LG cs.AI

    Constrained Optimization to Train Neural Networks on Critical and Under-Represented Classes

    Authors: Sara Sangalli, Ertunc Erdil, Andreas Hoetker, Olivio Donati, Ender Konukoglu

    Abstract: Deep neural networks (DNNs) are notorious for making more mistakes for the classes that have substantially fewer samples than the others during training. Such class imbalance is ubiquitous in clinical applications and very crucial to handle because the classes with fewer samples most often correspond to critical cases (e.g., cancer) where misclassifications can have severe consequences. Not to mis… ▽ More

    Submitted 5 January, 2022; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: Accepted to the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  14. arXiv:2008.06999  [pdf, other

    eess.IV cs.CV

    RevPHiSeg: A Memory-Efficient Neural Network for Uncertainty Quantification in Medical Image Segmentation

    Authors: Marc Gantenbein, Ertunc Erdil, Ender Konukoglu

    Abstract: Quantifying segmentation uncertainty has become an important issue in medical image analysis due to the inherent ambiguity of anatomical structures and its pathologies. Recently, neural network-based uncertainty quantification methods have been successfully applied to various problems. One of the main limitations of the existing techniques is the high memory requirement during training; which limi… ▽ More

    Submitted 18 August, 2020; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: Accepted to UNSURE by MICCAI 2020

  15. arXiv:2007.05363  [pdf, other

    eess.IV cs.CV cs.LG

    Semi-supervised Task-driven Data Augmentation for Medical Image Segmentation

    Authors: Krishna Chaitanya, Neerav Karani, Christian F. Baumgartner, Ertunc Erdil, Anton Becker, Olivio Donati, Ender Konukoglu

    Abstract: Supervised learning-based segmentation methods typically require a large number of annotated training data to generalize well at test time. In medical applications, curating such datasets is not a favourable option because acquiring a large number of annotated samples from experts is time-consuming and expensive. Consequently, numerous methods have been proposed in the literature for learning with… ▽ More

    Submitted 19 November, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: 15 pages, 11 Figures, 3 tables. Accepted at Medical Image Analysis, 2020

  16. arXiv:2007.04780  [pdf, other

    eess.IV cs.CV

    Modelling the Distribution of 3D Brain MRI using a 2D Slice VAE

    Authors: Anna Volokitin, Ertunc Erdil, Neerav Karani, Kerem Can Tezcan, Xiaoran Chen, Luc Van Gool, Ender Konukoglu

    Abstract: Probabilistic modelling has been an essential tool in medical image analysis, especially for analyzing brain Magnetic Resonance Images (MRI). Recent deep learning techniques for estimating high-dimensional distributions, in particular Variational Autoencoders (VAEs), opened up new avenues for probabilistic modeling. Modelling of volumetric data has remained a challenge, however, because constraint… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: accepted for publication at MICCAI 2020. Code available https://github.com/voanna/slices-to-3d-brain-vae/

  17. arXiv:2006.10712  [pdf, other

    cs.CV cs.LG

    Task-agnostic Out-of-Distribution Detection Using Kernel Density Estimation

    Authors: Ertunc Erdil, Krishna Chaitanya, Neerav Karani, Ender Konukoglu

    Abstract: In the recent years, researchers proposed a number of successful methods to perform out-of-distribution (OOD) detection in deep neural networks (DNNs). So far the scope of the highly accurate methods has been limited to image level classification tasks. However, attempts for generally applicable methods beyond classification did not attain similar performance. In this paper, we address this limita… ▽ More

    Submitted 30 March, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

  18. arXiv:2006.10511  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Contrastive learning of global and local features for medical image segmentation with limited annotations

    Authors: Krishna Chaitanya, Ertunc Erdil, Neerav Karani, Ender Konukoglu

    Abstract: A key requirement for the success of supervised deep learning is a large labeled dataset - a condition that is difficult to meet in medical image analysis. Self-supervised learning (SSL) can help in this regard by providing a strategy to pre-train a neural network with unlabeled data, followed by fine-tuning for a downstream task with limited annotations. Contrastive learning, a particular variant… ▽ More

    Submitted 30 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: 18 pages, 2 figures, 10 tables. This article has been accepted as Oral Presentation at NeurIPS 2020 (34th Conference on Neural Information Processing Systems)

  19. arXiv:2004.04668  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Test-Time Adaptable Neural Networks for Robust Medical Image Segmentation

    Authors: Neerav Karani, Ertunc Erdil, Krishna Chaitanya, Ender Konukoglu

    Abstract: Convolutional Neural Networks (CNNs) work very well for supervised learning problems when the training dataset is representative of the variations expected to be encountered at test time. In medical image segmentation, this premise is violated when there is a mismatch between training and test images in terms of their acquisition details, such as the scanner model or the protocol. Remarkable perfo… ▽ More

    Submitted 23 January, 2021; v1 submitted 9 April, 2020; originally announced April 2020.

    Comments: Published in Medical Image Analysis journal: https://doi.org/10.1016/j.media.2020.101907

    Journal ref: Medical Image Analysis, Volume 68, 2021, 101907, ISSN 1361-8415. http://www.sciencedirect.com/science/article/pii/S1361841520302711

  20. arXiv:1901.02513  [pdf, ps, other

    eess.IV cs.CV cs.LG stat.ML

    Combining nonparametric spatial context priors with nonparametric shape priors for dendritic spine segmentation in 2-photon microscopy images

    Authors: Ertunc Erdil, Ali Ozgur Argunsah, Tolga Tasdizen, Devrim Unay, Mujdat Cetin

    Abstract: Data driven segmentation is an important initial step of shape prior-based segmentation methods since it is assumed that the data term brings a curve to a plausible level so that shape and data terms can then work together to produce better segmentations. When purely data driven segmentation produces poor results, the final segmentation is generally affected adversely. One challenge faced by many… ▽ More

    Submitted 17 February, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

    Comments: IEEE International Symposium on Biomedical Imaging

  21. arXiv:1809.00488  [pdf, ps, other

    cs.CV

    Image Segmentation with Pseudo-marginal MCMC Sampling and Nonparametric Shape Priors

    Authors: Ertunc Erdil, Sinan Yildirim, Tolga Tasdizen, Mujdat Cetin

    Abstract: In this paper, we propose an efficient pseudo-marginal Markov chain Monte Carlo (MCMC) sampling approach to draw samples from posterior shape distributions for image segmentation. The computation time of the proposed approach is independent from the size of the training set used to learn the shape prior distribution nonparametrically. Therefore, it scales well for very large data sets. Our approac… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

  22. arXiv:1611.03749  [pdf, ps, other

    cs.CV

    MCMC Shape Sampling for Image Segmentation with Nonparametric Shape Priors

    Authors: Ertunc Erdil, Sinan Yıldırım, Müjdat Çetin, Tolga Taşdizen

    Abstract: Segmenting images of low quality or with missing data is a challenging problem. Integrating statistical prior information about the shapes to be segmented can improve the segmentation results significantly. Most shape-based segmentation algorithms optimize an energy functional and find a point estimate for the object to be segmented. This does not provide a measure of the degree of confidence in t… ▽ More

    Submitted 11 November, 2016; originally announced November 2016.

    Comments: Computer Vision and Pattern Recognition conference, 2016

  23. arXiv:1607.05523  [pdf, other

    cs.CV

    Dendritic Spine Shape Analysis: A Clustering Perspective

    Authors: Muhammad Usman Ghani, Ertunc Erdil, Sumeyra Demir Kanik, Ali Ozgur Argunsah, Anna Felicity Hobbiss, Inbal Israely, Devrim Unay, Tolga Tasdizen, Mujdat Cetin

    Abstract: Functional properties of neurons are strongly coupled with their morphology. Changes in neuronal activity alter morphological characteristics of dendritic spines. First step towards understanding the structure-function relationship is to group spines into main spine classes reported in the literature. Shape analysis of dendritic spines can help neuroscientists understand the underlying relationshi… ▽ More

    Submitted 19 July, 2016; originally announced July 2016.

    Comments: Accepted for BioImageComputing workshop at ECCV 2016