Accelerating stochastic gradient descent using predictive variance reduction
R Johnson, T Zhang - Advances in neural information …, 2013 - proceedings.neurips.cc
Stochastic gradient descent is popular for large scale optimization but has slow convergence
asymptotically due to the inherent variance. To remedy this problem, we introduce an …
asymptotically due to the inherent variance. To remedy this problem, we introduce an …
[PDF][PDF] A framework for learning predictive structures from multiple tasks and unlabeled data.
RK Ando, T Zhang, P Bartlett - Journal of machine learning research, 2005 - jmlr.org
One of the most important issues in machine learning is whether one can improve the
performance of a supervised learning algorithm by including unlabeled data. Methods that use …
performance of a supervised learning algorithm by including unlabeled data. Methods that use …
A proximal stochastic gradient method with progressive variance reduction
We consider the problem of minimizing the sum of two convex functions: one is the average
of a large number of smooth component functions, and the other is a general convex function …
of a large number of smooth component functions, and the other is a general convex function …
Local and global energy release rates for an electrically yielded crack in a piezoelectric ceramic
Structural reliability concerns of various electromechanical devices call for a better
understanding of the mechanisms of fracture in piezoelectric ceramics subjected to combined …
understanding of the mechanisms of fracture in piezoelectric ceramics subjected to combined …
Superconductivity in one-atomic-layer metal films grown on Si (111)
The two-dimensional (2D) superconducting state is a fragile state of matter susceptible to
quantum phase fluctuations. Although superconductivity has been observed in ultrathin metal …
quantum phase fluctuations. Although superconductivity has been observed in ultrathin metal …
Spatial–temporal recurrent neural network for emotion recognition
In this paper, we propose a novel deep learning framework, called spatial–temporal recurrent
neural network (STRNN), to integrate the feature learning from both spatial and temporal …
neural network (STRNN), to integrate the feature learning from both spatial and temporal …
A measure of spatial stratified heterogeneity
Spatial stratified heterogeneity, referring to the within-strata variance less than the between
strata-variance, is ubiquitous in ecological phenomena, such as ecological zones and many …
strata-variance, is ubiquitous in ecological phenomena, such as ecological zones and many …
Involution: Inverting the inherence of convolution for visual recognition
Convolution has been the core ingredient of modern neural networks, triggering the surge
of deep learning in vision. In this work, we rethink the inherent principles of standard …
of deep learning in vision. In this work, we rethink the inherent principles of standard …
Solving large scale linear prediction problems using stochastic gradient descent algorithms
T Zhang - Proceedings of the twenty-first international conference …, 2004 - dl.acm.org
Linear prediction methods, such as least squares for regression, logistic regression and support
vector machines for classification, have been extensively used in statistics and machine …
vector machines for classification, have been extensively used in statistics and machine …
[HTML][HTML] COVID-19 immune features revealed by a large-scale single-cell transcriptome atlas
…, C Yin, D Yu, K Yu, J Yuan, B Zhang, P Zhang, T Zhang… - Cell, 2021 - cell.com
A dysfunctional immune response in coronavirus disease 2019 (COVID-19) patients is a
recurrent theme impacting symptoms and mortality, yet a detailed understanding of pertinent …
recurrent theme impacting symptoms and mortality, yet a detailed understanding of pertinent …