Cited By
View all- Bao YShehu ALiu MOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Global convergence analysis of local SGD for two-layer neural network without overparameterizationProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667192(24610-24660)Online publication date: 10-Dec-2023
- Arjevani YField MKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)Annihilation of spurious minima in two-layer ReLU networksProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3602989(37510-37523)Online publication date: 28-Nov-2022
- Wen ZLi YKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)The mechanism of prediction head in non-contrastive self-supervised learningProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3602068(24794-24809)Online publication date: 28-Nov-2022
- Show More Cited By