- [R] Full-batch GD generalizes better than SGD https://arxiv.org/abs/2204.12446 62 comments machinelearning
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2204.12446] Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GD
See how to search.