New logarithmic step size for stochastic gradient descent

Last updated: April 23, 2024 2:02 am

By Celebrity Gig Add a Comment

Credit: M. Soheil Shamaee, S. Fathi Hafshejani, Z. Saeidian

The step size, often referred to as the learning rate, plays a pivotal role in optimizing the efficiency of the stochastic gradient descent (SGD) algorithm. In recent times, multiple step size strategies have emerged for enhancing SGD performance. However, a significant challenge associated with these step sizes is related to their probability distribution, denoted as ηt/Σ^T_t=1ηt .

This distribution has been observed to avoid assigning exceedingly small values to the final iterations. For instance, the widely used cosine step size, while effective in practice, encounters this issue by assigning very low probability distribution values to the last iterations.

To address this challenge, a research team led by M. Soheil Shamaee published their research in Frontiers of Computer Science.

The team introduces a new logarithmic step size for the SGD approach. This new step size has proven to be particularly effective during the final iterations, where it enjoys a significantly higher probability of selection compared to the conventional cosine step size.

As a result, the new step size method surpasses the performance of the cosine step size method in these critical concluding iterations, benefiting from their increased likelihood of being chosen as the selected solution. The obtained numerical results serve as a testament to the efficiency of the newly proposed step size, particularly on the FashionMinst, CIFAR10, and CIFAR100 datasets.

Additionally, the new logarithmic step size has shown remarkable improvements in test accuracy, achieving a 0.9% increase for the CIFAR100 dataset when utilized with a convolutional neural network (CNN) model.

More information:
New logarithmic step size for stochastic gradient descent, Frontiers of Computer Science (2024). DOI: 10.1007/s11704-023-3245-z. journal.hep.com.cn/fcs/EN/10.1 … 07/s11704-023-3245-z

Provided by
Higher Education Press

Citation:
New logarithmic step size for stochastic gradient descent (2024, April 22)
retrieved 22 April 2024
from

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

Top Stories

Over 20 Graduates storm TikToker Peller’s ₦500K Cameraman interview, sparks online buzz

Who dropped the best recently released album? Ongoing online poll divides fans of Wizkid, Davido, Olamide, Burna Boy

Latest Grok chatbot turns to Musk for some answers

Stay Connected

New logarithmic step size for stochastic gradient descent

Leave a Reply Cancel reply

Content Safety

Trustworthy

Celebrity Gig Magazine

Related Stories

Biden FCC nominee Gigi Sohn withdraws, citing ‘cruel attacks’

Fintech firm Airwallex buys MexPago to expand in Latin America

Big Tech wants to plug data centers right into power plants. Utilities say it’s not fair

Singapore’s digital banks dangle incentives to win new customers — is it sustainable?

Windows’ infamous ‘blue screen of death’ will soon turn black

Apple releases fix for overheating iPhone 15 bug in iOS 17.0.3

Snap AI chatbot investigation in UK over teen privacy concerns

US earthquake safety relies on federal employees’ expertise

About Us

Quick Links