The WuC-Adam algorithm based on joint improvement of Warmup and cosine annealing algorithms.

Researchers

Can Zhang Haijing Sun Le Zhang Lei Xing Qian Zhao Yichuan Shao

Journal

Mathematical biosciences and engineering : MBE

Modalities

Models

Adam Optimization Algorithm Cosine Annealing Technique Warmup Technique

Abstract

The Adam algorithm is a common choice for optimizing neural network models. However, its application often brings challenges, such as susceptibility to local optima, overfitting and convergence problems caused by unstable learning rate behavior. In this article, we introduce an enhanced Adam optimization algorithm that integrates Warmup and cosine annealing techniques to alleviate these challenges. By integrating preheating technology into traditional Adam algorithms, we systematically improved the learning rate during the initial training phase, effectively avoiding instability issues. In addition, we adopt a dynamic cosine annealing strategy to adaptively adjust the learning rate, improve local optimization problems and enhance the model’s generalization ability. To validate the effectiveness of our proposed method, extensive experiments were conducted on various standard datasets and compared with traditional Adam and other optimization methods. Multiple comparative experiments were conducted using multiple optimization algorithms and the improved algorithm proposed in this paper on multiple datasets. On the MNIST, CIFAR10 and CIFAR100 datasets, the improved algorithm proposed in this paper achieved accuracies of 98.87%, 87.67% and 58.88%, respectively, with significant improvements compared to other algorithms. The experimental results clearly indicate that our joint enhancement of the Adam algorithm has resulted in significant improvements in model convergence speed and generalization performance. These promising results emphasize the potential of our enhanced Adam algorithm in a wide range of deep learning tasks.

Show Full Text

The WuC-Adam algorithm based on joint improvement of Warmup and cosine annealing algorithms.

Researchers

Journal

Modalities

Models

Abstract

Effect of tokenization on transformers for biological sequences.

Radiomics model and deep learning model based on T1WI image for acute lymphoblastic leukemia identification.

Super-resolution technology to simultaneously improve optical & digital resolution of optical coherence tomography via deep learning.

Development of a Real-time Indoor Location System using Bluetooth Low Energy Technology and Deep Learning to Facilitate Clinical Applications.

Motor Imagery Classification for Brain Computer Interface Using Deep Convolutional Neural Networks and Mixup Augmentation.

Application of Artificial Intelligence to Cardiovascular Computed Tomography.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply