Auto-Spikformer: Spikformer architecture search.

Researchers

Jun Niu Kaiwei Che Li Yuan Shuaijie Shen Wei Fang Yanqi Chen Yonghong Tian Zhaokun Zhou Zhengyu Ma

Journal

Modalities

Models

Spiking Neural Networks Transformer architecture

Abstract

The integration of self-attention mechanisms into Spiking Neural Networks (SNNs) has garnered considerable interest in the realm of advanced deep learning, primarily due to their biological properties. Recent advancements in SNN architecture, such as Spikformer, have demonstrated promising outcomes. However, we observe that Spikformer may exhibit excessive energy consumption, potentially attributable to redundant channels and blocks.To mitigate this issue, we propose a one-shot Spiking Transformer Architecture Search method, namely Auto-Spikformer. Auto-Spikformer extends the search space to include both transformer architecture and SNN inner parameters. We train and search the supernet based on weight entanglement, evolutionary search, and the proposed Discrete Spiking Parameters Search (DSPS) methods. Benefiting from these methods, the performance of subnets with weights inherited from the supernet without even retraining is comparable to the original Spikformer. Moreover, we propose a new fitness function aiming to find a Pareto optimal combination balancing energy consumption and accuracy.Our experimental results demonstrate the effectiveness of Auto-Spikformer, which outperforms the original Spikformer and most CNN or ViT models with even fewer parameters and lower energy consumption.Copyright © 2024 Che, Zhou, Niu, Ma, Fang, Chen, Shen, Yuan and Tian.

Show Full Text

Auto-Spikformer: Spikformer architecture search.

Researchers

Journal

Modalities

Models

Abstract

Beehive Smart Detector Device for the Detection of Critical Conditions That Utilize Edge Device Computations and Deep Learning Inferences.

DeepPuff: Utilizing Deep Learning for Smoking Behavior Identification in Free-living Environment.

Reaction time coupling in a joint stimulus-response task: A matter of functional actions or likable agents?

Comparative Study of Cooperative Platoon Merging Control Based on Reinforcement Learning.

Letter perception emerges from unsupervised deep learning and recycling of natural image features.

Using deep learning to study emotional behavior in rodent models.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply