中国半导体行业协会封装分会会刊

中国电子学会电子制造与封装技术分会会刊

导航

电子与封装

• 电路与系统 •    下一篇

扩散模型神经网络加速策略综述

邹子涵1,闫鑫明1,郑鹏1,张顺1,蔡浩1,2,刘波1,2   

  1. 1. 东南大学集成电路学院,南京  210096;2. 国家集成电路设计自动化技术创新中心,南京  210031
  • 收稿日期:2025-04-29 修回日期:2025-07-17 出版日期:2025-12-10 发布日期:2025-12-10
  • 通讯作者: 刘波
  • 基金资助:
    国家重点研发计划(2023YFB4403103)

Review of Neural Network Acceleration Strategies for Diffusion Models

ZOU Zihan1, YAN Xinming1, ZHENG Peng1, ZHANG Shun1, CAI Hao1,2, LIU Bo1,2   

  1. 1. School of Integrated Circuits, Southeast University, Nanjing 210096, China; 2. National Center of Technology Innovation for EDA, Nanjing 210031, China
  • Received:2025-04-29 Revised:2025-07-17 Online:2025-12-10 Published:2025-12-10

摘要: 随着神经网络的发展和应用,扩散模型通过其独特的扩散机制在图像生成任务当中取得了非常大的成就。然而,为了实现优异的任务性能,其引入了大量的计算和复杂的网络结构,限制了其广泛应用,尤其是在资源受限的边缘端设备上。针对这个瓶颈,高效的模型加速算法和加速器软硬件协同框架成为了有效的解决方案。基于多种扩散模型加速和高效部署策略,从适用于通用计算平台的高效算法设计到软硬件框架协同设计,介绍了当前最先进的扩散模型加速策略。

关键词: 扩散模型, 模型加速, 边缘部署, 软硬件协同设计, 高效推理

Abstract: With the development of neural networks, Diffusion models have achieved remarkable success in image generation tasks due to their unique generative process. However, this performance comes at the cost of substantial computational overheads and complex network structures, posing significant challenges for efficient deployment, particularly on edge devices with limited resources. To address these bottlenecks, researchers have proposed a range of acceleration algorithms and co-designed software-hardware frameworks. This survey provides a comprehensive overview of state-of-the-art acceleration techniques for diffusion models, covering both algorithmic optimizations suitable for general-purpose computing platforms and hardware-aware designs tailored for efficient deployment.

Key words: diffusion models, model acceleration, edge deployment, hardware-software co-design, efficient inference