← 返回
储能系统技术 储能系统 SiC器件 工商业光伏 强化学习 ★ 5.0

多目标集成电路物理布局优化的分层深度强化学习及拥塞感知奖励塑造

Hierarchical Deep Reinforcement Learning for Multi-Objective Integrated Circuit Physical Layout Optimization With Congestion-Aware Reward Shaping

作者 Haijian Zhang · Yao Ge · Xiuyuan Zhao · Jiyuan Wang
期刊 IEEE Access
出版日期 2025年1月
技术分类 储能系统技术
技术标签 储能系统 SiC器件 工商业光伏 强化学习
相关度评分 ★★★★★ 5.0 / 5.0
关键词 集成电路布局优化 分层深度强化学习 拥塞感知 多目标优化 性能提升
语言:

中文摘要

随着半导体技术向先进节点演进,集成电路物理布局优化面临关键挑战,传统EDA工具难以同时优化布线拥塞、功耗和时序等多个冲突目标。本文提出一种新型分层深度强化学习框架,采用拥塞感知奖励塑造机制动态平衡探索与利用。

English Abstract

Physical layout optimization in integrated circuit (IC) design remains a critical challenge as semiconductor technology scales toward advanced nodes, where traditional electronic design automation (EDA) tools struggle to simultaneously optimize multiple conflicting objectives including routing congestion, power consumption, and timing performance. This paper presents a novel hierarchical deep reinforcement learning (HDRL) framework for multi-objective IC layout optimization that addresses the exponential complexity of modern chip design spaces. Our approach introduces a congestion-aware reward shaping mechanism that dynamically balances exploration and exploitation while incorporating domain-specific knowledge through a hierarchical policy decomposition. The framework employs a dual-level architecture where a high-level policy manages global placement strategies and a low-level policy handles detailed placement refinements, enabling efficient optimization across different granularities. We integrate adaptive action space pruning based on physical design rules and propose a novel state representation that captures both local congestion patterns and global routing accessibility. Extensive experiments on OpenROAD benchmarks demonstrate that our method achieves an average 23.7% reduction in routing congestion, 18.4% improvement in power efficiency, and 15.2% enhancement in timing closure compared to state-of-the-art commercial tools and recent learning-based approaches. The framework maintains computational efficiency with 40% faster convergence than baseline deep reinforcement learning methods while providing interpretable placement decisions through attention mechanism visualization. Our contributions include the hierarchical policy architecture, congestion-aware reward formulation, and comprehensive evaluation framework that advances the integration of artificial intelligence in physical design automation.
S

SunView 深度解读

该深度强化学习优化技术可直接应用于阳光电源SiC功率模块的芯片布局设计。通过多目标优化框架同时优化功率密度、热分布和可靠性,为ST系列储能变流器的新一代高功率密度芯片设计提供AI辅助工具。