Enhance Hardware Domain Specific Large Language Model with Reinforcement Learning for Resilience

Authors: Fu, Weimin and Zhao, Yifang and Jin, Yier and Guo, Xiaolong

Abstract:

To enhance the performance of large language models (LLMs) on hardware design tasks, we focus on training with reinforcement learning(RL) to improve LLMs' syntax synthesis and functional verification performance. We observed significant gains in power, performance, and area (PPA) metrics by applying RL. Specifically, DeepSeek Code saw a 23.6% performance increase, while the RTLCoder improved by 7.86%. Our findings demonstrate the effectiveness of RL in refining LLMs for more accurate hardware generation, considering power and area consumption. This approach offers a promising direction for generating hardware resilient to side-channel attacks in computer systems.

Link: Read Paper

Labels: code generation, program synthesis

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance Hardware Domain Specific Large Language Model with Reinforcement Learning for Resilience

FilesExpand file tree

paper_8.md

Latest commit

History

paper_8.md

File metadata and controls

Enhance Hardware Domain Specific Large Language Model with Reinforcement Learning for Resilience