Yingfang Zhang

School of Mathematics, Harbin Institute of Technology, Harbin, China Correspondence: lijun.zhang@brgroup.com Abstract Activation steering provides parameter-efficient control over large language models (LLMs) at inference time, but many methods rely on off-distribution supervision and discrete masking, leading to brittle interventions. We propose ROAST (Rollout-based On-distribution Activation Steering Technique), which estimates steering directions from the model’s own on-distribution rollouts via ROC and avoids hard sparsification via Continuous Soft Scaling (CSS) and Grouped Mean Normalization. Our empirical analysis reveals that while activation magnitude correlates moderately with directional consistency, the variance in magnitude is significant and often disproportionate to semantic quality. This suggests that high-magnitude activations risk dominating the global steering direction if not properly normalized. To address this, ROAST employs grouped normalization to balance contributions across samples, ensuring a more robust estimation of the consensus steering direction. Across models (0.

Papers on Lattice

Total citations

Topics

h-index

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Data Curation & Synthetic Data (1)Multimodal Models (1)

Frequent co-authors

Siyu Cao (1)Hangting Chen (1)Peng Chen (1)Yiji Cheng (1)

Papers (1)

Sep 28, 2025

Sep 28, 2025·also CAS, Corresponding authors, ECNU, Fudan +8

HunyuanImage 3.0 Technical Report

The largest open-source image generative model to date, HunyuanImage 3.0, achieves state-of-the-art performance using a Mixture-of-Experts architecture and native Chain-of-Thoughts schema.

Siyu Cao, Hangting Chen, Peng Chen +7132

Architecture Design (Transformers, SSMs, MoE)Data Curation & Synthetic Data Multimodal Models

Search

Yingfang Zhang

Research focus

Frequent co-authors

Papers (1)