Yang Gao

University of Shanghai for Science and Technology, T+MAP(Mi,FiI),\tilde{F}_{i}^{T}=\tilde{F}_{i-1}^{T}+\textit{MAP}(M_{i},F_{i}^{I}), (3) where MAP denotes the masked average pooling. By cascading this process from deep to shallow layers, CSP progressively expands object-consistent activations while suppressing background noise. Guided by the strong prior from SIA, the refinement jointly optimizes visual consistency and scoring reliability, yielding more precise and robust localization. Fig. 3 illustrates the effectiveness of this iterative process. To achieve an optimal balance between accuracy and efficiency, we set the number of iteration to 33. Table 1: Comparison with OVD models, MLLMs, and RPNs. The best results are highlighted in bold. AR100/300/

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Computer Vision (1)Red-Teaming & Adversarial Robustness (1)

Frequent co-authors

Bing Zhang (1)Bingxue Zhang (1)Feida Zhu (1)Yanyan Shen (1)

Papers (1)

Mar 16, 2026

University of Shanghai for Science and Technology2w ago·also SJTU, SMU

Architecture-Agnostic Feature Synergy for Universal Defense Against Heterogeneous Generative Threats

Stop building single-model defenses: aligning high-level features across generative architectures lets you defend against diverse threats, even from models you've never seen before.

Bing Zhang, Bingxue Zhang, Yang Gao +2

Architecture Design (Transformers, SSMs, MoE)Computer Vision Red-Teaming & Adversarial Robustness

Search

Yang Gao

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)