Jinhao Duan

SeClaw reveals that existing benchmarks fall short in capturing the complexities of agent behavior, enabling a more nuanced evaluation of security risks in autonomous systems.

Changtao Miao, Tianle Song, Erjia Xiao +8

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Tool Use & Agents

Apr 16, 2026

Haozhi Fan +2Apr 16, 2026

IUQ: Interrogative Uncertainty Quantification for Long-Form Large Language Model Generation

LLMs can now tell you how unsure they are about their long-form answers, thanks to a new interrogation-based uncertainty metric that actually works.

Haozhi Fan, Jinhao Duan, Kaidi Xu

Eval Frameworks & Benchmarks Natural Language Processing

Feb 23, 2026

A Replicate-and-Quantize Strategy for Plug-and-Play Load Balancing of Sparse Mixture-of-Experts LLMs

Solve SMoE load balancing at inference time without retraining by replicating heavily used experts and quantizing underutilized ones, achieving up to 1.4x imbalance reduction.

Jie Peng, Jinhao Duan, Zirui Liu +1

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Jinhao Duan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)