Qingwei Lin

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (5)Tool Use & Agents (5)Natural Language Processing (3)Code Generation & Program Synthesis (3)

Frequent co-authors

Dongmei Zhang (8)S. Rajmohan (6)Saravan Rajmohan (5)Saravan Rajmohan (3)

Papers (9)

Apr 23, 2026

Wenjie Fu +7Apr 23, 2026

CI-Work: Benchmarking Contextual Integrity in Enterprise LLM Agents

Enterprise LLM agents leak sensitive information in up to 50% of interactions, and surprisingly, performing better at tasks makes the problem *worse*.

Wenjie Fu, Xiaoting Qin, Jue Zhang +5

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Recommendation & Information Retrieval

Apr 20, 2026

Microsoft ResearchApr 20, 2026

Contrastive Attribution in the Wild: An Interpretability Analysis of LLM Failures on Realistic Benchmarks

Token-level attribution struggles to pinpoint the causes of LLM failures in realistic settings, suggesting current interpretability tools may not be up to the task of debugging complex model behaviors.

Rongyuan Tan, Jue Zhang, Zhuozhao Li +4

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Natural Language Processing

Apr 15, 2026

Microsoft ResearchApr 15, 2026·also KTH, SEU, ZJU

DUET: Joint Exploration of User Item Profiles in Recommendation System

Forget hand-crafted templates: DUET learns to generate user and item profiles jointly, boosting recommendation accuracy by better aligning textual representations.

Yifei Sun, Yifei Sun, Lu Wang +20

Natural Language Processing Recommendation & Information Retrieval

Apr 15, 2026·also Microsoft Research, MBZUAI, PKU

Beyond State Consistency: Behavior Consistency in Text-Based World Models

Stop obsessing over state prediction accuracy in text-based world models: aligning them with *behavior* yields better long-term planning and evaluation.

Youling Huang, Guanqiao Chen, Junchi Yao +8

Eval Frameworks & Benchmarks Tool Use & Agents World Models & Planning

Apr 14, 2026

Microsoft ResearchApr 14, 2026·also Virginia Tech

WebXSkill: Skill Learning for Autonomous Web Agents

Autonomous web agents get a serious upgrade with WebXSkill, which lets them learn and execute skills with both code-level precision and human-readable guidance.

Zhaoyang Wang, Qianhui Wu, Xuchao Zhang +16

Code Generation & Program Synthesis Natural Language Processing Tool Use & Agents

Apr 9, 2026

Microsoft ResearchApr 9, 2026·also Anhui University, Georgia Tech, iFlytek, Virginia Tech

ORACLE-SWE: Quantifying the Contribution of Oracle Information Signals on SWE Agents

Knowing the *perfect* API to use or *exact* location to edit could drastically improve SWE agent performance, but knowing the perfect regression test result? Not so much.

Kenan Li, Qirui Jin, Liao Zhu +16

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Apr 7, 2026

Lihao Sun +5Apr 7, 2026

LLM Reasoning as Trajectories: Step-Specific Representation Geometry and Correctness Signals

LLMs don't learn fundamentally new reasoning representations during training; they just get faster at converging to the right answer.

Lihao Sun, Hang Dong, Bo Qiao +3

Interpretability & Mechanistic Interp Reasoning & Chain-of-Thought

Mar 5, 2026

Mar 5, 2026·also Microsoft Research, CUHK, KU, RIKEN +2

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

Automating software repository build and testing across languages and platforms is now possible, unlocking scalable benchmarking and training for coding agents.

Kenan Li, Rongzhi Li, Qirui Jin +15

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Feb 19, 2026

Microsoft ResearchFeb 19, 2026·also Northeastern

Computer-Using World Model

World models can now effectively simulate complex desktop software environments like Microsoft Office, enabling agents to reason about actions before execution and significantly improving performance.

Yiming Guan, Rui Yu, John Zhang +23

Tool Use & Agents World Models & Planning

Search

Qingwei Lin

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (9)