Saravan Rajmohan

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (8)Code Generation & Program Synthesis (5)Eval Frameworks & Benchmarks (4)World Models & Planning (2)

Frequent co-authors

Qingwei Lin (6)Chetan Bansal (5)Dongmei Zhang (5)Chaoyun Zhang (3)

Papers (10)

Jul 20, 2026

Shenghao Yang +86d ago

DepRepair: LLM-Based Source-Code Repair for Dependency Breaking Changes

Structured evidence from upstream sources boosts LLM repair accuracy by up to 23 percentage points, revolutionizing how we adapt to breaking changes in software dependencies.

Shenghao Yang, Bo Lu, Yaochen Liu +6

Code Generation & Program Synthesis

Jul 13, 2026

1w ago·also Microsoft Research

ToolAtlas: Learning Once, Reusing Everywhere with Tool-Side Memory

ToolAtlas achieves up to 21.61% better performance in tool utilization by shifting memory management from agents to tool providers, revolutionizing how LLMs interact with external tools.

Yue Fang, Xiaoting Qin, Liqun Li +3

Tool Use & Agents

Jul 7, 2026

Tsinghua AI2w ago

AgentTether: Graph-Guided Diagnosis and Runtime Intervention for Reliable LLM Agent Operation

AgentTether repairs over 65% of failures in complex LLM tasks without modifying the agent, revolutionizing how we ensure reliability in AI deployments.

Chenyu Zhao, Shenglin Zhang, Wenwei Gu +5

Code Generation & Program Synthesis Scalable Oversight & Alignment Theory Tool Use & Agents

Apr 15, 2026

Apr 15, 2026·also Microsoft Research, MBZUAI, PKU

Beyond State Consistency: Behavior Consistency in Text-Based World Models

Stop obsessing over state prediction accuracy in text-based world models: aligning them with *behavior* yields better long-term planning and evaluation.

Youling Huang, Guanqiao Chen, Junchi Yao +5

Eval Frameworks & Benchmarks Tool Use & Agents World Models & Planning

Apr 14, 2026

Microsoft ResearchApr 14, 2026·also Virginia Tech

WebXSkill: Skill Learning for Autonomous Web Agents

Autonomous web agents get a serious upgrade with WebXSkill, which lets them learn and execute skills with both code-level precision and human-readable guidance.

Zhaoyang Wang, Qianhui Wu, Xuchao Zhang +12

Code Generation & Program Synthesis Natural Language Processing Tool Use & Agents

Apr 9, 2026

Microsoft ResearchApr 9, 2026·also Georgia Tech, Virginia Tech

ORACLE-SWE: Quantifying the Contribution of Oracle Information Signals on SWE Agents

Knowing the *perfect* API to use or *exact* location to edit could drastically improve SWE agent performance, but knowing the perfect regression test result? Not so much.

Kenan Li, Qirui Jin, Liao Zhu +15

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Mar 9, 2026

Sidharth Sinha +5Mar 9, 2026

AutoAdapt: An Automated Domain Adaptation Framework for LLMs

Stop wasting time on manual LLM domain adaptation: AutoAdapt automates the process and boosts accuracy by 25% over existing AutoML methods.

Sidharth Sinha, Anson Bastos, Xuchao Zhang +3

Data Curation & Synthetic Data Natural Language Processing Training Efficiency & Optimization

Mar 5, 2026

Mar 5, 2026·also Microsoft Research, KU, RIKEN, Shanghai AI Lab +1

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

Automating software repository build and testing across languages and platforms is now possible, unlocking scalable benchmarking and training for coding agents.

Kenan Li, Rongzhi Li, Qirui Jin +14

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Feb 19, 2026

Microsoft ResearchFeb 19, 2026·also Northeastern

Computer-Using World Model

World models can now effectively simulate complex desktop software environments like Microsoft Office, enabling agents to reason about actions before execution and significantly improving performance.

Rui Yu, John Zhang, John Zhang +18

Tool Use & Agents World Models & Planning

Nov 13, 2025

Nov 13, 2025·also OpenAI

Continuous Benchmark Generation for Evaluating Enterprise-scale LLM Agents

Forget hand-crafted benchmarks: this paper shows how LLMs can continuously generate relevant evaluation datasets for enterprise AI agents from just a few semi-structured documents.

Divyanshu Saxena, Rishikesh Maurya, Xiaoxuan Ou +7

Eval Frameworks & Benchmarks Tool Use & Agents

Search

Saravan Rajmohan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (10)