Guoxin Chen

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Code Generation & Program Synthesis (3)Scalable Oversight & Alignment Theory (2)Tool Use & Agents (2)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Jiale Zhao (2)Fanzhe Meng (2)Ji-rong Wen (2)Kai Jia (2)

Papers (4)

Jun 9, 2026

Jiale Zhao +5Jun 9, 2026·also RUC

DeNovoSWE: Scaling Long-Horizon Environments for Generating Entire Repositories from Scratch

Fine-tuning on DeNovoSWE catapults LLM performance in generating entire software repositories, achieving nearly an 8x improvement on a challenging benchmark.

Jiale Zhao, Guoxin Chen, Fanzhe Meng +3

Code Generation & Program Synthesis Scalable Oversight & Alignment Theory

Apr 28, 2026

Xinjie Chen +5Apr 28, 2026·also SYSU, Xiamen University

JURY-RL: Votes Propose, Proofs Dispose for Label-Free RLVR

Ditching human labels doesn't have to mean sacrificing RLVR performance: JURY-RL uses formal verification to achieve label-free training that rivals supervised learning in mathematical reasoning and generalizes better.

Xinjie Chen, Biao Fu, Jing Wu +3

Reasoning & Chain-of-Thought RLHF & Preference Learning Scalable Oversight & Alignment Theory

Apr 14, 2026

Guoxin Chen +8Apr 14, 2026

Toward Autonomous Long-Horizon Engineering for ML Research

Autonomous ML research agents achieve significantly better long-horizon performance by maintaining durable state through a shared workspace, suggesting that orchestration and memory are more critical than raw reasoning power.

Guoxin Chen, Jiale Zhao, Jiale Zhao +6

Code Generation & Program Synthesis Scientific Discovery & Drug Design Tool Use & Agents

Mar 3, 2026

Mar 3, 2026·also Gaoling AI

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Today's code-generating AI falls apart when faced with real-world software engineering tasks that demand cross-repository reasoning and external knowledge, achieving less than 45% success on the new BeyondSWE benchmark.

Guoxin Chen, Fanzheng Meng, Jiale Zhao +9

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Search

Guoxin Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)