Bo Zheng

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (3)Multimodal Models (3)RLHF & Preference Learning (2)Training Efficiency & Optimization (2)

Frequent co-authors

Jiashun Liu (2)Yiping Xie (1)Jingxuan Xing (1)Zekun Zhu (1)

Papers (5)

Mar 18, 2026

Jiashun Liu +1Mar 18, 2026

Complementary Reinforcement Learning

RL agents can learn far more efficiently by dynamically distilling and leveraging past experiences that co-evolve with the agent's growing capabilities.

Jiashun Liu, Bo Zheng

RLHF & Preference Learning Tool Use & Agents Training Efficiency & Optimization

Mar 9, 2026

Yiping Xie +5Mar 9, 2026

SecAgent: Efficient Mobile GUI Agent with Semantic Context

A 3B model can match the performance of models more than twice its size in mobile GUI automation by distilling visual history into concise natural language summaries.

Yiping Xie, Jingxuan Xing, Zekun Zhu +3

Data Curation & Synthetic Data Multimodal Models Tool Use & Agents

Mar 2, 2026

Haonan Jia +5Mar 2, 2026

Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning

Get better image captions without more data: reinforcement learning can train vision-language models to focus on image details by maximizing the similarity between images retrieved using the generated captions.

Haonan Jia, Shichao Dong, Xin Dong +3

Computer Vision Multimodal Models RLHF & Preference Learning

Feb 25, 2026

Feb 25, 2026·also CAS, UNSW

RuCL: Stratified Rubric-Based Curriculum Learning for Multimodal Large Language Model Reasoning

MLLMs can be significantly boosted by curriculum learning that focuses on reward design rather than data selection, dynamically weighting generalized rubrics based on the model's evolving competence.

Yukun Chen, Jiaming Li, Ze Gong +6

Multimodal Models Reasoning & Chain-of-Thought Training Efficiency & Optimization

Dec 31, 2025

Dec 31, 2025·also CAS, ECNU, Fudan, GIST Guangdong +6

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

An open-source ecosystem for agentic learning, complete with a trained agent and novel policy optimization, promises to accelerate research by providing a standardized, scalable platform.

Weixun Wang, Xiaoxiao Xu, Wanhe An +85

Open-Source Models & Weights Tool Use & Agents

Search

Bo Zheng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)