Peng Jiang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Recommendation & Information Retrieval (4)Inference & Quantization (3)Computer Vision (2)Architecture Design (Transformers, SSMs, MoE) (2)

Frequent co-authors

Lixiang Wang (2)Yongzhi Li (2)Jiaju Chen (1)Chongming Gao (1)

Papers (6)

Apr 30, 2026

Zhongguancun AcademyApr 30, 2026·also USTC

Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation

LLMs can generate recommendations up to 3.1x faster by explicitly modeling token position within items and speculation depth during speculative decoding.

Jiaju Chen, Chongming Gao, Chenxiao Fan +4

Inference & Quantization Natural Language Processing Recommendation & Information Retrieval

Apr 27, 2026

Shaunak Kolhe +13Apr 27, 2026

Pushing Radar Odometry Beyond the Pavement: Current Capabilities and Challenges

Radar odometry, typically confined to urban settings, can be pushed off-road with simple adaptations like IMU preintegration, but still faces significant challenges in unstructured environments.

Shaunak Kolhe, Shaunak Kolhe, Peng Jiang +11

Computer Vision Robotics & Embodied AI

Apr 21, 2026

CS3: Efficient Online Capability Synergy for Two-Tower Recommendation

Two-tower recommendation models can get a major online performance boost without latency penalties, thanks to a new capability synergy framework.

Lixiang Wang, Shaoyun Shi, Peng Wang +2

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Recommendation & Information Retrieval+1

Mar 30, 2026

Milton Zhou +3Mar 30, 2026

AutoCut: End-to-end advertisement video editing based on multimodal discretization and controllable generation

Forget disjointed workflows: AutoCut's unified token space for video, audio, and text slashes ad production costs while boosting consistency.

Milton Zhou, Sizhong Qin, Yongzhi Li +1

Computer Vision Multimodal Models Speech & Audio

Feb 26, 2026

Feb 26, 2026·also DAMO, Tsinghua AI, Fudan, Jinan University +1

Generative Recommendation for Large-Scale Advertising

Generative recommendation can beat DLRM in large-scale advertising, driving a 4.2% revenue lift in Kuaishou's production system via innovations in tokenization, decoding, optimization, and serving.

Ben Xue, Ben Xue, Dan Liu +35

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval Training Efficiency & Optimization

Feb 26, 2026

Sequential Regression for Continuous Value Prediction using Residual Quantization

Ditch rigid distribution assumptions: a novel residual quantization approach predicts continuous values by recursively refining quantization codes, outperforming SOTA in recommendation tasks.

Runpeng Cui, Runpeng Cui, Zhipeng Sun +4

Inference & Quantization Recommendation & Information Retrieval

Search

Peng Jiang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)