Lattice AI Research

Papers (7)

Apr 30, 2026

ZipCCL: Efficient Lossless Data Compression of Communication Collectives for Accelerating LLM Training

LLM training bottlenecks? ZipCCL achieves up to 1.18x end-to-end speedups by losslessly compressing communication collectives, without sacrificing model quality.

Wenxiang Lin, Xinglin Pan, Ruibo Fan +3

Distributed Systems & Hardware Inference & Quantization Training Efficiency & Optimization

Apr 21, 2026

Apr 21, 2026·also Tsinghua AI, PKU, SYSU

Evaluation-driven Scaling for Scientific Discovery

LLMs can leapfrog state-of-the-art scientific algorithms and human-designed solutions, but only if you scale the evaluation loop, not just the model.

Haotian Ye, Haowei Lin, Jingyi Tang +17

Eval Frameworks & Benchmarks Scientific Discovery & Drug Design Tool Use & Agents

Mar 18, 2026

Mar 18, 2026·also Rutgers

Is Your LLM-as-a-Recommender Agent Trustable? LLMs'Recommendation is Easily Hacked by Biases (Preferences)

LLM-powered recommendation agents, despite their reasoning prowess, are easily manipulated by contextual biases in high-stakes scenarios like paper review and job recruitment.

Zichen Tang, Ziru Zhang, Zirui Zhang +2

Eval Frameworks & Benchmarks Recommendation & Information Retrieval Red-Teaming & Adversarial Robustness

Mar 18, 2026·also HIT

ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression

Lossless compression can actually *speed up* LLM inference on GPUs, not just shrink model size, thanks to ZipServ's hardware-aware design.

Ruibo Fan, Xiangrui Yu, Xinglin Pan +5

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Mar 16, 2026

Zhenheng Tang +2Mar 16, 2026

Are Dilemmas and Conflicts in LLM Alignment Solvable? A View from Priority Graph

LLM alignment is fundamentally challenged by the dynamic and inconsistent nature of their internal "priority graphs," which adversaries can exploit through context manipulation.

Zhenheng Tang, Eunsol Choi, Xiaowen Chu

Constitutional AI & AI Ethics RLHF & Preference Learning Scalable Oversight & Alignment Theory

Feb 23, 2026

Feb 23, 2026·also HKUST

Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling

Achieve up to 102% Sharpe Ratio improvement and 17.5% directional accuracy gain by unifying event-centric data construction and decision-oriented fine-tuning with a hierarchical gated reward model.

Zikai Wei, Zikai Wei, Yiyan Qi +7

Data Curation & Synthetic Data Natural Language Processing RLHF & Preference Learning

Oct 21, 2025

Oct 21, 2025·also Tsinghua AI, HKUST

Reasoning Language Model Inference Serving Unveiled: An Empirical Study

Naive application of LLM inference optimizations can *hurt* the performance of smaller reasoning models, highlighting the need for RLLM-specific serving strategies.

Qi Li, Junpan Wu, Xiang Liu +6

Distributed Systems & Hardware Inference & Quantization Reasoning & Chain-of-Thought

Xiaowen Chu

Research focus

Frequent co-authors

Papers (7)

Search

Xiaowen Chu

Research focus

Frequent co-authors

Papers (7)