Ilija Bogunovic

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (3)Natural Language Processing (3)Code Generation & Program Synthesis (1)Tool Use & Agents (1)

Frequent co-authors

Seongho Son (2)Jiayin Lin (1)Nam Phuong Tran (1)Long Tran-Thanh (1)

Papers (5)

Jul 14, 2026

Jiayin Lin +51w ago

Meta-Learning Preferences for Multilingual LLM Alignment

Achieving a 28% improvement in alignment performance with just 100 preference samples highlights the potential of meta-learning to bridge the data gap in multilingual LLMs.

Jiayin Lin, Seongho Son, Nam Phuong Tran +3

RLHF & Preference Learning

Jun 30, 2026

Sangwoong Yoon +43w ago

SWE-Router: Routing in Multi-turn Agentic Software Engineering Tasks

Routing decisions based on exploratory trajectories can significantly boost cost efficiency in software engineering tasks without losing the performance edge of stronger models.

Sangwoong Yoon, Jiahua Tang, Shuhan Wang +2

Code Generation & Program Synthesis Tool Use & Agents

Jun 10, 2026

Jun 10, 2026·also Basel, Bosch AI, JHU

Re-evaluating Confidence Remasking in Masked Diffusion Language Models

Confidence-based remasking in dLLMs may not deliver the expected improvements and can actually worsen diversity issues in certain decoding settings.

Stipe Frkovic, Christian A. Naesseth, Ilija Bogunovic

Natural Language Processing

May 28, 2026

Xiaohang Tang +6May 28, 2026

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models

Ditch the ELBO: bypassing biased likelihood approximations in RL fine-tuning of diffusion LMs unlocks more stable and effective policy optimization, yielding nearly 20% accuracy gains on challenging tasks.

Xiaohang Tang, Keyue Jiang, Che Liu +4

Inference & Quantization Natural Language Processing RLHF & Preference Learning

Feb 24, 2026

Yu Fu +2Feb 24, 2026

Overton Pluralistic Reinforcement Learning for Large Language Models

A 3B model, guided by a novel RL framework, can outperform a 20B model in capturing diverse human perspectives, challenging the assumption that larger models inherently possess better alignment.

Yu Fu, Seongho Son, Ilija Bogunovic

Constitutional AI & AI Ethics Natural Language Processing RLHF & Preference Learning

Search

Ilija Bogunovic

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)