Ran El-Yaniv

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Inference & Quantization (2)Tool Use & Agents (1)Open-Source Models & Weights (1)

Frequent co-authors

Nave Assaf (2)Pavlo Molchanov (2)Ran Zilberstein (2)Tomer Asida (2)

Papers (4)

Apr 14, 2026

AI22w ago·also NVIDIA, UT Austin, Waterloo, Xiaomi Robotics +1

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Nemotron 3 Super proves you can achieve comparable accuracy to existing 120B models, but with significantly higher inference throughput, by combining Mamba, Attention, and Mixture-of-Experts.

Aakshita Chandiramani, Aaron Blakeman, Abdullahi Olaoye +481

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Tool Use & Agents

Feb 12, 2026

NVIDIAFeb 12, 2026·also Technion

Extending Puzzle for Mixture-of-Experts Reasoning Models with Application to GPT-OSS Acceleration

You can slash LLM inference costs without sacrificing quality by strategically pruning experts, quantizing, and swapping full attention for windowed attention, as demonstrated on gpt-oss-120B.

A. Bercovich, Nir Ailon, Vladimir Anisimov +21

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Open-Source Models & Weights

Feb 12, 2026

When Should LLMs Be Less Specific? Selective Abstraction for Reliable Long-Form Text Generation

LLMs can significantly boost factual accuracy in long-form generation by strategically "toning down" uncertain details, rather than simply omitting them.

Shani Goren, Ido Galil, Ran El-Yaniv

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Natural Language Processing

Feb 11, 2025

NVIDIAFeb 11, 2025·also IBM Research, Technion

Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips

Flipping just *two* sign bits in a large neural network can obliterate its performance, revealing a surprising fragility in even state-of-the-art models.

Ido Galil, Ido Galil, M. Kimhi +3

Computer Vision Red-Teaming & Adversarial Robustness

Search

Ran El-Yaniv

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)