Meta AI (FAIR)

×Architecture Design (Transformers, SSMs, MoE)

11 papers from Meta AI (FAIR) on Architecture Design (Transformers, SSMs, MoE)

Apr 30, 2026

Meta AI3w ago·also Oxford

3D-ReGen: A Unified 3D Geometry Regeneration Framework

Controllable 3D generation takes a leap forward with 3D-ReGen, a framework that leverages an initial 3D shape for tasks like enhancement and editing, outperforming existing methods.

Geon Yeong Park, Geon Yeong Park, Roman Shapovalov +8

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Apr 8, 2026

Apr 8, 2026·also Meta AI

Nexus: Transparent I/O Offloading for High-Density Serverless Computing

Serverless functions can get a 37% density boost and significantly reduced overhead by offloading I/O to a shared backend, without sacrificing ecosystem compatibility.

JooYoung Park, Kevin Nguetchouang, Kevin Nguetchouang +9

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware

Apr 8, 2026·also ETH, Meta AI

GenLCA: 3D Diffusion for Full-Body Avatars from In-the-Wild Videos

Training 3D avatar diffusion models on millions of in-the-wild videos is now possible, thanks to a clever 3D tokenization and visibility-aware training strategy that overcomes partial observability.

Yiqian Wu, Rawal Khirodkar, Egor Zakharov +7

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Apr 6, 2026

China University of Mining and Technology-BeijingApr 6, 2026·also Meta AI, UW

Rethinking Model Efficiency: Multi-Agent Inference with Large Models

Forget scaling laws: a large VLM strategically paired with a smaller model's reasoning tokens can rival the performance of a much larger, monolithic model.

SiXun Dong, Juhua Hu, Steven Li +1

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Multimodal Models

Apr 1, 2026

Meta AIApr 1, 2026

Autoregressive Appearance Prediction for 3D Gaussian Avatars

Stop avatars from looking like they're having a seizure: this method uses autoregressive prediction of appearance latents to create temporally stable and high-fidelity 3D Gaussian avatars.

Michael Steiner, Zhang Chen, Alexander Richard +3

Architecture Design (Transformers, SSMs, MoE)Computer Vision

Mar 10, 2026

Meta AIMar 10, 2026

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Forget imbalanced LoRA usage: ReMix leverages reinforcement learning to route effectively among LoRAs, boosting performance in parameter-efficient fine-tuning.

Ruizhong Qiu, Hanqing Zeng, Yinglong Xia +14

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Training Efficiency & Optimization

Feb 19, 2026

Meta AIFeb 19, 2026·also ByteDance

Bending the Scaling Law Curve in Large-Scale Recommendation Systems

Forget quadratic complexity: ULTRA-HSTU achieves 21x faster inference and 4-8% better engagement in large-scale recommendation by co-designing input sequences, sparse attention, and model topology.

Qin Ding, Kevin Course, Linjian Ma +12

Architecture Design (Transformers, SSMs, MoE)Recommendation & Information Retrieval Scaling Laws & Emergent Abilities

Meta AIFeb 19, 2026·also OpenAI, CAS

Multi-Probe Zero Collision Hash (MPZCH): Mitigating Embedding Collisions and Enhancing Model Freshness in Large-Scale Recommenders

Achieve zero-collision embedding tables in production recommenders without sacrificing training speed, unlocking better personalization via fresher and higher-quality item embeddings.

Ziliang Zhao, Emma Lin, Mengjiao Zhou +13

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Recommendation & Information Retrieval

Feb 18, 2026

Meta AIFeb 18, 2026

Rethinking ANN-based Retrieval: Multifaceted Learnable Index for Large-scale Recommendation System

Ditch ANN search altogether: MFLI learns a hierarchical index alongside item embeddings, boosting recall by up to 11.8% and cold-content delivery by 57.29% in large-scale recommender systems.

Jiang Zhang, Jiang Zhang, Yubo Wang +11

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Recommendation & Information Retrieval

May 20, 2025

Meta AIMay 20, 2025·also HUJI

PAST: Phonetic-Acoustic Speech Tokenizer

Ditch the pre-trained models: PAST directly learns speech tokens from phonetic data, outperforming existing methods in representation and reconstruction.

Nadav Har-Tuv, Or Tal, Yossi Adi5

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Speech & Audio

Jan 3, 2025

Meta AIJan 3, 2025·also Sorbonne

MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling

Edit the bassline, drums, or other instruments of any song with this new open-source multi-stem music generation model.

Simon Rouard, Robin San Roman, Yossi Adi +112

Architecture Design (Transformers, SSMs, MoE)Data Curation & Synthetic Data Multimodal Models+1

Search

Meta AI (FAIR)