March 18 – March 25, 2026

Open-Source Models & Weights - Weekly Roundup

22 papers published across 0 labs.

23% acceleration

Top Papers

Mar 19, 2026

1w ago

Language Model Maps for Prompt-Response Distributions via Log-Likelihood Vectors

Forget comparing models with benchmarks – mapping them by prompt-response likelihoods reveals hidden relationships between architecture, training data, and even how prompts compose.

Yusuke Takase, Yusuke Takase, Momose Oyama +3

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Natural Language Processing+1

Zhelin Xu +51w ago

AutoScreen-FW: An LLM-based Framework for Resume Screening

Open-source LLMs, when carefully prompted with representative examples, can rival or even surpass smaller commercial models like GPT-3.5-nano in resume screening tasks, offering a privacy-preserving alternative.

Zhelin Xu, Zhelin Xu, Shuhei Yamamoto +3

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

Ziyin Zhang +81w ago

F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World

Multilingual embeddings just got a whole lot smaller and faster, with F2LLM-v2 models outperforming larger counterparts while supporting over 200 languages.

Ziyin Zhang, Ziyin Zhang, Zihan Liao +6

Natural Language Processing Open-Source Models & Weights Training Efficiency & Optimization

Victor Nikhil Antony +51w ago

Introducing M: A Modular, Modifiable Social Robot

Democratizing social robotics research, M offers a low-cost, open-source platform that's easy to reproduce, modify, and deploy in real-world settings.

Victor Nikhil Antony, V. Antony, Zhili Gong +3

Open-Source Models & Weights Robotics & Embodied AI

Nicholas D'Silva +21w ago

SoK: Practical Aspects of Releasing Differentially Private Graphs

Navigating the maze of differentially private graph release methods just got easier: a new framework helps practitioners choose the right approach, avoid common pitfalls, and make sound evaluations.

Nicholas D'Silva, Surya Nepal, Salil S. Kanhere

Constitutional AI & AI Ethics Natural Language Processing Open-Source Models & Weights

All Papers (22)

Mar 19, 2026

1w ago

Language Model Maps for Prompt-Response Distributions via Log-Likelihood Vectors

Forget comparing models with benchmarks – mapping them by prompt-response likelihoods reveals hidden relationships between architecture, training data, and even how prompts compose.

Yusuke Takase, Yusuke Takase, Momose Oyama +3

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Natural Language Processing+1

Zhelin Xu +51w ago

AutoScreen-FW: An LLM-based Framework for Resume Screening

Zhelin Xu, Zhelin Xu, Shuhei Yamamoto +3

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

Ziyin Zhang +81w ago

F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World

Multilingual embeddings just got a whole lot smaller and faster, with F2LLM-v2 models outperforming larger counterparts while supporting over 200 languages.

Ziyin Zhang, Ziyin Zhang, Zihan Liao +6

Natural Language Processing Open-Source Models & Weights Training Efficiency & Optimization

Victor Nikhil Antony +51w ago

Introducing M: A Modular, Modifiable Social Robot

Democratizing social robotics research, M offers a low-cost, open-source platform that's easy to reproduce, modify, and deploy in real-world settings.

Victor Nikhil Antony, V. Antony, Zhili Gong +3

Open-Source Models & Weights Robotics & Embodied AI

Nicholas D'Silva +21w ago

SoK: Practical Aspects of Releasing Differentially Private Graphs

Navigating the maze of differentially private graph release methods just got easier: a new framework helps practitioners choose the right approach, avoid common pitfalls, and make sound evaluations.

Nicholas D'Silva, Surya Nepal, Salil S. Kanhere

Constitutional AI & AI Ethics Natural Language Processing Open-Source Models & Weights

Yue Zhao +51w ago

CNT: Safety-oriented Function Reuse across LLMs via Cross-Model Neuron Transfer

Stealing just the right neurons from another LLM lets you patch safety holes or remove biases in your own, with almost no performance hit.

Yue Zhao, Yujia Gong, Ruigang Liang +3

Constitutional AI & AI Ethics Open-Source Models & Weights Red-Teaming & Adversarial Robustness

1w ago

SpecForge: A Flexible and Efficient Open-Source Training Framework for Speculative Decoding

Training speculative decoding models just got an order of magnitude faster, unlocking real-world deployment with a new open-source framework and a suite of production-ready draft models.

Shenggui Li, Chao Wang, Yikai Zhu +29

Inference & Quantization Open-Source Models & Weights Training Efficiency & Optimization

Zikang Ding +51w ago

Functional Subspace Watermarking for Large Language Models

LLM watermarks can now survive fine-tuning, quantization, and distillation thanks to a new method that embeds them in a stable functional subspace.

Zikang Ding, Junhao Li, Suling Wu +3

Inference & Quantization Natural Language Processing Open-Source Models & Weights

Mar 18, 2026

2w ago·also Fudan

MOSS-TTS Technical Report

Achieve controllable and scalable speech generation with MOSS-TTS, enabling zero-shot voice cloning and long-form synthesis.

Y. Gong, Yitian Gong, Botian Jiang +28

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Speech & Audio

Noam H. Rotenberg +22w ago

Classifier Pooling for Modern Ordinal Classification

Unlock the power of your favorite classifier for ordinal data: Classifier Pooling consistently beats standard methods, especially when data is scarce or categories are numerous.

Noam H. Rotenberg, A. V. Faria, Brian S. Caffo

Natural Language Processing Open-Source Models & Weights

Zeeshan Akram2w ago

Circumventing Platform Defenses at Scale: Automated Content Replication from YouTube to Blockchain-Based Decentralized Storage

YouTube's platform defenses are a house of cards: circumventing one control often triggers a cascade of failures, demanding constant architectural adaptation for large-scale content replication.

Zeeshan Akram

Data Curation & Synthetic Data Distributed Systems & Hardware Open-Source Models & Weights

Mengyu Bu2w ago

Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality

LLMs can get a massive multilingual boost, especially in low-resource languages, by offloading translation to specialized models and carefully aligning their representations.

Mengyu Bu

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Open-Source Models & Weights

Andor Diera +22w ago

Do Language Models Encode Semantic Relations? Probing and Sparse Feature Analysis

LLMs encode hierarchical semantic relations asymmetrically, with hypernymy being far more robust and redundantly represented than hyponymy.

Andor Diera, A. Scherp, Ansgar Scherp

Interpretability & Mechanistic Interp Natural Language Processing Open-Source Models & Weights

Huan Song +72w ago

Ruyi2.5 Technical Report

Ruyi2.5 achieves comparable performance to Qwen3-VL on general multimodal benchmarks while significantly outperforming it in privacy-constrained surveillance, demonstrating the effectiveness of its edge-cloud architecture.

Huan Song, Shuyu Tian, Qingfei Zhao +5

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Open-Source Models & Weights

Alireza Sadeghi +12w ago

Causal Representation Learning on High-Dimensional Data: Benchmarks, Reproducibility, and Evaluation Metrics

Current CRL benchmarks often fail to provide a holistic view of model performance, hindering progress, but a new aggregate metric could change that.

Alireza Sadeghi, Wael AbdAlmageed

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Open-Source Models & Weights

Gaotian Wang +32w ago

ManiDreams: An Open-Source Library for Robust Object Manipulation via Uncertainty-aware Task-specific Intuitive Physics

ManiDreams lets robots handle real-world uncertainty in manipulation tasks without retraining, outperforming standard RL baselines under various perturbations.

Gaotian Wang, Kejia Ren, Andrew S. Morgan +1

Open-Source Models & Weights Robotics & Embodied AI World Models & Planning

2w ago

TENSO: Software Package for Numerically Exact Open Quantum Dynamics Based on Efficient Tree Tensor Network Decomposition of the Hierarchical Equations of Motion

Tackle previously intractable open quantum systems simulations with TENSO, a new open-source package that efficiently handles complex environments via tree tensor networks.

Juan C. Rodriguez Betancourt, Michelle C Anderson, Luchang Niu +2

Open-Source Models & Weights Scientific Discovery & Drug Design

Multiverse Computing2w ago·also Donostia International Physics Center, Ikerbasque Foundation for Science, Tecnun - University of Navarra

Only relative ranks matter in weight-clustered large language models

LLMs can be drastically compressed without retraining because the relative ordering of weights matters far more than their exact values, opening the door to efficient, training-free compression techniques.

Borja Aizpurua, Sukhbinder Singh, Román Orús +1

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Open-Source Models & Weights

Maria Andueza Rodriguez +22w ago

Modeling the human lexicon under temperature variations: linguistic factors, diversity and typicality in LLM word associations

LLMs can mimic human lexical patterns, but larger models act like stereotypical humans, sacrificing diversity for typicality in word associations, a trade-off tunable by temperature.

Maria Andueza Rodriguez, Marie Candito, Richard Huyghe

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

Philipp Normann +42w ago

Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards

A 4B parameter model can nearly match the privilege escalation performance of a state-of-the-art closed LLM like Claude Opus, while being fully local and 100x cheaper to run.

Philipp Normann, Andreas Happe, A. Happe +2

Code Generation & Program Synthesis Open-Source Models & Weights Red-Teaming & Adversarial Robustness+1

Weihua Xiao +92w ago

GUIDE: GenAI Units In Digital Design Education

Standardized, modular GenAI teaching units in GUIDE offer a practical path to integrating cutting-edge AI tools into digital design education.

Weihua Xiao, W. Xiao, Jason Blocklove +7

Code Generation & Program Synthesis Open-Source Models & Weights

2w ago·also Monash, Sydney

Revisiting Vulnerability Patch Identification on Data in the Wild

Security patch detectors trained on standard vulnerability databases are practically useless in the real world, losing up to 90% F1-score when deployed on in-the-wild data.

I. Irsan, Ratnadira Widyasari, Ting Zhang +7

Code Generation & Program Synthesis Natural Language Processing Open-Source Models & Weights

Search

Open-Source Models & Weights - Weekly Roundup

Top Papers

All Papers (22)