May 1 – May 8, 2026

Open-Source Models & Weights - Weekly Roundup

26 papers published across 4 labs.

Selected Labs publishing this week

Top Papers

May 5, 2026

Alan L. McCann2w ago

Cryptographic Registry Provenance: Structural Defense Against Dependency Confusion in AI Package Ecosystems

Current package managers are surprisingly vulnerable: a single misconfiguration can silently allow attackers to inject malicious dependencies, a problem solved by this paper's cryptographically enforced provenance system.

Alan L. McCann5

Code Generation & Program Synthesis Open-Source Models & Weights Red-Teaming & Adversarial Robustness

May 6, 2026

Moshe Eliasof +42w ago

Bridging Input Feature Spaces Towards Graph Foundation Models

Graph models can now generalize to entirely new datasets with different input features, thanks to a simple projection into a shared random space.

Moshe Eliasof, Krishna Sri Ipsit Mantri, Beatrice Bevilacqua +2

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights

Independent Researcher2w ago

PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation

Synthetic data augmentation and per-language threshold tuning can significantly boost the performance of LLMs on multilingual tasks, outperforming alternative architectures that showed promise on the development set.

Srikar Kashyap Pulipaka

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

Álvaro Becerra +22w ago·also School of Engineering

AICoFe: Implementation and Deployment of an AI-Based Collaborative Feedback System for Higher Education

Teachers can now scalably provide high-quality, personalized feedback to students by leveraging a multi-LLM system that synthesizes rubric data and qualitative observations, while retaining control through a teacher-in-the-loop workflow.

Álvaro Becerra, A. Palma, Ruth Cobos

Natural Language Processing Open-Source Models & Weights Tool Use & Agents

Ivan Bondarenko +52w ago

RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation

A judge-orchestrated ensemble of diverse LLMs trounces single models in multi-turn response generation, proving that strategic model selection beats brute force scaling.

Ivan Bondarenko, Roman Derunets, Oleg Sedukhin +3

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

All Papers (26)

May 6, 2026

Independent Researcher2w ago

PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation

Srikar Kashyap Pulipaka

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

Moshe Eliasof +42w ago

Bridging Input Feature Spaces Towards Graph Foundation Models

Graph models can now generalize to entirely new datasets with different input features, thanks to a simple projection into a shared random space.

Moshe Eliasof, Krishna Sri Ipsit Mantri, Beatrice Bevilacqua +2

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights

Álvaro Becerra +22w ago·also School of Engineering

AICoFe: Implementation and Deployment of an AI-Based Collaborative Feedback System for Higher Education

Álvaro Becerra, A. Palma, Ruth Cobos

Natural Language Processing Open-Source Models & Weights Tool Use & Agents

Ivan Bondarenko +52w ago

RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation

A judge-orchestrated ensemble of diverse LLMs trounces single models in multi-turn response generation, proving that strategic model selection beats brute force scaling.

Ivan Bondarenko, Roman Derunets, Oleg Sedukhin +3

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

Wei Liu +22w ago

Open-Source Image Editing Models Are Zero-Shot Vision Learners

Open-source image editing models can match or beat fine-tuned models on visual understanding tasks *without any task-specific training*.

Wei Liu, Jiaxin Lin, Rui Chen

Computer Vision Multimodal Models Open-Source Models & Weights

CMU ML2w ago·also SKKU, UBC

Harnessing Linguistic Dissimilarity for Language Generalization on Unseen Low-Resource Varieties

Dissimilarity, not just similarity, unlocks better language generalization for low-resource varieties.

Jinju Kim, Haeji Jung, Youjeong Roh +2

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

M. Arabov2w ago

TajikNLP: An Open-Source Toolkit for Comprehensive Text Processing of Tajik (Cyrillic Script)

Unlock Tajik NLP: a new open-source toolkit delivers a comprehensive pipeline for processing Cyrillic-script Tajik text, complete with datasets and pre-trained embeddings.

M. Arabov

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

University of Pavia2w ago·also Radboud

You Snooze, You Lose: Automatic Safety Alignment Restoration through Neural Weight Translation

Forget retraining: NeWTral instantly restores safety to your LLM after adding a risky LoRA, slashing attack success rates from 70% to 13% without sacrificing expertise.

Marco Arazzi, Vignesh Kumar Kembu, Antonino Nocera +2

Constitutional AI & AI Ethics Open-Source Models & Weights Red-Teaming & Adversarial Robustness

2w ago

Not All Faults Are Equal: Transient-Fault Sensitivity Characterization of an Open-Source RISC-V Vector Cluster

Exponent bits are the Achilles' heel of floating-point arithmetic, as corrupting them in RISC-V vector processors leads to the most severe silent data corruption.

M. Cai, Amirhossein Kiamarzi, Davide Rossi +1

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Open-Source Models & Weights

Gaolin Ge +52w ago

3D Printing of Passively Actuated Self-Folding Robots with Integrated Functional Modules

Forget complex assembly: this 3D printing technique lets you pop out functional, self-folding robots with integrated sensors and actuators directly from a flat sheet.

Gaolin Ge, Qifeng Yang, Haoran Lu +3

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Robotics & Embodied AI

2w ago

Low-Rank Adaptation of Geospatial Foundation Models for Wildfire Mapping Using Sentinel-2 Data

Forget full fine-tuning: LoRA lets you adapt Geospatial Foundation Models for wildfire mapping with comparable accuracy while only tweaking 1% of the parameters.

Ali Shibli, Andrea Nascetti, Yifang Ban

Computer Vision Open-Source Models & Weights Training Efficiency & Optimization

May 5, 2026

2w ago

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

Forget resource-intensive pipelines: a purely academic team achieves SOTA search agent performance with just 10.6k SFT data points, outperforming models trained with CPT+SFT+RL.

Yuwen Du, Rui Ye, Shuo Tang +4

Eval Frameworks & Benchmarks Open-Source Models & Weights Tool Use & Agents

Yao-Shun Chuang +82w ago

Self-Prompting Small Language Models for Privacy-Sensitive Clinical Information Extraction

Forget massive models: small, locally-deployable language models can achieve surprisingly strong performance on privacy-sensitive clinical information extraction tasks with self-prompting and preference-based optimization.

Yao-Shun Chuang, Tushti Mody, Uday Pratap Singh +6

Inference & Quantization Natural Language Processing Open-Source Models & Weights

Stephen E. Moore +152w ago

Nsanku: Evaluating Zero-Shot Translation Performance of LLMs for Ghanaian Languages

Despite impressive multilingual capabilities, today's LLMs still can't reliably translate between English and Ghanaian languages at scale.

Stephen E. Moore, M. Owusu, Akwasi Asare +13

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

Hoffmann Muki +12w ago

Are LLMs Ready for Conflict Monitoring? Empirical Evidence from West Africa

LLMs exhibit a surprising "False Illegitimation bias," systematically misclassifying legitimate battles as violence against civilians, highlighting a critical flaw for conflict monitoring applications.

Hoffmann Muki, Olukunle P. Owolabi

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

D. Gringras +12w ago

Frontier Lag: A Bibliometric Audit of Capability Misrepresentation in Academic AI Evaluation

LLM benchmarks are increasingly measuring the capabilities of yesterday's models, not today's frontier, creating a widening gap that misrepresents the state of AI.

D. Gringras, Misha Salahshoor

Eval Frameworks & Benchmarks Open-Source Models & Weights

2w ago·also NTU, ZJU

ZK-Value: A Practical Zero-Knowledge System for Verifiable Data Valuation

Finally, a zero-knowledge data valuation system that scales: ZK-Value proves Shapley values in seconds to minutes, beating specialized ZK baselines by over an order of magnitude.

Zhaoyu Wang, Pingchuan Ma, Zhantong Xue +10

Data Curation & Synthetic Data Open-Source Models & Weights

Alan L. McCann2w ago

Cryptographic Registry Provenance: Structural Defense Against Dependency Confusion in AI Package Ecosystems

Alan L. McCann5

Code Generation & Program Synthesis Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Sinan Bank +12w ago

OPENJ: A Conceptual Framework for Open-Source Digital Human Modeling and Ergonomic Assessment in a CAD Environment

An open-source alternative to expensive, proprietary digital human modeling software could democratize ergonomic analysis and workplace design.

Sinan Bank, Casey E. Eaton

Computer Vision Open-Source Models & Weights Robotics & Embodied AI

Daniel C. Elton +12w ago

Benchmarking open-source tools for in silico antiviral drug discovery

Public antiviral drug discovery datasets are riddled with errors that can be fixed with careful polyprotein splitting, unlocking significant performance gains in binding affinity prediction.

Daniel C. Elton, Preston W. Estep

Eval Frameworks & Benchmarks Open-Source Models & Weights Scientific Discovery & Drug Design

Jing Gong2w ago

MiniMind-O Technical Report: An Open Small-Scale Speech-Native Omni Model

Open-sourcing a 0.1B-scale speech-native omni model lets you directly inspect the complete interaction loop and reveals critical design choices for building effective small multimodal models.

Jing Gong

Multimodal Models Open-Source Models & Weights Speech & Audio

University of Tennessee2w ago·also ORNL

Exploring Sustainability in Scientific Software through Code Quality&Test Coverage Metrics

Sustainable scientific software isn't just about the code; it's about consistent testing and clear links between code quality and tests, a pattern often missing in unsustainable projects.

Sheikh Md. Mushfiqur Rahman, Gregory R. Watson, Nasir U. Eisty

Code Generation & Program Synthesis Open-Source Models & Weights Scientific Discovery & Drug Design

May 4, 2026

AI22w ago·also NUS, UW, JHU, UMich +1

MolmoAct2: Action Reasoning Models for Real-world Deployment

Open-sourcing a VLA model that beats closed-source giants on embodied reasoning tasks could finally make real-world robot deployment practical.

Haoquan Fang, Jiafei Duan, Donovan Clay +26

Multimodal Models Open-Source Models & Weights Robotics & Embodied AI

2w ago

ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Autonomous agents can produce plausible-sounding research that's subtly wrong, so ARIS uses adversarial collaboration between different LLMs to catch these errors.

Ruofeng Yang, Yongcan Li, Shuai Li

Eval Frameworks & Benchmarks Open-Source Models & Weights Tool Use & Agents

Venkata Pushpak Teja Menta2w ago

The TTS-STT Flywheel: Synthetic Entity-Dense Audio Closes the Indic ASR Gap Where Commercial and Open-Source Systems Fail

Synthetic data closes the Indic ASR gap where commercial and open-source systems fail, boosting entity recognition by up to 22x.

Venkata Pushpak Teja Menta

Data Curation & Synthetic Data Open-Source Models & Weights Speech & Audio

May 1, 2026

Daniel Song +233w ago

Code World Model Preparedness Report

Meta's risk assessment of its Code World Model (CWM) gives it a clean bill of health, concluding it poses no *new* catastrophic risks beyond those already present in the AI landscape.

Daniel Song, Peter Ney, Cristina Menghini +21

Code Generation & Program Synthesis Open-Source Models & Weights Red-Teaming & Adversarial Robustness