April 24 – May 1, 2026

Open-Source Models & Weights - Weekly Roundup

48 papers published across 5 labs.

Selected Labs publishing this week

Top Papers

Apr 28, 2026

Venkata Pushpak Teja Menta3w ago

Praxy Voice: Voice-Prompt Recovery + BUPS for Commercial-Class Indic TTS from a Frozen Non-Indic Base at Zero Commercial-Training-Data Cost

Achieve near-native Indic TTS from a non-Indic base model at zero commercial-training-data cost by cleverly combining phoneme space unification, LoRA adaptation, and voice-prompt recovery.

Venkata Pushpak Teja Menta

Natural Language Processing Open-Source Models & Weights Speech & Audio

Apr 27, 2026

Rakshit Soni +93w ago

OpenPodcar2: a robust, ROS2 vehicle for self-driving research

Democratizing self-driving research, OpenPodcar2 offers a robust, low-cost (≈$7k new, $2k used), open-source autonomous vehicle platform ready for ROS2 integration and real-world deployment.

Rakshit Soni, Rakshit Soni, Chris Waltham +7

Distributed Systems & Hardware Open-Source Models & Weights Robotics & Embodied AI

May 1, 2026

Daniel Song +233w ago

Code World Model Preparedness Report

Meta's risk assessment of its Code World Model (CWM) gives it a clean bill of health, concluding it poses no *new* catastrophic risks beyond those already present in the AI landscape.

Daniel Song, Peter Ney, Cristina Menghini +21

Code Generation & Program Synthesis Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Apr 30, 2026

Binghao Huang +23w ago

FlexiTac: A Low-Cost, Open-Source, Scalable Tactile Sensing Solution for Robotic Systems

Unlock advanced robotic manipulation with FlexiTac, a tactile sensing solution so cheap and easy to integrate, you'll wonder why you were using anything else.

Binghao Huang, Yunzhu Li, Yunzhu Li

Open-Source Models & Weights Robotics & Embodied AI

Sof'ia P'erez Casulo +73w ago

A Unified Framework of Hyperbolic Graph Representation Learning Methods

Hyperbolic embeddings are powerful, but a fragmented ecosystem makes them hard to use—this framework finally puts them all in one place.

Sof'ia P'erez Casulo, Sofía Pérez Casulo, Marcelo Fiori +5

Architecture Design (Transformers, SSMs, MoE)Eval Frameworks & Benchmarks Open-Source Models & Weights

All Papers (48)

May 1, 2026

Daniel Song +233w ago

Code World Model Preparedness Report

Meta's risk assessment of its Code World Model (CWM) gives it a clean bill of health, concluding it poses no *new* catastrophic risks beyond those already present in the AI landscape.

Daniel Song, Peter Ney, Cristina Menghini +21

Code Generation & Program Synthesis Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Apr 30, 2026

Binghao Huang +23w ago

FlexiTac: A Low-Cost, Open-Source, Scalable Tactile Sensing Solution for Robotic Systems

Unlock advanced robotic manipulation with FlexiTac, a tactile sensing solution so cheap and easy to integrate, you'll wonder why you were using anything else.

Binghao Huang, Yunzhu Li, Yunzhu Li

Open-Source Models & Weights Robotics & Embodied AI

Sof'ia P'erez Casulo +73w ago

A Unified Framework of Hyperbolic Graph Representation Learning Methods

Hyperbolic embeddings are powerful, but a fragmented ecosystem makes them hard to use—this framework finally puts them all in one place.

Sof'ia P'erez Casulo, Sofía Pérez Casulo, Marcelo Fiori +5

Architecture Design (Transformers, SSMs, MoE)Eval Frameworks & Benchmarks Open-Source Models & Weights

R. A. Gouvêa +33w ago·also Institute of Condensed Matter and Nanosciences, Louvain-la-Neuve, Université Catholique de Louvain, WEL Research Institute

VibroML: an automated toolkit for high-throughput vibrational analysis and dynamic instability remediation of crystalline materials using machine-learned potentials

Forget computationally verifying stability – VibroML automatically *fixes* dynamically unstable crystal structures, opening the door to exploring previously inaccessible materials.

R. A. Gouvêa, Rogério Almeida Gouvêa, Gian-Marco Rignanese +1

Open-Source Models & Weights Scientific Discovery & Drug Design

Dawid Wisniewski +13w ago

Beyond Semantics: Measuring Fine-Grained Emotion Preservation in Small Language Model-Based Machine Translation

Even with emotion-aware prompting, today's best small language models still struggle to preserve subtle emotional nuances when translating between languages.

Dawid Wisniewski, Igor Czudy

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

Tsinghua AI3w ago·also MiniCPM-o Team, Tencent AI

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

Forget turn-based interactions: MiniCPM-o 4.5 lets you build AI that sees, hears, speaks, and *reacts* in real-time, all on a device with only 12GB of RAM.

Junbo Cui, Bokai Xu, Chongyi Wang +36

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Open-Source Models & Weights

3w ago·also Osaka, Wakayama University

A Longitudinal Analysis of Good First Issue Practices and Newcomer Pull Requests in Popular OSS Projects

Newcomers beware: the odds of your "good first issue" pull request getting merged have plummeted nearly 20% in the last year.

Hirotatsu Hoshikawa, Hidetake Tanaka, Kazumasa Shimari +3

Code Generation & Program Synthesis Natural Language Processing Open-Source Models & Weights

A. Sadallah +93w ago·also Zayed University of Artificial

Instruction-Guided Poetry Generation in Arabic and Its Dialects

Forget Shakespeare, LLMs can now sling verses in Arabic dialects, thanks to a new dataset for instruction-guided poetry generation.

A. Sadallah, Abdelrahman Sadallah, Ka-reem Elozeiri +7

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

3w ago

Math Education Digital Shadows for facilitating learning with LLMs: Math performance, anxiety and confidence in simulated students and AIs

LLMs exhibit surprisingly human-like biases and overconfidence in math, revealed by a new dataset mapping their mathematical reasoning across diverse personas.

Naomi Esposito, Anthony Tricarico, A. Tricarico +5

Eval Frameworks & Benchmarks Open-Source Models & Weights Reasoning & Chain-of-Thought

Jullajak Karnjanaekarin +73w ago

JaiTTS: A Thai Voice Cloning Model

Thai voice cloning just leapfrogged human performance on short-duration speech, thanks to a new model that directly handles code-switching and numerals.

Jullajak Karnjanaekarin, Pontakorn Trakuekul, Narongkorn Panitsrisit +5

Natural Language Processing Open-Source Models & Weights Speech & Audio

3w ago·also BAIR, Mila, Toronto Metropolitan University, UofT

A Reproducibility Study of LLM-Based Query Reformulation

LLM-powered query reformulation, a hot topic in IR, often fails to translate gains from lexical to neural retrieval, and bigger models don't always help.

Amin Bigdeli, Radin Hamidi Rad, Hai Son Le +4

Eval Frameworks & Benchmarks Open-Source Models & Weights Recommendation & Information Retrieval

Jon-Paul Cacioli3w ago

Beyond the Mean: Within-Model Reliable Change Detection for LLM Evaluation

LLM upgrades are a chaotic mix of progress and decay: despite overall gains, up to 47% of questions get *worse* after an update, and single-shot evals miss almost half of these critical regressions.

Jon-Paul Cacioli

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

China Telecom Research Institute3w ago

How Code Representation Shapes False-Positive Dynamics in Cross-Language LLM Vulnerability Detection

LLMs trained on raw code text learn surface-level cues that trigger false positives when detecting vulnerabilities in other languages, but simply feeding them ASTs at inference time can dramatically reduce these errors.

Maofei Chen, Laifu Wang, Yue Qin +5

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Open-Source Models & Weights

Zi Li +63w ago

Secret Stealing Attacks on Local LLM Fine-Tuning through Supply-Chain Model Code Backdoors

You can steal secrets from locally fine-tuned LLMs by backdooring their model code, even bypassing common defenses like differential privacy and code audits.

Zi Li, Tian Zhou, Tianyang Zhou +4

Code Generation & Program Synthesis Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Rochester Institute of Technology3w ago

Unsafe and Unused? A History of Utility Code in Mature Open Source Projects

"Utility" code, intended to be broadly useful and reusable, is actually 2.75x more likely to be involved in a vulnerability than other code.

Brandon N. Keller, Brandon Keller, Kaitlin Yandik +5

Code Generation & Program Synthesis Open-Source Models & Weights

IIIT Allahabad3w ago·also IIIT Hyderabad, IIIT Manipur

Multifaceted Hero Developers and Bug-Fixing Outcomes Across Severity

Defining "hero developers" in open-source projects is more nuanced than previously thought: technical prowess doesn't guarantee social engagement, and vice versa, impacting bug-fixing success in surprising ways.

Amit Kumar, Mahen Gandhi, Meher Bhardwaj +2

Code Generation & Program Synthesis Natural Language Processing Open-Source Models & Weights

Apr 29, 2026

University of Hildesheim3w ago

Reproducible Automated Program Repair Is Hard -- Experiences With the Defects4J Dataset

Reproducibility issues plague over 20% of Defects4J, a widely used benchmark for automated program repair, casting doubt on the validity of many APR evaluations.

Adam Krafczyk, Klaus Schmid

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Open-Source Models & Weights

Ericsson AB3w ago·also KTH

Where did we fail? -- Reproducing build failures in embedded open source software

Replaying CI failures in embedded systems is now possible at scale: PhantomRun reconstructs over 90% of failing builds, opening the door to systematic debugging and failure analysis.

Han Fu, Andreas Ermedahl, Sigrid Eldh +3

Code Generation & Program Synthesis Distributed Systems & Hardware Open-Source Models & Weights

QUT3w ago·also Edith Cowan University, Research Graduate School

eDySec: A Deep Learning-based Explainable Dynamic Analysis Framework for Detecting Malicious Packages in PyPI Ecosystem

You can slash false positives in PyPI malware detection by 82% while simultaneously reducing feature dimensionality by 50% using a carefully tuned deep learning approach.

Sk Tanzir Mehedi, Raja Jurdak, Chadni Islam +2

Code Generation & Program Synthesis Open-Source Models & Weights

3w ago·also KCL, Universidad del Atlantico Medio

Hot Fixing in the Wild

AI agents and humans exhibit over 10 distinct repair behaviors when performing urgent hot fixes, suggesting opportunities for targeted human-automation collaboration.

Carol Hanna, Karine Even-Mendoza, W. B. Langdon +3

Code Generation & Program Synthesis Open-Source Models & Weights

3w ago

Revealing NVIDIA Closed-Source Driver Command Streams for CPU-GPU Runtime Behavior Insight

NVIDIA's closed-source driver secrets are out: researchers can now see the exact hardware commands triggered by CUDA code.

Yuang Yan, Ian Karlin, Ryan Grant

Distributed Systems & Hardware Open-Source Models & Weights

Jon-Paul Cacioli3w ago

Instruction Complexity Induces Positional Collapse in Adversarial LLM Evaluation

Complex, multi-step instructions can cause LLMs to completely ignore question content and instead rely on positional shortcuts when asked to underperform, revealing a critical vulnerability in adversarial evaluation.

Jon-Paul Cacioli

Eval Frameworks & Benchmarks Open-Source Models & Weights Red-Teaming & Adversarial Robustness

3w ago·also Children's National Hospital, HKU

Domain-Adapted Small Language Models for Reliable Clinical Triage

Forget giant LLMs: fine-tuned small language models can actually *beat* GPT-4o on critical clinical tasks like emergency triage.

Manar Aljohani, Brandon Ho, Kenneth McKinley +2

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

3w ago

Hypencoder Revisited: Reproducibility and Analysis of Non-Linear Scoring for First-Stage Retrieval

Non-linear scoring with Hypencoders boosts retrieval performance, but don't expect it to fix your speed or adversarial robustness problems.

Arne Eichholtz, Yongkang Li, Jutte Vijverberg +2

Natural Language Processing Open-Source Models & Weights Recommendation & Information Retrieval

3w ago

Taking a Bite Out of the Forbidden Fruit: Characterizing Third-Party Iranian iOS App Stores

Sanctions and censorship breed a shadow economy: Iranian third-party iOS app stores are rife with cracked apps, unauthorized monetization, and privacy-invading trackers.

Amirhossein Khanlari, Amir Rahmati

Natural Language Processing Open-Source Models & Weights

3w ago

Enhancing Linux Privilege Escalation Attack Capabilities of Local LLM Agents

Local LLMs can now rival cloud-based giants like GPT-4o in Linux privilege escalation tasks, thanks to targeted system-level and prompting interventions.

Benjamin Probst, Andreas Happe, Jürgen Cito

Open-Source Models & Weights Red-Teaming & Adversarial Robustness Tool Use & Agents

University of Alabama at Birmingham3w ago

OpenSOC-AI: Democratizing Security Operations with Parameter Efficient LLM Log Analysis

SMBs drowning in security logs can now achieve enterprise-grade threat detection with a lightweight, open-source framework fine-tuned on a tiny LLM.

Chaitanya Vilas Garware, Sharif Noor Zisad

Natural Language Processing Open-Source Models & Weights Training Efficiency & Optimization

3w ago·also IMT School for Advanced Studies Lucca, Napoli

What Makes Software Bugs Escape Testing? Evidence from a Large-Scale Empirical Study

Post-release software bugs aren't just about code complexity; they're a symptom of code age, frequent modification, and high churn, demanding a shift in testing focus.

Domenico Cotroneo, Giuseppe De Rosa, Cristina Improta +1

Code Generation & Program Synthesis Open-Source Models & Weights

Apr 28, 2026

Amir M. Saeidi +73w ago·also ASU, Cisco Research

FAMA: Failure-Aware Meta-Agentic Framework for Open-Source LLMs in Interactive Tool Use Environments

Open-source LLM agents can get a 27% performance boost in tool use by strategically injecting context tailored to address their most common failure modes.

Amir M. Saeidi, Amir Saeidi, Venkatesh Mishra +5

Eval Frameworks & Benchmarks Open-Source Models & Weights Tool Use & Agents

Venkata Pushpak Teja Menta3w ago

Praxy Voice: Voice-Prompt Recovery + BUPS for Commercial-Class Indic TTS from a Frozen Non-Indic Base at Zero Commercial-Training-Data Cost

Achieve near-native Indic TTS from a non-Indic base model at zero commercial-training-data cost by cleverly combining phoneme space unification, LoRA adaptation, and voice-prompt recovery.

Venkata Pushpak Teja Menta

Natural Language Processing Open-Source Models & Weights Speech & Audio

DAMO3w ago

Marco-MoE: Open Multilingual Mixture-of-Expert Language Models with Efficient Upcycling

Multilingual MoEs can achieve best-in-class performance-to-compute ratios, even with extreme sparsity, by strategically upcycling from dense models and exhibiting structured expert activation patterns across languages.

Fan Jiang, Yu Zhao, Chenyang Lyu +5

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Training Efficiency & Optimization

3w ago

OxyGent: Making Multi-Agent Systems Modular, Observable, and Evolvable via Oxy Abstraction

Plug-and-play multi-agent systems are now a reality: OxyGent's "Lego-like" abstraction lets you compose agents, tools, and LLMs into scalable systems with unprecedented observability and evolvability.

Junxing Hu, Tianlong Li, Lei Yu +1

Open-Source Models & Weights Reasoning & Chain-of-Thought Tool Use & Agents

Verdict Security3w ago·also Ain Shams University

Prime-Field PINI: Machine-Checked Composition Theorems for Post-Quantum NTT Masking

Fresh masking between pipeline stages in NTT-based post-quantum crypto isn't just good practice, it's provably necessary to erase vulnerabilities arising from prior stages, as demonstrated with a machine-checked proof and a real-world hardware flaw.

Ray Iskander, Khaled Kirah

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Open-Source Models & Weights

3w ago·also Central South University, Cyprus University of Technology, ERATOSTHENES Centre of Excellence, Federal University of Santa Catarina +4

EOS-Bench: A Comprehensive Benchmark for Earth Observation Satellite Scheduling

EOS-Bench reveals that the complexity of satellite scheduling can be systematically quantified, unlocking new insights into algorithm performance across thousands of scenarios.

Qiannan Yin, Qian Yin, Jiaxing Li +29

Eval Frameworks & Benchmarks Open-Source Models & Weights Scientific Discovery & Drug Design

Wenzhi Bai +53w ago·also Manchester

SlicerRoboTMS: An Open-Source 3D Slicer Extension for Robot-Assisted Transcranial Magnetic Stimulation

SlicerRoboTMS revolutionizes Robo-TMS research by providing a versatile, open-source platform that simplifies integration and enhances reproducibility.

Wenzhi Bai, Yi Guo, Yituo Guo +3

Computer Vision Open-Source Models & Weights Robotics & Embodied AI

Apr 27, 2026

Zhongjie Duan +23w ago

Diffusion Templates: A Unified Plugin Framework for Controllable Diffusion

Finally, a plugin framework that lets you mix-and-match KV-Cache, LoRA, and other controls to steer diffusion models without being locked into a specific backbone.

Zhongjie Duan, Hong Zhang, Yingda Chen

Architecture Design (Transformers, SSMs, MoE)Computer Vision Open-Source Models & Weights

NVIDIA3w ago·also Texas Tech University

CiteRadar: A Citation Intelligence Platform for Researcher Profiling and Geographic Visualization

See where your citations are coming from with a single command, thanks to CiteRadar's open-source platform that automatically generates interactive maps and detailed researcher profiles from your Google Scholar ID.

Chenxu Niu, Yiming Sun

Natural Language Processing Open-Source Models & Weights Recommendation & Information Retrieval

William Oliveira3w ago

Less Is More: Engineering Challenges of On-Device Small Language Model Integration in a Mobile Application

On-device SLMs in mobile apps demand a radical shift: the less the LLM does, the more reliable it becomes.

William Oliveira

Inference & Quantization Natural Language Processing Open-Source Models & Weights

3w ago

Evaluating Cryptographic API Misuse Detectors for Go

Go's security-critical infrastructure is riddled with thousands of cryptographic API misuses, and your favorite static analysis tool might be missing them.

Vivi Andersson, Martin Monperrus

Code Generation & Program Synthesis Open-Source Models & Weights

Abdallah Abou Hasna +23w ago

From Spoofing to Trust: Emergency Alerts Spoofing Testbed and Cross-Cell Verification

5G emergency alert systems are surprisingly vulnerable to spoofing attacks that can do more than just display fake warnings.

Abdallah Abou Hasna, N. Chendeb, A. Falou

Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Lahore University of Management Sciences3w ago

On the Footprints of Reviewer Bots Feedback on Agentic Pull Requests in OSS GitHub Repositories

More reviewer bot comments on agentic pull requests actually *increase* resolution time, suggesting that quality trumps quantity in automated code review.

Syeda Kaneez Fatima, Yousuf Abrar, Abdul Rehman Tahir +3

Code Generation & Program Synthesis Open-Source Models & Weights Tool Use & Agents

Department of Computer and Software3w ago·also School of Computer Science

Putting a Face to the Issue: Fostering User Empathy of Open Source Software Developers With PersonaFlow

OSS developers who saw automatically generated user personas responded to issues with more empathy and tailored explanations, suggesting a simple UI intervention can bridge the user-developer gap.

Boniface Bahati Tadjuidje, Jin L. C. Guo, Jinghui Cheng

Code Generation & Program Synthesis Natural Language Processing Open-Source Models & Weights

Liyou Chen +53w ago·also Shaanxi Normal University

Vulnerability Identification by Harnessing Inter-connected Multi-Source Information

Open-source library vulnerabilities are easier to spot when you connect the dots between bug reports, code changes, and commit messages.

Liyou Chen, Hailong Sun, Xiang Gao +3

Code Generation & Program Synthesis Natural Language Processing Open-Source Models & Weights

Jorge L. A. Lima +13w ago

Aycromo: An Open-Source Platform for Automatic Chromosome Detection in Metaphase Images Based on Deep Learning

Cytogeneticists can now slash chromosome analysis time from days to seconds with Aycromo, an open-source platform that democratizes access to high-performance deep learning models.

Jorge L. A. Lima, Filipe R. Cordeiro

Computer Vision Open-Source Models & Weights Scientific Discovery & Drug Design

Zhongzheng Zhang +73w ago

TEACar: An Open-Source Autonomous Driving Platform

An open-source autonomous driving platform offers researchers a modular, scalable, and cost-effective alternative to complex and restrictive hardware validation setups.

Zhongzheng Zhang, Maxwell Ruyle, A. Kappes +5

Computer Vision Open-Source Models & Weights Robotics & Embodied AI

Rakshit Soni +93w ago

OpenPodcar2: a robust, ROS2 vehicle for self-driving research

Democratizing self-driving research, OpenPodcar2 offers a robust, low-cost (≈$7k new, $2k used), open-source autonomous vehicle platform ready for ROS2 integration and real-world deployment.

Rakshit Soni, Rakshit Soni, Chris Waltham +7

Distributed Systems & Hardware Open-Source Models & Weights Robotics & Embodied AI

3w ago·also IIT Delhi, Indraprastha Institute of Information, Jaypee Institute of Information

Learning Illumination Control in Diffusion Models

Open-source diffusion models can now achieve state-of-the-art illumination control rivaling closed-source alternatives, thanks to a novel training pipeline and dataset.

Nishit Anand, Manan Suri, Christopher Metzler +2

Computer Vision Data Curation & Synthetic Data Open-Source Models & Weights

3w ago·also McGill, School of Computer Science

What If We Work Together? Fostering Reflections on Designer Inclusion in Open Source Software Through Speculative Design

Speculative design can effectively catalyze critical reflection and generate actionable insights for fostering designer inclusion within the often developer-centric world of Open Source Software.

Rozhan Hozhabri Nezhad, Rozhan Hozhabri Nezhad, Jin L. C. Guo +2

Natural Language Processing Open-Source Models & Weights Tool Use & Agents

Search

Open-Source Models & Weights - Weekly Roundup

Selected Labs publishing this week

Top Papers

All Papers (48)