Unlock the power of your favorite classifier for ordinal data: Classifier Pooling consistently beats standard methods, especially when data is scarce or categories are numerous.

Noam H. Rotenberg, A. V. Faria, Brian S. Caffo

Natural Language Processing Open-Source Models & Weights

Zeeshan Akram2w ago

Circumventing Platform Defenses at Scale: Automated Content Replication from YouTube to Blockchain-Based Decentralized Storage

YouTube's platform defenses are a house of cards: circumventing one control often triggers a cascade of failures, demanding constant architectural adaptation for large-scale content replication.

Zeeshan Akram

Data Curation & Synthetic Data Distributed Systems & Hardware Open-Source Models & Weights

Mengyu Bu2w ago

Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality

LLMs can get a massive multilingual boost, especially in low-resource languages, by offloading translation to specialized models and carefully aligning their representations.

Mengyu Bu

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Open-Source Models & Weights

All Papers (67)

Mar 18, 2026

2w ago·also Fudan

MOSS-TTS Technical Report

Achieve controllable and scalable speech generation with MOSS-TTS, enabling zero-shot voice cloning and long-form synthesis.

Y. Gong, Yitian Gong, Botian Jiang +28

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Speech & Audio

Noam H. Rotenberg +22w ago

Classifier Pooling for Modern Ordinal Classification

Unlock the power of your favorite classifier for ordinal data: Classifier Pooling consistently beats standard methods, especially when data is scarce or categories are numerous.

Noam H. Rotenberg, A. V. Faria, Brian S. Caffo

Natural Language Processing Open-Source Models & Weights

Zeeshan Akram2w ago

Circumventing Platform Defenses at Scale: Automated Content Replication from YouTube to Blockchain-Based Decentralized Storage

YouTube's platform defenses are a house of cards: circumventing one control often triggers a cascade of failures, demanding constant architectural adaptation for large-scale content replication.

Zeeshan Akram

Data Curation & Synthetic Data Distributed Systems & Hardware Open-Source Models & Weights

Mengyu Bu2w ago

Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality

LLMs can get a massive multilingual boost, especially in low-resource languages, by offloading translation to specialized models and carefully aligning their representations.

Mengyu Bu

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Open-Source Models & Weights

Andor Diera +22w ago

Do Language Models Encode Semantic Relations? Probing and Sparse Feature Analysis

LLMs encode hierarchical semantic relations asymmetrically, with hypernymy being far more robust and redundantly represented than hyponymy.

Andor Diera, Ansgar Scherp, A. Scherp

Interpretability & Mechanistic Interp Natural Language Processing Open-Source Models & Weights

Huan Song +72w ago

Ruyi2.5 Technical Report

Ruyi2.5 achieves comparable performance to Qwen3-VL on general multimodal benchmarks while significantly outperforming it in privacy-constrained surveillance, demonstrating the effectiveness of its edge-cloud architecture.

Huan Song, Shuyu Tian, Qingfei Zhao +5

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Open-Source Models & Weights

Alireza Sadeghi +12w ago

Causal Representation Learning on High-Dimensional Data: Benchmarks, Reproducibility, and Evaluation Metrics

Current CRL benchmarks often fail to provide a holistic view of model performance, hindering progress, but a new aggregate metric could change that.

Alireza Sadeghi, Wael AbdAlmageed

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Open-Source Models & Weights

Gaotian Wang +32w ago

ManiDreams: An Open-Source Library for Robust Object Manipulation via Uncertainty-aware Task-specific Intuitive Physics

ManiDreams lets robots handle real-world uncertainty in manipulation tasks without retraining, outperforming standard RL baselines under various perturbations.

Gaotian Wang, Kejia Ren, Andrew S. Morgan +1

Open-Source Models & Weights Robotics & Embodied AI World Models & Planning

2w ago

TENSO: Software Package for Numerically Exact Open Quantum Dynamics Based on Efficient Tree Tensor Network Decomposition of the Hierarchical Equations of Motion

Tackle previously intractable open quantum systems simulations with TENSO, a new open-source package that efficiently handles complex environments via tree tensor networks.

Juan C. Rodriguez Betancourt, Michelle C Anderson, Luchang Niu +2

Open-Source Models & Weights Scientific Discovery & Drug Design

Multiverse Computing2w ago·also Donostia International Physics Center, Ikerbasque Foundation for Science, Tecnun - University of Navarra

Only relative ranks matter in weight-clustered large language models

LLMs can be drastically compressed without retraining because the relative ordering of weights matters far more than their exact values, opening the door to efficient, training-free compression techniques.

Borja Aizpurua, Sukhbinder Singh, Rom'an Or'us +1

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Open-Source Models & Weights

Maria Andueza Rodriguez +22w ago

Modeling the human lexicon under temperature variations: linguistic factors, diversity and typicality in LLM word associations

LLMs can mimic human lexical patterns, but larger models act like stereotypical humans, sacrificing diversity for typicality in word associations, a trade-off tunable by temperature.

Maria Andueza Rodriguez, Marie Candito, Richard Huyghe

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

Philipp Normann +42w ago

Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards

A 4B parameter model can nearly match the privilege escalation performance of a state-of-the-art closed LLM like Claude Opus, while being fully local and 100x cheaper to run.

Philipp Normann, Andreas Happe, A. Happe +2

Code Generation & Program Synthesis Open-Source Models & Weights Red-Teaming & Adversarial Robustness+1

W. Xiao +92w ago

GUIDE: GenAI Units In Digital Design Education

Standardized, modular GenAI teaching units in GUIDE offer a practical path to integrating cutting-edge AI tools into digital design education.

W. Xiao, Weihua Xiao, Jason Blocklove +7

Code Generation & Program Synthesis Open-Source Models & Weights

2w ago·also Monash, Sydney

Revisiting Vulnerability Patch Identification on Data in the Wild

Security patch detectors trained on standard vulnerability databases are practically useless in the real world, losing up to 90% F1-score when deployed on in-the-wild data.

I. Irsan, Ratnadira Widyasari, Ting Zhang +7

Code Generation & Program Synthesis Natural Language Processing Open-Source Models & Weights

Mar 17, 2026

Engineering Group2w ago

EngGPT2: Sovereign, Efficient and Open Intelligence

This Italian LLM punches way above its weight, matching the performance of models trained on 6-10x more data while using only 3B active parameters during inference.

G. Ciarfaglia, A. Rosanova, S. Cipolla +13

Eval Frameworks & Benchmarks Open-Source Models & Weights Training Efficiency & Optimization

Prajwal Panth +12w ago

TharuChat: Bootstrapping Large Language Models for a Low-Resource Language via Synthetic Data and Human Validation

A small, synthetically generated dataset can dramatically improve LLM performance on low-resource languages, even when the data is noisy and imperfect.

Prajwal Panth, Agniva Maiti

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

Editor2w ago·also ECFA Early-Career Researchers Panel, Henryk Niewodniczanski Institute of Nuclear Physics Polish Academy of Sciences, Institut de Física d’Altes Energies, UGent +3

Results of the analysis of a survey for young scientists on training quality in HEP instrumentation software and machine learning

Early-career researchers in experimental physics report significant gaps in training for software and machine learning tools crucial to their work, highlighting a critical need for improved educational resources.

Cecilia Borca, C. Borca, Javier Jiménez Peña +9

Code Generation & Program Synthesis Open-Source Models & Weights Scientific Discovery & Drug Design

Junyi Liu +52w ago

A Scalable Open-Source QEC System with Sub-Microsecond Decoding-Feedback Latency

Achieve sub-microsecond decoding-feedback latency in a scalable, open-source QEC system, bringing fault-tolerant quantum computation closer to reality.

Junyi Liu, Yi-Che Lee, Yi Lee +3

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Open-Source Models & Weights

Hanif Rahman2w ago

PashtoCorp: A 1.25-Billion-Word Corpus, Evaluation Suite, and Reproducible Pipeline for Low-Resource Language Development

A new 1.25B-word Pashto corpus boosts NER performance by 10% and slashes training variance nearly 7x, highlighting the disproportionate value of Wikipedia data.

Hanif Rahman

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

Surya Vardhan Yalavarthi2w ago

Open-Source Reproduction and Explainability Analysis of Corrective Retrieval Augmented Generation

CRAG's retrieval evaluator surprisingly relies on named entity alignment, not semantic similarity, to judge document quality.

Surya Vardhan Yalavarthi

Natural Language Processing Open-Source Models & Weights Recommendation & Information Retrieval

Subina Khanal +42w ago

Bridging the High-Frequency Data Gap: A Millisecond-Resolution Network Dataset for Advancing Time Series Foundation Models

Current time series foundation models struggle with millisecond-resolution 5G network data, revealing a critical gap in their ability to generalize to high-frequency real-world applications.

Subina Khanal, Seshu Tirupathi, Merim Dzaferagic +2

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Open-Source Models & Weights

MetaX2w ago

IQuest-Coder-V1 Technical Report

Code LLMs can achieve SOTA performance in agentic tasks by explicitly modeling the dynamic evolution of software logic across different training stages.

Jian Yang, Wei Zhang, Shawn Guo +44

Architecture Design (Transformers, SSMs, MoE)Code Generation & Program Synthesis Open-Source Models & Weights

Matthijs Jansen op de Haar +32w ago

Beyond Grading Accuracy: Exploring Alignment of TAs and LLMs

Open-source LLMs can grade UML diagrams with near-human accuracy on individual criteria, paving the way for AI-assisted teaching without relying on proprietary models.

Matthijs Jansen op de Haar, Nacir Bouali, N. Bouali +1

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Open-Source Models & Weights

Quy-Anh Dang +12w ago

Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR

Get competitive multilingual ASR performance with 6x smaller models and 200x less training cost by using balanced fine-tuning and implicit language learning.

Quy-Anh Dang, Chris Ngo

Natural Language Processing Open-Source Models & Weights Speech & Audio

MetaX2w ago·also NJU, USTC

InCoder-32B: Code Foundation Model for Industrial Scenarios

A new 32B code LLM trained specifically for industrial tasks crushes existing models on specialized domains like chip design and GPU kernel optimization, while remaining competitive on general coding benchmarks.

Jian Yang, Wei Zhang, Jiajun Wu +30

Code Generation & Program Synthesis Distributed Systems & Hardware Open-Source Models & Weights

Mo El-Haj2w ago

Tarab: A Multi-Dialect Corpus of Arabic Lyrics and Poetry

A new dataset of 2.56 million verses of Arabic lyrics and poetry opens the door for large-scale computational analysis of Arabic language evolution, cultural trends, and artistic expression.

Mo El-Haj

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

Kelechi G. Kalu +82w ago

A Longitudinal Study of Usability in Identity-Based Software Signing

Identity-based software signing may reduce key management burdens, but it relocates complexity to verification, configuration, and deployment, creating new usability challenges.

Kelechi G. Kalu, Kelechi G. Kalu, Hieu Tran +6

Code Generation & Program Synthesis Open-Source Models & Weights Tool Use & Agents

Alexandre Blanco-González +52w ago

Training a force field for proteins and small molecules from scratch

A graph neural network can learn accurate force field parameters from scratch, rivaling manually-developed force fields and opening avenues for automated force field discovery.

Alexandre Blanco-González, Alexandre Blanco-Gonz'alez, Thea K. Schulze +3

Open-Source Models & Weights Scientific Discovery & Drug Design

Luís Freire +22w ago

Exploring different approaches to customize language models for domain-specific text-to-code generation

LoRA fine-tuning beats prompting and RAG for adapting smaller language models to domain-specific code generation tasks, offering a path to higher accuracy and domain alignment.

Luís Freire, Fernanda A. Andaló, Nicki Skafte Detlefsen

Code Generation & Program Synthesis Natural Language Processing Open-Source Models & Weights

Mar 16, 2026

2w ago

Reproducible Orchestration of Best Practices for Reaction Path Optimization with the Nudged Elastic Band

Say goodbye to ad-hoc scripts: this automated workflow slashes manual intervention in NEB calculations, ensuring reproducible reaction path optimization across platforms.

Rohit Goswami, Rohit Goswami Institute Imx, Lab-COSMO +4

Code Generation & Program Synthesis Open-Source Models & Weights Scientific Discovery & Drug Design

Techno India University2w ago·also Sister Nivedita University

Affordable Precision Agriculture: A Deployment-Oriented Review of Low-Cost, Low-Power Edge AI and TinyML for Resource-Constrained Farming Systems

TinyML for agriculture is trending towards localized inference on microcontrollers, but inconsistent resource reporting is slowing down real-world deployment.

Riya Samanta, Bidyut Saha

Computer Vision Inference & Quantization Open-Source Models & Weights

Nikita Mosievskiy2w ago

Fine-tuning RoBERTa for CVE-to-CWE Classification: A 125M Parameter Model Competitive with LLMs

A fine-tuned RoBERTa model with only 125M parameters can match the CVE-to-CWE classification accuracy of models 64x larger, proving that strategic fine-tuning and data curation can close the gap between small and large language models.

Nikita Mosievskiy

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

Jim O'Connor +32w ago

Evolutionary Transfer Learning for Dragonchess

Stockfish's chess heuristics stumble in the 3D world of Dragonchess, but evolutionary adaptation can bridge the gap, opening new avenues for transferring AI knowledge across structurally different domains.

Jim O'Connor, Annika Hoag, Sarah Goyette +1

Open-Source Models & Weights Training Efficiency & Optimization

Minsung Cho +12w ago

InterPol: De-anonymizing LM Arena via Interpolated Preference Learning

LM Arena's model anonymity is more vulnerable than previously thought: a new attack, INTERPOL, leverages interpolated preference learning to expose deep stylistic patterns and manipulate rankings.

Minsung Cho, Jaehyung Kim

Eval Frameworks & Benchmarks Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Arturs Znotins2w ago

Pretraining and Benchmarking Modern Encoders for Latvian

Latvian NLP gets a boost: a new 111M parameter model outperforms larger multilingual baselines, proving that targeted pretraining still matters for low-resource languages.

Arturs Znotins

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

2w ago

Interpretable Predictability-Based AI Text Detection: A Replication Study

Stylometric features, combined with modern multilingual language models, significantly boost the performance of machine-generated text detection, often surpassing language-specific models.

Adam Skurla, Dominik Macko, Jakub Simko

Interpretability & Mechanistic Interp Natural Language Processing Open-Source Models & Weights

H. S. Bank +12w ago

A Framework and Prototype for a Navigable Map of Datasets in Engineering Design and Systems Engineering

Engineering design research lacks benchmark datasets, but this framework and prototype promise to change that by mapping the data landscape and revealing critical gaps.

H. S. Bank, Daniel R. Herber

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Open-Source Models & Weights

Zehao Chen2w ago

Fusian: Multi-LoRA Fusion for Fine-Grained Continuous MBTI Personality Control in Large Language Models

Control LLM personality on a continuous spectrum, not just discrete categories, by dynamically fusing LoRA adapters with a reinforcement learning policy.

Zehao Chen

Natural Language Processing Open-Source Models & Weights

University of Cross River State2w ago·also Arthur Jarvis University, ML Collective

Developing an English-Efik Corpus and Machine Translation System for Digitization Inclusion

NLLB-200 can be effectively fine-tuned for low-resource languages like Efik, even with a relatively small, community-curated dataset, achieving surprisingly strong translation performance.

Offiong Bassey Edet, Mbuotidem Sunday Awak, Emmanuel Oyo-Ita +2

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

2w ago

OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

OpenSeeker proves that frontier-level search agents can be achieved with surprisingly little data, outperforming even heavily optimized industrial systems.

Yuwen Du, Rui Ye, Shuo Tang +4

Data Curation & Synthetic Data Open-Source Models & Weights Tool Use & Agents

2w ago·also Freiburg

xplainfi: Feature Importance and Statistical Inference for Machine Learning in R

Unlock robust feature importance analysis with `xplainfi`, an R package that fills critical gaps by offering conditional importance methods and statistical inference for diverse ML models.

Lukas Burk, Fiona Katharina Ewald, Giuseppe Casalicchio +2

Interpretability & Mechanistic Interp Open-Source Models & Weights

Julian Dehne2w ago

To be FAIR or RIGHT? Methodological [R]esearch [I]ntegrity [G]iven [H]uman-facing [T]echnologies using the example of Learning Technologies

The RIGHT framework offers a new lens for evaluating the validity of human-facing research software, moving beyond just reliability and FAIR principles.

Julian Dehne

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Open-Source Models & Weights

2w ago·also Bosch AI

A Family of LLMs Liberated from Static Vocabularies

Ditch the tokenizer: this new LLM architecture processes text at the byte level, offering better compression, spelling robustness, and multilingual performance.

Aleph Alpha Adnen Abdessaied, Artur Baranowski, Lukas Balles +33

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Open-Source Models & Weights

Mar 15, 2026

2w ago·also Concordia University

Mining the YARA Ecosystem: From Ad-Hoc Sharing to Data-Driven Threat Intelligence

Despite high static quality scores, YARA rules in the wild suffer from significant noise, low recall, and a bias towards legacy threats, exposing a "double penalty" for defenders.

Dectot--Le Monnier de Gouville Esteban, Mohammad Hamdaqa, Moataz Chouchen

Code Generation & Program Synthesis Natural Language Processing Open-Source Models & Weights

Yiqin Zhang +12w ago

ITKIT: Feasible CT Image Analysis based on SimpleITK and MMEngine

ITKIT offers a streamlined CT image analysis pipeline that democratizes access to deep learning-based segmentation, even for researchers with limited computational resources.

Yiqin Zhang, Meiling Chen

Computer Vision Open-Source Models & Weights Scientific Discovery & Drug Design

Deepon Halder +12w ago

Multilingual TinyStories: A Synthetic Combinatorial Corpus of Indic Children's Stories for Training Small Language Models

Training SLMs for low-resource Indic languages just got easier: a new synthetic dataset of children's stories offers a large, localized, and simple corpus.

Deepon Halder, Angira Mukherjee

Data Curation & Synthetic Data Natural Language Processing Open-Source Models & Weights

Nicola Neophytou +12w ago

Open, to What End? A Capability-Theoretic Perspective on Open Search

The pursuit of "open search" risks being co-opted by powerful corporations unless it shifts focus from technical openness to the actual capabilities afforded to users.

Nicola Neophytou, Bhaskar Mitra

Constitutional AI & AI Ethics Open-Source Models & Weights Recommendation & Information Retrieval

Ziling Zhou2w ago

Governing Dynamic Capabilities: Cryptographic Binding and Reproducibility Verification for AI Agent Tool Use

Stop silent capability escalation: this framework uses cryptographic binding and reproducibility verification to ensure AI agents only do what they're authorized to do.

Ziling Zhou

Constitutional AI & AI Ethics Open-Source Models & Weights Tool Use & Agents

Mar 12, 2026

Jakub Proboszcz +12w ago

Large Language Models for Biomedical Article Classification

LLMs can classify biomedical articles surprisingly well, rivaling traditional methods like Naive Bayes and Random Forests, especially when using output token probabilities.

Jakub Proboszcz, Paweł Cichosz

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

2w ago

Implementing and Optimizing an Open-Source SD-card Host Controller for RISC-V SoCs

RISC-V's memory model can tank SD card performance by 6x, but clever driver tweaks can recover it.

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Open-Source Models & Weights

Shanghai Jiaotong University2w ago·also ZJU

EmbTracker: Traceable Black-box Watermarking for Federated Language Models

Pinpoint exactly which client leaked your federated model with a black-box watermark that's robust to fine-tuning, pruning, and quantization.

Haodong Zhao, Jinming Hu, Yijie Bai +6

Natural Language Processing Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Remigiusz Kinas +102w ago

Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language

Achieve a 50% inference speedup on a large language model for European languages by compressing it to 7.35B parameters, while retaining 90% of the original 11B parameter model's performance.

Remigiusz Kinas, Pawel Kiszczak, Paweł Kiszczak +8

Inference & Quantization Natural Language Processing Open-Source Models & Weights

Sławomir Dadas +112w ago·also National Information Processing Institute

Long-Context Encoder Models for Polish Language Understanding

Polish language understanding gets a long-context boost: a new encoder model handles sequences up to 8192 tokens, outperforming existing models on long documents while remaining competitive on shorter texts.

Sławomir Dadas, Slawomir Dadas, Rafał Poświata +9

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Open-Source Models & Weights

Yu Yan +62w ago

DatedGPT: Preventing Lookahead Bias in Large Language Models with Time-Aware Pretraining

Training LLMs on temporally partitioned data reveals a practical method for mitigating lookahead bias, enabling more reliable financial forecasting.

Yu Yan, Yutong Yan, Raphael Tang +4

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Open-Source Models & Weights

2w ago·also Beihang

$\Psi_0$: An Open Foundation Model Towards Universal Humanoid Loco-Manipulation

Forget scaling laws: this humanoid robot model crushes benchmarks using 10x less data by cleverly pre-training on human videos and then fine-tuning on robot-specific movements.

Songlin Wei, Hongyi Jing, Boqian Li +14

Architecture Design (Transformers, SSMs, MoE)Data Curation & Synthetic Data Open-Source Models & Weights+1

University of Yamanashi2w ago·also Exploratory Oncology Research & Clinical Trial Center, National Cancer Center Hospital East, UTokyo

Performance Evaluation of Open-Source Large Language Models for Assisting Pathology Report Writing in Japanese

Open-source LLMs can help write Japanese pathology reports, but pathologists strongly disagree on which model provides the best explanations.

Masataka Kawai, Singo Sakashita, Shumpei Ishikawa +8

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

2w ago

Automating Skill Acquisition through Large-Scale Mining of Open-Source Agentic Repositories: A Framework for Multi-Agent Procedural Knowledge Extraction

LLMs can gain 40% in knowledge transfer efficiency by mining skills from open-source agent repositories, without needing retraining.

Shuzhen Bi, Mengsong Wu, Hao Hao +5

Code Generation & Program Synthesis Open-Source Models & Weights Tool Use & Agents

Sanchit Pandey2w ago

Can Small Language Models Use What They Retrieve? An Empirical Study of Retrieval Utilization Across Model Scale

RAG with small language models (<8B parameters) can be a net negative, as they often ignore retrieved context and even "forget" existing knowledge.

Sanchit Pandey

Open-Source Models & Weights Recommendation & Information Retrieval Scaling Laws & Emergent Abilities

Mila2w ago·also Google Research, Microsoft Research, Cohere

Tiny Aya: Bridging Scale and Multilingual Depth

Forget brute-force scaling: Tiny Aya proves a 3B parameter model can achieve state-of-the-art multilingual performance with clever training and region-aware specialization.

Alejandro R. Salamanca, Diana Abagyan, Daniel D'souza +23

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Open-Source Models & Weights

2w ago

InSpatio-WorldFM: An Open-Source Real-Time Generative Frame Model

Ditch the video: InSpatio-WorldFM achieves real-time spatial intelligence by generating frames independently, offering a low-latency alternative to video-based world models.

InSpatio Team, InSpatio Team Xiaoyu Zhang, Xiaoyu Zhang +19

Computer Vision Open-Source Models & Weights World Models & Planning

Mar 11, 2026

Turku Bioscience Centre3w ago·also Åbo Akademi University, Foundation for the Finnish Cancer Institute, InFLAMES Research Flagship Centre, Instituto de Tecnologia Química e Biológica António Xavier +1

Packaging Jupyter notebooks as installable desktop apps using LabConstrictor

Turn your Jupyter notebooks into one-click installable desktop apps with LabConstrictor, democratizing access to computational methods for researchers without DevOps expertise.

Iván Hidalgo-Cenalmor, Marcela Xiomara Rivera Pineda, Bruno M. Saraiva +2

Code Generation & Program Synthesis Open-Source Models & Weights Scientific Discovery & Drug Design

Nishat Raihan +13w ago

Temporal Text Classification with Large Language Models

Despite their general prowess, open-source LLMs still lag behind proprietary models in the nuanced task of dating texts, even after fine-tuning.

Nishat Raihan, Marcos Zampieri

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

A. Trybus +23w ago

Making Bielik LLM Reason (Better): A Field Report

Can a dedicated research program keep a smaller, local LLM competitive against global giants in the rapidly evolving AI landscape?

A. Trybus, Bartosz Bartnicki, Remigiusz Kinas

Eval Frameworks & Benchmarks Open-Source Models & Weights Reasoning & Chain-of-Thought

Tobias Geger +43w ago

From Education to Evidence: A Collaborative Practice Research Platform for AI-Integrated Agile Development

An AI-integrated agile education platform accelerates practice-relevant AI research by closing the theory-practice gap in software development.

Tobias Geger, Andreas Rausch, Ina Schiering +2

Code Generation & Program Synthesis Open-Source Models & Weights Tool Use & Agents

Jesse Yu +13w ago

The Orthogonal Vulnerabilities of Generative AI Watermarks: A Comparative Empirical Benchmark of Spatial and Latent Provenance

Single-domain watermarks are fundamentally insufficient against modern adversarial toolsets, as spatial and latent watermarks exhibit orthogonal vulnerabilities to generative and geometric attacks, respectively.

Jesse Yu, Nicholas Wei

Eval Frameworks & Benchmarks Open-Source Models & Weights Red-Teaming & Adversarial Robustness

Yujie Liao +43w ago

OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs

A fully open-source speech understanding model, OSUM-Pangu, proves that competitive performance is achievable on non-CUDA hardware, challenging the dominance of GPU-centric ecosystems.

Yujie Liao, Xuelong Geng, Hongfei Xue +2

Distributed Systems & Hardware Open-Source Models & Weights Speech & Audio

Thomas Thebaud +43w ago

Speaker Verification with Speech-Aware LLMs: Evaluation and Augmentation

Speech-aware LLMs are surprisingly bad at speaker verification, but a simple embedding injection trick closes the gap with dedicated systems while preserving the LLM's language abilities.

Thomas Thebaud, Yuzhe Wang, L. Moro-Velázquez +2

Eval Frameworks & Benchmarks Open-Source Models & Weights Speech & Audio