Feb 17, 2026arXiv:2602.15814

Avey-B

Devang Acharya, Devang Acharya, Mohammad Hammoud, Mohammad Hammoud

AI Summary

This paper reformulates the autoregressive, attention-free Avey architecture into an encoder-only model suitable for resource-constrained NLP applications. They introduce decoupled static/dynamic parameterizations, stability-oriented normalization, and neural compression techniques to improve Avey's performance. The resulting Avey-B model outperforms Transformer-based encoders on token classification and information retrieval tasks, demonstrating improved efficiency and scalability to longer contexts.

Key Contribution

Attention-free Avey-B matches or exceeds the performance of BERT-style encoders on token classification and information retrieval, offering a more efficient alternative for resource-constrained NLP applications.

Abstract

Compact pretrained bidirectional encoders remain the backbone of industrial NLP under tight compute and memory budgets. Their effectiveness stems from self-attention's ability to deliver high-quality bidirectional contextualization with sequence-level parallelism, as popularized by BERT-style architectures. Recently, Avey was introduced as an autoregressive, attention-free alternative that naturally admits an encoder-only adaptation. In this paper, we reformulate Avey for the encoder-only paradigm and propose several innovations to its architecture, including decoupled static and dynamic parameterizations, stability-oriented normalization, and neural compression. Results show that this reformulated architecture compares favorably to four widely used Transformer-based encoders, consistently outperforming them on standard token-classification and information-retrieval benchmarks while scaling more efficiently to long contexts.

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References73

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Avey-B

Related Papers