Search papers, labs, and topics across Lattice.
This paper introduces a categorical framework for deep learning architectures, formalizing broadcasting and compositionality using novel "axis-stride" and "array-broadcasted" categories. It enables precise mathematical expression and manipulation of model architectures, addressing the limitations of ad-hoc notations. The framework is implemented in both Python (pyncd) and TypeScript (tsncd), supporting algebraic construction, graph conversion, PyTorch compilation, and diagram rendering.
Finally, a rigorous mathematical framework lets you treat deep learning architectures as composable algebraic objects, opening the door to formal verification and automated design.
Despite deep learning models running well-defined mathematical functions, we lack a formal mathematical framework for describing model architectures. Ad-hoc notation, diagrams, and pseudocode poorly handle nonlinear broadcasting and the relationship between individual components and composed models. This paper introduces a categorical framework for deep learning models that formalizes broadcasting through the novel axis-stride and array-broadcasted categories. This allows the mathematical function underlying architectures to be precisely expressed and manipulated in a compositional manner. These mathematical definitions are translated into human manageable diagrams and machine manageable data structures. We provide a mirrored implementation in Python (pyncd) and TypeScript (tsncd) to show the universal aspect of our framework, along with features including algebraic construction, graph conversion, PyTorch compilation and diagram rendering. This lays the foundation for a systematic, formal approach to deep learning model design and analysis.