Search papers, labs, and topics across Lattice.
The paper identifies a "semantic crisis" in modern interconnect technologies (NVLink, UALink, etc.) stemming from implicit assumptions about time, ordering, and failure, leading to vendor-specific divergence. It argues that the root cause is the "Forward-In-Time-Only (FITO)" category mistake, where explicit link semantics are lacking from APIs to the physical layer. The paper proposes Open Atomic Ethernet (OAE) as a potential solution by introducing bilateral transaction primitives with explicit ordering, completion, and failure visibility, drawing parallels to scalable OLTP isolation.
Today's high-performance interconnects are built on shaky semantic ground, potentially sacrificing concurrency for reliability through hidden serialization.
For fifty years, networking has fragmented whenever new workloads exposed hidden assumptions about time, ordering, failure, and trust. This paper argues that the current interconnect landscape -- NVLink, UALink, Ultra Ethernet, AELink/Aethernet, TTPoE, and classical RDMA -- suffers from a semantic crisis: vendor-specific divergence disguised as optimization. We trace this crisis to the Forward-In-Time-Only (FITO) category mistake embedded in every major fabric stack, and show how each pathology -- aspirational RDMA completion, fire-and-forget GPU semantics, opaque proprietary stacks, incompatible multi-cloud ordering, universal fencing -- arises from the same failure to define explicit, testable link semantics from APIs to bits on the wire. We conjecture that RDMA achieves reliability through universal fencing that collapses concurrency into serialized checkpoints, and that precise minimal semantics can maintain correctness without global barriers, as superscalar architectures separated execution from retirement. We describe how Open Atomic Ethernet (OAE) under the Open Compute Project addresses the crisis through bilateral transaction primitives with explicit ordering, completion, and failure visibility. Drawing on Helland's analysis of scalable OLTP isolation (the "BIG DEAL"), we show the crisis pervades the entire stack. We assess whether convergence on a single open standard is still possible or whether fragmentation is now structural.