Search papers, labs, and topics across Lattice.
The paper introduces SubQuad, a novel pipeline for comparative analysis of adaptive immune repertoires that tackles the near-quadratic computational cost of pairwise affinity evaluations and dataset imbalances. SubQuad uses MinHash prefiltering for subquadratic retrieval, a differentiable gating module for adaptive multimodal fusion, and fairness-constrained clustering to ensure proportional representation of rare subgroups. Experiments on viral and tumor repertoires demonstrate that SubQuad improves throughput, memory usage, recall, cluster purity, and subgroup equity.
SubQuad tackles the computational bottleneck of adaptive immune repertoire analysis with a near-quadratic-free approach that also corrects for dataset imbalances, enabling more equitable and scalable biomarker discovery.
Comparative analysis of adaptive immune repertoires at population scale is hampered by two practical bottlenecks: the near-quadratic cost of pairwise affinity evaluations and dataset imbalances that obscure clinically important minority clonotypes. We introduce SubQuad, an end-to-end pipeline that addresses these challenges by combining antigen-aware, near-subquadratic retrieval with GPU-accelerated affinity kernels, learned multimodal fusion, and fairness-constrained clustering. The system employs compact MinHash prefiltering to sharply reduce candidate comparisons, a differentiable gating module that adaptively weights complementary alignment and embedding channels on a per-pair basis, and an automated calibration routine that enforces proportional representation of rare antigen-specific subgroups. On large viral and tumor repertoires SubQuad achieves measured gains in throughput and peak memory usage while preserving or improving recall@k, cluster purity, and subgroup equity. By co-designing indexing, similarity fusion, and equity-aware objectives, SubQuad offers a scalable, bias-aware platform for repertoire mining and downstream translational tasks such as vaccine target prioritization and biomarker discovery.