Adel N. Toosi

DisNet Lab

Papers on Lattice

Total citations

Topics

Research focus

Distributed Systems & Hardware (6)Inference & Quantization (3)Computer Vision (1)Training Efficiency & Optimization (1)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

A. N. Toosi (3)A. Toosi (2)Xu Bai (1)Muhammed Tawfiqul Islam (1)

Papers (6)

Apr 14, 2026

DisNet LabApr 14, 2026·also IBM Research

PipeLive: Efficient Live In-place Pipeline Parallelism Reconfiguration for Dynamic LLM Serving

Achieve near-instantaneous LLM pipeline parallelism reconfiguration – going from seconds of downtime to under 10ms – by borrowing techniques from live virtual machine migration.

Xu Bai, Muhammed Tawfiqul Islam, Adel N. Toosi +1

Distributed Systems & Hardware Inference & Quantization

Apr 13, 2026

Hossein Hosseini Kasnavieh +3Apr 13, 2026·also DisNet Lab

RouterWise: Joint Resource Allocation and Routing for Latency-Aware Multi-Model LLM Serving

Resource allocation is the unsung hero of multi-model LLM routing: get it wrong, and you could be leaving up to 87% of your output quality on the table.

Hossein Hosseini Kasnavieh, Christopher Leckie, A. N. Toosi +1

Distributed Systems & Hardware Inference & Quantization

Mar 16, 2026

Mar 16, 2026·also DisNet Lab

Multi-Objective Load Balancing for Heterogeneous Edge-Based Object Detection Systems

Achieve up to 50% energy savings and 80% latency reduction in edge-based object detection by intelligently balancing load across heterogeneous devices, even with a minor accuracy trade-off.

Daghash K. Alqahtani, Maria A. Rodriguez, Muhammad Aamir Cheema +2

Computer Vision Distributed Systems & Hardware

Feb 18, 2026

Feb 18, 2026·also DisNet Lab, Monash

LLM-Driven Intent-Based Privacy-Aware Orchestration Across the Cloud-Edge Continuum

Achieve near-instant (<50ms) service downtime when dynamically reconfiguring LLM inference pipelines across heterogeneous GPUs in serverless environments.

Zijie Su, Muhammed Tawfiqul Islam, Mohammad Goudarzi +2

Distributed Systems & Hardware Inference & Quantization

RediMinds IncFeb 18, 2026·also DisNet Lab, Melbourne, UT Austin

DistributedEstimator: Distributed Training of Quantum Neural Networks via Circuit Cutting

Circuit cutting introduces substantial end-to-end overheads in quantum neural network training, with reconstruction dominating per-query time, but surprisingly, test accuracy and robustness can be preserved or even improved.

Prabhjot Singh, Adel N. Toosi, A. Toosi +2

Distributed Systems & Hardware Training Efficiency & Optimization

Feb 17, 2026

Feb 17, 2026·also DisNet Lab, UPF

Service Orchestration in the Computing Continuum: Structural Challenges and Vision

Current service orchestration solutions fall short of achieving autonomous, resilient, and scalable performance in the Computing Continuum, highlighting the urgent need for standardized evaluation environments.

Víctor Casamayor Pujol, Ildefons Magrans de Abril, Praveen Kumar Donta +2

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Tool Use & Agents

Search

Adel N. Toosi

Research focus

Frequent co-authors

Papers (6)