Search papers, labs, and topics across Lattice.
100 papers published across 3 labs.
Democratizing self-driving research, OpenPodcar2 offers a robust, low-cost (≈$7k new, $2k used), open-source autonomous vehicle platform ready for ROS2 integration and real-world deployment.
Knowing *when* to listen to *which* sensor lets robotic fruit pickers predict failures before they happen, boosting accuracy to 90% even with minimal sensor sets.
Decomposing robotic manipulation into coarse and fine-grained actions isn't just conceptually cleaner—it actually unlocks a sweet spot where learning difficulty is balanced, boosting performance.
LLMs can now generate driving rules from traffic laws with significantly improved accuracy by grounding their reasoning in structured traffic scenarios.
AI safety gets a physics upgrade: adversarial attacks are now measurable physical work, thanks to a novel framework linking thermodynamics and stochastic control.
Knowing *when* to listen to *which* sensor lets robotic fruit pickers predict failures before they happen, boosting accuracy to 90% even with minimal sensor sets.
Decomposing robotic manipulation into coarse and fine-grained actions isn't just conceptually cleaner—it actually unlocks a sweet spot where learning difficulty is balanced, boosting performance.
LLMs can now generate driving rules from traffic laws with significantly improved accuracy by grounding their reasoning in structured traffic scenarios.
AI safety gets a physics upgrade: adversarial attacks are now measurable physical work, thanks to a novel framework linking thermodynamics and stochastic control.
Road crack detection gets a boost by having the infrastructure tell the car where to look.
Safe visuomotor control from high-resolution images is now practical at scale, thanks to a learned visual abstraction coupled with an efficient SLS solver.
Unlock species-agnostic 3D tracking from standard drone footage with WildLIFT, turning 2D video into structured, viewpoint-aware representations for richer wildlife analysis.
Robots can now understand human intentions with near-human accuracy thanks to a new video-language model that reasons about goals like a human.
Radar odometry, typically confined to urban settings, can be pushed off-road with simple adaptations like IMU preintegration, but still faces significant challenges in unstructured environments.
Encoding vehicle trajectory directionality via HSV rasterization unlocks accurate lane-level HD map generation from crowdsourced data using a DETR architecture.
An open-source autonomous driving platform offers researchers a modular, scalable, and cost-effective alternative to complex and restrictive hardware validation setups.
Robots can now "see" and understand doorways, enabling more robust navigation in complex indoor environments.
Low-cost stereo vision can rival LiDAR for real-time windrow detection, paving the way for more accessible autonomous farming solutions.
AI can now pilot jet trainers to avoid ground collisions, even with limited visibility.
Robots can now leverage human intuition for manipulation tasks, learning from a massive video dataset to improve motion plausibility and robustness, even when conditions change.
Imagine buildings that adapt to the materials available, not the other way around: this framework uses robots to make it a reality.
Simulate once, deploy anywhere: SPLIT lets you train tactile perception models on synthetic data and transfer them across different sensors without retraining.
Discrete diffusion policies, typically used for image generation, turn out to be surprisingly effective and efficient asynchronous executors for robots acting in dynamic environments, outperforming traditional continuous control methods.
Zero-shot Sim2Real transfer for a humanoid ballbot is now possible thanks to a friction-aware RL framework and high-fidelity simulation that models omni-wheel mechanics.
Forget clunky animation pipelines – MotionBricks lets you assemble real-time, high-quality character motions like LEGOs, even controlling robots.
Successfully backing up a trailer without jackknifing or hitting anything just got easier thanks to a new path-planning algorithm that respects the physics of articulated vehicles.
Separating geometry from logic with fuzzy path constraints yields motion planning specifications that are both more intuitive for humans and more amenable to learning from demonstrations.
Unlock accurate friction estimation for any material pairing with just a handful of proxy material measurements, slashing experimental costs.
Network jitter in cloud-based robot control can be overcome by converting temporal lag into spatial pose offsets, restoring the VLA's original geometric intent without fine-tuning.
Current event-based SLAM algorithms falter when faced with the full complexity of high-speed, 6-DoF maneuvers, highlighting a gap between current capabilities and the promise of event cameras.
Achieve robust trajectory tracking and moving obstacle avoidance for diverse mobile robots, including Ackermann-steered vehicles, by combining sliding mode control with a novel collision cone control barrier function.
Frequency domain analysis unlocks 1.59x speedups in Vision-Language-Navigation by enabling optimal token caching, a feat previously limited by visual domain approaches.
6G-enabled Internet of Everything promises a unified intelligent ecosystem, but faces critical scalability, security, and privacy challenges that demand innovative research.
Score-based diffusion models can now generate robust guiding vector fields for robotic path following, even when traditional methods stumble on unordered, branching, or probabilistically-generated paths.
Autonomous vehicles can learn to navigate pedestrian interactions more efficiently by subtly threatening collisions, as humans do, without compromising safety.
Autonomous vehicles can drive more efficiently by using a new metric that links real-time acceleration decisions to overall travel time.
Edge NPUs can outperform flagship GPUs in cost and energy efficiency for on-robot VLA model deployment, but only with hardware-aware optimizations that tackle the models' distinct compute and memory-bound phases.
Democratizing self-driving research, OpenPodcar2 offers a robust, low-cost (≈$7k new, $2k used), open-source autonomous vehicle platform ready for ROS2 integration and real-world deployment.
Multi-robot motion planning can be accelerated by over 850X, enabling solutions in milliseconds, by exploiting SIMD parallelism with vector-accelerated primitives.
Forget end-to-end fine-tuning: $M^2$-VLA unlocks the power of generalized VLMs for robotic manipulation by intelligently mixing layers and incorporating meta-skills.
Ditch expensive robot trials: a novel "betting" framework lets you accurately predict real-world robot performance using only cheap simulations.
Robots can strengthen family bonds, but only if designers carefully consider the robot's initiative and communication timing, as families experience tensions around privacy and control.
A social robot can successfully integrate into family life to support family-school partnerships, but parental facilitation styles significantly impact its effectiveness.
Visual RL agents can recover near-perfect performance even under severe, dynamically changing visual corruptions by learning to disentangle task-relevant foreground from perturbation artifacts.
Achieve real-time learning-based control of complex robotic systems by exploiting differential flatness for dramatic speedups in MPC computation.
Forget slow, multi-step action generation: CF-VLA's coarse-to-fine approach slashes latency by 75% while boosting real-robot success rates to a new high of 83%.
Ditch the heuristics: MILP delivers up to 30% better latency, energy, and reliability for IoT workflow scheduling in edge-hub-cloud systems.
VLA models introduce a fundamentally new risk landscape compared to LLMs or robotics alone, demanding a unified safety perspective that considers irreversible physical consequences and multimodal attack surfaces.
Current 3D anomaly detection struggles with real-world complexity, but this new approach directly models inlier feature distributions, achieving state-of-the-art results and offering a more robust solution.
Neurosymbolic grounding of LLMs in telemetry and knowledge graphs slashes expert-rated overclaims in industrial maintenance explanations by 93%, making AI assistants far more trustworthy in safety-critical settings.
Forget slow, expensive real-world trials: dWorldEval's discrete diffusion world model lets you evaluate robot policies across thousands of environments and tasks with unprecedented speed and accuracy.
Achieve 30x improvement in attitude control precision by fusing classical control with RL, enabling reliable space robotics.
Unlock the secrets of historical keyboard performance with PHOTON, a non-invasive optical tracking system that reveals the subtle interplay between performer input and instrument mechanics.
Transforming human motion into structured language allows LLMs to achieve unprecedented accuracy in motion understanding without the constraints of traditional encoding methods.
Multi-task RL agents solving related navigation tasks underwater rely on a surprisingly small fraction of their weights (1.5%) to differentiate between tasks.
Controller design can be effectively framed as inference, enabling efficient trajectory and policy optimization via tempered sampling.
RL policies don't have to be temporally incoherent messes: shaping action probabilities with dynamical priors unlocks structured, interpretable decision-making.
Channel-free HAR is now possible: a single model can perform activity recognition across diverse IoT sensor setups without needing fixed channel arrangements, thanks to metadata-conditioned fusion.
Autonomous vehicles can now plan trajectories 10x faster without sacrificing performance, thanks to a novel architecture that learns complex driving behaviors in latent space during training.
Imagine reconstructing detailed human motion and scene layouts using just your smartwatch and earbuds – no cameras needed.
Despite the complexity of ROS2 robotics software architectures, LLMs can achieve near-perfect accuracy in answering questions about them, hinting at a powerful new tool for roboticists.
MLLMs often *hallucinate* the referent of a pointing gesture, latching onto nearby or salient objects instead of truly understanding spatial semantics.
Achieve millimeter-level accuracy in 3D human body fitting from multi-modal inputs, even with scale distortion common in AI-generated assets.
Frozen LLMs, when fused with spatial scene encodings, can effectively reason about vehicle trajectories, opening new avenues for integrating language-based reasoning into autonomous driving systems.
A single pneumatic input and clever use of magneto-elastic hysteresis can drive a surprisingly simple and effective peristaltic pump.
DINOv3 representations and diffusion-based planning enable a visual tracker that's both robust to occlusions and discriminative enough to avoid visually similar distractors.
Compressing expansive contexts like a convex mirror allows deep learning models to achieve robust ground filtering across diverse landscapes, even in complex urban scenes.
Small decreases in swarm size leave human operators with elevated workload, even when performance is unaffected, suggesting a "workload residue" effect that designers must address.
Guaranteeing swarm drone recovery from faults is now possible with a hybrid discrete-event system that merges high-level supervision with low-level control.
Point-VLMs can learn to see the world as it really is: targeted reward assignment and cross-modal verification nearly close the reality gap in 3D reasoning.
Optimality guarantees are now possible when jointly optimizing robot design, fleet composition, and task planning for heterogeneous multi-robot systems.
Forget brittle visual-history buffers: LoHo-Manip uses a VLM task manager with visual trace prompts to achieve robust long-horizon robotic manipulation through implicit closed-loop replanning.
Achieve robust robot manipulation across diverse viewpoints without camera calibration by synthesizing novel views with a geometry-aware video diffusion model.
Forget expensive real-world robot training: Hi-WM lets humans directly edit a robot's simulated reality, turning world models into powerful, reusable playgrounds for failure recovery.
A robot that can skate, climb stairs, and deliver packages shows how hybrid locomotion can unlock new levels of versatility.
Forget dry training manuals: a challenge-based, LLM-powered humanoid robot can spark real employee excitement and understanding of robotics in the workplace.
Swarms of long, articulated vehicles face surprising deadlock challenges, with up to 31% of vehicles immobilized in dense scenarios despite collision avoidance guarantees.
Optimal robot exploration can be achieved by framing SLAM as a POMDP with a geometry-aware exploration cost, enabling near-optimal policy learning.
Expert knowledge, encoded in a Bayesian network, can dramatically improve the accuracy of autonomous robotic triage systems operating in chaotic, data-scarce environments.
Real-world robots can now navigate complex environments with human-level instructions, thanks to a new system that combines efficient perception with high-level reasoning, all while running in real-time on limited hardware.
Robot hands get a serious upgrade: embedding cameras in fingertips unlocks robust manipulation in cluttered environments where traditional wrist-mounted cameras fail.
Stop reimplementing localization pipelines: Ufil offers a unified, open-source framework for infrastructure-based localization that lets you swap in new components without rewriting everything.
Humanoid robots can now adapt to diverse environments without task-specific tuning by selectively "relaxing" joints, mimicking how humans exploit weightlessness for stability.
Current VLA benchmarks may be overstating real-world readiness, as models succeeding by standard metrics often exhibit unsafe behaviors and poor robustness.
Humanoid robots can now seamlessly transition between fighting skills thanks to a novel policy gating approach that ensures stability and smoothness.
Guaranteeing full-body collision avoidance for robot manipulators in dynamic environments is now computationally tractable thanks to a novel application of 3D Poisson Safety Functions.
Fine-tuning VLMs with action-aligned language supervision and terrain-aware preference optimization unlocks more robust off-road autonomous driving, outperforming prior approaches on key traversability metrics.
Explicitly constraining action generation with predicted spatial "corridors" boosts VLA model performance by up to 12.4% on challenging robotic manipulation tasks.
Autonomous vehicles can be fooled by coordinated camera and LiDAR attacks that create "phantom" objects, even when using multi-sensor fusion designed for redundancy.
Ditch the bounding boxes: PLAS-Net's pixel-perfect segmentation of beach litter reveals that fishing gear, though numerically scarce, dominates the total pollution area.
By spectrally decoupling robot control into intent and dynamics, ResVLA offers a more efficient and robust approach to generative VLA policies.
Reshooting video from arbitrary viewpoints just got a whole lot better thanks to a 4D point cloud representation that maintains temporal consistency and precise camera control.
World-model-based planning enables reliable robotic manipulation in complex industrial settings where reactive policies crumble.
Vision-based tactile signals in the VTOUCH dataset significantly enhance bimanual manipulation capabilities, paving the way for more effective robotic interactions.
Ditch sparse contact cues: LEXIS-Flow uses a learned manifold of interaction signatures to capture dense, continuous proximity between humans and objects, leading to more realistic 3D HOI reconstructions.
Stop guessing and start knowing: this framework accurately predicts *your* EV's energy consumption by learning your driving style and integrating it with detailed map data.
Layer-selective rehearsal and rapid recovery strategies can boost model performance in federated learning by over 30% in real-world applications.
Individual prosumers can now effectively coordinate in electricity markets, boosting overall market performance through a novel hierarchical MARL framework.
Reinforcement learning's Temporal Difference value estimation offers a surprisingly effective and theoretically grounded approach to calibrating uncertainty in vision-language-action models for robotics.
Voice-commanded surface finishing is now a reality, thanks to a new framework that lets non-experts adapt robot skills through touch, language, and a drag-and-drop interface.
Achieving robust Quality-Diversity in RL without the computational burden of target networks could revolutionize how we approach skill discovery in complex environments.
Stop guessing what feels good: this system learns personalized vibration preferences from just 40 pairwise comparisons.
Forget retraining: LEVER lets you snap together pre-trained RL policies at inference time, matching or beating from-scratch performance in some cases.
World models can navigate blood vessels autonomously with higher success rates than standard RL, paving the way for safer robotic stroke treatments.
Shielding can ensure safety in autonomous systems even when perception is uncertain, potentially transforming how we manage risks in AI decision-making.