Search papers, labs, and topics across Lattice.
2
0
3
Fine-tuning vision-language models on a richly described dataset boosts their accuracy in interpreting subtle driver actions by 10 points, revealing a critical gap in existing training methods.
Faulty negatives and unreliable positives can be effectively managed in multi-modal learning, leading to significant improvements in driver distraction detection accuracy.