Antoni B. Chan

Department of Computer Science, City University of Hong Kong

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Computer Vision (3)Training Efficiency & Optimization (2)Architecture Design (Transformers, SSMs, MoE) (2)

Frequent co-authors

Chenyang Zhao (1)Wei Lin (1)J. H. Hsiao (1)Jixuan Chen (1)

Papers (5)

Jul 15, 2026

1w ago·also Hong Kong University of Science & Technology

Fine-grained CLIP fine-tuning with self-annotated region alignment

SFF-CLIP boosts fine-grained feature representation in CLIP without the need for cumbersome region annotations, achieving significant performance gains.

Chenyang Zhao, Wei Lin, Antoni B. Chan +1

Multimodal Models Training Efficiency & Optimization

Apr 21, 2026

Apr 21, 2026·also HKU, HKUST, Shenzhen University

Multi-view Crowd Tracking Transformer with View-Ground Interactions Under Large Real-World Scenes

Transformer-based architectures can now outperform CNNs in multi-view crowd tracking, especially in large, complex real-world scenes, thanks to a novel view-ground interaction mechanism.

Jixuan Chen, Kaiyi Zhang, Xinquan Yu +1

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Apr 2, 2026

National Clinical Research Center for HematologicApr 2, 2026·also HIT, HKU, PKU

Dense Point-to-Mask Optimization with Reinforced Point Selection for Crowd Instance Segmentation

Turns out, you can get SOTA crowd instance segmentation by cleverly combining SAM with point supervision and reinforcement learning to select optimal points for mask generation.

Hongru Chen, Hongru Chen, Jiyang Huang +4

Mar 18, 2026

Mar 18, 2026·also HKU

M2P: Improving Visual Foundation Models with Mask-to-Point Weakly-Supervised Learning for Dense Point Tracking

By cleverly using readily available video segmentation masks, this method boosts DINOv2's point tracking performance by over 14% – a surprisingly effective way to inject temporal awareness into static image-pretrained models.

Qiangqiang Wu, Matias Di Martino, Guillermo Sapiro +1

Computer Vision Multimodal Models Training Efficiency & Optimization

Mar 17, 2026

Mar 17, 2026·also HKU, Princeton

Exclusivity-Guided Mask Learning for Semi-Supervised Crowd Instance Segmentation and Counting

Achieve state-of-the-art semi-supervised crowd instance segmentation and counting by generating high-quality mask supervision from sparse annotations, effectively bridging the gap between these two tasks.

Hongru Cheng, Antoni B. Chan

Architecture Design (Transformers, SSMs, MoE)Computer Vision Data Curation & Synthetic Data

Search

Antoni B. Chan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)