Jie Feng

Xidian University

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Computer Vision (5)Multimodal Models (3)Robotics & Embodied AI (2)Tool Use & Agents (1)

Frequent co-authors

Guanbin Li (4)Ronghua Shang (3)Junpeng Zhang (3)Di Li (2)

Papers (5)

Apr 2, 2026

3d ago·also Qinghai Normal University, SYSU

A3R: Agentic Affordance Reasoning via Cross-Dimensional Evidence in 3D Gaussian Scenes

Stop guessing affordances from static scenes: A3R's agentic approach leverages cross-dimensional evidence acquisition to significantly outperform one-shot methods in complex 3D environments.

Di Li, Jie Feng, Guanbin Li +4

Computer Vision Robotics & Embodied AI Tool Use & Agents

3d ago·also SYSU

SDesc3D: Towards Layout-Aware 3D Indoor Scene Generation from Short Descriptions

Generate detailed 3D indoor scenes from short text descriptions with SDesc3D, a framework that leverages multi-view structural priors and regional functionality to overcome the limitations of explicit semantic cues.

Jie Feng, Jiawei Shen, Junjia Huang +4

Computer Vision Multimodal Models Natural Language Processing

3d ago·also Jimei University, SYSU

Resonance4D: Frequency-Domain Motion Supervision for Preset-Free Physical Parameter Learning in 4D Dynamic Physical Scene Simulation

Forget expensive video diffusion: Resonance4D unlocks high-fidelity 4D dynamic simulations by cleverly supervising motion in the frequency domain.

Changshe Zhang, Jie Feng, Siyu Chen +3

Computer Vision Robotics & Embodied AI World Models & Planning

3d ago·also Jimei University, SCUT

Decouple and Rectify: Semantics-Preserving Structural Enhancement for Open-Vocabulary Remote Sensing Segmentation

By recognizing that CLIP features aren't monolithic, DR-Seg unlocks targeted structural enhancements that dramatically improve open-vocabulary remote sensing segmentation.

Jie Feng, Fengze Li, Feng Li +5

Computer Vision Multimodal Models

3d ago·also Qinghai Normal University, SYSU

Are VLMs Lost Between Sky and Space? LinkS$^2$Bench for UAV-Satellite Dynamic Cross-View Spatial Intelligence

VLMs struggle to connect the dots between dynamic drone footage and satellite imagery, highlighting a critical gap in their spatial reasoning abilities.

Dian Liu, Jie Feng, Di Li +5

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Search

Jie Feng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)