Search papers, labs, and topics across Lattice.
This paper introduces a two-stage approach for entity search that leverages LLMs to construct category-aware attribute graphs from unstructured product data, then uses these graphs for LLM-based ranking. By reasoning over structured attribute graphs instead of raw text, the method reduces token usage by 57% and improves ranking precision by over 5% in zero-shot settings. The approach demonstrates robust generalization across diverse product categories, making it suitable for real-world e-commerce applications.
LLMs can achieve better zero-shot product ranking with 57% less token usage by reasoning over structured attribute graphs instead of raw text.
Entity search, i.e., finding the most similar entities to a query entity, faces unique challenges in e-commerce, where product similarity varies across categories and contexts. Traditional embedding-based approaches often struggle to capture nuanced context-specific attribute relevance. In this paper, we present a two-stage approach combining Large Language Model (LLM)-driven attribute graph construction with graph-aware LLM ranking. In the offline stage, we extract structured product attributes from unstructured text, and construct a reusable attribute graph with category-aware schemas. In the online stage, we rank retrieved candidates by reasoning over this structured representation rather than raw text, reducing per-product token usage by 57% while improving ranking precision. Experiments show that our approach outperforms multiple baselines under zero-shot scenarios, achieving a over 5% improvement in average precision without requiring training data, generalizes robustly across diverse product categories, and shows immense potential for real-world deployment.