Search papers, labs, and topics across Lattice.
This paper tackles the WikiKG90Mv2 knowledge graph embedding task by employing a retrieve-then-re-rank pipeline. They introduce a priority infilling retrieval model to efficiently identify structurally and semantically similar candidate entities. Furthermore, they use an ensemble-based re-ranking model with neighbor-enhanced representations to refine link predictions. Their method achieves a significant improvement in MRR on the validation set, increasing it from 0.2342 to 0.2839.
A surprisingly simple retrieve-then-re-rank pipeline, enhanced with priority infilling and neighbor-aware re-ranking, achieves state-of-the-art results on the massive WikiKG90Mv2 knowledge graph.
WikiKG90Mv2 in NeurIPS 2022 is a large encyclopedic knowledge graph. Embedding knowledge graphs into continuous vector spaces is important for many practical applications, such as knowledge acquisition, question answering, and recommendation systems. Compared to existing knowledge graphs, WikiKG90Mv2 is a large scale knowledge graph, which is composed of more than 90 millions of entities. Both efficiency and accuracy should be considered when building graph embedding models for knowledge graph at scale. To this end, we follow the retrieve then re-rank pipeline, and make novel modifications in both retrieval and re-ranking stage. Specifically, we propose a priority infilling retrieval model to obtain candidates that are structurally and semantically similar. Then we propose an ensemble based re-ranking model with neighbor enhanced representations to produce final link prediction results among retrieved candidates. Experimental results show that our proposed method outperforms existing baseline methods and improves MRR of validation set from 0.2342 to 0.2839.