Search papers, labs, and topics across Lattice.
School of Computer Science and Technology, Xinjiang University, Urumqi, 830017, China, Xinjiang Key Laboratory of Signal Detection and Processing, Urumqi, 830017, China
1
1
0
5
A novel visual encoder is designed that aims to obtain summary-oriented visual features to help generate higher-quality summaries and introduces a minimum margin loss to suppress the overconfidence problem of the model when generating text during reasoning.