Search papers, labs, and topics across Lattice.
Institute of Science Tokyo, NII LLMC
3
0
6
4
Training VLMs on Jagle, the largest Japanese multimodal dataset, not only crushes existing models on Japanese tasks, but *also* boosts English performance when combined with English data.
Japanese VQA benchmarks are riddled with issues that lead to misleading model comparisons, but JAMMEval fixes this with a rigorous, two-stage refinement process.
LLMs can harbor hidden biases in their reasoning processes, even when reaching unbiased conclusions, and a new Japanese benchmark exposes these subtle cultural biases.