Search papers, labs, and topics across Lattice.
This paper analyzes the limitations of LaTeX in the context of LLM-assisted scientific writing, focusing on compilation efficiency, semantic generation, error localization, and tool ecosystem. It introduces Mogan STEM, a WYSIWYG structured editor, as an alternative that addresses these limitations through an efficient data structure, fast rendering, and on-demand plugin loading. Experiments demonstrate that Mogan outperforms LaTeX in compilation/rendering time, LLM task performance, and LLM fine-tuning efficiency due to its lower information entropy.
LaTeX's reign as the king of scientific documents may be threatened: a new WYSIWYG editor, Mogan STEM, significantly speeds up compilation and even improves LLM fine-tuning compared to TeX.
As large language models (LLMs) increasingly assist scientific writing, limitations and the significant token cost of TeX become more and more visible. This paper analyzes TeX's fundamental defects in compilation and user experience design to illustrate its limitations on compilation efficiency, generated semantics, error localization, and tool ecosystem in the era of LLMs. As an alternative, Mogan STEM, a WYSIWYG structured editor, is introduced. Mogan outperforms TeX in the above aspects by its efficient data structure, fast rendering, and on-demand plugin loading. Extensive experiments are conducted to verify the benefits on compilation/rendering time and performance in LLM tasks. What's more, we show that due to Mogan's lower information entropy, it is more efficient to use .tmu (the document format of Mogan) to fine-tune LLMs than TeX. Therefore, we launch an appeal for larger experiments on LLM training using the .tmu format.