Search papers, labs, and topics across Lattice.
The paper introduces LoRA-MME, a multi-model ensemble architecture for code comment classification that leverages parameter-efficient fine-tuning (PEFT) via LoRA. Four transformer encoders (UniXcoder, CodeBERT, GraphCodeBERT, CodeBERTa) are independently fine-tuned and their predictions are aggregated using a learned weighted ensemble. The approach achieved an F1 Weighted score of 0.7906 and a Macro F1 of 0.6867, demonstrating improved classification performance compared to single models.
LoRA-MME's ensemble of LoRA-tuned code encoders achieves state-of-the-art code comment classification, but at the cost of inference efficiency.
Code comment classification is a critical task for automated software documentation and analysis. In the context of the NLBSE'26 Tool Competition, we present \textbf{LoRA-MME}, a Multi-Model Ensemble architecture utilizing Parameter-Efficient Fine-Tuning (PEFT). Our approach addresses the multi-label classification challenge across Java, Python, and Pharo by combining the strengths of four distinct transformer encoders: UniXcoder, CodeBERT, GraphCodeBERT, and CodeBERTa. By independently fine-tuning these models using Low-Rank Adaptation(LoRA) and aggregating their predictions via a learned weighted ensemble strategy, we maximize classification performance without the memory overhead of full model fine-tuning. Our tool achieved an \textbf{F1 Weighted score of 0.7906} and a \textbf{Macro F1 of 0.6867} on the test set. However, the computational cost of the ensemble resulted in a final submission score of 41.20\%, highlighting the trade-off between semantic accuracy and inference efficiency.