All publications
| 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | 151-175 | 176-200 | 201-225 | 226-250 | 251-275 | 276-300 | 301-325 | 326-350 | 351-375 | 376-400 | 401-425 | 426-450 | 451-475 | 476-500 | 501-525 | 526-550 | 551-575 | 576-582 |
2025
| , and , Back Attention: Understanding and Enhancing Multi-Hop Reasoning in Large Language Models, in: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 11257–11272, 2025 |
[DOI] [URL] |
| , and , Break the Checkbox: Challenging Closed-Style Evaluations of Cultural Alignment in LLMs, in: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 24–51, 2025 |
[DOI] [URL] |
| , , and , ConspEmoLLM-v2: A robust and stable model to detect sentiment-transformed conspiracy theories, in: Proceedings of the 14th Conference on Prestigious Applications of Intelligent Systems (PAIS-2025), pages 5311 - 5318, 2025 |
[URL] |
| , and , Disentangled VAD Representations via a Variational Framework for Political Stance Detection, arXiv, 2025 |
[URL] |
| , , , , , , , , and , DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation, arXiv, 2025 |
[DOI] [URL] |
| , , , , , , , , and , ELAINE-medLLM: Lightweight English Japanese Chinese Trilingual Large Language Model for Bio-medical Domain, in: Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025), pages 4670–4688, 2025 |
[URL] |
| , , and , EMPEC: A Comprehensive Benchmark for Evaluating Large Language Models Across Diverse Healthcare Professions, in: Findings of the Association for Computational Linguistics: ACL 2025, pages 9945–9958, 2025 |
[DOI] [URL] |
| and , Enhancing Stress Detection on Social Media Through Multi-Modal Fusion of Text and Synthesized Visuals, in: Proceedings of the 24th Workshop on Biomedical Language Processing (BioNLP), pages 34–43, 2025 |
[DOI] [URL] |
| , , , , , , and , Exploring Safety Alignment Evaluation of LLMs in Chinese Mental Health Dialogues via LLM-as-Judge, arXiv, 2025 |
[DOI] [URL] |
| , , , , , , , , , , , and , FinAudio: A Benchmark for Audio Large Language Models in Financial Applications, arXiv, 2025 |
[URL] |
| , , , , , , , , , , , , , , , , , , , , , , , and , FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning, arXiv, 2025 |
[DOI] [URL] |
| , , , , , , , , , , , , and , FinNLP-FNP-LLMFinLegal-2025 Shared Task: Financial Misinformation Detection Challenge Task, in: Proceedings of the Joint Workshop of the 9th Financial Technology and Natural Language Processing (FinNLP), the 6th Financial Narrative Processing (FNP), and the 1st Workshop on Large Language Models for Finance and Legal (LLMFinLegal), pages 271–276, 2025 |
[URL] |
| , , , , , , , , , , , and , FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading, in: Findings of the Association for Computational Linguistics: ACL 2025, pages 13921–13934, 2025 |
[DOI] [URL] |
| , , , , and , FMDLlama: Financial Misinformation Detection Based on Large Language Models, in: Proceedings of the ACM on Web Conference 2025, pages 1153 - 1157, 2025 |
[DOI] [URL] |
| , and , From n-gram to Attention: How Model Architectures Learn and Propagate Bias in Language Modelin, in: Findings of the Association for Computational Linguistics: EMNLP 2025, pages 18478–18498, 2025 |
[DOI] [URL] |
| , , , , , , , , , and , From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models, arXiv, 2025 |
[DOI] [URL] |
| , and , IRIS: Rapid Curation Framework for Iterative Improvement of Noisy Named Entity Annotations, in: Proceedings of the International Conference on Applications of Natural Language to Information Systems, pages 58-69, 2025 |
[DOI] [URL] |
| , , , , , , , , , , and , Large Language Models in Mental Health Care: A Scoping Review (2025), in: Current Treatment Options in Psychiatry, 12(27) |
[DOI] [URL] |
| and , Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs, in: Findings of the Association for Computational Linguistics: EMNLP 2024, pages 7065–7078, 2025 |
[DOI] [URL] |
| , , , , and , MMAFFBen: A Multilingual and Multimodal Affective Analysis Benchmark for Evaluating LLMs and VLMs, arXiv, 2025 |
[URL] |
| , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation, arXiv, 2025 |
[URL] |
| , , , , , , and , Natural Language Processing for Cardiology: A Narrative Review, arXiv, 2025 |
[DOI] [URL] |
| , , , , , , , , and , Overview of the BioLaySumm 2025 Shared Task on Lay Summarization of Biomedical Research Articles and Radiology Reports, in: Proceedings of the 24th Workshop on Biomedical Language Processing, pages 365–377, 2025 |
[DOI] [URL] |
| , , , , , , and , Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs, arXiv, 2025 |
[DOI] [URL] |
| , , , , , , , , and , Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance, in: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 30176–30202, 2025 |
[DOI] [URL] |
| 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | 151-175 | 176-200 | 201-225 | 226-250 | 251-275 | 276-300 | 301-325 | 326-350 | 351-375 | 376-400 | 401-425 | 426-450 | 451-475 | 476-500 | 501-525 | 526-550 | 551-575 | 576-582 |
