Selective Preference Optimization via Token-Level Reward Function Estimation - NaCTeM Publications

Publicatietype:	In proceedings
Citatie:	yang:2025
Publication status:	Accepted
Boektitel:	Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Jaar:	2025
Pagina's:	7032–7056
URL:	https://aclanthology.org/2025....
DOI:	10.18653/v1/2025.emnlp-main.359
Trefwoorden:
Auteurs	Yang, K Liu, Z. Xie, Q. Huang, J. Min, E. Ananiadou, S.
Toegevoegd door:	[PRT]
Totaalscore:	0
Bestanden

Aantekeningen

Onderwerpen

Tijd voor verwerking: 0.0404 seconden.