Selective Preference Optimization via Token-Level Reward Function Estimation - NaCTeM Publications

Type of publication:	Inproceedings
Citation:	yang:2025
Publication status:	Accepted
Booktitle:	Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Year:	2025
Pages:	7032–7056
URL:	https://aclanthology.org/2025....
DOI:	10.18653/v1/2025.emnlp-main.359
Keywords:
Authors	Yang, K Liu, Z. Xie, Q. Huang, J. Min, E. Ananiadou, S.
Added by:	[PRT]
Total mark:	0
Attachments

Notes

Topics

processing time: 0.0270 seconds.