REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning
Published in arXiv preprint arXiv:2505.11718, 2025
This paper proposes a multi-objective reward function to quantify the quality of peer review comments and train a LLM via Reinforcement Learning based-on the reward function for better peer review comments.
Recommended citation: Taechoyotin, P., & Acuna, D. (2025). REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning. arXiv preprint arXiv:2505.11718.
Recommended citation: Taechoyotin, P., & Acuna, D. (2025). REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning. arXiv preprint arXiv:2505.11718.
Download Paper
