REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning

Published in arXiv preprint arXiv:2505.11718, 2025

This paper proposes a multi-objective reward function to quantify the quality of peer review comments and train a LLM via Reinforcement Learning based-on the reward function for better peer review comments.

Recommended citation: Taechoyotin, P., & Acuna, D. (2025). REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning. arXiv preprint arXiv:2505.11718.

Recommended citation: Taechoyotin, P., & Acuna, D. (2025). REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning. arXiv preprint arXiv:2505.11718.
Download Paper

Share on

Bluesky Facebook LinkedIn Mastodon X (formerly Twitter)

Pawin Taechoyotin

Share on