109989

: It achieves a high success rate because LLMs are highly likely to follow instructions appearing at the very beginning of a prompt.

: The primary limitation is that it requires indirect prompt injection (placing hidden text in the source PDF), meaning it only works if the reviewer uploads the specific document to an AI tool. Detecting LLM-Generated Peer Reviews - arXiv 109989

: It has proven effective even against common "reviewer defenses," such as light editing or rephrasing. : It achieves a high success rate because

The topic originates from a 2025 study on Detecting LLM-Generated Peer Reviews . Researchers developed a watermarking system that uses fabricated citations to flag reviews created by AI instead of human experts. 109989