OpenAI’s new “CriticGPT” model is trained to criticize GPT-4 outputs

OpenAI’s new “CriticGPT” model is trained to criticize GPT-4 outputs

6 months ago
Anonymous $genLyrxdTY

https://arstechnica.com/information-technology/2024/06/openais-criticgpt-outperforms-humans-in-catching-ai-generated-code-bugs/

On Thursday, OpenAI researchers unveiled CriticGPT, a new AI model designed to identify mistakes in code generated by ChatGPT. It aims to enhance the process of making AI systems behave in ways humans want (called "alignment") through Reinforcement Learning from Human Feedback (RLHF), which helps human reviewers make large language model (LLM) outputs more accurate.