With Evals, OpenAI hopes to crowdsource AI model testing

With Evals, OpenAI hopes to crowdsource AI model testing

a year ago
Anonymous $Gb26S9Emwz

https://techcrunch.com/2023/03/14/with-evals-openai-hopes-to-crowdsource-ai-model-testing/

Alongside GPT-4, OpenAI has open-sourced a software framework to evaluate the performance of its AI models. Called Evals, OpenAI says that the tooling will allow anyone to report shortcomings in its models to help guide improvements.

It’s a sort of crowdsourcing approach to model testing, OpenAI explains in a blog post.