IBM hopes to fight bias in facial recognition with new diverse dataset

IBM hopes to fight bias in facial recognition with new diverse dataset

6 years ago
Anonymous $cyhBy-qkd5

https://www.theverge.com/2018/6/27/17509400/facial-recognition-bias-ibm-data-training

dBias is a big problem in facial recognition, with studies showing that commercial systems are more accurate if you’re white and male. Part of the reason for this is a lack of diversity in the training data, with people of color appearing less frequently than their peers. IBM is one of the companies trying to combat this problem, and today announced two new public datasets that anyone can use to train facial recognition systems — one of which has been curated specifically to help remove bias.

The first dataset contains 1 million images and will help train systems that can spot specific attributes, like hair color, eye color, and facial hair. Each face is annotated with these characteristics, making it easier for programmers to hone their systems to better distinguish between, say, a goatee and a soul patch. It’s not the largest public dataset for training facial recognition systems, but IBM says it’s the biggest to include such tags.