In this blog post we discuss a tool we build to ensure data used for training is balanced and has minimal labelling bias.