We make it easy to run and share evaluations of multimodal models. Most tools focus on language models (LLMs); multimodal models deserve their own evaluation tools. We believe it should be easy for everyone to compare models & publically share evaluations.