We make it easy to run and share evaluations of multimodal models. Most tools focus on language models (LLMs); multimodal models deserve their own evaluation tools. We believe it should be easy for everyone to compare models & publically share evaluations.
We make it easy to run and share evaluations of multimodal models.
Most tools focus on language models (LLMs); multimodal models deserve their own evaluation tools.
We believe it should be easy for everyone to compare models & publically share evaluations.