Repository containing Winogrande dataset and datasets of LLMs from TogetherAI evaluating on the Winogrande test set to find model performance on Winogrande dataset. Use Oxen's Compare feature to easily find which prompts the model gets correct.