Contains HellaSwag Validation dataset with Labels as well as datasets of LLMs from TogetherAI evaluating on HellaSwag. Use Compare to find the differences in the model's response and the true label.