This repository is an effort to recreate the "Self-Rewarding Language Models" paper by the team at [Meta.ai](http://meta.ai/) but with using a smaller model that is able to be fine tuned by the community. https://arxiv.org/abs/2401.10020