Microsoft Releases Orca 2: Teaching Small Language Models How to Reason

As large language models continue to advance AI capabilities, there is also tremendous value in developing more efficient models that can retain reasoning abilities while using fewer computational resources. Microsoft’s latest Orca 2 models demonstrate how smaller neural networks can achieve significant reasoning skills through careful training methodology. By leveraging the knowledge within large language models to create tailored training data, Orca 2 matches or even exceeds the performance of models over 5 times its size on complex reasoning tasks.

The two Orca 2 variants, weighing in at 7 billion and 13 billion parameters, showcase clever techniques to imbue strong logical thinking within compact model architectures. Building on the successes of the original Orca release earlier this year, Orca 2 represents the next milestone in Microsoft’s mission to democratize access to capable AI systems. Its state-of-the-art results provide a blueprint for the future development and deployment of reasoning-focused models that do not require massive compute budgets. By open-sourcing Orca 2, Microsoft enables the broader research community to further advance this important work on efficient and aligned language models.

More Efficient Reasoning

Orca 2 demonstrates that strong reasoning skills can be attained without the massive computational resources required by frontier LMs through improved training signals and methods that empower smaller language models to achieve enhanced reasoning abilities. Orca 2 significantly surpasses models of similar size and attains performance levels similar to or better than models 5-10 times larger, as assessed on complex tasks that test advanced reasoning abilities in zero-shot settings. The key insight behind Orca 2 is that different tasks could benefit from different solution strategies, and the solution strategy employed by a large model may not be the best choice for a smaller one. Orca 2 is trained with an expanded, highly tailored synthetic dataset, teaching it various reasoning techniques and different solution strategies for different tasks. The model’s performance significantly surpasses models of similar size and attains performance levels similar or better than those of models at least 10 times larger, showcasing the potential of equipping smaller models with better reasoning capabilities.

Democratizing Capable AI

Open-sourcing Orca 2 enables more researchers to build reasoning abilities into compact, efficient models by providing access to a high-performing, smaller language model with enhanced reasoning capabilities. Orca 2, with its 7 billion and 13 billion parameter sizes, has been open-sourced to encourage further research on the development, evaluation, and alignment of smaller language models. By making Orca 2 available to the research community, Microsoft aims to facilitate the exploration and advancement of reasoning abilities in smaller language models. This open-sourcing initiative allows researchers to leverage Orca 2’s success in achieving performance levels comparable to or better than models 5-10 times larger, particularly in zero-shot reasoning tasks. Furthermore, Orca 2’s application of diverse reasoning techniques and identification of optimal solutions for various tasks serve as valuable insights for researchers looking to enhance the reasoning abilities of smaller language models. Therefore, open-sourcing Orca 2 provides a valuable resource for researchers to study and build upon, ultimately contributing to the advancement of reasoning abilities in compact, efficient models.