Close Menu
  • Home
  • AI News
  • AI Startups
  • Deep Learning
  • Interviews
  • Machine-Learning
  • Robotics

Subscribe to Updates

Get the latest creative news from FooBar about art, design and business.

What's Hot

Visium Applied sciences Launches TruContext™ AI Governance Layer to Comprise ‘OpenClaw’ Type Autonomous Agent Dangers

March 19, 2026

CivicPlus Brings AI to Constructing Plan Evaluate with CodeComply.Ai

March 19, 2026

Expedience Software program Joins Microsoft AI Cloud Accomplice Program to Ship Copilot-Powered Proposal Automation in Microsoft Phrase

March 19, 2026
Facebook X (Twitter) Instagram
Smart Homez™
Facebook X (Twitter) Instagram Pinterest YouTube LinkedIn TikTok
SUBSCRIBE
  • Home
  • AI News
  • AI Startups
  • Deep Learning
  • Interviews
  • Machine-Learning
  • Robotics
Smart Homez™
Home»Deep Learning»Researchers from CMU and Microsoft Introduce TinyGSM: A Artificial Dataset Containing GSM8K-Fashion Math Phrase Issues Paired with Python Options
Deep Learning

Researchers from CMU and Microsoft Introduce TinyGSM: A Artificial Dataset Containing GSM8K-Fashion Math Phrase Issues Paired with Python Options

By December 19, 2023Updated:December 19, 2023No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Reddit WhatsApp Email
Researchers from CMU and Microsoft Introduce TinyGSM: A Artificial Dataset Containing GSM8K-Fashion Math Phrase Issues Paired with Python Options
Share
Facebook Twitter LinkedIn Pinterest WhatsApp Email


In pure language processing, the highlight is shifting towards the untapped potential of small language fashions (SLMs). Whereas their bigger counterparts have dominated the panorama, the query lingers: simply how crucial is mannequin measurement for efficient problem-solving? The examine explores this pivotal query, delving into SLMs’ benefits and introducing TinyGSM.

Researchers from Carnegie Mellon College and Microsoft Analysis introduce TinyGSM, an artificial dataset comprising 12.3 million grade faculty math issues and Python options generated by GPT-3.5. It’s a examine software for small language fashions (SLMs) in mathematical reasoning. The method leverages the high-quality dataset and makes use of a verifier to boost efficiency, surpassing bigger fashions in accuracy.

The examine addresses the efficacy of information utilization versus typical scaling legal guidelines in mannequin enchancment, emphasizing the importance of artificial knowledge era in data-scarce eventualities. It notes the compensatory impact of accelerating dataset measurement for smaller mannequin sizes. The usage of verifiers to pick out optimum responses from a number of candidates is highlighted as profitable in prior works. 

The examine addresses the under-explored potential of SLMs in mathematical reasoning, specializing in breaking the 80% accuracy barrier on the difficult GSM8K benchmark for grade faculty math issues. Researchers suggest leveraging high-quality datasets like TinyGSM and a verifier mannequin for optimum output choice from a number of candidate generations to realize this. The examine explores artificial knowledge era, prompt-engineered knowledge, and a teacher-student situation to boost small mannequin efficiency, introducing TinyGSM as an artificial dataset demonstrating excessive accuracy on the GSM8K benchmark.

TinyGSM, an artificial dataset of grade faculty math issues with Python options, is solely generated by GPT-3.5. By fine-tuning a 1.3B era mannequin and a 1.3B verifier mannequin on TinyGSM, the verifier selects optimum outputs from a number of candidates, enhancing mannequin accuracy. Filtering ensures knowledge high quality, excluding brief issues or non-numeric content material. Exploring totally different answer codecs suggests scaling the verifier as a extra environment friendly use of mannequin parameters, drawing connections to GAN coaching insights. Emphasizing high-quality datasets and verifier use, the examine underscores attaining excessive accuracy with small language fashions.

TinyGSM is launched, an artificial dataset of grade faculty math issues and Python options generated by GPT-3.5. Effective-tuning a 1.3B era mannequin and a 1.3B verifier on TinyGSM achieves a exceptional 81.5% accuracy on the GSM8K benchmark, surpassing a lot bigger fashions. The mannequin’s efficiency rivals that of the GSM8K dataset, and it displays robustness with 75.6% accuracy on SVAMP with out additional fine-tuning. The examine emphasizes the verifier’s efficacy in optimum response choice, suggesting scaling it as a extra environment friendly use of mannequin parameters. Excessive-quality datasets and together with irrelevant context contribute to improved small language mannequin efficiency.

https://arxiv.org/abs/2312.09241

In conclusion, the examine highlights the potential of SLMs for bettering grade faculty mathematical reasoning. By using high-quality datasets like TinyGSM and a verifier mannequin, SLMs can surpass bigger fashions in accuracy on the GSM8K benchmark. The examine additionally emphasizes the significance of utilizing high quality datasets and verifiers, which might help bridge the efficiency hole between pupil and trainer fashions. The outcomes recommend that SLMs could be a promising method for attaining environment friendly and efficient mathematical reasoning duties.


Try the Paper. All credit score for this analysis goes to the researchers of this mission. Additionally, don’t neglect to affix our 34k+ ML SubReddit, 41k+ Fb Neighborhood, Discord Channel, and Electronic mail E-newsletter, the place we share the most recent AI analysis information, cool AI initiatives, and extra.

When you like our work, you’ll love our publication..



Sana Hassan, a consulting intern at Marktechpost and dual-degree pupil at IIT Madras, is enthusiastic about making use of know-how and AI to handle real-world challenges. With a eager curiosity in fixing sensible issues, he brings a recent perspective to the intersection of AI and real-life options.


🐝 [FREE AI WEBINAR] Google Gemini Professional: Builders Overview: Dec 20 2023, 10 am PST

Related Posts

Meet SymTorch: A PyTorch Library that Interprets Deep Studying Fashions into Human-Readable Equations

March 3, 2026

The right way to Design Advanced Deep Studying Tensor Pipelines Utilizing Einops with Imaginative and prescient, Consideration, and Multimodal Examples

February 10, 2026

Microsoft AI Proposes OrbitalBrain: Enabling Distributed Machine Studying in House with Inter-Satellite tv for pc Hyperlinks and Constellation-Conscious Useful resource Optimization Methods

February 9, 2026
Misa
Trending
Interviews

Visium Applied sciences Launches TruContext™ AI Governance Layer to Comprise ‘OpenClaw’ Type Autonomous Agent Dangers

By Editorial TeamMarch 19, 20260

Graph-based cyber intelligence provides enterprises visibility and management over Shadow AI, immediate injection, and excessive…

CivicPlus Brings AI to Constructing Plan Evaluate with CodeComply.Ai

March 19, 2026

Expedience Software program Joins Microsoft AI Cloud Accomplice Program to Ship Copilot-Powered Proposal Automation in Microsoft Phrase

March 19, 2026

Hyperscale Knowledge Launches Omnipresent Robotics

March 19, 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Our Picks

Visium Applied sciences Launches TruContext™ AI Governance Layer to Comprise ‘OpenClaw’ Type Autonomous Agent Dangers

March 19, 2026

CivicPlus Brings AI to Constructing Plan Evaluate with CodeComply.Ai

March 19, 2026

Expedience Software program Joins Microsoft AI Cloud Accomplice Program to Ship Copilot-Powered Proposal Automation in Microsoft Phrase

March 19, 2026

Hyperscale Knowledge Launches Omnipresent Robotics

March 19, 2026

Subscribe to Updates

Get the latest creative news from SmartMag about art & design.

The Ai Today™ Magazine is the first in the middle east that gives the latest developments and innovations in the field of AI. We provide in-depth articles and analysis on the latest research and technologies in AI, as well as interviews with experts and thought leaders in the field. In addition, The Ai Today™ Magazine provides a platform for researchers and practitioners to share their work and ideas with a wider audience, help readers stay informed and engaged with the latest developments in the field, and provide valuable insights and perspectives on the future of AI.

Our Picks

Visium Applied sciences Launches TruContext™ AI Governance Layer to Comprise ‘OpenClaw’ Type Autonomous Agent Dangers

March 19, 2026

CivicPlus Brings AI to Constructing Plan Evaluate with CodeComply.Ai

March 19, 2026

Expedience Software program Joins Microsoft AI Cloud Accomplice Program to Ship Copilot-Powered Proposal Automation in Microsoft Phrase

March 19, 2026
Trending

Hyperscale Knowledge Launches Omnipresent Robotics

March 19, 2026

TalentNeuron Launches Synappy, Bringing Conversational AI to Workforce Intelligence

March 18, 2026

Luminys Unveils LumiAgent AI and Complete Safety Ecosystem Enlargement Forward of ISC West 2026

March 18, 2026
Facebook X (Twitter) Instagram YouTube LinkedIn TikTok
  • About Us
  • Advertising Solutions
  • Privacy Policy
  • Terms
  • Podcast
Copyright © The Ai Today™ , All right reserved.

Type above and press Enter to search. Press Esc to cancel.