Close Menu
  • Home
  • Daily
  • AI
  • Crypto
  • Bitcoin
  • Stock Market
  • E-game
  • Casino
  • World
  • Affiliate News
  • English
    • Português
    • English
    • Español

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Mario Kart World Easter Egg Lets You Mirror the Free Roam Map

June 9, 2025

live updates for June 9 2025

June 8, 2025

Gyaltsen Norbu: China-appointed Panchen Lama vows to make Tibetan Buddhism more Chinese in meeting with Xi Jinping

June 8, 2025
Facebook X (Twitter) Instagram
MetaDaily – Breaking News in Crypto, Markets & Digital Trends
  • Home
  • Daily
  • AI
  • Crypto
  • Bitcoin
  • Stock Market
  • E-game
  • Casino
  • World
  • Affiliate News
  • English
    • Português
    • English
    • Español
MetaDaily – Breaking News in Crypto, Markets & Digital Trends
Home » DeepSeek’s distilled new R1 AI model can run on a single GPU
AI

DeepSeek’s distilled new R1 AI model can run on a single GPU

adminBy adminMay 29, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


DeepSeek’s updated R1 reasoning AI model might be getting the bulk of the AI community’s attention this week. But the Chinese AI lab also released a smaller, “distilled” version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably sized models on certain benchmarks.

The smaller updated R1, which was built using the Qwen3-8B model Alibaba launched in May as a foundation, performs better than Google’s Gemini 2.5 Flash on AIME 2025, a collection of challenging math questions.

DeepSeek-R1-0528-Qwen3-8B also nearly matches Microsoft’s recently released Phi 4 reasoning plus model on another math skills test, HMMT.

So-called distilled models like DeepSeek-R1-0528-Qwen3-8B are generally less capable than their full-sized counterparts. On the plus side, they’re far less computationally demanding. According to the cloud platform NodeShift, Qwen3-8B requires a GPU with 40GB-80GB of RAM to run (e.g., an Nvidia H100). The full-sized new R1 needs around a dozen 80GB GPUs.

DeepSeek trained DeepSeek-R1-0528-Qwen3-8B by taking text generated by the updated R1 and using it to fine-tune Qwen3-8B. In a dedicated web page for the model on the AI dev platform Hugging Face, DeepSeek describes DeepSeek-R1-0528-Qwen3-8B as “for both academic research on reasoning models and industrial development focused on small-scale models.”

DeepSeek-R1-0528-Qwen3-8B is available under a permissive MIT license, meaning it can be used commercially without restriction. Several hosts, including LM Studio, already offer the model through an API.



Source link

China deepseek
Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleEhud Olmert: Former Israeli PM blasts war in Gaza
Next Article Josh Brown says he’s not sure if Nike can ever turn it around
admin
  • Website

Related Posts

Lawyers could face ‘severe’ penalties for fake AI-generated citations, UK court warns

June 7, 2025

Trump administration takes aim at Biden and Obama cybersecurity rules

June 7, 2025

Week in Review: Why Anthropic cut access to Windsurf

June 7, 2025

Will Musk vs. Trump affect xAI’s $5 billion debt deal?

June 7, 2025
Leave A Reply Cancel Reply

Our Picks

Voluptatem aliquam adipisci dolor eaque

April 24, 2025

Funeral of Pope Francis Coincides with King’s Day Celebrations in the Netherlands and Curaçao

April 24, 2025

Curaçao’s Waste-to-Energy Plant Remains Unfeasible Due to High Costs

April 23, 2025

Dutch Ministers: No Immediate Threat from Venezuela to ABC Islands

April 23, 2025
Don't Miss
Affiliate Network News

The Sunday Times List of Best Places to Work in 2025

By adminMay 27, 20250

We’re incredibly proud to share that Awin has once again secured a spot on The…

The Sunday Times List of Best Places to Work in 2025

May 23, 2025

Awin Claims Best Affiliate Network or SaaS of the Year at 2025 Performance Marketing Awards

May 15, 2025

Global ThinkTank 2025: Who, What, Where

May 9, 2025
About Us
About Us

Welcome to MetaDaily.io — Your Daily Pulse on the Digital Frontier.

At MetaDaily.io, we bring you the latest, most relevant, and most exciting news from the world of affiliate networks, cryptocurrency, Bitcoin, egaming, and global markets. Whether you’re an investor, gamer, tech enthusiast, or digital entrepreneur, we provide the insights you need to stay ahead of the curve in this fast-moving digital era.

Our Picks

Crypto.com Sues Nevada Over Block on Sports Event Contracts

June 6, 2025

A New Chapter for the Industry

June 5, 2025

Japan’s House of Representatives Passes Online Casino Ban Bill 2025

June 4, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • Privacy Policy
  • Terms & Conditions
  • DMCA
© 2025 metadaily. Designed by metadaily.

Type above and press Enter to search. Press Esc to cancel.