Data Science News

What's moving in data science & applied AI.

Live headlines from the publications we trust most for signal over hype — KDnuggets, MIT News, MarkTechPost and the Google AI Blog. Refreshed on every page load directly from the source. No tracking, no ads, no algorithm.

KDnuggets MIT News · AI MarkTechPost Google AI Blog

Latest

Headlines

KDnuggets 3 Jun 2026

5 Fun Papers That Explain LLMs Clearly

Want to understand LLMs better? Start with these five foundational papers that explain how they work.

Read article

MarkTechPost 3 Jun 2026

Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0.15.2 with Streaming Tool Output

Hermes Desktop is a no-terminal GUI sharing one agent core, skills, and memory with the Hermes Agent CLI. The post Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0.15.2 with St…

Read article

MarkTechPost 3 Jun 2026

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

NVIDIA released Cosmos 3, open omnimodal world models pairing an autoregressive VLM reasoner with a diffusion generator for physical AI. The post NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation M…

Read article

MIT News — AI 3 Jun 2026

MIT researchers teach AI models to interpret charts

The new ChartNet training dataset could improve the accuracy of vision-language models that help analyze business trends or interpret scientific figures.

Read article

MarkTechPost 3 Jun 2026

How to Fine-Tune LFM2 Using QLoRA and DPO: A Complete Step-by-Step Coding Tutorial on Google Colab

Learn to fine-tune LFM2 with QLoRA, supervised fine-tuning, DPO, and adapter merging using TRL and PEFT on Colab. The post How to Fine-Tune LFM2 Using QLoRA and DPO: A Complete Step-by-Step Coding Tutorial on Google Cola…

Read article

MarkTechPost 2 Jun 2026

TinyFish Launches BigSet: An Open-Source Multi-Agent System That Builds Structured Live Datasets from Plain-English Descriptions

Describe a dataset in one sentence; Bigset's orchestrator and parallel sub-agents research the live web and return structured tables. The post TinyFish Launches BigSet: An Open-Source Multi-Agent System That Builds Struc…

Read article

KDnuggets 2 Jun 2026

A Gentle Primer on LLM Explainability

This article discusses LLM explainability and outlines the advances, trends, and ongoing developments in this important field of study.

Read article

KDnuggets 2 Jun 2026

10 GitHub Repositories for Modern Database Systems and Tools

Explore 10 top open-source GitHub repositories for modern databases, analytics, SQL, caching, monitoring, replication, PostgreSQL, SQLite, and AI agent memory.

Read article

MarkTechPost 2 Jun 2026

Alibaba’s Qwen Team Launches Qwen3.7-Plus, Adding Vision, Deep Reasoning, Tool Invocation, and Autonomous Iteration on the Bailian Platform

Qwen3.7-Plus is Alibaba's multimodal agent model on Bailian, understanding images and video while adding self-programming and tool invocation. The post Alibaba’s Qwen Team Launches Qwen3.7-Plus, Adding Vision, Deep…

Read article

MarkTechPost 2 Jun 2026

JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines

JetBrains releases Mellum2 under Apache 2.0 — a 12B MoE model trained on 10.6 trillion tokens for AI workflows. The post JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pipelines…

Read article

MarkTechPost 2 Jun 2026

How to Speed Up Transformer Training Using NVIDIA Apex (FusedAdam, FusedLayerNorm) and Native torch.amp

We build NVIDIA Apex from source, detect fused kernels, and benchmark FusedAdam, FusedLayerNorm, and torch.amp in Transformer training. The post How to Speed Up Transformer Training Using NVIDIA Apex (FusedAdam, FusedLay…

Read article

MarkTechPost 1 Jun 2026

MiniMax Releases MiniMax M3 with MSA Architecture Supporting 1M-Token Context, Native Multimodality, and Agentic Coding

MiniMax M3 introduces MiniMax Sparse Attention, a 1M-token context window, and native image, video, and computer use support. The post MiniMax Releases MiniMax M3 with MSA Architecture Supporting 1M-Token Context, Native…

Read article

Google AI Blog 1 Jun 2026

How we used Gemini to build Google I/O 2026

Learn how Googlers used AI to produce Google I/O 2026.

Read article

KDnuggets 1 Jun 2026

Mocking a Year of IoT Sensor Time Series Data with Mimesis

In this guide, you will learn the process of generating a year's worth of daily temperature readings, mimicking a seasonal curve that looks like real — all together with device-level metadata, and ready to build ba…

Read article

KDnuggets 1 Jun 2026

5 Must-Know Python Concepts for Data Scientists

In this article, we will dive deep into five must-know Python concepts that will help you transition from writing clunky, slow spaghetti code to constructing lightning-fast, production-grade, and beautifully functional d…

Read article

Google AI Blog 29 May 2026

Take our I/O 2026 quiz, vibe coded in Google AI Studio.

We used Google AI Studio to vibe code a quiz about our top I/O 2026 announcements.

Read article

Google AI Blog 29 May 2026

9 demos of Gemini Omni and Gemini 3.5 in action

Watch 9 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026.

Read article

KDnuggets 29 May 2026

Practical NLP in the Browser with Transformers.js

This tutorial covers three NLP tasks: text classification, zero-shot labelling, and question answering using Transformers.js's pipeline() API.

Read article

KDnuggets 29 May 2026

The ‘Entry-Level’ Gatekeeper: Auditing Job Descriptions with Textstat

This article shows how to use free, open-source tools like Python and its Textstat library to build a script that automates the process of capturing "gatekeeping language" in job descriptions before publishing them.

Read article

Google AI Blog 29 May 2026

Check out real-life AI prototypes from the Futures Lab.

University of Waterloo students develop AI prototypes like sign language tutors to reshape the future of education and work.

Read article

MIT News — AI 28 May 2026

Media Advisory: MIT to establish regional quantum hub

With $25 million investment from the Commonwealth of Massachusetts, MIT to build a new shared-use facility to serve as a statewide quantum toolbox.

Read article

Google AI Blog 28 May 2026

Catch up on 12 major I/O 2026 moments

Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.

Read article

KDnuggets 28 May 2026

Tweaking Local Language Model Settings with Ollama

In this article, we will go deep under the hood of Ollama's configuration engine, exploring how to fine-tune local language model parameters.

Read article

Google AI Blog 22 May 2026

Catch up on the Dialogues stage at Google I/O 2026.

A recap of the 2026 I/O Dialogues, where leaders discuss the future of AI, quantum computing, robotics and creativity.

Read article

Headlines, excerpts and links are republished here under fair-use editorial syndication from publicly available RSS feeds. All copyright remains with the original publishers; please follow the links above to read full articles on the source sites.

Let's build something measurable

Have a forecasting or classification problem that needs to work in production?

Tell us about it. First conversations are confidential, no-obligation, and usually end with a clear view of feasibility, data needs, and time-to-value.

Book a confidential call jtepper@perceptronix.net