Introducing Community Benchmarks on Kaggle

Introducing Community Benchmarks on Kaggle

• Introducing Community Benchmarks on Kaggle Jan 14, 2026 Today’s AI models require more than static accuracy scores. • Community Benchmarks, a new capability on Kaggle, enables th

Customizing multiturn AI agents with reinforcement learning

Customizing multiturn AI agents with reinforcement learning

• Customizing multiturn AI agents with reinforcement learning Leveraging existing environment simulators and reward functions based on verifiable ground truth boosts task success r

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

• Salesforce on Tuesday launched an entirely rebuilt version of Slackbot, the company’s workplace assistant, transforming it from a simple notification tool into what executives de

Anthropic launches Cowork, a Claude Desktop agent that works in your files - no coding required

Anthropic launches Cowork, a Claude Desktop agent that works in your files - no coding required

• Anthropic released Cowork on Monday, a new AI agent capability that extends the power of its wildly successful Claude Code tool to non-technical users - and according to company

Fine-tuning vision-language models on memory-constrained devices

Fine-tuning vision-language models on memory-constrained devices

• Fine-tuning vision-language models on memory-constrained devices A new hybrid optimization approach allows edge devices to fine-tune vision-language models using only forward pas

Nous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code moment

Nous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code moment

• Nous Research, the open-source artificial intelligence startup backed by crypto venture firm Paradigm, released a new competitive programming model on Monday that it says matches

The unseen work of building reliable AI agents

The unseen work of building reliable AI agents

• AI agents must master countless low-level tasks before handling high-level requests. • A simple ‘book vacation’ command triggers hundreds of micro-interactions across legacy syst

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

• NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI +56 NVIDIA today releasedCosmos Reason 2, the latest advancement in open, reasoning vision language models for phy

The creator of Claude Code just revealed his workflow, and developers are losing their minds

The creator of Claude Code just revealed his workflow, and developers are losing their minds

• Credit: VentureBeat made with Midjourney When the creator of the world’s most advanced coding agent speaks, Silicon Valley doesn’t just listen - it takes notes. • For the past we

The 10 most viewed publications of 2025

The 10 most viewed publications of 2025

• The 10 most viewed publications of 2025 From foundation model safety frameworks and formal verification at cloud scale to advanced robotics and multimodal AI reasoning, these are

The 10 most viewed blog posts of 2025

The 10 most viewed blog posts of 2025

• Time series forecasting has undergone a transformation with the emergence of foundation models, moving beyond traditional statistical methods that extrapolate from single time se

Dialogue Boost: How Amazon is using AI to enhance TV and movie dialogue

Dialogue Boost: How Amazon is using AI to enhance TV and movie dialogue

• Dialogue Boost: How Amazon is using AI to enhance TV and movie dialogue Newaudio-processing technology is making entertainment more accessible for millions of viewers. • Copy lin

Amazon Nova Forge: "Open training” paradigm that empowers everyone to build their own frontier AI

Amazon Nova Forge: "Open training” paradigm that empowers everyone to build their own frontier AI

• As foundation models (FMs) - such as Transformer-based large language models (LLMs) - have grown in popularity, there is a pattern we have seen repeatedly: a new model launches w

AutoGluon assistant: Zero-code AutoML through multiagent collaboration

AutoGluon assistant: Zero-code AutoML through multiagent collaboration

• AutoGluon assistant: Zero-code AutoML through multiagent collaboration A multiagent architecture separates data perception, tool knowledge, execution history, and code generation

AI-native 6G: From networks to intelligence fabrics

AI-native 6G: From networks to intelligence fabrics

• AI-native 6G: From networks to intelligence fabrics ‘Network language models’ will coordinate complex interactions among intelligent components, computational infrastructure, acc

Using LLMs to improve Amazon product listings

Using LLMs to improve Amazon product listings

• Using LLMs to improve Amazon product listings Large language models are increasing the accuracy, reliability, and consistency of the product catalogue at scale. • Copy link Email

The overthinking problem in AI

The overthinking problem in AI

• The overthinking problem in AI Reasoning models can generate seven to 10 times as many tokens as necessary on simple tasks, creating unsustainable costs at scale. • Amazon’s visi

63 Amazon Research Award recipients announced

63 Amazon Research Award recipients announced

• Amazon Research Awards (ARA) provides unrestricted funds and AWS Promotional Credits to academic researchers investigating various research topics in multiple disciplines. • This

How Amazon uses AI agents to anticipate and counter cyber threats

How Amazon uses AI agents to anticipate and counter cyber threats

• How Amazon uses AI agents to anticipate and counter cyber threats Amazon’s competitive-agent architecture creates a continuous improvement cycle that develops security protection

Making fairness in LLMs observable, quantifiable, and governable

Making fairness in LLMs observable, quantifiable, and governable

• Making fairness in LLMs observable, quantifiable, and governable A new evaluation pipeline called FiSCo uncovers hidden biases and offers an assessment framework that evolves alo

Amazon launches private AI bug bounty to strengthen Nova models

Amazon launches private AI bug bounty to strengthen Nova models

• Amazon launches private AI bug bounty to strengthen Nova models Partnering with professional security researchers and the academic community to build safer, more robust AI. • Cop

A conversation with Kevin Scott: What's next in AI

A conversation with Kevin Scott: What's next in AI

• Large AI models will keep boosting productivity, creativity, and job satisfaction. • Generative AI will drive scientific breakthroughs and tackle global challenges. • Microsoft’s

From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative

From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative

• Mattel designers use DALL·E 2 to generate Hot Wheels concepts via text prompts. • AI lets them iterate quickly, turning models into convertibles, color variants, and more. • DALL

Microsoft open sources its 'farm of the future' toolkit

Microsoft open sources its 'farm of the future' toolkit

• Microsoft Research opens Project FarmVibes, releasing FarmVibes.AI toolkit for data‑driven farming. • Andrew Nelson, a fifth‑generation farmer and software engineer, uses the too

How data and AI will transform contact centres for financial services

How data and AI will transform contact centres for financial services

• Shift from cost‑driven KPIs to experience‑centric metrics boosts loyalty. • AI powers 24/7, empathetic support, replacing rigid 9‑5 scripts. • Unified digital, mobile, and physic

AI-equipped drones study dolphins on the edge of extinction

AI-equipped drones study dolphins on the edge of extinction

• AI-powered drones monitor endangered Māui dolphins, aiding conservation efforts in New Zealand. • Only 54 known individuals, making them one of the world’s most threatened marine

Online math tutoring service uses AI to help boost students' skills and confidence

• Eedi offers AI‑driven online math tutoring to address learning gaps post‑COVID. • Students start with a 10‑question dynamic quiz that maps their strengths and weaknesses. • Micro

AI-Mimi is building inclusive TV experiences for Deaf and Hard of Hearing user in Japan

AI-Mimi is building inclusive TV experiences for Deaf and Hard of Hearing user in Japan

• AI-Mimi delivers real‑time subtitles for Deaf/Hard‑of‑Hearing viewers in Japan. • Uses Microsoft Azure Cognitive Services combined with human input for live TV. • Addresses high

Microsoft's framework for building AI systems responsibly

Microsoft's framework for building AI systems responsibly

• Microsoft releases Responsible AI Standard to guide ethical AI system development. • Framework emphasizes fairness, reliability, safety, privacy, inclusiveness, transparency, acc

Singapore develops Asia's first AI-based mobile app for shark and ray fin identification to combat illegal wildlife trade

• Singapore’s NParks, Microsoft, Conservation International launch Fin Finder, AI-powered shark and ray fin ID app. • First mobile app in Asia to visually identify illegal shark an

The opportunity at home - can AI drive innovation in personal assistant devices and sign language?

The opportunity at home - can AI drive innovation in personal assistant devices and sign language?

• Microsoft AI for Accessibility launched sign language workshop, attracting top researchers. • PhD student Abraham Glasser received 3‑year grant to enhance ASL interaction with sm