Optimizing Allreduce Operations for Modern Heterogeneous Architectures with Multiple Processes per GPU

Optimizing Allreduce Operations for Modern Heterogeneous Architectures with Multiple Processes per GPU

• Computer Science > Distributed, Parallel, and Cluster Computing [Submitted on 18 Aug 2025 (v1), last revised 24 Feb 2026 (this version, v2)] Title:Optimizing Allreduce Operations

Basics 2 Breakthroughs: Optimizing Materials for Next-Generation Microelectronics

Basics 2 Breakthroughs: Optimizing Materials for Next-Generation Microelectronics

• Basics 2 Breakthroughs: Optimizing Materials for Next-Generation Microelectronics Video Microelectronics The tiny microchips that power modern technologies are already an impress

Research & Labs · February 23, 2026 (updated February 25, 2026) · 2 min · 395 words
DeepCompile: A Compiler-Driven Approach to Optimizing Distributed Deep Learning Training

DeepCompile: A Compiler-Driven Approach to Optimizing Distributed Deep Learning Training

• Computer Science > Distributed, Parallel, and Cluster Computing [Submitted on 14 Apr 2025 (v1), last revised 19 Feb 2026 (this version, v2)] Title:DeepCompile: A Compiler-Driven

ROIX-Comp: Optimizing X-ray Computed Tomography Imaging Strategy for Data Reduction and Reconstruction

ROIX-Comp: Optimizing X-ray Computed Tomography Imaging Strategy for Data Reduction and Reconstruction

• Electrical Engineering and Systems Science > Image and Video Processing [Submitted on 17 Feb 2026] Title:ROIX-Comp: Optimizing X-ray Computed Tomography Imaging Strategy for Data

Optimizing Communication for Mixture-of-Experts Training with Hybrid Expert Parallel

Optimizing Communication for Mixture-of-Experts Training with Hybrid Expert Parallel

• In LLM training, Expert Parallel (EP) communication for hyperscale mixture-of-experts (MoE) models is challenging. • EP communication is essentially all-to-all, but due to its dy

Optimizing Our E2E Pipeline

Optimizing Our E2E Pipeline

• Optimizing Our E2E Pipeline How We Cut Our Build Times in Half In the world of DevOps and Developer Experience (DevXP), speed and efficiency can make a big difference on an engin

Engineering Blogs · April 14, 2025 (updated February 25, 2026) · 2 min · 272 words