<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Chip-Architecture on Tenu Tech Brief</title>
    <link>https://cluster-site.onrender.com/categories/chip-architecture/</link>
    <description>Recent content in Chip-Architecture on Tenu Tech Brief</description>
    <generator>Hugo -- 0.146.0</generator>
    <language>en-us</language>
    <lastBuildDate>Wed, 25 Feb 2026 17:44:48 +0000</lastBuildDate>
    <atom:link href="https://cluster-site.onrender.com/categories/chip-architecture/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Making Softmax More Efficient with NVIDIA Blackwell Ultra</title>
      <link>https://cluster-site.onrender.com/posts/making-softmax-more-efficient-with-nvidia-blackwell-ultra/</link>
      <pubDate>Wed, 25 Feb 2026 17:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/making-softmax-more-efficient-with-nvidia-blackwell-ultra/</guid>
      <description>• LLM context lengths are exploding, and architectures are moving toward complex attention schemes like Multi-Head Latent Attention (MLA) and Grouped Query Attention (GQA) • As a r</description>
    </item>
    <item>
      <title>Using NVFP4 Low-Precision Model Training for Higher Throughput Without Losing Accuracy</title>
      <link>https://cluster-site.onrender.com/posts/using-nvfp4-low-precision-model-training-for-higher-throughput-without-losing-accuracy/</link>
      <pubDate>Mon, 23 Feb 2026 18:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/using-nvfp4-low-precision-model-training-for-higher-throughput-without-losing-accuracy/</guid>
      <description>• As the sizes of AI models and datasets continue to increase, relying only on higher-precision BF16 training is no longer sufficient. • Key challenges such as training throughput</description>
    </item>
    <item>
      <title>Accelerating Data Processing with NVIDIA Multi-Instance GPU and NUMA Node Localization</title>
      <link>https://cluster-site.onrender.com/posts/accelerating-data-processing-with-nvidia-multi-instance-gpu-and-numa-node-localization/</link>
      <pubDate>Thu, 19 Feb 2026 17:30:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/accelerating-data-processing-with-nvidia-multi-instance-gpu-and-numa-node-localization/</guid>
      <description>• NVIDIA flagship data center GPUs in the NVIDIA Ampere, NVIDIA Hopper, and NVIDIA Blackwell families all feature non-uniform memory access (NUMA) behaviors, but expose a single me</description>
    </item>
    <item>
      <title>Unlocking Ultralow Latency Video for Tethered Drones with Semtech&#39;s BlueRiver® Audio Video Processor Technology</title>
      <link>https://cluster-site.onrender.com/posts/unlocking-ultralow-latency-video-for-tethered-drones-with-semtechs-blueriver-audio-video-processor-technology/</link>
      <pubDate>Wed, 18 Feb 2026 19:51:11 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/unlocking-ultralow-latency-video-for-tethered-drones-with-semtechs-blueriver-audio-video-processor-technology/</guid>
      <description>• Unlocking Ultralow Latency Video for Tethered Drones with Semtech&amp;rsquo;s BlueRiver® Audio Video Processor Technology Tethered drones are rapidly expanding across surveillance, industr</description>
    </item>
    <item>
      <title>Unlock Massive Token Throughput with GPU Fractioning in NVIDIA Run:ai</title>
      <link>https://cluster-site.onrender.com/posts/unlock-massive-token-throughput-with-gpu-fractioning-in-nvidia-runai/</link>
      <pubDate>Wed, 18 Feb 2026 18:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/unlock-massive-token-throughput-with-gpu-fractioning-in-nvidia-runai/</guid>
      <description>• As AI workloads scale, achieving high throughput, efficient resource usage, and predictable latency becomes essential. • NVIDIA Run:ai addresses these challenges through intellig</description>
    </item>
    <item>
      <title>Topping the GPU MODE Kernel Leaderboard with NVIDIA cuda.compute</title>
      <link>https://cluster-site.onrender.com/posts/topping-the-gpu-mode-kernel-leaderboard-with-nvidia-cuda.compute/</link>
      <pubDate>Wed, 18 Feb 2026 17:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/topping-the-gpu-mode-kernel-leaderboard-with-nvidia-cuda.compute/</guid>
      <description>• Topping the GPU MODE Kernel Leaderboard with NVIDIA cuda.compute The leaderboard scores how fast users&amp;rsquo; custom GPU kernels solve a set of standard problems like vector addition,</description>
    </item>
    <item>
      <title>How NVIDIA Extreme Hardware-Software Co-Design Delivered a Large Inference Boost for Sarvam AI&#39;s Sovereign Models</title>
      <link>https://cluster-site.onrender.com/posts/how-nvidia-extreme-hardware-software-co-design-delivered-a-large-inference-boost-for-sarvam-ais-sovereign-models/</link>
      <pubDate>Wed, 18 Feb 2026 16:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/how-nvidia-extreme-hardware-software-co-design-delivered-a-large-inference-boost-for-sarvam-ais-sovereign-models/</guid>
      <description>• As global AI adoption accelerates, developers face a growing challenge: delivering large language model (LLM) performance that meets real-world latency and cost requirements. • R</description>
    </item>
    <item>
      <title>Build AI-Ready Knowledge Systems Using 5 Essential Multimodal RAG Capabilities</title>
      <link>https://cluster-site.onrender.com/posts/build-ai-ready-knowledge-systems-using-5-essential-multimodal-rag-capabilities/</link>
      <pubDate>Tue, 17 Feb 2026 18:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/build-ai-ready-knowledge-systems-using-5-essential-multimodal-rag-capabilities/</guid>
      <description>• Enterprise data is inherently complex: real-world documents are multimodal, spanning text, tables, charts and graphs, images, diagrams, scanned pages, forms, and embedded metadat</description>
    </item>
    <item>
      <title>LoRa® and LoRaWAN® for Industrial Sensors: Enabling Smart, Connected Operations</title>
      <link>https://cluster-site.onrender.com/posts/lora-and-lorawan-for-industrial-sensors-enabling-smart-connected-operations/</link>
      <pubDate>Wed, 11 Feb 2026 17:15:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/lora-and-lorawan-for-industrial-sensors-enabling-smart-connected-operations/</guid>
      <description>• LoRa® and LoRaWAN® for Industrial Sensors: Enabling Smart, Connected Operations The industrial sector is experiencing a significant transformation as organizations strive to enha</description>
    </item>
    <item>
      <title>R²D²: Scaling Multimodal Robot Learning with NVIDIA Isaac Lab</title>
      <link>https://cluster-site.onrender.com/posts/rd-scaling-multimodal-robot-learning-with-nvidia-isaac-lab/</link>
      <pubDate>Tue, 10 Feb 2026 18:30:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/rd-scaling-multimodal-robot-learning-with-nvidia-isaac-lab/</guid>
      <description>• Building robust, intelligent robots requires testing them in complex environments. • However, gathering data in the physical world is expensive, slow, and often dangerous. • It i</description>
    </item>
    <item>
      <title>Using Accelerated Computing to Live-Steer Scientific Experiments at Massive Research Facilities</title>
      <link>https://cluster-site.onrender.com/posts/using-accelerated-computing-to-live-steer-scientific-experiments-at-massive-research-facilities/</link>
      <pubDate>Tue, 10 Feb 2026 17:30:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/using-accelerated-computing-to-live-steer-scientific-experiments-at-massive-research-facilities/</guid>
      <description>• Scientists and engineers who design and build unique scientific research facilities face similar challenges. • These include managing massive data rates that exceed current compu</description>
    </item>
    <item>
      <title>Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy</title>
      <link>https://cluster-site.onrender.com/posts/automating-inference-optimizations-with-nvidia-tensorrt-llm-autodeploy/</link>
      <pubDate>Mon, 09 Feb 2026 18:30:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/automating-inference-optimizations-with-nvidia-tensorrt-llm-autodeploy/</guid>
      <description>• NVIDIA TensorRT LLM enables developers to build high-performance inference engines for large language models (LLMs), but deploying a new architecture traditionally requires signi</description>
    </item>
    <item>
      <title>3 Ways NVFP4 Accelerates AI Training and Inference</title>
      <link>https://cluster-site.onrender.com/posts/3-ways-nvfp4-accelerates-ai-training-and-inference/</link>
      <pubDate>Fri, 06 Feb 2026 16:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/3-ways-nvfp4-accelerates-ai-training-and-inference/</guid>
      <description>• 3 Ways NVFP4 Accelerates AI Training and Inference L T F R E The latest AI models continue to grow in size and complexity, demanding increasing amounts of compute performance for</description>
    </item>
    <item>
      <title>How to Build License-Compliant Synthetic Data Pipelines for AI Model Distillation</title>
      <link>https://cluster-site.onrender.com/posts/how-to-build-license-compliant-synthetic-data-pipelines-for-ai-model-distillation/</link>
      <pubDate>Thu, 05 Feb 2026 18:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/how-to-build-license-compliant-synthetic-data-pipelines-for-ai-model-distillation/</guid>
      <description>• Specialized AI models are built to perform specific tasks or solve particular problems. • But if you&amp;rsquo;ve ever tried to fine-tune or distill a domain-specific model, you&amp;rsquo;ve probabl</description>
    </item>
    <item>
      <title>How Painkiller RTX Uses Generative AI to Modernize Game Assets at Scale</title>
      <link>https://cluster-site.onrender.com/posts/how-painkiller-rtx-uses-generative-ai-to-modernize-game-assets-at-scale/</link>
      <pubDate>Thu, 05 Feb 2026 14:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/how-painkiller-rtx-uses-generative-ai-to-modernize-game-assets-at-scale/</guid>
      <description>• Painkiller RTX sets a new standard for how small teams can balance massive visual ambition with limited resources by integrating generative AI. • By upscaling thousands of legacy</description>
    </item>
    <item>
      <title>Build with Kimi K2.5 Multimodal VLM Using NVIDIA GPU-Accelerated Endpoints</title>
      <link>https://cluster-site.onrender.com/posts/build-with-kimi-k2.5-multimodal-vlm-using-nvidia-gpu-accelerated-endpoints/</link>
      <pubDate>Wed, 04 Feb 2026 19:46:33 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/build-with-kimi-k2.5-multimodal-vlm-using-nvidia-gpu-accelerated-endpoints/</guid>
      <description>• Kimi K2.5 is a multimodal vision‑language model trained with Megatron‑LM. • It contains 1 trillion parameters, 384 experts, a single dense layer, and 3.2% activation per token. •</description>
    </item>
    <item>
      <title>How to Build a Document Processing Pipeline for RAG with Nemotron</title>
      <link>https://cluster-site.onrender.com/posts/how-to-build-a-document-processing-pipeline-for-rag-with-nemotron/</link>
      <pubDate>Wed, 04 Feb 2026 16:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/how-to-build-a-document-processing-pipeline-for-rag-with-nemotron/</guid>
      <description>• What if your AI agent could instantly parse complex PDFs, extract nested tables, and &amp;lsquo;see&amp;rsquo; data within charts as easily as reading a text file? • With NVIDIA Nemotron RAG, you ca</description>
    </item>
    <item>
      <title>Accelerating Long-Context Model Training in JAX and XLA</title>
      <link>https://cluster-site.onrender.com/posts/accelerating-long-context-model-training-in-jax-and-xla/</link>
      <pubDate>Tue, 03 Feb 2026 17:30:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/accelerating-long-context-model-training-in-jax-and-xla/</guid>
      <description>• Large language models (LLMs) are rapidly expanding their context windows, with recent models supporting sequences of 128K tokens, 256K tokens, and beyond. • However, training the</description>
    </item>
    <item>
      <title>Optimizing Communication for Mixture-of-Experts Training with Hybrid Expert Parallel</title>
      <link>https://cluster-site.onrender.com/posts/optimizing-communication-for-mixture-of-experts-training-with-hybrid-expert-parallel/</link>
      <pubDate>Mon, 02 Feb 2026 18:43:08 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/optimizing-communication-for-mixture-of-experts-training-with-hybrid-expert-parallel/</guid>
      <description>• In LLM training, Expert Parallel (EP) communication for hyperscale mixture-of-experts (MoE) models is challenging. • EP communication is essentially all-to-all, but due to its dy</description>
    </item>
    <item>
      <title>Advancing GPU Programming with the CUDA Tile IR Backend for OpenAI Triton</title>
      <link>https://cluster-site.onrender.com/posts/advancing-gpu-programming-with-the-cuda-tile-ir-backend-for-openai-triton/</link>
      <pubDate>Fri, 30 Jan 2026 20:01:47 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/advancing-gpu-programming-with-the-cuda-tile-ir-backend-for-openai-triton/</guid>
      <description>• NVIDIA CUDA Tile is a GPU-based programming model that targets portability for NVIDIA Tensor Cores, unlocking peak GPU performance. • One of the great things about CUDA Tile is t</description>
    </item>
    <item>
      <title>Establishing a Scalable Sparse Ecosystem with the Universal Sparse Tensor</title>
      <link>https://cluster-site.onrender.com/posts/establishing-a-scalable-sparse-ecosystem-with-the-universal-sparse-tensor/</link>
      <pubDate>Fri, 30 Jan 2026 18:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/establishing-a-scalable-sparse-ecosystem-with-the-universal-sparse-tensor/</guid>
      <description>• UST decouples tensor sparsity from memory representation, enabling flexible storage formats. • Developers describe storage via a DSL, focusing solely on sparsity patterns. • Comp</description>
    </item>
    <item>
      <title>Practical Security Guidance for Sandboxing Agentic Workflows and Managing Execution Risk</title>
      <link>https://cluster-site.onrender.com/posts/practical-security-guidance-for-sandboxing-agentic-workflows-and-managing-execution-risk/</link>
      <pubDate>Fri, 30 Jan 2026 16:13:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/practical-security-guidance-for-sandboxing-agentic-workflows-and-managing-execution-risk/</guid>
      <description>• AI coding agents enable developers to work faster by streamlining tasks and driving automated, test-driven development. • However, they also introduce a significant, often overlo</description>
    </item>
    <item>
      <title>Ensuring Balanced GPU Allocation in Kubernetes Clusters with Time-Based Fairshare</title>
      <link>https://cluster-site.onrender.com/posts/ensuring-balanced-gpu-allocation-in-kubernetes-clusters-with-time-based-fairshare/</link>
      <pubDate>Wed, 28 Jan 2026 17:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/ensuring-balanced-gpu-allocation-in-kubernetes-clusters-with-time-based-fairshare/</guid>
      <description>• NVIDIA Run:ai v2.24 introduces time-based fairshare scheduling for Kubernetes GPU clusters. • Scheduler tracks historical GPU usage, adjusting queue scores to balance long-term r</description>
    </item>
    <item>
      <title>Speeding Up Variable-Length Training with Dynamic Context Parallelism and NVIDIA Megatron Core</title>
      <link>https://cluster-site.onrender.com/posts/speeding-up-variable-length-training-with-dynamic-context-parallelism-and-nvidia-megatron-core/</link>
      <pubDate>Wed, 28 Jan 2026 16:28:06 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/speeding-up-variable-length-training-with-dynamic-context-parallelism-and-nvidia-megatron-core/</guid>
      <description>• This post introduces Dynamic Context Parallelism (Dynamic-CP), a scheduling approach in NVIDIA Megatron Core used for LLM post-training or DiT pre-training. • It dynamically sele</description>
    </item>
    <item>
      <title>Updating Classifier Evasion for Vision Language Models</title>
      <link>https://cluster-site.onrender.com/posts/updating-classifier-evasion-for-vision-language-models/</link>
      <pubDate>Wed, 28 Jan 2026 16:19:12 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/updating-classifier-evasion-for-vision-language-models/</guid>
      <description>• Advances in AI architectures have unlocked multimodal functionality, enabling transformer models to process multiple forms of data in the same context. • For instance, vision lan</description>
    </item>
    <item>
      <title>LoRa® Technology Revolutionizing Drone Communications</title>
      <link>https://cluster-site.onrender.com/posts/lora-technology-revolutionizing-drone-communications/</link>
      <pubDate>Tue, 27 Jan 2026 23:40:49 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/lora-technology-revolutionizing-drone-communications/</guid>
      <description>• LoRa&amp;rsquo;s Chirp Spread Spectrum delivers &amp;gt;10 km range, surpassing conventional RC links for drones. • Low power consumption keeps drone batteries light, extending flight times and m</description>
    </item>
    <item>
      <title>CES 2026: Semtech Showcases Next-Generation IoT Innovation with LoRa® and Edge AI Solutions</title>
      <link>https://cluster-site.onrender.com/posts/ces-2026-semtech-showcases-next-generation-iot-innovation-with-lora-and-edge-ai-solutions/</link>
      <pubDate>Tue, 27 Jan 2026 21:51:56 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/ces-2026-semtech-showcases-next-generation-iot-innovation-with-lora-and-edge-ai-solutions/</guid>
      <description>• Semtech unveiled its next-gen LoRa® Gen 4 platform at CES 2026, boosting throughput to 2.6 Mbps. • Multi‑protocol connectivity via LoRa Plus™ supports LoRaWAN®, Amazon Sidewalk,</description>
    </item>
    <item>
      <title>Racing to 6G: Key Takeaways from the MOPA Alliance Webinar on Optical Standardization</title>
      <link>https://cluster-site.onrender.com/posts/racing-to-6g-key-takeaways-from-the-mopa-alliance-webinar-on-optical-standardization/</link>
      <pubDate>Wed, 14 Jan 2026 19:43:45 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/racing-to-6g-key-takeaways-from-the-mopa-alliance-webinar-on-optical-standardization/</guid>
      <description>• Racing to 6G: Key Takeaways from the MOPA Alliance Webinar on Optical Standardization Industry leaders from Ericsson, Nokia, and Semtech recently discussed why mobile optical sta</description>
    </item>
    <item>
      <title>LoRaWAN® Takes Center Stage at Enlit Europe: A Smart Metering Milestone</title>
      <link>https://cluster-site.onrender.com/posts/lorawan-takes-center-stage-at-enlit-europe-a-smart-metering-milestone/</link>
      <pubDate>Wed, 07 Jan 2026 14:30:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/lorawan-takes-center-stage-at-enlit-europe-a-smart-metering-milestone/</guid>
      <description>• LoRaWAN® Takes Center Stage at Enlit Europe: A Smart Metering Milestone If there was any doubt that LoRaWAN® has secured its place in smart metering, Enlit Europe 2025 put that t</description>
    </item>
    <item>
      <title>LoRa® Holiday Blog 2025: Wrapping Up a Year of Innovation and Collaboration</title>
      <link>https://cluster-site.onrender.com/posts/lora-holiday-blog-2025-wrapping-up-a-year-of-innovation-and-collaboration/</link>
      <pubDate>Tue, 16 Dec 2025 14:15:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/lora-holiday-blog-2025-wrapping-up-a-year-of-innovation-and-collaboration/</guid>
      <description>• LoRa® Holiday Blog 2025: Wrapping Up a Year of Innovation and Collaboration Expanding the Boundaries of Internet of Things (IoT) Connectivity - Because Innovation Doesn&amp;rsquo;t Take a</description>
    </item>
    <item>
      <title>One-Channel Hub: A Compact LoRaWAN® Access Point for Cost-Effective Internet of Things (IoT)</title>
      <link>https://cluster-site.onrender.com/posts/one-channel-hub-a-compact-lorawan-access-point-for-cost-effective-internet-of-things-iot/</link>
      <pubDate>Fri, 12 Dec 2025 01:16:33 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/one-channel-hub-a-compact-lorawan-access-point-for-cost-effective-internet-of-things-iot/</guid>
      <description>• One-Channel Hub: A Compact LoRaWAN® Access Point for Cost-Effective Internet of Things (IoT) Semtech&amp;rsquo;s One-Channel Hub reference design enables affordableLoRaWAN®sensor networks,</description>
    </item>
    <item>
      <title>Advanced Surge Protection for Industrial 24V DC Systems</title>
      <link>https://cluster-site.onrender.com/posts/advanced-surge-protection-for-industrial-24v-dc-systems/</link>
      <pubDate>Wed, 10 Dec 2025 01:45:28 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/advanced-surge-protection-for-industrial-24v-dc-systems/</guid>
      <description>• Advanced Surge Protection for Industrial 24V DC Systems Protecting the 24-Volt Direct Current (DC) System - The Backbone of Industry 4.0 Industrial automation is rapidly evolving</description>
    </item>
    <item>
      <title>LoRaWAN® Reaches Critical Mass: 125 Million Devices and Accelerating Growth</title>
      <link>https://cluster-site.onrender.com/posts/lorawan-reaches-critical-mass-125-million-devices-and-accelerating-growth/</link>
      <pubDate>Tue, 18 Nov 2025 17:50:05 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/lorawan-reaches-critical-mass-125-million-devices-and-accelerating-growth/</guid>
      <description>• LoRaWAN® Reaches Critical Mass: 125 Million Devices and Accelerating Growth The Internet of Things (IoT) landscape is witnessing a significant milestone. • LoRaWAN®, the low-powe</description>
    </item>
  </channel>
</rss>
