<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Reinforcement Learning on Tenu Tech Brief</title>
    <link>https://cluster-site.onrender.com/tags/reinforcement-learning/</link>
    <description>Recent content in Reinforcement Learning on Tenu Tech Brief</description>
    <generator>Hugo -- 0.146.0</generator>
    <language>en-us</language>
    <lastBuildDate>Tue, 24 Feb 2026 06:06:02 +0000</lastBuildDate>
    <atom:link href="https://cluster-site.onrender.com/tags/reinforcement-learning/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Carbon-aware decentralized dynamic task offloading in MIMO-MEC networks via multi-agent reinforcement learning</title>
      <link>https://cluster-site.onrender.com/posts/carbon-aware-decentralized-dynamic-task-offloading-in-mimo-mec-networks-via-multi-agent-reinforcement-learning/</link>
      <pubDate>Tue, 24 Feb 2026 05:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/carbon-aware-decentralized-dynamic-task-offloading-in-mimo-mec-networks-via-multi-agent-reinforcement-learning/</guid>
      <description>• CADDTO-PPO introduces carbon‑aware decentralized task offloading for MIMO‑MEC networks in. • Uses multi‑agent proximal policy optimization to jointly minimize carbon emissions,</description>
    </item>
    <item>
      <title>The unseen work of building reliable AI agents</title>
      <link>https://cluster-site.onrender.com/posts/the-unseen-work-of-building-reliable-ai-agents/</link>
      <pubDate>Wed, 07 Jan 2026 17:04:36 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/the-unseen-work-of-building-reliable-ai-agents/</guid>
      <description>• AI agents must master countless low-level tasks before handling high-level requests. • A simple &amp;lsquo;book vacation&amp;rsquo; command triggers hundreds of micro-interactions across legacy syst</description>
    </item>
    <item>
      <title>Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment</title>
      <link>https://cluster-site.onrender.com/posts/scaling-up-reinforcement-learning-for-traffic-smoothing-a-100-av-highway-deployment/</link>
      <pubDate>Tue, 25 Mar 2025 09:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/scaling-up-reinforcement-learning-for-traffic-smoothing-a-100-av-highway-deployment/</guid>
      <description>• Deployed 100 RL-controlled AVs on rush‑hour highway to smooth congestion and cut fuel use. • Trained agents in fast, data‑driven simulations to maximize energy efficiency and thr</description>
    </item>
  </channel>
</rss>
