<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Reinforcementlearning on Tenu Tech Brief</title>
    <link>https://cluster-site.onrender.com/tags/reinforcementlearning/</link>
    <description>Recent content in Reinforcementlearning on Tenu Tech Brief</description>
    <generator>Hugo -- 0.146.0</generator>
    <language>en-us</language>
    <lastBuildDate>Tue, 24 Feb 2026 06:06:03 +0000</lastBuildDate>
    <atom:link href="https://cluster-site.onrender.com/tags/reinforcementlearning/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Deep Reinforcement Learning for Optimizing Energy Consumption in Smart Grid Systems</title>
      <link>https://cluster-site.onrender.com/posts/deep-reinforcement-learning-for-optimizing-energy-consumption-in-smart-grid-systems/</link>
      <pubDate>Tue, 24 Feb 2026 05:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/deep-reinforcement-learning-for-optimizing-energy-consumption-in-smart-grid-systems/</guid>
      <description>• PINNs replace costly smart grid simulators, reducing sample inefficiency in RL-based OPF solutions. • RL policy learning converges 50% faster using PINN surrogates versus traditi</description>
    </item>
    <item>
      <title>FineRef: Fine-Grained Error Reflection and Correction for Long-Form Generation with Citations</title>
      <link>https://cluster-site.onrender.com/posts/fineref-fine-grained-error-reflection-and-correction-for-long-form-generation-with-citations/</link>
      <pubDate>Tue, 24 Feb 2026 05:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/fineref-fine-grained-error-reflection-and-correction-for-long-form-generation-with-citations/</guid>
      <description>• FineRef introduces fine-grained error reflection for citation mismatch and irrelevance in long‑form LLM generation. • Two‑stage training: supervised fine‑tuning with attempt‑refl</description>
    </item>
    <item>
      <title>Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning</title>
      <link>https://cluster-site.onrender.com/posts/scaling-the-scaling-logic-agentic-meta-synthesis-of-logic-reasoning/</link>
      <pubDate>Tue, 17 Feb 2026 05:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/scaling-the-scaling-logic-agentic-meta-synthesis-of-logic-reasoning/</guid>
      <description>• RLVR scaling limited by scarce verifiable training signals, especially for complex logic tasks. • Logical reasoning offers formal constraints and programmatically checkable answe</description>
    </item>
    <item>
      <title>UniRG: Scaling medical imaging report generation with multimodal reinforcement learning</title>
      <link>https://cluster-site.onrender.com/posts/unirg-scaling-medical-imaging-report-generation-with-multimodal-reinforcement-learning/</link>
      <pubDate>Tue, 27 Jan 2026 17:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/unirg-scaling-medical-imaging-report-generation-with-multimodal-reinforcement-learning/</guid>
      <description>• AI-driven radiology report generation boosts provider efficiency and reduces reporting burden. • Traditional models overfit to institutional phrasing, limiting generalization to</description>
    </item>
    <item>
      <title>Multimodal reinforcement learning with agentic verifier for AI agents</title>
      <link>https://cluster-site.onrender.com/posts/multimodal-reinforcement-learning-with-agentic-verifier-for-ai-agents/</link>
      <pubDate>Tue, 20 Jan 2026 17:00:00 +0000</pubDate>
      <guid>https://cluster-site.onrender.com/posts/multimodal-reinforcement-learning-with-agentic-verifier-for-ai-agents/</guid>
      <description>• Argos trains multimodal RL agents to reward answers grounded in visual and temporal evidence, not just plausibility. • Automated verification selects specialized tools per answer</description>
    </item>
  </channel>
</rss>
