Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments

Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environmen

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 272 words
Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning

Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning View PDF HTML (e

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 255 words
Creating a digital poet

Creating a digital poet

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Creating a digital poet View PDF HTML (experimental)Abstract:Can a machine write good poetry? • Any po

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 244 words
EdgeNav-QE: QLoRA Quantization and Dynamic Early Exit for LAM-based Navigation on Edge Devices

EdgeNav-QE: QLoRA Quantization and Dynamic Early Exit for LAM-based Navigation on Edge Devices

• Computer Science > Robotics [Submitted on 12 Jan 2026] Title:EdgeNav-QE: QLoRA Quantization and Dynamic Early Exit for LAM-based Navigation on Edge Devices View PDF HTML (experim

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 267 words
EnterpriseGym Corecraft: Training Generalizable Agents on High-Fidelity RL Environments

EnterpriseGym Corecraft: Training Generalizable Agents on High-Fidelity RL Environments

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:EnterpriseGym Corecraft: Training Generalizable Agents on High-Fidelity RL Environments View PDF HTML

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 236 words
Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinical Intelligence Layer on the 2025 Endocrinology Board-Style Examination

Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinical Intelligence Layer on the 2025 Endocrinology Board-Style Examination

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinical Intelligence Layer on the 2025

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 221 words
Framework of Thoughts: A Foundation Framework for Dynamic and Optimized Reasoning based on Chains, Trees, and Graphs

Framework of Thoughts: A Foundation Framework for Dynamic and Optimized Reasoning based on Chains, Trees, and Graphs

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Framework of Thoughts: A Foundation Framework for Dynamic and Optimized Reasoning based on Chains, Tre

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 241 words
GPSBench: Do Large Language Models Understand GPS Coordinates?

GPSBench: Do Large Language Models Understand GPS Coordinates?

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:GPSBench: Do Large Language Models Understand GPS Coordinates? • View PDF HTML (experimental)Abstract:

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 249 words
How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment

How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:How Uncertain Is the Grade? • A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment Vi

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 249 words
Improving Interactive In-Context Learning from Natural Language Feedback

Improving Interactive In-Context Learning from Natural Language Feedback

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:Improving Interactive In-Context Learning from Natural Language Feedback View PDF HTML (experimental)A

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 224 words
Learning Personalized Agents from Human Feedback

Learning Personalized Agents from Human Feedback

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Learning Personalized Agents from Human Feedback View PDF HTML (experimental)Abstract:Modern AI agents

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 238 words
Leveraging Large Language Models for Causal Discovery: a Constraint-based, Argumentation-driven Approach

Leveraging Large Language Models for Causal Discovery: a Constraint-based, Argumentation-driven Approach

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Leveraging Large Language Models for Causal Discovery: a Constraint-based, Argumentation-driven Approa

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 251 words
Multi-agent cooperation through in-context co-player inference

Multi-agent cooperation through in-context co-player inference

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Multi-agent cooperation through in-context co-player inference View PDF HTML (experimental)Abstract:Ac

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 268 words
Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection

Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection View PDF HTML

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 271 words
Revolutionizing Long-Term Memory in AI: New Horizons with High-Capacity and High-Speed Storage

Revolutionizing Long-Term Memory in AI: New Horizons with High-Capacity and High-Speed Storage

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Revolutionizing Long-Term Memory in AI: New Horizons with High-Capacity and High-Speed Storage View PD

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 269 words
Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents

Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 226 words
Towards a Science of AI Agent Reliability

Towards a Science of AI Agent Reliability

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Towards a Science of AI Agent Reliability View PDF HTML (experimental)Abstract:AI agents are increasin

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 224 words
Towards Efficient Constraint Handling in Neural Solvers for Routing Problems

Towards Efficient Constraint Handling in Neural Solvers for Routing Problems

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:Towards Efficient Constraint Handling in Neural Solvers for Routing Problems View PDF HTML (experiment

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 267 words
Verifiable Semantics for Agent-to-Agent Communication

Verifiable Semantics for Agent-to-Agent Communication

• Computer Science > Artificial Intelligence [Submitted on 18 Feb 2026] Title:Verifiable Semantics for Agent-to-Agent Communication View PDF HTML (experimental)Abstract:Multiagent

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 219 words
What Persona Are We Missing? Identifying Unknown Relevant Personas for Faithful User Simulation

What Persona Are We Missing? Identifying Unknown Relevant Personas for Faithful User Simulation

• Computer Science > Human-Computer Interaction [Submitted on 3 Jan 2026] Title:What Persona Are We Missing? • Identifying Unknown Relevant Personas for Faithful User Simulation Vi

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 235 words
Identifying spatial single-cell-level interactions with graph transformer

Identifying spatial single-cell-level interactions with graph transformer

• Subjects Cellular signalling networks Multicellular systems Identifying cell-cell interactions from imaging-based spatial transcriptomics suffers from limited gene panels. • A ne

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 225 words

Attributing and situating knowledge cannot be left to language models

• Subjects Authorship Ethics Meticulous citation is a marker of well-researched, serious scholarship. • Citations do a lot more than attributing credit; they situate claims within

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 217 words
A federated graph learning method to realize multi-party collaboration for molecular discovery

A federated graph learning method to realize multi-party collaboration for molecular discovery

• Abstract Optimizing molecular resource utilization for molecular discovery requires collaborative efforts across research institutions and organizations to accelerate progress. •

Research · February 19, 2026 (updated February 19, 2026) · 2 min · 234 words

What matters in building vision-language-action models for generalist robots

• Nature Machine Intelligence, Published online: 11 February 2026; doi:10.1038/s42256-025-01168-7 Vision-language-action models recently emerged as a tool for robotics. • Here Li a

Research · February 19, 2026 (updated February 19, 2026) · 1 min · 70 words

When large language models are reliable for judging empathic communication

• Nature Machine Intelligence, Published online: 11 February 2026; doi:10.1038/s42256-025-01169-6 Kumar et al. • show that large language models (LLMs) nearly match expert reliabil

Research · February 19, 2026 (updated February 19, 2026) · 1 min · 99 words

Reusability Report: Evaluating the performance of a meta-learning foundation model on predicting the antibacterial activity of natural products

• Nature Machine Intelligence, Published online: 12 February 2026; doi:10.1038/s42256-026-01187-y This Reusability Report tests the ability of a foundation model, ActFound, to pred

Research · February 19, 2026 (updated February 19, 2026) · 1 min · 98 words

A flaw in using pretrained protein language models in protein-protein interaction inference models

• Nature Machine Intelligence, Published online: 13 February 2026; doi:10.1038/s42256-025-01176-7 The usage of pretrained protein language models (pLMs) is rapidly growing. • Howev

Research · February 19, 2026 (updated February 19, 2026) · 1 min · 98 words

Parallel hierarchical encoding of linguistic representations in the human auditory cortex and recurrent automatic speech recognition systems

• Nature Machine Intelligence, Published online: 17 February 2026; doi:10.1038/s42256-026-01185-0 Keshishian, Mischler et al. • report that a recurrent automatic speech recognition

Research · February 19, 2026 (updated February 19, 2026) · 1 min · 104 words

Grasshopper-inspired wing design improves gliding performance

• Science Robotics, Volume 11, Issue 111, February 2026.

Research · February 18, 2026 (updated February 19, 2026) · 1 min · 21 words

Low-voltage and high-output dielectric elastomer actuators for untethered soft machines working at 200 volts

• Science Robotics, Volume 11, Issue 111, February 2026.

Research · February 18, 2026 (updated February 19, 2026) · 1 min · 21 words

The codevelopment of soft robotics and assistive technology

• Science Robotics, Volume 11, Issue 111, February 2026.

Research · February 18, 2026 (updated February 19, 2026) · 1 min · 21 words
Five Berkeley Lab Scientists Receive DOE Early Career Research Awards

Five Berkeley Lab Scientists Receive DOE Early Career Research Awards

• Five Berkeley Lab Scientists Receive DOE Early Career Research Awards Article Emerging Ideas Five scientists from Lawrence Berkeley National Laboratory (Berkeley Lab) have receiv

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 269 words
AgriWorld:A World Tools Protocol Framework for Verifiable Agricultural Reasoning with Code-Executing LLM Agents

AgriWorld:A World Tools Protocol Framework for Verifiable Agricultural Reasoning with Code-Executing LLM Agents

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:AgriWorld:A World Tools Protocol Framework for Verifiable Agricultural Reasoning with Code-Executing L

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 241 words
Attention-gated U-Net model for semantic segmentation of brain tumors and feature extraction for survival prognosis

Attention-gated U-Net model for semantic segmentation of brain tumors and feature extraction for survival prognosis

• Computer Science > Artificial Intelligence [Submitted on 14 Feb 2026] Title:Attention-gated U-Net model for semantic segmentation of brain tumors and feature extraction for survi

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 262 words
Common Belief Revisited

Common Belief Revisited

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:Common Belief Revisited View PDF HTML (experimental)Abstract:Contrary to common belief, common belief

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 235 words
da Costa and Tarski meet Goguen and Carnap: a novel approach for ontological heterogeneity based on consequence systems

da Costa and Tarski meet Goguen and Carnap: a novel approach for ontological heterogeneity based on consequence systems

• Computer Science > Artificial Intelligence [Submitted on 16 Feb 2026] Title:da Costa and Tarski meet Goguen and Carnap: a novel approach for ontological heterogeneity based on co

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 272 words
EAA: Automating materials characterization with vision language model agents

EAA: Automating materials characterization with vision language model agents

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:EAA: Automating materials characterization with vision language model agents View PDF HTML (experiment

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 230 words
Enhancing Diversity and Feasibility: Joint Population Synthesis from Multi-source Data Using Generative Models

Enhancing Diversity and Feasibility: Joint Population Synthesis from Multi-source Data Using Generative Models

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:Enhancing Diversity and Feasibility: Joint Population Synthesis from Multi-source Data Using Generativ

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 244 words
GenAI-LA: Generative AI and Learning Analytics Workshop (LAK 2026), April 27--May 1, 2026, Bergen, Norway

GenAI-LA: Generative AI and Learning Analytics Workshop (LAK 2026), April 27--May 1, 2026, Bergen, Norway

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:GenAI-LA: Generative AI and Learning Analytics Workshop (LAK 2026), April 27–May 1, 2026, Bergen, Nor

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 256 words
Improving LLM Reliability through Hybrid Abstention and Adaptive Detection

Improving LLM Reliability through Hybrid Abstention and Adaptive Detection

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:Improving LLM Reliability through Hybrid Abstention and Adaptive Detection View PDF HTML (experimental

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 262 words
Mind the (DH) Gap! A Contrast in Risky Choices Between Reasoning and Conversational LLMs

Mind the (DH) Gap! A Contrast in Risky Choices Between Reasoning and Conversational LLMs

• Computer Science > Artificial Intelligence [Submitted on 16 Feb 2026] Title:Mind the (DH) Gap! • A Contrast in Risky Choices Between Reasoning and Conversational LLMs View PDF HT

Research · February 18, 2026 (updated February 19, 2026) · 1 min · 206 words
Panini: Continual Learning in Token Space via Structured Memory

Panini: Continual Learning in Token Space via Structured Memory

• Computer Science > Artificial Intelligence [Submitted on 16 Feb 2026] Title:Panini: Continual Learning in Token Space via Structured Memory View PDF HTML (experimental)Abstract:L

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 296 words
Predicting Invoice Dilution in Supply Chain Finance with Leakage Free Two Stage XGBoost, KAN (Kolmogorov Arnold Networks), and Ensemble Models

Predicting Invoice Dilution in Supply Chain Finance with Leakage Free Two Stage XGBoost, KAN (Kolmogorov Arnold Networks), and Ensemble Models

• Computer Science > Artificial Intelligence [Submitted on 16 Feb 2026] Title:Predicting Invoice Dilution in Supply Chain Finance with Leakage Free Two Stage XGBoost, KAN (Kolmogor

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 249 words
Protecting Language Models Against Unauthorized Distillation through Trace Rewriting

Protecting Language Models Against Unauthorized Distillation through Trace Rewriting

• Computer Science > Artificial Intelligence [Submitted on 16 Feb 2026] Title:Protecting Language Models Against Unauthorized Distillation through Trace Rewriting View PDF HTML (ex

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 235 words
Quantifying construct validity in large language model evaluations

Quantifying construct validity in large language model evaluations

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:Quantifying construct validity in large language model evaluations View PDF HTML (experimental)Abstrac

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 222 words
ResearchGym: Evaluating Language Model Agents on Real-World AI Research

ResearchGym: Evaluating Language Model Agents on Real-World AI Research

• Computer Science > Artificial Intelligence [Submitted on 16 Feb 2026] Title:ResearchGym: Evaluating Language Model Agents on Real-World AI Research View PDF HTML (experimental)Ab

Research · February 18, 2026 (updated February 19, 2026) · 1 min · 209 words
Secure and Energy-Efficient Wireless Agentic AI Networks

Secure and Energy-Efficient Wireless Agentic AI Networks

• Computer Science > Artificial Intelligence [Submitted on 16 Feb 2026] Title:Secure and Energy-Efficient Wireless Agentic AI Networks View PDF HTML (experimental)Abstract:In this

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 297 words
When Remembering and Planning are Worth it: Navigating under Change

When Remembering and Planning are Worth it: Navigating under Change

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:When Remembering and Planning are Worth it: Navigating under Change View PDF HTML (experimental)Abstra

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 293 words
World-Model-Augmented Web Agents with Action Correction

World-Model-Augmented Web Agents with Action Correction

• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:World-Model-Augmented Web Agents with Action Correction View PDF HTML (experimental)Abstract:Web agent

Research · February 18, 2026 (updated February 19, 2026) · 2 min · 276 words

Bioinspired adaptive pupil reflex based on liquid-metal shape-shifters for machine vision

• Science Robotics, Volume 11, Issue 111, February 2026.

Research · February 18, 2026 (updated February 19, 2026) · 1 min · 21 words

Scalable robot collective resilience by sharing resources

• Science Robotics, Volume 11, Issue 111, February 2026.

Research · February 18, 2026 (updated February 19, 2026) · 1 min · 21 words

Science on the Double: How an AI-Powered 'Digital Twin' Accelerates Chemistry and Materials Discoveries

• The post Science on the Double: How an AI-Powered ‘Digital Twin’ Accelerates Chemistry and Materials Discoveries appeared first on Berkeley Lab News Center . • The post What Is a

Research · February 17, 2026 (updated February 19, 2026) · 1 min · 203 words
What Is a Digital Twin?

What Is a Digital Twin?

• What Is a Digital Twin? • Video AI Digital twins are transforming how scientists study and improve complex systems - reducing the time between discovery and delivery. • Learn wha

Research · February 17, 2026 (updated February 18, 2026) · 1 min · 205 words
Authorization of prognostic AI medical devices

Authorization of prognostic AI medical devices

• Subjects Health policy Medical humanities Less than 2% of artificial intelligence devices authorized by the US Food and Drug Agency are prognostic, with prediction horizons rangi

Research · February 17, 2026 (updated February 19, 2026) · 2 min · 248 words
A Geometric Taxonomy of Hallucinations in LLMs

A Geometric Taxonomy of Hallucinations in LLMs

• Computer Science > Artificial Intelligence [Submitted on 26 Jan 2026] Title:A Geometric Taxonomy of Hallucinations in LLMs View PDF HTML (experimental)Abstract:The term ‘hallucin

Research · February 17, 2026 (updated February 19, 2026) · 1 min · 210 words
Agentic AI for Commercial Insurance Underwriting with Adversarial Self-Critique

Agentic AI for Commercial Insurance Underwriting with Adversarial Self-Critique

• Computer Science > Artificial Intelligence [Submitted on 21 Jan 2026] Title:Agentic AI for Commercial Insurance Underwriting with Adversarial Self-Critique View PDF HTML (experim

Research · February 17, 2026 (updated February 19, 2026) · 2 min · 218 words
AST-PAC: AST-guided Membership Inference for Code

AST-PAC: AST-guided Membership Inference for Code

• Computer Science > Artificial Intelligence [Submitted on 30 Jan 2026] Title:AST-PAC: AST-guided Membership Inference for Code View PDF HTML (experimental)Abstract:Code Large Lang

Research · February 17, 2026 (updated February 19, 2026) · 2 min · 216 words
BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors

BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors

• Computer Science > Artificial Intelligence [Submitted on 22 Jan 2026] Title:BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors View PDF HTML (experimental)Abstract:Large

Research · February 17, 2026 (updated February 19, 2026) · 2 min · 243 words
DPBench: Large Language Models Struggle with Simultaneous Coordination

DPBench: Large Language Models Struggle with Simultaneous Coordination

• Computer Science > Artificial Intelligence [Submitted on 2 Feb 2026] Title:DPBench: Large Language Models Struggle with Simultaneous Coordination View PDF HTML (experimental)Abst

Research · February 17, 2026 (updated February 19, 2026) · 2 min · 254 words
General learned delegation by clones

General learned delegation by clones

• Computer Science > Artificial Intelligence [Submitted on 3 Feb 2026] Title:General learned delegation by clones View PDF HTML (experimental)Abstract:Frontier language models impr

Research · February 17, 2026 (updated February 19, 2026) · 2 min · 286 words