INFORMS Open Forum

Back to discussions

Expand all | Collapse all

Is Deep Reinforcement Learning in Finance Overhyped - or Just Misapplied?

1. Is Deep Reinforcement Learning in Finance Overhyped - or Just Misapplied?

Like
Mujahid Merchant
Posted 03-17-2026 19:33
Is Deep Reinforcement Learning in Finance Overhyped - or Just Misapplied?

There's a growing body of work applying DRL to portfolio optimization, algorithmic trading, and even monetary policy. The results often look impressive on paper. But I keep coming back to a core tension: most DRL agents are trained and evaluated on the same distributional regime they were tested on. The moment the environment shifts - a crisis, a structural break, a new policy regime - the agent fails silently.

The problem isn't DRL itself. It's that we keep applying stationary assumptions to non-stationary environments and then calling it "adaptive AI."

The fix, I'd argue, isn't a better neural network. It's regime-awareness built into the architecture from the start - HMMs for latent state identification, LSTM for temporal dependencies, and entropy-regularized policies (like SAC) that don't overcommit to a single action distribution.

Curious where this community stands: Is DRL in finance genuinely advancing the field, or are we producing results that won't survive contact with real markets?

------------------------------
Mujahid Merchant
Independent Researcher | Risk Pricing, Securitization, Regime Aware AI
Springer and IEEE Published Author | Reviewer | PC Member, ATLC 2025 | INFORMS Member
Email: info@mujahidmerchant.com
Website: https://mujahidmerchant.comGoogle Scholar: https://scholar.google.com/citations?hl=en&user=yeNi_usAAAAJ
LinkedIn: https://www.linkedin.com/in/mujahid-merchant-338383104/
Medium: https://medium.com/@merchantmujahid1
ORCID: 0009-0007-3378-180X
------------------------------

INFORMS Open Forum

Is Deep Reinforcement Learning in Finance Overhyped - or Just Misapplied?

1. Is Deep Reinforcement Learning in Finance Overhyped - or Just Misapplied?

Related Content

Seeking collaborators on regime aware AI for risk pricing and decision systems

AI Isn't Failing-Our Information Readiness Is

Ignorance Isn't Bliss-It's Risk: A Human-Centered Path to Cyber Preparedness

IFORS Global Webinar: OR Developments in Africa

IFORS Fellows - Call for Nominations