INFORMS Open Forum

Is Deep Reinforcement Learning in Finance Overhyped - or Just Misapplied?

  • 1.  Is Deep Reinforcement Learning in Finance Overhyped - or Just Misapplied?

    Posted 2 hours ago
    Is Deep Reinforcement Learning in Finance Overhyped - or Just Misapplied?
     
    There's a growing body of work applying DRL to portfolio optimization, algorithmic trading, and even monetary policy. The results often look impressive on paper. But I keep coming back to a core tension: most DRL agents are trained and evaluated on the same distributional regime they were tested on. The moment the environment shifts - a crisis, a structural break, a new policy regime - the agent fails silently.
     
    The problem isn't DRL itself. It's that we keep applying stationary assumptions to non-stationary environments and then calling it "adaptive AI."
     
    The fix, I'd argue, isn't a better neural network. It's regime-awareness built into the architecture from the start - HMMs for latent state identification, LSTM for temporal dependencies, and entropy-regularized policies (like SAC) that don't overcommit to a single action distribution.
     
    Curious where this community stands: Is DRL in finance genuinely advancing the field, or are we producing results that won't survive contact with real markets?
     



    ------------------------------
    Mujahid Merchant
    Independent Researcher | Risk Pricing, Securitization, Regime Aware AI
    Springer and IEEE Published Author | Reviewer | PC Member, ATLC 2025 | INFORMS Member
    Email: info@mujahidmerchant.com
    Website: https://mujahidmerchant.comGoogle Scholar: https://scholar.google.com/citations?hl=en&user=yeNi_usAAAAJ
    LinkedIn: https://www.linkedin.com/in/mujahid-merchant-338383104/
    Medium: https://medium.com/@merchantmujahid1
    ORCID: 0009-0007-3378-180X
    ------------------------------