Reinforcement learning for IBM Z mainframe
A Gymnasium-compatible RL environment that finds MIPS optimisations in COBOL batch programs — using real z/Architecture execution, not estimates.
IBM Z manages $3 trillion in daily commerce. Enterprises running it are billed by the MIPS — every inefficient loop, every redundant compute, every unoptimised sort adds directly to the monthly bill. The engineers who know how to fix it are retiring and not being replaced. Refinery closes the loop: an RL agent that runs your COBOL against a real z/Architecture emulator, reads real SMF telemetry, and discovers optimisations no static analysis tool can find.