Don't Merge Departments, Prune Them
AI researchers proved that merging specialists always destroys precision. The same maths applies to your org chart.
Donât Merge Departments, Prune Them
Your organisation already is a neural network. You just havenât noticed.
Familiar Ground
Every company has specialists. Engineering builds. Marketing tells the story. Legal says no. Finance keeps score. These teams exist because specialisation works. Nobody debates that.
The debate starts when the budget shrinks. The CEO looks at the org chart and sees redundancy. Two teams doing âsimilarâ things. The consulting deck arrives: âConsolidate for efficiency.â The merger is announced. Marketing and Communications become âMarComms.â DevOps and Platform Engineering become âInfrastructure.â The headcount drops. The quarterly report looks better.
You have seen this. You may have survived it.
Counter-Signal
In October 2025, researchers at Cerebras published a paper called REAP (Router-weighted Expert Activation Pruning). They were trying to make large AI models smaller. These models, called Mixture-of-Experts, work exactly like your organisation: multiple specialist sub-networks, each trained to handle one type of problem, with a learned ârouterâ that sends each input to the right specialist.
The researchers tested two compression strategies. Merging (combine two specialists into one) and pruning (remove the least-used specialists entirely).
The result was unambiguous: pruning always won. On generative tasks, from 20-billion to 1-trillion parameters, across every benchmark.
And the reason is what matters.
âď¸ The Fusion
Here is where the maths of neural networks collides with the politics of organisational restructuring.
When you merge two AI experts into one, you destroy the routerâs ability to distinguish between them. The router was trained to send Token Type A to Expert 1 and Token Type B to Expert 2 for different reasons. Merge them into Expert 12, and the router can no longer recover the distinction. The researchers named this the irreducible merging error. Irreducible, because no amount of optimisation can fix it. The information loss is in the routing, not the experts.
When you merge two departments, the same thing happens. The people who used to know âask Team A for brand strategy, ask Team B for demand generationâ now face a merged âMarCommsâ and cannot recover the distinction. Some never learn the new routing. Others stop asking entirely. The institutional knowledge of who does what, knowledge that lived in the minds of the people who directed work to them, not in the teams themselves, is gone.
This is not a metaphor. It is the same mathematical structure. Specialised units with a learned routing function. Merge the units, break the routing. The error is irreducible because the routing intelligence was distributed across the organisation, and it was destroyed the moment the merger was announced.
Pruning works because it removes experts the model rarely activates. The remaining experts keep their routing precision intact. In organisational terms: eliminate the team nobody routes to. The teams that remain still know exactly who they are and what they do.
There is a deeper surprise. When the REAP researchers measured which experts had the highest âactivation massâ (the most routing traffic), they expected the newest, most recently added layers to dominate. They were wrong. Early layers, the foundational capabilities, carried the most activation mass. In organisations, this translates directly: your HR, training, and culture functions (the âfoundational layersâ that restructurings cut first) may be the highest-activation components you have.
The New Pattern
| Merging Mindset | Pruning Mindset |
|---|---|
| Combine similar teams for efficiency | Remove unused teams; keep routing intact |
| Cut foundational functions first (âoverheadâ) | Foundational layers carry the highest activation, cut last |
| Restructure based on industry benchmarks | Observe YOUR routing patterns before restructuring |
| Faster is better | Observe before optimising |
| Reduce headcount | Reduce routing confusion |
The REAP researchers also discovered something elegant. Instead of permanently removing experts (layoffs), you can cache them: keep all specialists available, but pre-load the frequently-used ones into fast memory. The organisational equivalent is moving low-activation teams to advisory, on-call, or part-time roles. Full knowledge preserved. Zero routing breakage. Lower cost.
This is not âkeep everyone on payroll.â It is a precise intervention. You measure activation (who gets routed to, how often, for what). You keep the high-activation teams in fast access. You move the low-activation teams to slower access. Nobodyâs knowledge is destroyed.
The Open Question
Your next restructuring is probably already on someoneâs calendar. Before you merge two teams, consider: do you know the routing intelligence that will be destroyed?
And if you cannot map it, should you be merging at all?
This fusion emerged from a STEAL on 0xSeroâs REAP MoE compression research (arXiv:2510.13999). The concept that grounded it lives in Organisation as Mixture-of-Experts.