My focus is bringing collective intelligence to LLM systems.
My current work connects semantic routing with the Workload-Router-Pool architecture: using workload signals, policy loops, and coordinated serving pools to optimize quality, cost, latency, privacy, and safety together.
- It is still a good thinking game.
- Designing system-level intelligence above LLMs.
- Routing is shifting from traffic control to behavior control.