WHITEPAPER

How an Energy Trading Platform Hit Sub-100ms Latency on AWS

P95 latency that used to drift past 100ms during peak now holds the target, and the trading desk stopped routing around the platform. Here is the network-to-serialization program that got them there.

Download WhitePaper

How a Healthcare Org Made Its Data AI-Ready Without Ripping and Replacing

Your Trading Platform's Latency Drifts Above 100ms During Peak.

The wrong status quo: chase a single culprit. Optimize one service while ignoring serialization, or tune the network while running on the wrong instance type, and you move the number a little and miss the target.
The better approach: fix the stack systematically across network, compute, and serialization in order, then verify every change in an environment that actually resembles production.

Download White Paper

The Numbers That Make This A Board-Level Conversation

Up to $100M a year

estimated value of a single millisecond of advantage to a major trading firm

72% reduction

trading-path P95 latency after the program, hitting and holding sub-100ms

31ms saved by Protobuf

two of the highest-leverage fixes, with transport-level round trips around 29 microseconds proven possible on AWS

The Three Moves That Made It Work

Stop single-cause thinking

A trade request crosses the network between instances, hits a CPU that may not be scheduled cleanly, gets serialized and deserialized.

Test in an environment that matches production

A load test on a quiet afternoon against half the real traffic, on differently configured instances, tells you almost nothing about peak behavior.

Know that AWS is not the ceiling

Plenty of teams assume the public cloud simply cannot do low latency. The evidence says otherwise.

The 10-Week Program That Got Them There

Weeks 1-3 - Network

Start where the biggest, cheapest wins usually are. Deploy with AZ awareness so chatty services are not paying cross-AZ round trips for every hop.

Weeks 4-7 - Compute

Right-size the instances with the vCPU and memory the workload actually needs, then make the machine deterministic.

Weeks 8-10 - Serialization

JSON is rarely the right format for a latency-critical path. It is verbose, slow to parse, and you pay for it on every message.

Weeks 11+ - Continuous load testing and regression defense

Stand up a load-test environment that mirrors production and runs continuously, so every future change is measured against the real latency profile before it ships.

P95 Hits And Holds The Target.

The reward is not just a better number on a dashboard. It is a platform the desk trusts enough to keep their flow on, which is the only verdict that counts.

Download White Paper

Frequently Asked Questions

Are these wins durable?

Yes, when the load-test environment runs continuously. Regressions get caught before they reach production. Without that, latency drifts back over time as changes accumulate.

Will this work for non-trading low-latency workloads?

Yes. The framework applies to any latency-sensitive workload: auction bidding, real-time game backends, ad serving. The layers are the same even when the business is not trading.

Where do the biggest milliseconds usually hide?

Network and serialization. AZ-aware routing recovered 18ms and the JSON-to-Protobuf switch recovered 31ms in this engagement. We measure where your milliseconds go first, then fix the biggest contributors in order rather than guessing.

What about UDP-based protocols?

Some trading paths benefit from UDP-based protocols like ZeroMQ or a custom binary transport. We have shipped those where the path justified it. They are a tool for the hottest flows, not a default for everything.

Can AWS really compete with on-prem or colo for latency?

For most platforms, yes. AWS documents up to 85% tail-latency reduction at p99.9 with network-optimized instances, and published benchmarks show transport-level round trips around 29 microseconds on AWS. The gap between a tuned and untuned setup is far larger than the gap between cloud and colo for the workloads most platforms actually run.