Timing Correlation Risks in Mixnet Delay Strategies

prem · March 5, 2026, 10:21am

Discussion: Timing Correlation Risks in Mixnet Delay Strategies

This summary captures a technical discussion regarding the implementation of mixing delays and their interaction with auxiliary mechanisms (such as RLN based DoS protection) for libp2p based mixnets. While the discussion originated around the Exponential Delay Strategy and RLN (Rate Limiting Nullifier) used as DoS protection, the insights are relevant for any pluggable delay or security mechanism within the mixnet.

1. The “Timing Floor” (Minimum Delay Interference)

In the current specification and implementation, DoS protection proof generation is executed in parallel with the mixing delay timer by the intermediate nodes (this is done as an optimization). In such cases, the effective time a packet is held/delayed by an intermediate mix node is dictated by:

$$\text{Effective Delay} = \max(\text{Sampled Delay}, \text{RLN Proof Generation Time})$$

The Security Risk:

If the sampled delay is smaller than the time taken for the RLN proof generation to complete, the delay of each packet processing by the intermediate node would always become fixed leading to predictability. An external observer can then correlate incoming and outgoing traffic which completely defeats the purpose of having random delays.

2. The “Timing Ceiling” (Vulnerabilities in Truncation)

For practical reasons the delays are truncated (nim-libp2p PR #2120) to avoid “long tail” of an exponential distribution that could lead to extreme latencies (e.g., > 60s). For the portion of packets that fall into the statistical tail (e.g., the top 5%), the delay is no longer random. These packets are all delayed by the exact same value. This again increases the chance of possible timing correlation by an external observer.

3. Proposed Mitigation

To eliminate predictable spikes at both the floor (processing) and the ceiling (practical limit), the consensus is to follow the approach below.

Handling the “Long Tail”: If a sampled delay exceeds the practical threshold (e.g., 5 seconds), the node should resample until it obtains a value within the valid range. This redistributes the probability weight across the rest of the curve, maintaining a smooth, unpredictable distribution.
Handling varying RLN proof generation times across hardware: Set a minimum value of delay based on average proof generation time+buffer (e.g ~100msecs based on logos-messaging RLN data +buffer).

Few open points:
1. Determine how much buffer time is appropriate.
2. fine-tune proof generation time to be used

Key Requirements for Delay Implementations

Stochastic Independence: The exit time should remain as close to the intended probability distribution as possible.
Avoid Truncation: Any logic that forces diverse samples into a single fixed value (max) introduces a timing correlation risk. Rely on resampling rather.
Distribution Integrity: Implementers must ensure that auxiliary mechanisms do not create artificial “spikes” that facilitate traffic analysis.

Contributors: @akshaya @bkomuves , @haelius , @mghazwi

prem · March 30, 2026, 12:38pm

Modified implementation a litte bit, rather than resampling, the bounds itself are changed to be within floor and ceiling.

github.com/vacp2p/nim-libp2p

fix: mix delay correlation risks

master ← codex/fix-mix-delay-correlation

opened 02:12PM - 27 Mar 26 UTC

chaitanyaprem

+142 -31

## Summary Implements the mix-delay timing-correlation mitigations discussed [h…ere](https://forum.research.logos.co/t/timing-correlation-risks-in-mixnet-delay-strategies/677). The change removes fixed-value artifacts at the practical upper bound and at the low end of spam-protected delays, while keeping sampling runtime bounded. ## Affected Areas - [x] Protocol Logic - exponential mix delays now sample directly from the bounded practical window instead of truncating to a fixed ceiling or relying on unbounded rejection retries. - [x] Protocol Logic - when per-hop spam protection is enabled and an `ExponentialDelayStrategy` leaves `minimumDelayMs = 0`, `MixProtocol` applies a default `100 ms` floor. - [x] Observability - warning text now reports when proof generation exceeds the sampled delay, which matches the actual comparison in the code. ## Compatibility & Downstream Validation NA ## Impact on Library Users - Changes behavior for users who enable per-hop spam protection with `ExponentialDelayStrategy` and leave `minimumDelayMs = 0`: `MixProtocol` now treats that as unset and applies a default `100 ms` floor. - Changes the bounded exponential-delay implementation from clamp/retry behavior to direct truncated sampling within the configured practical window. - No API breakage. ## Risk Assessment Low. The change is limited to the mix delay sampling path and is covered by unit tests plus spam-protection integration tests. ## Verification - `tests/libp2p/mix/test_delay_strategy.nim`: `10/10` passed - `tests/libp2p/mix/component/test_spam_protection.nim`: `3/3` passed - `tests/test_all.nim -d:path=mix`: `132/132` passed earlier in the branch before the later focused follow-ups ## References https://forum.research.logos.co/t/timing-correlation-risks-in-mixnet-delay-strategies/677