Experiment Scope for Testing the Synchronization de-MLS Assumption

ugur · March 17, 2026, 7:07am

TL;DR: Since the assumption that at least (2n/3) members see a message within \Delta seconds is critical for the de-MLS RFC, we propose measuring it experimentally for selected network sizes and \Delta values.

Assumption

We assume that at least (2n/3) of the members become synchronized within \Delta time.

Definition of Synchronization

For this experiment, we define synchronization as follows:

A message generated by one node is considered synchronized if it is seen by at least (2n/3) members within \Delta seconds.

In other words, after a node produces a message at time (t_0), we check whether at least (2n/3) members have seen that message by time (t_0 + \Delta).

Goal

The goal is to evaluate whether the network satisfies the above synchronization assumption under different network sizes and time bounds.

Proposed Parameters

n \in {200, 500, 750, 1000}
\Delta \in {3, 5, 7, 10}) in seconds

Experiment Outline

For each (n, \Delta ) configuration:

Initialize a network with (n) Waku/Logos message nodes.
Select one node to generate a message.
Broadcast the message at time (t_0).
Measure how many nodes have seen the message by time (t_0 + \Delta).
Check whether the message reached at least (2n/3) members within the time bound.

Core Output

For each configuration, the experiment should report:

the fraction of nodes that saw the message within \Delta ,
whether the (2n/3) threshold was reached,
the observed dissemination behavior across the tested parameter sets.

Discussion

Does this kind of experiment make sense? Any suggestions on what we could add or remove to make it more useful?

Alberto · March 17, 2026, 10:36am

I think this setup is fairly easy to replicate and measure. Also very well defined, thanks.

We were just about to start with the new version v0.38.0.rc-1. Do you want us to measure this using just logos-messaging (waku)?

ugur · March 19, 2026, 9:31am

Sure, but just wonder what would be else?

seemenkina · March 24, 2026, 6:55am

@Alberto I have some additional pull of information for you, let me know if it makes sense

@ugur check please that I don’t mess up with rfc understanding

In short, aside from the network, we need to measure how the protocol itself affects the synchronisation time and how the parameters will depend on this

When a message travels through de-MLS, it doesn’t just hop between Waku nodes. It goes through MLS encryption (ratchet tree operations that scale with O(log₂ n)), consensus voting where every member sends an individually encrypted vote, freeze rounds where commit candidates need to reach 2n/3 before deterministic selection can work, and so on. All of this adds time on top of the pure network propagation that the baseline experiment captures.

So even if Waku delivers a message to 2n/3 nodes in 3 seconds, the protocol might need 8 seconds end-to-end for everyone to actually be “synchronized” in the de-MLS sense — meaning they’ve decrypted, voted, and converged on the same state.

Simplified flow:


1. Member proposes "add Alice"

2. Proposal MLS-encrypted, broadcast via Waku

3. Every member decrypts and votes (n vote messages)

4. Consensus reached (2n/3 votes tallied)  ← 2n/3 must sync here

5. Steward batches approved proposals → CommitCandidate

6. CommitCandidate broadcast via Waku

7. ═══ FREEZE WINDOW (Δ) ═══  ← 2n/3 must sync here

8. Deterministic selection → MLS tree update → new epoch

The baseline experiment covers step 6→7. The full protocol also needs time for 2→4 (consensus) and 8 (crypto). The connection:

protocol Δ  >=  network Δ  +  crypto overhead  +  consensus time  +  margin

                    ↑               ↑                    ↑

               Δ baseline    measurement A        measurement B

Proposed additional measurements

	What	Why	How it connects to 2n/3
A	MLS crypto timing per operation	CPU floor at each group size	Added on top of every network Δ
B	Consensus convergence time	n vote messages → 2n/3 agreement	Must finish before freeze even starts
C	Full epoch round-trip	Proposal → commit → freeze → sync	The real end-to-end Δ
D	Join burst throughput	Many joins at once, peak load	Stress test of the assumption
E	Emergency proposal propagation	High-priority message to 2n/3	Time-critical protocol events
F	Network partition recovery	What happens when assumption breaks	Failure mode analysis

The batch size problem — how many proposals can one epoch handle?

This is something we need to figure out from the benchmarks. The steward computes MLS tree updates sequentially — each add or remove is a tree operation. If we allow too many proposals in one epoch, the steward spends most of the epoch just computing, and the group is effectively blocked.

Here’s the problem visualized (with some kinda random numbers):

epoch = 30 seconds

Scenario A: 5 proposals per epoch (comfortable)
├── consensus voting ──── 5s ──┤
├── steward computes ── 0.5s ──┤
├── freeze window Δ ─── 15s ───┤
├── idle/chat ─────── 9.5s ────┤  ← group works normally
                                   ✓ OK

Scenario B: 200 proposals per epoch (problem!)
├── consensus voting ── 10s ───┤
├── steward computes ── 20s ───┤  ← tree ops for 200 members
├── freeze window Δ ─── 15s ───┤
│                              │
└── total: 45s > 30s epoch ────┘  ← DOESNT FIT
                                   ✗ Group is stuck

So we probably need a max batch size per epoch — the steward takes the first k approved proposals, commits those, and defers the rest to the next epoch. That limit depends on:

k_max = (epoch_budget - consensus_time - freeze_duration - margin) / T_per_tree_op(n)

Where T_per_tree_op(n) comes directly from measurement A. For example, if at n=500 each tree operation takes ~5ms, and we have 10 seconds of epoch budget after consensus and freeze:

k_max = 10s / 5ms = 2000  ← probably fine for adds at this size

But if the tree operation takes 50ms at n=1000 with larger payloads:

k_max = 10s / 50ms = 200  ← still okay, but getting tighter

The real concern is that tree operations might not scale linearly — if there’s overhead that compounds with batch size, the limit could be much lower. That’s exactly what measurement A should reveal.

This matters for the join burst test (measurement D): if 200 people try to join at once but k_max is 50, then the first 50 join in epoch E, next 50 in epoch E+1, and so on — spreading the load across 4 epochs.
The question becomes: is the user experience acceptable? And does the deferred queue work correctly? @ugur

What makes it worse at scale

Three factors compound as the group grows:

Tree depth increases — MLS tree operations are O(log₂ n). At n=1000, each add/remove touches ~10 tree levels instead of ~7 at n=100. Individual operations get slower.
CommitCandidate size grows — k proposals × MLS proposal bytes + 1 commit. A 100-proposal candidate could be tens of KB. Larger messages take longer to propagate through Waku, eating into the freeze window.
Welcome messages grow linearly — each new joiner gets a Welcome containing the full ratchet tree (~350KB at n=1000). If 50 people join in one batch, that’s ~17MB of Welcome traffic from the steward alone.

Timer tuning — finding the minimum epoch

All protocol timers derive from epoch_duration:

epoch_duration (default 30s)

  ├── freeze_duration = epoch / 2   ← this IS the protocol Δ
  ├── consensus_timeout = 15s       ← must finish before freeze
  └── join_timeout ≈ 2 × epoch

We want to find the minimum epoch that works for each group size. So we’d test:

Group sizes: n ∈ {200, 500, 750, 1000} (same as original experiment)
Epoch durations: {10, 15, 30, 60}s (which gives freeze Δ of {5, 7.5, 15, 30}s)

Questions for discussion

Does extending the experiment this way make sense alongside the network-level tests?
For join burst — what’s a realistic “spike” size? 10 joins? 100? And should we cap the batch size per epoch based on measurement A results?
The Welcome message grows linearly with group size (~350KB at n=1000). Should we measure whether Waku handles messages that large reliably?
The consensus timeout is hardcoded at 15s. If benchmarks show it needs more at large n, should it become configurable?
For the batch size limit — should excess proposals be deferred automatically to the next epoch, or should there be backpressure on proposal creation (e.g., reject new proposals when the queue is full)?

ugur · April 16, 2026, 5:54am

Thanks for the improved test scenario!

200 is quite high number for an epoch so we can divide it and having some delay regarding to this big changes in the group in P2P should be acceptable to me. Also want to hear @jazzz’s comment here.

jazzz · April 16, 2026, 5:27pm

200 is quite high number for an epoch so we can divide it.

I’d agree this is a high number @ugur, if we exclude initialization. initializing a group either due to rotation, or recovery from a partition will see all members added at once - Exceeding 200 for large groups.

I don’t think this is an issue, as for initialization we can assume that stewards are not involved, and this is handled by the creator.

In practice even “join via link” members would be normally distributed over some mean reaction time, which means that 200 members in a epoch would be unlikely, an a wait would be reasonable. With a 30s Epoch, this seems likely it would be fine for human based interaction.

jazzz · April 16, 2026, 5:30pm

In the early days, ratchet_trees will be embedded in the welcome messages, however this is not sufficient if scaling is desired. I’ll need to find a solution to provide ratchet_trees to participants separately. You can assume this.

Alberto · April 29, 2026, 3:27pm

Hi @seemenkina @ugur @jazzz

I come back with more questions, because recently I did some analysis for the logos-delivery version (0.38).

In order to answer the original question:

For each (n, Δ ) configuration:

Initialize a network with (n) Waku/Logos message nodes.
Select one node to generate a message.
Broadcast the message at time (t_0).
Measure how many nodes have seen the message by time (t_0 + Δ).
Check whether the message reached at least (2n/3) members within the time bound.

This was something easy to calculate. Of course in my setup we are talking about an scenario that is easy to pass for every node.

What I didn’t consider is the payload size of the messages (I did 1KB) and the network load. For example, for the 5 described steps, I repeated that once per second.

Now, with the new information, I am unsure about the epoch. Are we still talking about plan logos-delivery? Or we are using something else?

Happy to jump in a call for more discussion.