Finding S-Boxes with Efficient Masking in Hardware
Whole Lotta Searchin' Goin' On
"That thing that AES uses."
"That thing that DES has."
\(H(x) = L(F(x)) \oplus A(x)\)
\(2^{16} \cdot 2^{20} \cdot 4713 \approx 2^{48}\)
But we can reduce to \(\approx 2^{46.5}\)
Minimize search space.
– First rule of Find Club
Minimize redundancy.
– Third rule of Find Club
"What's in the box!?"
"Good luck storming the castle."
"Power! Unlimited power!"
Per thread/work item/core/…:
"The root of all evil."
Let's talk about rule #3 again.
Precompute, precompute, precompute.
– Fourth rule of Find Club
Kids, don't try this at home.
Idea:
Better: but within the same parallel context.
sbox = [...] maximum = 0 for alpha in range(1, 256): hist = [0] * 256 for x in range(0, 256): beta = sbox[x] ^ sbox[x ^ alpha] hist[beta]++ maximum = max(maximum, hist[beta])
Terminate early, in concert.
– Fifth rule of Find Club
Optimize for throughput.
– Sixth rule of Find Club
Mind the latency.
– Seventh rule of Find Club
Welcome to Find Club. Enjoy your stay.