Minimize drbg initialization in ot extension, small optimizations and streams #249

GuutBoy · 2018-03-15T21:02:50Z

This pull request:

Reduces the number of DRBG initializations in the BristolRotBatch class in order to optimize OT extension performance. This is done by introducing a utility class to adjust the length a random string to match a desired length. This works by, if necessary, stretching the string using a hash function but otherwise using the provided string. An other utility uses this to implement form of OTP encryption. (fixes Minimize DRBG initialization in MASCOT/OTE #248 )
Moves many parts of the OT-extension from loops to streams.
Improves the Transpose utility class by copying byte arrays bytewise instead of bitwise
Improves the MultiplyWithoutReduction method in the RotSharedImpl class by pre-computing all possible rotations of the bvec input.

Simple functionality to adjust the length of a candidate byte array by either truncating it, or streching it using some secure but deterministic strategy (e.g., in this implementation SHA-256 and a counter).

A simple implementation of OTP using LengthAdjustment.java to adjust the length of keys to the length of the messages to be en/decrypted.

- This makes is to make it very easy to switch to parallel processing.

- Copies arrays bytewise instead of bitwise

- In the MultiplyWithoutReduction method we optimize by pre-computing each of the eight possible rotations of bytes in the b-vector.

codecov · 2018-03-15T21:18:50Z

Codecov Report

Merging #249 into master will increase coverage by 0.02%.
The diff coverage is 100%.

@@             Coverage Diff              @@
##             master     #249      +/-   ##
============================================
+ Coverage     98.87%   98.89%   +0.02%     
- Complexity     2812     2827      +15     
============================================
  Files           318      320       +2     
  Lines          8441     8446       +5     
  Branches        695      691       -4     
============================================
+ Hits           8346     8353       +7     
  Misses           85       85              
+ Partials         10        8       -2

Impacted Files	Coverage Δ	Complexity Δ
...k/alexandra/fresco/tools/ot/base/DhParameters.java	`100% <ø> (ø)`	`2 <0> (-6)`	⬇️
...fresco/tools/ot/otextension/BristolOtReceiver.java	`100% <100%> (ø)`	`10 <0> (-1)`	⬇️
...andra/fresco/tools/mascot/triple/MultiplyLeft.java	`100% <100%> (ø)`	`4 <1> (ø)`	⬇️
...a/fresco/tools/mascot/triple/TripleGeneration.java	`100% <100%> (ø)`	`26 <3> (ø)`	⬇️
...exandra/fresco/tools/mascot/cope/CopeInputter.java	`100% <100%> (ø)`	`8 <0> (ø)`	⬇️
...k/alexandra/fresco/tools/ot/base/NaorPinkasOt.java	`100% <100%> (ø)`	`13 <5> (-2)`	⬇️
...a/fresco/tools/ot/otextension/BristolOtSender.java	`100% <100%> (ø)`	`6 <0> (-1)`	⬇️
.../fresco/tools/ot/otextension/LengthAdjustment.java	`100% <100%> (ø)`	`6 <6> (?)`
...alexandra/fresco/tools/mascot/cope/CopeSigner.java	`100% <100%> (ø)`	`8 <0> (ø)`	⬇️
...exandra/fresco/tools/ot/otextension/PseudoOtp.java	`100% <100%> (ø)`	`4 <4> (?)`
... and 8 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c4c2e2b...b3ae05c. Read the comment docs.

pffrandsen

Two things:

It is very annoying that you bundled serveral changes in one branch - it is quite hard to see the minimization for all the streaming and reformating. Please create seperate branches in the future.
Make sure the last lines are covered, there should not be a reason for not reaching 100 %

Otherwise fine. I had hoped for more gains actually, it is still slow sadly

GuutBoy · 2018-03-16T09:56:14Z

@pffrandsen I totally agree, this branch got a little out of hand. I will do better in the furture. I also agree the performance gain is a little disappointing. One way to improve might be to use more parallelization. The addition of streams should make this quite easy to add.

The coverage is at 100% already though. Not sure what else to test.

jot2re · 2018-03-16T15:04:59Z

tools/ot/src/main/java/dk/alexandra/fresco/tools/ot/otextension/BristolRotBatch.java

-    }
-    return res;
+    int amountToPreprocess = computeExtensionSize(choiceBits.getSize(), comSecParam, statSecParam);
+    byte[] extraByteChoices = Arrays.copyOf(choiceBits.toByteArray(), amountToPreprocess / 8);


Shouldn't it be Byte.SIZE instead of 8, just to be consistent in the usage of constants :)

jot2re · 2018-03-16T15:54:20Z

tools/ot/src/main/java/dk/alexandra/fresco/tools/ot/otextension/LengthAdjustment.java

+   * @param i an integer
+   * @return a corresponding byte array
+   */
+  private static byte[] intToBytes(int i) {


I guess that for (rather pedantic) consistency we should do a loop from 0 to Integer.BYTES and shift based on a multiple of Byte.BIT, rather than having hardcoded 4 bytes for an int and 8 bits per byte. But I think that might be a bit too much to do, because we already have the necessary guarantees from Java that what is below will work.

I considered it! But I felt it would be simpler this way.

I must admit that I agree :)

jot2re · 2018-03-18T00:36:11Z

tools/ot/src/test/java/dk/alexandra/fresco/tools/ot/otextension/LengthAdjustmentTest.java

+    byte[] adjusted1 = LengthAdjustment.adjust(candidate, adjustedLength);
+    assertEquals(adjustedLength, adjusted1.length);
+    byte[] adjusted2 = LengthAdjustment.adjust(candidate, adjustedLength);
+    assertArrayEquals(adjusted1, adjusted2);


Perhaps add sanity test that random output looks random, e.g. not the zero-vector.

jot2re · 2018-03-18T00:37:21Z

tools/ot/src/test/java/dk/alexandra/fresco/tools/ot/otextension/PseudoOtpTest.java

+     assertFalse(Arrays.equals(Arrays.copyOf(candidate, cipherLength), cipherText));
+    }
+    byte[] decryptedMessage = PseudoOtp.decrypt(cipherText, candidate, cipherLength);
+    assertArrayEquals(Arrays.copyOf(message, cipherLength), decryptedMessage);


Perhaps we should also check that no zero-vectors have sneaked in here?

I think that is already covered by testing for equality of the original message.

I agree, handled with the extra check in the lengthAdjustmentTest

jot2re · 2018-03-18T00:43:19Z

tools/ot/src/test/java/dk/alexandra/fresco/tools/ot/base/TestFunctionalNaorPinkas.java

    try {
      Drbg rand = new AesCtrDrbg(HelperForTests.seedOne);
-      DhParameters params = new DhParameters();
-      Ot otSender = new NaorPinkasOt(2, rand, network, params
-          .computeSecureDhParams(1, 2, rand, network));


Do we want to remove the option of computing computeSecureDhParams from DhParameters? Because if not, then the testing using this method should not have been removed from all the tests in this class. On the other hand if we do want to remove the computeSecureDhParams option then the code for this method should be removed from DhParameters.

Good point, I will go with just removing the computeSecureDhParams method, as we are never really using it anyway.

jot2re

I think you have made a really nice refactorization, making the OT extension much more dynamic, and in many cases easier to read. However, I do have a couple of questions/comments I would like some clarification on before final approval:
Why did you remove clarifying comments in NaorPinkasOt? In particular comments clarifying exactly what math is going on. They were meant to make the code easier to read if read alongside the paper.

Regarding speed improvements; I assume Java is very good at optimising things, but in general changing a lot of the code from imperative to lambda expressions might result in some overhead? I am also a bit unsure why exactly you rewrote most of the transpositioning code into lambda expression. Was it for purely cosmetic/coding style reasons?
Also, depending on where the bottleneck is, an alternative could be to not do Eklundh, but simply the trivial approach for matrix transposition.

As Peter says, the decrease in coverage is a not so nice feature of the pull-request, is it possible to add a few more tests to prevent this decrease in coverage?

Finally, I think your auto formatting has been set 100 char lines instead of 80?

GuutBoy · 2018-03-18T13:56:36Z

@jot2re I wrote some point by point comments below:

Removing comments in NaorPinkasOt

I felt that the code was already nicely documented by having code factored into well named local methods, which were all well documented. So the comments did not seem to be needed. My personal taste is that in this case the code can actually be made harder to read by being cluttered by too many inline comments. I guess it is a matter of taste. Is there anything in particular where you feel the comments should be added back?

Stream overhead

I think you are right that using streams can have bit of overhead, but as far as I can see it is not significant here. I really like the new stream API, and to me it makes the code very easy to understand, which is part of the reason why I rewrote it in that style. An other reason is that it makes it very easy to parallelize the computation (basically just add .parallel() to the pipeline). I did not do that in this PR since I was not sure how that would effect how we use MASCOT in the larger picture though (and as Peter pointed out I was already doing to much in this PR).

Transpose

Yes, I mainly rewrote this to streams to make it more clear to myself. Once I reduced the number of DRBG initializations, it turned out transpose was the new bottleneck. To fix that I first "cleaned up" the code a bit to get a better idea of what was going on. As far as I can see transposing is no longer a bottleneck btw., so I think we should keep Eklundh. The new bottleneck appears to be RotSharedImpl.multiplyWithoutReduction(...), which is why I did some minor optimizations on that method. However, it remains the main bottleneck in my local tests, so any suggestions to improve would be welcome 😄 (in c++ I think you can take advantage of the CMUL instructions, but not sure how to do that in Java).

Coverage

I agree. I guess I just focused on the patch having 100% coverage. I think, however, that I may have decreased DhParameters coverage by using static parameters in the tests (which I would like to keep doing because it really speeds up the tests though.).

Linelength

100 is the correct length https://google.github.io/styleguide/javaguide.html#s4.4-column-limit

n1v0lg · 2018-03-18T20:17:02Z

tools/ot/src/main/java/dk/alexandra/fresco/tools/ot/otextension/RotSharedImpl.java

+    List<StrictBitVector> products = IntStream.range(0, alist.size())
+        .mapToObj(i -> multiplyWithoutReduction(alist.get(i), blist.get(i)))
+        .collect(Collectors.toList());
+    res = products.stream().reduce(res, (a, b) -> {


I'd remove the extra collect step, i.e., call reduce on the stream resulting from mapToObj.

Good point.

- Tests of not zero in LengthAdjustmentTest - Removed redundant collect

pffrandsen · 2018-03-19T10:10:25Z

Having realized the dh creation just dropped out, I still think this creation should be present somewhere so we are not forced to reinvent how to create if we want to run on different parameters.

It could be in a test class, in a comment or as a main - just make it discorable within this repo

GuutBoy · 2018-03-19T10:29:25Z

@pffrandsen The test in TestDhParameters generates DhParameters in order to test against the static parameters. Is that all you are asking for?

jot2re · 2018-03-19T21:50:38Z

@GuutBoy Some feedback below:

Removing comments in NaorPinkasOt

It is fine. No need to put things back. I was just curious about the motivation :)

Stream overhead

If it doesn't have a significant impact then everything is fine. I have not used the streams in java before, but if it is that easy to parallelise using them, then it sounds like a great tool I/we should use much more in the future.

Transpose

Ok. I don't have any optimisation suggestions for RotSharedImpl.multiplyWithoutReduction(...) unfortunately.

Coverage

Looks good now :)

Linelength
Sorry. my fault.

Peter Sebastian Nordholt added 14 commits March 13, 2018 17:01

First commit LengthAdjustment.java.

debbb0e

Simple functionality to adjust the length of a candidate byte array by either truncating it, or streching it using some secure but deterministic strategy (e.g., in this implementation SHA-256 and a counter).

First commit of PseudoOtp{Impl}.java.

e0063f2

A simple implementation of OTP using LengthAdjustment.java to adjust the length of keys to the length of the messages to be en/decrypted.

Formatting and usage of the new LengthAdjustment in BristolRotBatch.

9746d8a

Using the new PseudoOtp implementation in various classes.

832b8de

Cleaning up NaorPinkasOt.java.

085412c

Adjusting TestNaorPinkasOt.java to refactoring and cleaning up.

85e2b46

Use static DH parameters to speed up TestFunctionalNaorPinkas.java.

eaa151d

Added tests for length adjustment and pseudootp.

7931e08

Adjusting for new interface for pseudo otp.

1697625

Adjusting to the new interface for pseudo otp.

cfa27c6

Moving to streams from loops in the otextensions.

2d29538

- This makes is to make it very easy to switch to parallel processing.

Moving to streams and simple optimization of Transpose.java.

dfa7d9e

- Copies arrays bytewise instead of bitwise

Moving to streams and a small optimization to multiplication.

ccf63b3

- In the MultiplyWithoutReduction method we optimize by pre-computing each of the eight possible rotations of bytes in the b-vector.

Corrected spelling error in exception message.

59638a2

GuutBoy requested review from pffrandsen and jot2re March 15, 2018 21:02

Peter Sebastian Nordholt added 3 commits March 16, 2018 08:37

Removed dead code in Transpose.java.

0ccdfc1

Various style related changes.

0dfea05

Modfied LengthAdjustment.java to make testable and adjusted test.

a04e650

pffrandsen approved these changes Mar 16, 2018

View reviewed changes

jot2re reviewed Mar 16, 2018

View reviewed changes

jot2re reviewed Mar 18, 2018

View reviewed changes

jot2re requested changes Mar 18, 2018

View reviewed changes

n1v0lg reviewed Mar 18, 2018

View reviewed changes

Peter Sebastian Nordholt added 5 commits March 19, 2018 08:48

Consistent use of Byte.SIZE and some more documentation in Transpose

7bee020

Removed unused method in DhParameters and wrote tests.

6bf3888

Addressing review comments and moved TestDhParameters.java.

d3440ee

- Tests of not zero in LengthAdjustmentTest - Removed redundant collect

Final tests to get to 100% coverage.

e44e8ec

Style fix.

2cb2787

Move MascotDemo back. Not sure how this got moved in the first place 🤔

555e833

jot2re approved these changes Mar 19, 2018

View reviewed changes

Peter Sebastian Nordholt added 2 commits March 20, 2018 11:14

Switch to parallel streams in MASCOT/OT extension.

cba4d4a

A little more parallelization.

b3ae05c

GuutBoy merged commit 1477952 into master Mar 20, 2018

GuutBoy deleted the Minimize-DRBG-Initialization-in-OTExtension-and-MASCOT branch March 20, 2018 12:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minimize drbg initialization in ot extension, small optimizations and streams #249

Minimize drbg initialization in ot extension, small optimizations and streams #249

GuutBoy commented Mar 15, 2018

codecov bot commented Mar 15, 2018 •

edited

pffrandsen left a comment

GuutBoy commented Mar 16, 2018 •

edited

jot2re Mar 16, 2018

jot2re Mar 16, 2018

GuutBoy Mar 19, 2018

jot2re Mar 19, 2018

jot2re Mar 18, 2018

GuutBoy Mar 19, 2018

jot2re Mar 18, 2018

GuutBoy Mar 19, 2018

jot2re Mar 19, 2018

jot2re Mar 18, 2018

GuutBoy Mar 19, 2018

jot2re left a comment

GuutBoy commented Mar 18, 2018

n1v0lg Mar 18, 2018

GuutBoy Mar 19, 2018

pffrandsen commented Mar 19, 2018

GuutBoy commented Mar 19, 2018

jot2re commented Mar 19, 2018

Minimize drbg initialization in ot extension, small optimizations and streams #249

Minimize drbg initialization in ot extension, small optimizations and streams #249

Conversation

GuutBoy commented Mar 15, 2018

codecov bot commented Mar 15, 2018 • edited

Codecov Report

pffrandsen left a comment

Choose a reason for hiding this comment

GuutBoy commented Mar 16, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jot2re left a comment

Choose a reason for hiding this comment

GuutBoy commented Mar 18, 2018

Removing comments in NaorPinkasOt

Stream overhead

Transpose

Coverage

Linelength

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pffrandsen commented Mar 19, 2018

GuutBoy commented Mar 19, 2018

jot2re commented Mar 19, 2018

codecov bot commented Mar 15, 2018 •

edited

GuutBoy commented Mar 16, 2018 •

edited