[WIP] binary bloat analysis #1223

pauldreik · 2020-10-10T05:43:54Z

This is for measuring binary size. Not that anyone asked about it that I know of, but it may be interesting to

see which functions take up space
notice if the binary size suddenly increases

It is really quick to run, less than a minute.

However, I do not really know how to present the result and/or possibly act on it, see djarek/bloaty-analyze#1

jkeiser · 2020-10-12T23:33:39Z

It's definitely a good idea to keep track of size over time ... that affects the ability to distribute.

Not part of this at ALL, but someday it'll be nice to do a size comparison--instead of just comparing simdjson against other libraries for speed on certain tasks, make a separate executable for each library+task combo and compare size as well.

pauldreik · 2020-10-13T06:27:15Z

Would a reasonable small example be one that takes the twitter json and outputs the first tweet found that contains "cats" ?

I think there are several metrics interesting to keep track of over time

lines of code in total
fuzz coverage (I have a manually updated table here)
unit test coverage
performance (N platforms x M benchmarks)
binary size of the library
binary size of "hello world" similar to what @jkeiser proposes above

There was discussion of tracking performance earlier, perhaps it would be possible to store all these metrics somewhere?

The curl project tracks all sorts of things over time, we might get inspiration there.

jkeiser · 2020-12-22T17:23:26Z

For some reason I didn't see your response: I think it's reasonable to run this on simdjson.so at a minimum, and perhaps the parse executable.

For ondemand, perhaps the partial_tweets benchmarks? Perhaps benchmark_ondemand as a whole, which would potentially give us interesting comparisons.

jkeiser · 2021-03-20T17:43:02Z

@pauldreik I'm fine leaving this pr around if you plan to get back to it; otherwise let's file an issue and get back to it when we have time :)

pauldreik · 2021-03-23T07:09:22Z

@pauldreik I'm fine leaving this pr around if you plan to get back to it; otherwise let's file an issue and get back to it when we have time :)

I pinged the author of the bloaty action job, let's see if I get a response and if not, let's do as you suggest!

pauldreik · 2021-04-04T17:58:43Z

@jkeiser I fixed the bloaty job and it seems to work, but what should we do with the results? should we reject the pull request if the binary size increases more than X%?

jkeiser · 2021-06-24T15:26:48Z

@pauldreik do you have any idea whether the results of this are generally stable? If so, I think it'd be reasonable to reject changes with a 20% size change (or at least flag the crap out of them). @lemire thoughts?

At the very, very least, we should run this in CI so we can go look at the results when we're worried. If it's expensive we can restrict to just master pushes.

lemire · 2021-06-24T16:56:00Z

@jkeiser Sure, sure.

pauldreik · 2021-06-27T15:04:56Z

I do not know if the results are stable, I do not understand why they wouldn't be? 20% seems like a pretty big margin, let's start with that!

pauldreik force-pushed the pauldreik/bloaty branch from 0b94c19 to e03e602 Compare April 4, 2021 17:24

pauldreik added 2 commits April 4, 2021 20:09

drop everything byt bloaty to avoid wasting CI

37de5b0

add bloaty CI job

00a59a7

pauldreik force-pushed the pauldreik/bloaty branch from 2d0eb3e to 00a59a7 Compare April 4, 2021 18:09

jkeiser changed the title ~~DON'T MERGE binary bloat analysis~~ [WIP] binary bloat analysis Jun 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] binary bloat analysis #1223

[WIP] binary bloat analysis #1223

pauldreik commented Oct 10, 2020 •

edited

jkeiser commented Oct 12, 2020

pauldreik commented Oct 13, 2020

jkeiser commented Dec 22, 2020 •

edited

jkeiser commented Mar 20, 2021

pauldreik commented Mar 23, 2021

pauldreik commented Apr 4, 2021

jkeiser commented Jun 24, 2021

lemire commented Jun 24, 2021

pauldreik commented Jun 27, 2021 via email

[WIP] binary bloat analysis #1223

Are you sure you want to change the base?

[WIP] binary bloat analysis #1223

Conversation

pauldreik commented Oct 10, 2020 • edited

jkeiser commented Oct 12, 2020

pauldreik commented Oct 13, 2020

jkeiser commented Dec 22, 2020 • edited

jkeiser commented Mar 20, 2021

pauldreik commented Mar 23, 2021

pauldreik commented Apr 4, 2021

jkeiser commented Jun 24, 2021

lemire commented Jun 24, 2021

pauldreik commented Jun 27, 2021 via email

pauldreik commented Oct 10, 2020 •

edited

jkeiser commented Dec 22, 2020 •

edited