src: improve base64 encoding performance#39701

mscdex · 2021-08-08T01:48:22Z

Benchmark results on a Core i7-3770K:

 confidence improvement accuracy (*) (**) (***) buffers/buffer-base64-encode.js n=32 len=67108864 *** 15.94 % ±0.11% ±0.14% ±0.18%

I was tempted to run the benchmark on ARM out of curiosity, but I don't have a machine with a new enough build environment or one that I can easily swap the OS at the moment and trying to cross compile node.js with a Linaro aarch64 toolchain is impossible.

nodejs-github-bot · 2021-08-08T01:48:50Z

CI: https://ci.nodejs.org/job/node-test-pull-request/39506/

benjamingr

The improvement (sure you tested with -O3 in both cases right?) is somewhat surprising but sure.

mscdex · 2021-08-08T09:44:53Z

sure you tested with -O3 in both cases right?

-O3 is the default in node builds, which I did not touch.

is somewhat surprising

I'm not as surprised considering this is reading more bytes per loop iteration (and in one go) than before, which is not something I'm sure a compiler would necessarily know to do.

benjamingr · 2021-08-08T10:19:54Z

Yeah I of course believe you and assumed it's run with -O3 (otherwise I wouldn't have LGTMd) I'm just surprised about:

which is not something I'm sure a compiler would necessarily know to do.

It sounds like a pretty safe and straightforward form of loop unrolling which is a fairly simple/standard optimisation

tniessen · 2021-08-08T13:32:27Z

src/base64-inl.h

+
+// Read in chunks of 8 bytes for as long as possible
+while (i < n64){
+constuint64_t dword = *reinterpret_cast<constuint64_t*>(src + i);


Some architectures don't support unaligned 64-bit access, so this could end up being somewhat slower on those. But still an optimization on most systems!

That's why I was curious to check on ARM. However from what I'm seeing ARMv7 and newer support unaligned reads, with ARMv7 supposedly not supporting them for only 2 instructions?

Have you tried to start reading 1/4 bytes until 8 byte alignment is reached?

mscdex · 2021-08-11T08:32:42Z

Alright, after more wrangling than I wanted, I managed to get some armv7 binaries compiled and ran the benchmarks on a Pi 2B before and after the changes in this PR:

 confidence improvement accuracy (*) (**) (***) buffers/buffer-base64-encode.js n=32 len=67108864 *** 2.20 % ±0.39% ±0.52% ±0.68%

I don't have an armv8 board available to test on.

Mesteery · 2021-08-11T09:51:29Z

I can run the benchmark on a Pi 4B. Should I compare with master or 16.6.1?

Mesteery · 2021-08-11T13:44:42Z

compared with 16.6.1 :

 confidence improvement accuracy (*) (**) (***) buffers/buffer-base64-encode.js n=32 len=67108864 *** 5.12 % ±0.22% ±0.30% ±0.39%

fhinkel · 2021-08-17T18:20:31Z

Can you link the issue where this is too slow in the real world? Not sure 15% perf improvement on a synthetic benchmark warrants this change.

mscdex · 2021-08-17T20:16:16Z

Not sure 15% perf improvement on a synthetic benchmark warrants this change.

"Synthetic" benchmarks are all we have at the moment, after the benchmarking WG was dechartered/decommissioned. IMO base64 encoding/decoding is not something that I'd exactly call an uncommon task, especially within the realm of web applications.

In general, any sensible changes that provide a measurable improvement should be welcomed, especially considering the number of performance hits we continue to take on over time as features get added and as V8 evolves. The improvements all add up.

Anyway, as far as base64 encoding goes, I also have an alternative PR here that you may be more interested in. 🤷‍♂️

mscdex · 2022-07-10T03:36:41Z

Closing in favor of #39775

src: improve base64 encoding performance
d6719e5

mscdex added the performance Issues and PRs related to the performance of Node.js. label Aug 8, 2021

nodejs-github-bot added buffer Issues and PRs related to the buffer subsystem. c++ Issues and PRs that require attention from people who are familiar with C++. needs-ci PRs that need a full CI run. labels Aug 8, 2021

mscdex removed the needs-ci PRs that need a full CI run. label Aug 8, 2021

benjamingr approved these changes Aug 8, 2021
View reviewed changes

tniessen reviewed Aug 8, 2021
View reviewed changes

jasnell approved these changes Aug 8, 2021
View reviewed changes

tniessen approved these changes Aug 12, 2021
View reviewed changes

mscdex mentioned this pull request Aug 15, 2021
deps,src: use SIMD for normal base64 encoding #39775
Merged

mscdex closed this Jul 10, 2022

mscdex deleted the base64-perf branch July 14, 2022 15:47

Uh oh!

src: improve base64 encoding performance#39701

src: improve base64 encoding performance #39701

Uh oh!

Conversation

mscdex commented Aug 8, 2021• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nodejs-github-bot commented Aug 8, 2021

Uh oh!

benjamingr left a comment

Choose a reason for hiding this comment

Uh oh!

mscdex commented Aug 8, 2021• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benjamingr commented Aug 8, 2021

Uh oh!

tniessenAug 8, 2021

Choose a reason for hiding this comment

Uh oh!

mscdexAug 8, 2021

Choose a reason for hiding this comment

Uh oh!

FlarnaAug 21, 2021

Choose a reason for hiding this comment

Uh oh!

mscdex commented Aug 11, 2021• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mesteery commented Aug 11, 2021

Uh oh!

Mesteery commented Aug 11, 2021

Uh oh!

fhinkel commented Aug 17, 2021

Uh oh!

mscdex commented Aug 17, 2021• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mscdex commented Jul 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

mscdex commented Aug 8, 2021•
edited
Loading

mscdex commented Aug 8, 2021•
edited
Loading

mscdex commented Aug 11, 2021•
edited
Loading

mscdex commented Aug 17, 2021•
edited
Loading