lib: use slice to compact Readable buffer in streams#59676

wlgh1553 · 2025-08-29T16:03:09Z

Replace splice(0, idx) with
state.buffer = ArrayPrototypeSlice(state.buffer, idx) when compacting the Readable buffer.

Rationale:

avoids O(N) left-shift and extra return array from splice
reduces CPU/GC churn on large buffers
aligns with similar optimization in Writable lib: optimize writable stream buffer clearing #59406

Benchmark (local):

 confidence improvement accuracy (*) (**) (***) streams/readable-bigread.js n=1000 * 0.77 % ±0.73% ±0.97% ±1.26%

nodejs-github-bot · 2025-08-29T16:03:14Z

Review requested:

@nodejs/streams

codecov · 2025-08-29T16:55:35Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 89.89%. Comparing base (95bef5a) to head (384d35e).
⚠️ Report is 23 commits behind head on main.

Additional details and impacted files

@@ Coverage Diff @@## main #59676 +/- ## ======================================== Coverage 89.89% 89.89% ======================================== Files 667 667 Lines 196675 196852 +177 Branches 38612 38652 +40 ======================================== + Hits 176796 176966 +170 + Misses 12332 12312 -20 - Partials 7547 7574 +27

Files with missing lines	Coverage Δ
lib/internal/streams/readable.js	`96.20% <100.00%> (+<0.01%)`	⬆️

... and 45 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

daeyeon · 2025-09-02T00:41:14Z

Benchmark CI: https://ci.nodejs.org/view/Node.js%20benchmark/job/benchmark-node-micro-benchmarks/1726/

Details

11:15:07 confidence improvement accuracy (*) (**) (***) 11:15:07 streams/compose.js n=1000 0.26 % ±0.69% ±0.91% ±1.19% 11:15:07 streams/creation.js kind='duplex' n=50000000 -0.60 % ±1.59% ±2.13% ±2.80% 11:15:07 streams/creation.js kind='readable' n=50000000 1.53 % ±2.07% ±2.78% ±3.68% 11:15:07 streams/creation.js kind='transform' n=50000000 0.68 % ±0.82% ±1.10% ±1.45% 11:15:07 streams/creation.js kind='writable' n=50000000 0.17 % ±0.53% ±0.70% ±0.91% 11:15:07 streams/destroy.js kind='duplex' n=1000000 0.22 % ±0.35% ±0.47% ±0.62% 11:15:07 streams/destroy.js kind='readable' n=1000000 -0.00 % ±0.57% ±0.76% ±0.99% 11:15:07 streams/destroy.js kind='transform' n=1000000 -0.17 % ±0.51% ±0.68% ±0.88% 11:15:07 streams/destroy.js kind='writable' n=1000000 0.08 % ±0.56% ±0.75% ±0.98% 11:15:07 streams/pipe-object-mode.js n=5000000 -0.41 % ±0.89% ±1.19% ±1.57% 11:15:07 streams/pipe.js n=5000000 *** -2.28 % ±0.39% ±0.52% ±0.68% 11:15:07 streams/readable-async-iterator.js sync='no' n=100000 -0.72 % ±1.52% ±2.02% ±2.63% 11:15:07 streams/readable-async-iterator.js sync='yes' n=100000 -0.13 % ±4.36% ±5.80% ±7.54% 11:15:07 streams/readable-bigread.js n=1000 0.36 % ±2.97% ±3.95% ±5.16% 11:15:07 streams/readable-bigunevenread.js n=1000 *** 3.41 % ±0.66% ±0.89% ±1.16% 11:15:07 streams/readable-boundaryread.js type='buffer' n=2000 *** -0.71 % ±0.35% ±0.47% ±0.61% 11:15:07 streams/readable-boundaryread.js type='string' n=2000 *** -4.09 % ±1.24% ±1.65% ±2.15% 11:15:07 streams/readable-from.js type='array' n=10000000 ** -1.82 % ±1.13% ±1.51% ±1.97% 11:15:07 streams/readable-from.js type='async-generator' n=10000000 0.01 % ±0.50% ±0.66% ±0.87% 11:15:07 streams/readable-from.js type='sync-generator-with-async-values' n=10000000 -0.28 % ±1.80% ±2.40% ±3.12% 11:15:07 streams/readable-from.js type='sync-generator-with-sync-values' n=10000000 *** 5.80 % ±2.21% ±2.94% ±3.85% 11:15:07 streams/readable-readall.js n=5000 1.28 % ±2.26% ±3.01% ±3.92% 11:15:07 streams/readable-uint8array.js kind='encoding' n=1000000 -0.35 % ±0.76% ±1.01% ±1.32% 11:15:07 streams/readable-uint8array.js kind='read' n=1000000 -1.00 % ±1.14% ±1.53% ±1.99% 11:15:07 streams/readable-unevenread.js n=1000 0.11 % ±0.41% ±0.55% ±0.72% 11:15:07 streams/writable-manywrites.js len=1024 callback='no' writev='no' sync='no' n=100000 0.70 % ±4.52% ±6.01% ±7.82% 11:15:07 streams/writable-manywrites.js len=1024 callback='no' writev='no' sync='yes' n=100000 6.39 % ±10.34% ±13.76% ±17.93% 11:15:07 streams/writable-manywrites.js len=1024 callback='no' writev='yes' sync='no' n=100000 3.19 % ±6.21% ±8.27% ±10.76% 11:15:07 streams/writable-manywrites.js len=1024 callback='no' writev='yes' sync='yes' n=100000 -0.45 % ±11.68% ±15.54% ±20.24% 11:15:07 streams/writable-manywrites.js len=1024 callback='yes' writev='no' sync='no' n=100000 -0.59 % ±3.02% ±4.01% ±5.23% 11:15:07 streams/writable-manywrites.js len=1024 callback='yes' writev='no' sync='yes' n=100000 6.19 % ±8.00% ±10.64% ±13.85% 11:15:07 streams/writable-manywrites.js len=1024 callback='yes' writev='yes' sync='no' n=100000 3.78 % ±5.77% ±7.68% ±10.00% 11:15:07 streams/writable-manywrites.js len=1024 callback='yes' writev='yes' sync='yes' n=100000 7.83 % ±11.86% ±15.78% ±20.54% 11:15:07 streams/writable-manywrites.js len=32768 callback='no' writev='no' sync='no' n=100000 3.88 % ±5.70% ±7.58% ±9.87% 11:15:07 streams/writable-manywrites.js len=32768 callback='no' writev='no' sync='yes' n=100000 4.24 % ±11.26% ±14.99% ±19.51% 11:15:07 streams/writable-manywrites.js len=32768 callback='no' writev='yes' sync='no' n=100000 0.87 % ±5.49% ±7.30% ±9.50% 11:15:07 streams/writable-manywrites.js len=32768 callback='no' writev='yes' sync='yes' n=100000 8.32 % ±11.50% ±15.30% ±19.93% 11:15:07 streams/writable-manywrites.js len=32768 callback='yes' writev='no' sync='no' n=100000 2.53 % ±5.20% ±6.92% ±9.01% 11:15:07 streams/writable-manywrites.js len=32768 callback='yes' writev='no' sync='yes' n=100000 -3.56 % ±8.32% ±11.07% ±14.41% 11:15:07 streams/writable-manywrites.js len=32768 callback='yes' writev='yes' sync='no' n=100000 -1.20 % ±4.37% ±5.82% ±7.57% 11:15:07 streams/writable-manywrites.js len=32768 callback='yes' writev='yes' sync='yes' n=100000 -5.65 % ±9.32% ±12.41% ±16.15% 11:15:07 streams/writable-uint8array.js kind='object-mode' n=50000000 0.01 % ±0.14% ±0.19% ±0.25% 11:15:07 streams/writable-uint8array.js kind='write' n=50000000 -0.26 % ±0.33% ±0.44% ±0.58% 11:15:07 streams/writable-uint8array.js kind='writev' n=50000000 0.20 % ±0.33% ±0.44% ±0.57% 11:15:07 11:15:07 Be aware that when doing many comparisons the risk of a false-positive 11:15:07 result increases. In this case, there are 44 comparisons, you can thus 11:15:07 expect the following amount of false-positive results: 11:15:07 2.20 false positives, when considering a 5% risk acceptance (*, **, ***), 11:15:07 0.44 false positives, when considering a 1% risk acceptance (**, ***), 11:15:07 0.04 false positives, when considering a 0.1% risk acceptance (***) 11:15:07 ++ mv output010925-145040.csv /w/bnch-comp 11:15:08 Sending e-mails to: [email protected] 11:15:08 Collecting metadata... 11:15:08 Metadata collection done. 11:15:09 Notifying upstream projects of job completion 11:15:09 Finished: SUCCESS

ronag

This will cause a copy and an allocation. I don't see how this is an improvement. Slicing in JavaScript is not like in go, it will copy the elements, not just create a view.

wlgh1553 · 2025-09-02T07:48:26Z

@ronag Thanks for the review! 😀

My primary intention was to avoid the O(N) element shifting cost incurred by splice. As you correctly pointed out, slice introduces a new allocation. However, my reasoning was that the low-level implementation of slice would be a more efficient operation than the element shifting required by splice.

This approach is also a proven optimization pattern used in PR #59406 to solve the same issue in Writable streams.

I see the Jenkins CI benchmark results as data that empirically validates this hypothesis in a real-world scenario.

ronag · 2025-09-02T07:49:43Z

However, my reasoning was that the low-level implementation of slice would be a more efficient operation than the element shifting required by splice.

Why is it more efficient? slice still needs to copy the elements just like splice does in this case.

ronag · 2025-09-02T07:54:15Z

Here is an illustration of what needs to happen in both cases:

function slice(arr, idx){const ret = [] let n = 0 for (let i = idx; i < arr.length; i++){ret[n++] = arr[i] } return ret } function splice(arr, idx){let n = 0 for (let i = idx; i < arr.length; i++){arr[n++] = arr[i] } arr.length = arr.length - idx }

splice does the exact same thing as slice but without the extra allocation. Unless there is some V8 magic involved that I am unaware of. I think the benchmark results of both this and the other PR is within the margin of error.

wlgh1553 · 2025-09-02T12:29:26Z

Thank you for the detailed illustration with the code examples. It was very helpful for clarifying the logical operations. You are correct, and I agree that splice uses a smaller temporary allocation since slice must allocate for the larger, remaining portion of the array.

I believe the 'V8 magic' we're discussing is the most likely explanation for the consistent benchmark improvements we see for slice in both the CI for this PR and the previous one (#59406).

To isolate this performance difference more clearly, I ran a more targeted local microbenchmark with a larger array.

"use strict";functionbench(label,fn){constt0=performance.now();fn();constt1=performance.now();console.log(`${label}:`.padEnd(25),`${(t1-t0).toFixed(2)} ms`);}functionmakeArray(size){constarr=newArray(size);for(leti=0;i<size;i++){arr[i]=i;}returnarr;}constARRAY_SIZE=20_000_000;constREMOVE_COUNT=100;constRUNS=100;bench("Array.prototype.splice()",()=>{for(leti=0;i<RUNS;i++){constarr=makeArray(ARRAY_SIZE);arr.splice(0,REMOVE_COUNT);}});bench("Array.prototype.slice()",()=>{constarr=makeArray(ARRAY_SIZE);for(leti=0;i<RUNS;i++){constnewArr=arr.slice(REMOVE_COUNT);}});

The results are consistently and significantly in favor of slice:

ubuntu@ip-172-31-38-159:~$ node test.js Array.prototype.splice(): 22854.06 ms Array.prototype.slice(): 19305.41 ms ubuntu@ip-172-31-38-159:~$ node test.js Array.prototype.splice(): 22501.54 ms Array.prototype.slice(): 19330.00 ms ubuntu@ip-172-31-38-159:~$ node test.js Array.prototype.splice(): 22613.90 ms Array.prototype.slice(): 19190.69 ms

Test Environment:

OS: Ubuntu 24.04.3 LTS
CPU: Intel(R) Xeon(R) CPU E5-2686 v4 @ 2.30GHz
RAM: 1GB
Node.js: v22.19.0

This data strongly supports the idea that the high CPU cost of shifting millions of elements for splice is a much more impactful bottleneck than the cost of slice's single allocation and optimized bulk-copy operation.

This is the core rationale for my proposal: I believe trading the one-time memory allocation for a significant gain in CPU performance is a worthwhile improvement for this hot-path in the streams implementation.

ronag · 2025-09-02T13:22:31Z

I believe your benchmark is flawed. V8 is probably entirely optimizing away newArr and slice since it has no side effect.

clk: ~3.43GHz cpu: AppleM3Pro runtime: node24.5.0(arm64-darwin)benchmarkavg(min…max)p75/p99(min…top1%)--------------------------------------------------------------------------Array.prototype.splice()42.12µs/iter36.21µs█(24.92µs…1.39ms)192.87µs█(14.66kb…285.54kb)157.71kb▂█▃▄▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁Array.prototype.slice()56.49µs/iter56.07µs███(48.54µs…73.75µs)71.57µs█▅▅██▅▅▅(77.85b…6.09kb)701.80b█████▁▁█▁▁▁▁▁▁▁▁▁█▁▁█

import{bench,run}from'mitata'functionmakeArray(size){constarr=newArray(size);for(leti=0;i<size;i++){arr[i]=i;}returnarr;}constARRAY_SIZE=20_000;constREMOVE_COUNT=100;letcounter=0;bench("Array.prototype.splice()",()=>{constarr=makeArray(ARRAY_SIZE);counter+=arr.splice(0,REMOVE_COUNT).length;});bench("Array.prototype.slice()",()=>{constarr=makeArray(ARRAY_SIZE);counter+=arr.slice(REMOVE_COUNT).length;});awaitrun();console.log(counter);

wlgh1553 · 2025-09-03T04:01:09Z

Thank you for all the helpful feedback. You've helped me clear up my misunderstanding of the performance characteristics.

Based on our discussion, I'll go ahead and close this PR. Thanks again!

lib: use slice to compact Readable buffer
384d35e

nodejs-github-bot added needs-ci PRs that need a full CI run. stream Issues and PRs related to the stream subsystem. labels Aug 29, 2025

ronag requested changes Sep 2, 2025
View reviewed changes

wlgh1553 closed this Sep 3, 2025

This was referenced Sep 3, 2025
lib: optimize writable stream buffer clearing #59406
Merged
Revert "lib: optimize writable stream buffer clearing" #59743
Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

lib: use slice to compact Readable buffer in streams#59676

lib: use slice to compact Readable buffer in streams #59676

Uh oh!

wlgh1553 commented Aug 29, 2025

Uh oh!

nodejs-github-bot commented Aug 29, 2025

Uh oh!

codecovbot commented Aug 29, 2025•
edited
Loading

Uh oh!

daeyeon commented Sep 2, 2025

Uh oh!

ronag left a comment

Uh oh!

wlgh1553 commented Sep 2, 2025

Uh oh!

ronag commented Sep 2, 2025

Uh oh!

ronag commented Sep 2, 2025•
edited
Loading

Uh oh!

wlgh1553 commented Sep 2, 2025

Uh oh!

ronag commented Sep 2, 2025•
edited
Loading

Uh oh!

wlgh1553 commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

lib: use slice to compact Readable buffer in streams#59676

lib: use slice to compact Readable buffer in streams #59676

Uh oh!

Conversation

wlgh1553 commented Aug 29, 2025

Uh oh!

nodejs-github-bot commented Aug 29, 2025

Uh oh!

codecovbot commented Aug 29, 2025• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

daeyeon commented Sep 2, 2025

Uh oh!

ronag left a comment

Choose a reason for hiding this comment

Uh oh!

wlgh1553 commented Sep 2, 2025

Uh oh!

ronag commented Sep 2, 2025

Uh oh!

ronag commented Sep 2, 2025• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wlgh1553 commented Sep 2, 2025

Uh oh!

ronag commented Sep 2, 2025• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wlgh1553 commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecovbot commented Aug 29, 2025•
edited
Loading

ronag commented Sep 2, 2025•
edited
Loading

ronag commented Sep 2, 2025•
edited
Loading