An unusually large amount of memory usage #19

pflannery · 2015-03-18T18:58:06Z

When creating 10000 small tasks I find that the process memory peeks around 170MB. When I run the same 10000 tasks with setImmediate it's doesn't go over 40MB.
So its seems that task group is generating an unusually large amount of memory overhead.

Here the code I was testing with.

var TaskGroup = require("taskgroup").TaskGroup;
var testArray = Array(10000).join().split(',');
var grp = TaskGroup.create();

testArray.forEach(function(value, index) {
  grp.addTask(function() {
    console.log("task"+index);
  });
  /*
  setImmediate(function() {
    console.log("task"+index);
  });
  */
});

grp.run();

I only tested on Windows 8.1 so not sure what a Linux box will output

balupton · 2015-03-20T05:59:34Z

Perhaps linked to #12

balupton · 2015-03-20T06:00:52Z

To be honest, I would be open to migrating TaskGroup to a pre-processor that actually cares about performance. CoffeeScript no longer cares about it at all, which is really sad. Perhaps CoffeeScript Redux is an option, otherwise we could look at TypeScript, or maybe just native javascript (not that it is actually a good language - unlike compared to 5 years ago)

/ ref https://developers.google.com/v8/experiments

@pflannery I've started a discussion about this here: https://discuss.bevry.me/t/move-from-coffeescript-to-es6/30/1

pflannery · 2015-03-21T03:15:34Z

Seeing as the topic has gone and the ssl is telling me that it's not valid. I will just reply here for now :)

I agree, I think moving to native ES5\ES6 will give us more control over what code we use. I'm all for it.
I find coffee classes are a real pain when analysing them in a heap snapshot because they are all called ctor instead of the class names I define. This makes it very difficult.

I've been using v8-profiler to take snapshots of the test code I posted above and below you see that a Task instance is around 8K and the _events property generated by the EventEmitter is adding a 5-6kb per Task. So I'm wonder if its worth replacing the event emitter for something that is much lighter in weight?

balupton · 2015-03-21T12:18:02Z

https://discuss.bevry.me/t/move-from-coffeescript-to-es6/30 is the new link :-)

pflannery · 2015-03-21T17:30:30Z

Just want to correct myself slightly, EventEmitter isn't adding 5-6k per task, instead it just seems that its referencing 5-6k.

balupton · 2015-03-21T17:38:02Z

@pflannery can you do a screencast sometime of how you come up with that profile information - I've spent ages trying to figure it out and no luck!

pflannery · 2015-03-21T19:19:53Z

Here's the code for generating the heap snapshot..

var fs = require('fs');
var profiler = require('v8-profiler');

function saveSnapshot(fileName) {
  var buffer = '';
  var snapshot = profiler.takeSnapshot("test");
  var t = snapshot.serialize(
    function iterator(data, length) {
      buffer += data;
    },function complete(){
      fs.writeFileSync(fileName, buffer)
    }
  );
}

var TaskGroup = require("./taskgroup").TaskGroup;
var testArray = Array(10000).join().split(',');
var grp = TaskGroup.create();

testArray.forEach(function(value, index) {
  grp.addTask(function() {
    // console.log("task"+index);
  }); 
});

grp.on("completed", function(){
  saveSnapshot("completed.heapsnapshot")
});

grp.run();

pflannery · 2015-03-21T22:52:16Z

also here is a Memory Management Masterclass video on https://www.youtube.com/watch?v=LaxbdIyBkL0

pflannery · 2015-03-22T12:47:26Z

I've created a gist with javascript that generates the cpu profile and ensures the deopt reasons will render in latest chrome - https://gist.github.com/pflannery/8e38a06a844a7fc362dc

setup instructions

create a temp folder
npm install taskgroup
npm install v8-profiler
run node taskgroup-cpuprofiling.js
open the generated complete.cpuprofile file in chrome-dev-tools -> profiler

balupton · 2015-03-22T16:19:05Z

Sweet. I'll give it a go (watching the talk right now). I'm wondering if it is best to just do the performance tracking inside the browser with a browserified/webpack'd build of taskgroup. Your thoughts?

pflannery · 2015-03-22T20:01:10Z

I've never used browserified or webpack so don't know what the outcome would be.

pflannery · 2015-03-23T23:56:17Z

@balupton It wouldn;t let me post post than three times on the discuss thread and it wont let me add any more links in the post either so I've replied here

I've been analysing the heap generated from running the my test and i've seen that commenting out the following lines takes the heap snapshot from 114MB down to 31MB. (the snapshot was taken once during the taskgroup completed event)

Lines here and Lines here

So from that I think this confirms that the EventEmitter (taking up 53MB) is a big part of this problem and it also seems that the Map copying (taking up 14MB) is adding to it too.

balupton · 2015-03-24T00:15:15Z

Interesting. A few notes:

Commenting out those events means that the taskgroup of 31MB doesn't work...

Perhaps the downsize of the commented out map options are actually because it disables domains.

My other thought is that it the completed event will be too early, as the normal garbage collection wouldn't have kicked in yet, it will have to be once the taskgroup has completed and the scope of the taskgroup instance has been forgotten. I'll do some evaluating of this today. Keep up the good work.

balupton · 2015-03-24T00:25:23Z

@pflannery think you came across these rate limits: https://www.dropbox.com/s/sltz6p9gavq64vz/Screenshot%202015-03-24%2008.25.01.png?dl=0

Tomorrow it won't be a problem :-)

balupton · 2015-03-24T08:43:44Z

@pflannery I've pushed some profiling things up to the es6 branch, you can now run:

cake compile
npm run-script profile

To generate the V8 profile report.

However, you can also run:

cake compile
npm run-script browserify

Then serve the web to your browser, and then profile from there! Got Chrome and Firefox both profiling 👍

balupton · 2015-03-24T09:25:53Z

Chrome's TaskGroup (CoffeeScript version) profile (13 seconds):

https://www.dropbox.com/s/lnh01tehmtsv2cb/taskgroup-coffeescript.cpuprofile?dl=0

Conclusions from evaluating the running of tasks:

A lot of time is spent emitting events, as such we should avoid emitting unnecessary events (this is what you found by commenting out the event names)
A lot of time is spent waiting between tasks due to the setImmediate/nextTick calls to prevent locking, as such we should look into alternative ways of queuing tasks without causing lockups
Creating a domain in the browser (so a eventemitter that wraps a try catch) takes 1ms self time, that's a lot of time
Task.prototype.exit takes 1ms self time, that's a lot of time
Cycling through the nested events to listen for them takes 1ms of self time, that is a lot of time
Instantiating a event emitter class takes 2ms

balupton · 2015-03-24T09:35:37Z

Chrome's TaskGroup (ES6 babel version) profile (8 seconds):

https://www.dropbox.com/s/alsdwj0u97sm09c/taskgroup-es6-babel.cpuprofile?dl=0

balupton · 2015-03-24T11:36:56Z

Okay I was able to get it down to 3.5 seconds by removing maps and disabling nested events by default.

Removing the next tick stuff actually makes it go way slower, as the stack continues to grow.

@pflannery any idea how to evaluate the heap snapshots to tell if it has cleaned up properly, I have no clue how I am meant to read those snapshots.

pflannery · 2015-03-24T15:37:26Z

Firstly what I do is replace babel's _applyConstructor code (at the very top of TaskGroup output) so we can see the Task class in the heap list, otherwise they get created as Object

var _applyConstructor = function (Constructor, args) {
  return new (Function.prototype.bind.apply(Constructor, [null].concat(args)));
};

The way I evaluate the heaps is to take three snapshots, one at the start, one just after run and then one when it's completed - like this example
Then I use the comparison view between them which shows what's been added or removed.

Also If you take a snapshot when destroy is emitted then that will show you what's left in memory after clean up.

The three snapshot technique will go away soon as there is now a Record Heap Allocations in chrome which gives us a timeline of heap allocations but it's not yet working in the current v8-profiler so we cant do it yet :'(

balupton · 2015-03-24T16:07:45Z

The three snapshot technique will go away soon as there is now a Record Heap Allocations in chrome which gives us a timeline of heap allocations but it's not yet working in the current v8-profiler so we cant do it yet :'(

Working on Chrome Canary with #19 (comment)

Will try your suggestion, hopefully it will make the snapshots make sense.

pflannery · 2015-03-25T00:30:13Z

just seen that using async WriteFile here was keeping all the snapshots and strings in memory so each snapshot ended up containing the previous snapshots. I've created PR #20 to resolve this

balupton · 2015-03-25T00:37:45Z

just seen that using async WriteFile here was keeping all the snapshots and strings in memory so each snapshot ended up containing the previous snapshots. I've created PR #20 to resolve this

Merged, but still getting segfaults on heaps (neither sync or async work). I don't get segfaults on profile writing sync or async (both work).

pflannery · 2015-03-25T06:11:37Z

Merged, but still getting segfaults on heaps (neither sync or async work). I don't get segfaults on profile writing sync or async (both work).
For me it does grow large but doesn't get a seg fault with either async or sync..

The latest changes seems to of cleared this problem. I wonder how much of this was removing domains and how much was removing coffee?

balupton · 2015-03-26T08:05:55Z

@pflannery domains are still enabled by default in node land (just disabled in browser land)

balupton · 2015-03-26T08:07:17Z

@pflannery does the latest commits make this any better? Notably: 9edcd0f#commitcomment-10406680

pflannery · 2015-03-26T15:13:27Z

@balupton seem my reply to you commit comments

balupton · 2016-06-04T05:03:32Z

Closed with v5.0.0 - https://github.com/bevry/taskgroup/blob/master/HISTORY.md#v500-2016-june-4

pflannery mentioned this issue Mar 20, 2015

Improve performance for large sites docpad/docpad#936

Closed

pflannery changed the title ~~Unusually large amount of memory usage~~ An unusually large amount of memory usage Mar 20, 2015

balupton closed this as completed Jun 4, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

An unusually large amount of memory usage #19

An unusually large amount of memory usage #19

An unusually large amount of memory usage #19

An unusually large amount of memory usage #19

Comments

setup instructions