Tutorial: Get started with CPU profiling

The example application

It repeatedly tries to create a path in the file system (we use createDirectories() from NIO2 for that). If an attempt fails, no exception is raised. We measure the throughput using an improvised benchmark.

Average count: 3527 op
Spent time: 3052 ms

Setup

Run with profiler

Analyze the profile

To our surprise, the createDirectory() method did not account for the most execution time. Our homemade benchmark took about the same amount of time to execute! Furthermore, if we look one frame above, we see that this is primarily because of the removeIf() method, which accounts for almost all the time of its parent method, update().

public static int update(Deque<Long> events, long nanos, long interval) {
    events.add(nanos);
    events.removeIf(aTime -> aTime < nanos - interval);
    return events.size();
}

Apparently, removeIf() takes so long to execute because it iterates over the entire collection, even though it doesn’t really need to.

Optimize the code and verify the results

Since we’re using an ordered collection, and events are added in chronological order, we can be sure that all elements subject for removal are always at the head of the queue. If we replace removeIf() with a loop that breaks once it starts iterating over events that it is not going to remove, we can potentially improve performance:

while (events.peekFirst() < nanos - interval) {
    events.removeFirst();
}

When we search for the update() method, we find out that it has become a tiny little piece on the graph and doesn’t have monstrous overhead anymore. createDirectories() now occupies a more considerable share of application time.

Average count: 9237 op
Spent time: 1090 ms

Further optimizations

We could stop now and pat ourselves on the back, but what’s going on with our createDirectories() method? Does the flame graph say Exception?

If we examine the top part of the stack where createDirectories() is invoked, we see a lot of native frames that seem to deal with exceptions. But our app didn’t crash, and we didn’t handle any, so why is that happening?

Let’s try to avoid this and wrap the call to createDirectories() in a Files.exists() check:

Path p = Paths.get("./a/b");
if (!Files.exists(p)) {
    Files.createDirectories(p);
}

Average count: 48453 op
Spent time: 143 ms

Tutorial: Get started with CPU profiling﻿

The example application﻿

Setup﻿

Include native samples in the snapshot﻿

Run with profiler﻿

Analyze the profile﻿

Optimize the code and verify the results﻿

Further optimizations﻿

Summary﻿