manifold-scheduler-pool can be a bottleneck on higher loads #479

alexander-yakushev · 2019-02-08T11:10:59Z

Feel free to mark this as a minor issue, and I don't have a generic solution right now, just throwing around some ideas.

manifold-scheduler-pool that is used whenever manifold.time/in is used (and transitively for all manifold.deferred/timeout!) is based on Java's ScheduledThreadPoolExecutor. That executor uses blocking queues and is not terribly efficient when facing 100k+/sec scheduling rate and grows a sizeable tail in the queue. Aleph is quite trigger-happy about creating timeouts, especially when you use a client and set :pool-timeout, :connection-timeout, and the other similar parameters.

The immediate solution I came up with is replacing the default scheduler with Netty's HashedTimerWheel, which uses JCTools' lockless MPSC queue under the hood. No extra dependencies are needed since Netty is already there; however, this approach cannot be blindly applied to Manifold itself as people might use it without Aleph (and Netty). Perhaps, we could detect on pool initialization whether Netty is present and create a more efficient executor then? I don't know.

Anyway, here's a hack for those who might run into a similar problem:

(def ^:private hashed-timer-clock
  "A replacement for the default manifold.time clock that is used for scheduling
  timeouts. We do many timeouts, need something faster. Netty's HashedWheelTimer
  is a good replacement, based on JCTools' MPSC lockless queue."
  (let [;; 10 milliseconds is the minimal resolution - should be enough.
        timer (HashedWheelTimer.
               (thread-factory (fn [] "manifold-timeout-scheduler"))
               10 TimeUnit/MILLISECONDS 1024)
        ;; We also create a regular ScheduledExecutor to serve IClock's `every`
        ;; functionality since WheelTimer doesn't do it. Yet, those periodic
        ;; schedulings are less frequent, so we can afford using the default.
        periodic-clock (mtime/scheduled-executor->clock
                        (Executors/newSingleThreadScheduledExecutor
                         (thread-factory (fn [] "manifold-periodic-scheduler"))))]
    (reify IClock
      (in [_ interval f]
        (.newTimeout timer (reify TimerTask (run [_ _] (f)))
                     interval TimeUnit/MILLISECONDS))
      (every [_ delay period f]
        (.every ^IClock periodic-clock delay period f)))))

;; And now, hack into the root of a dynamic var. Kids, don't try this at home.
(alter-var-root #'mtime/*clock* (constantly hashed-timer-clock))

The text was updated successfully, but these errors were encountered:

kachayev · 2019-02-08T12:57:11Z

I've been looking into HashedWheelTimer for quite some time to replace manifold timeouts. I did some experiments using AsyncHttpClient as an example.

Netty does not use this timer internally tho', it turns out that in practice it's better to leverage the same tasks queue that's used for all I/O tasks. I think that it would be a decent improvement to move connection/request etc timeouts to Netty's executor. The same way it's done for WebSocket handshake timeout handling, here.

The harder part is testing & benchmarking. If you're volunteering to measure performance improvements on 100k+/sec, - I'm more than glad to help with the implementation.

Regarding Manifold itself, I assume that the scheduler there aims to be a general purpose scheduler. It's hard to make a general purpose implementation that works well in I/O bounded context, tailored solution might be ar order of magnitude better. So, I would say that "better peformance for Aleph timeouts" and "better Manifold scheduler" are 2 separate tasks. Having a scheduler with a single thread to execute callbacks might be tough not only from performance/thoughtput point of view, one of the problems we've spotted a while ago in our projects.

arnaudgeiser · 2023-01-25T23:03:18Z

I would propose to put the content of this issue on a cljdoc article.

KingMob · 2023-01-27T05:46:10Z

What are you thinking? Something to implement for people who need faster timers?

arnaudgeiser · 2023-01-27T08:38:55Z

Yes, something along those lines.

Advanced use cases

Replace internal scheduler

kachayev added the performance label Feb 8, 2019

kachayev mentioned this issue Mar 25, 2019

Reimplement http client timeouts using hashed wheel timer #499

Closed

arnaudgeiser mentioned this issue Jan 30, 2023

Document how to replace internal scheduler #664

Merged

arnaudgeiser closed this as completed in #664 Feb 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

manifold-scheduler-pool can be a bottleneck on higher loads #479

manifold-scheduler-pool can be a bottleneck on higher loads #479

alexander-yakushev commented Feb 8, 2019 •

edited

Loading

kachayev commented Feb 8, 2019

arnaudgeiser commented Jan 25, 2023 •

edited

Loading

KingMob commented Jan 27, 2023

arnaudgeiser commented Jan 27, 2023

manifold-scheduler-pool can be a bottleneck on higher loads #479

manifold-scheduler-pool can be a bottleneck on higher loads #479

Comments

alexander-yakushev commented Feb 8, 2019 • edited Loading

kachayev commented Feb 8, 2019

arnaudgeiser commented Jan 25, 2023 • edited Loading

KingMob commented Jan 27, 2023

arnaudgeiser commented Jan 27, 2023

alexander-yakushev commented Feb 8, 2019 •

edited

Loading

arnaudgeiser commented Jan 25, 2023 •

edited

Loading