Skip to content

Scheduler

https://docs.dask.org/en/latest/scheduling.html

https://dask.pydata.org/en/latest/scheduler-overview.html#configuring-the-schedulers

schedulers

  • threaded (threads): a scheduler backed by a thread pool

  • processes (processes): a scheduler backed by a process pool

  • synchronous (single-threaded): a synchronous scheduler, good for debugging

  • distributed (distributed): a distributed scheduler for executing graphs on multiple machines

synchronous scheduler

good for debugging

# As a context manager
with dask.config.set(scheduler='single-threaded'):
    x.sum().compute()

# Set globally: scheduler='processes', num_workers=4
dask.config.set(scheduler='single-threaded')
x.sum().compute()