Topology aware task stealing for on-chip NUMA multi-core processors

dc.contributor.author Vikranth, B.
dc.contributor.author Wankar, Rajeev
dc.contributor.author Raghavendra Rao, C.
dc.date.accessioned 2022-03-27T05:59:25Z
dc.date.available 2022-03-27T05:59:25Z
dc.date.issued 2013-01-01
dc.description.abstract "The On Chip NUMA Architectures (OCNA) introduce a new challenge namely memory-latency to the scheduling methods. The language run-times and libraries try to explore the processing power of these multiple cores by mapping the user-created tasks on to these cores by using suitable scheduling algorithms with load balancing support to improve throughput. The popular load balancing techniques used are work-sharing and work-stealing and many run-time systems such as Cilk, TBB and wool implement task stealing algorithm to schedule the tasks on to the cores by multiplexing the program generated tasks on to the native worker threads supported by the operating system. But the task stealing strategy applied in present run-time systems assumes the sharing the last level cache (LLC) and common shared bus among all cores on Chip Multi Processor. It tries to optimize the utilization without considering the presence of multiple On Die DRAM controllers and their topological arrangements. Current task stealing technique also suffers from problem of randomly choosing the victim worker queue. In this paper we address these issues and propose a solution for these problems by suggesting few optimizations. Our proposed task stealing strategy dynamically analyzes the topology of the underlying hardware connections and models the group of cores and connections as a logical topology tree. This logical tree is translated into multiple worker pools called stealing domains. By restricting the task stealing within these domains, this strategy is implemented and shows an average of 1.24 times better performance on NAS Parallel Benchmark programs compared to popular runtimes Cilk and OpenMP. © 2013 The Authors. Published by Elsevier B.V.
dc.identifier.citation Procedia Computer Science. v.18
dc.identifier.issn 18770509
dc.identifier.uri 10.1016/j.procs.2013.05.201
dc.identifier.uri https://www.sciencedirect.com/science/article/abs/pii/S187705091300344X
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/9029
dc.subject Load balancing
dc.subject Multi-core
dc.subject On chip NUMA multi core
dc.subject Task stealing
dc.subject Work sharing
dc.subject Worker queue
dc.title Topology aware task stealing for on-chip NUMA multi-core processors
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: