Acar, U., G. Blelloch, and R. Blumofe (2000). “The Data Locality of Work Stealing.” Proceedings of the Twelfth Annual ACM Symposium on Parallel Algorithms and Architectures, 1–12.
Amdahl, G. M. (1967, April). “Validity of the single-processor approach to achieving large scale computing capabilities.” AFIP Conference Proceedings, 30. Reston, VA: AFIPS Press, 483–485.
An, P., A. Jula, et al. (2003). “STAPL: An Adaptive, Generic Parallel C++ Library.” Workshop on Language and Compilers for Parallel Computing, 2001. Lecture Notes in Computer Science 2624, 193–208.
Austern, M. H., R. A. Towle, and A. A. Stepanov (1996). “Range partition adaptors: a mechanism for parallelizing STL.” ACM SIGAPP Applied Computing Review. 4, 1, 5–6.
Blumofe, R. D., and D. Papadopoulos (1998). “Hood: A User-Level Threads Library for Multiprogrammed Multiprocessors.” From http://citeseer.ist.psu.edu/blumofe98hood.html.
Blumofe, R. D., C. F. Joerg, et al. (1996). “Cilk: An Efficient Multithreaded Runtime System.” Proceedings of the 5th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 207–216. http://supertech.csail.mit.edu/papers/cilkjpdc96.pdf.
Boehm, H. (2006, June). “An Atomic Operations Library for C++.” C++ standards committee document N2047.
Butenhof, D. R. (1997). Programming with POSIX Threads. Reading, MA: Addison Wesley.
Flynn, M. J. (1972, September). “Some Computer Organizations and Their Effectiveness.” IEEE Transactions on Computers, C-21, 9, 948–960.
Garcia, R., ...