PARALLEL COMPUTING FOR DATA SCIENCE: WITH EXAMPLES IN R, C++ AND CUDA

PARALLEL COMPUTING FOR DATA SCIENCE: WITH EXAMPLES IN R, C++ AND CUDA

Editorial:
CRC PRESS
Año de edición:
Materia
Lectura
ISBN:
978-1-4665-8701-4
Páginas:
328
N. de edición:
1
Idioma:
Inglés
Disponibilidad:
Disponible en 2-3 semanas

Descuento:

-5%

Antes:

60,00 €

Despues:

57,00 €

• Introduction to Parallel Processing in R
• "Why Is My Program So Slow?": Obstacles to Speed
• Principles of Parallel Loop SchedulingThe Shared Memory Paradigm: A Gentle Introduction through R
• The Shared Memory Paradigm: C Level
• The Shared Memory Paradigm: GPUsThrust and RthThe Message Passing ParadigmMapReduce
• ComputationParallel Sorting and Merging
• Parallel Prefix Scan
• Parallel Matrix OperationsInherently Statistical Approaches: Subset Methods
• Appendix A: Review of Matrix Algebra
• Appendix B: R Quick Start
• Appendix C: Introduction to C for R Programmers

Parallel Computing for Data Science: With Examples in R, C++ and CUDA is one of the first parallel computing books to concentrate exclusively on parallel data structures, algorithms, software tools, and applications in data science. It includes examples not only from the classic "n observations, p variables" matrix format but also from time series, network graph models, and numerous other structures common in data science. The examples illustrate the range of issues encountered in parallel programming.
With the main focus on computation, the book shows how to compute on three types of platforms: multicore systems, clusters, and graphics processing units (GPUs). It also discusses software packages that span more than one type of hardware and can be used from more than one type of programming language. Readers will find that the foundation established in this book will generalize well to other languages, such as Python and Julia.

Features
• Focuses on applications in the data sciences, including statistics, data mining, and machine learning
• Discusses structures common in data science, such as network data models
• Emphasizes general principles throughout, such as avoiding factors that reduce the speed of parallel programs
• Covers the main types of computing platforms: multicore, cluster, and GPU
• Explains how the Thrust package eases the programming of multicore machines and GPUs and enables the same code to be used on either platform
• Provides code for the examples on the author’s web page