×


 x 

Shopping cart
Ilya Ganelin - Spark: Big Data Cluster Computing in Production - 9781119254010 - V9781119254010
Stock image for illustration purposes only - book cover, edition or condition may vary.

Spark: Big Data Cluster Computing in Production

€ 50.99
€ 50.85
FREE Delivery in Ireland
Description for Spark: Big Data Cluster Computing in Production Paperback. Num Pages: 260 pages. BIC Classification: UK. Category: (P) Professional & Vocational. Weight in Grams: 666.
Production-targeted Spark guidance with real-world use cases

Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. Real use cases provide deep insight into common problems, limitations, challenges, and opportunities, while expert tips and tricks help you get the most out of Spark performance. Coverage includes Spark SQL, Tachyon, Kerberos, ML Lib, YARN, and Mesos, with ... Read more

Spark has become the tool of choice for many Big Data problems, with more active contributors than any other Apache Software project. General introductory books abound, but this book is the first to provide deep insight and real-world advice on using Spark in production. Specific guidance, expert tips, and invaluable foresight make this guide an incredibly useful resource for real production settings.

  • Review Spark hardware requirements and estimate cluster size
  • Gain insight from real-world production use cases
  • Tighten security, schedule resources, and fine-tune performance
  • Overcome common problems encountered using Spark in production

Spark works with other big data tools including MapReduce and Hadoop, and uses languages you already know like Java, Scala, Python, and R. Lightning speed makes Spark too good to pass up, but understanding limitations and challenges in advance goes a long way toward easing actual production implementation. Spark: Big Data Cluster Computing in Production tells you everything you need to know, with real-world production insight and expert guidance, tips, and tricks.

Show Less

Product Details

Publisher
John Wiley & Sons Inc
Format
Paperback
Publication date
2016
Condition
New
Weight
381g
Number of Pages
216
Place of Publication
New York, United States
ISBN
9781119254010
SKU
V9781119254010
Shipping Time
Usually ships in 7 to 11 working days
Ref
99-15

About Ilya Ganelin
Ilya Ganelin is a data engineer working at Capital One Data Innovation Lab. Ilya is an active contributor to the core components of Apache Spark and a committer to Apache Apex. Ema Orhian is a Big Data Engineer interested in scaling algorithms. She is the main committer on jaws-spark-sql-rest, a data warehouse explorer on top of Spark SQL. ... Read more

Reviews for Spark: Big Data Cluster Computing in Production

Goodreads reviews for Spark: Big Data Cluster Computing in Production


Subscribe to our newsletter

News on special offers, signed editions & more!