Books [Alan F Gates] Programming Pig

tttx · 18 Фев 2024

DESCRIPTION:

This book is an ideal learning tool and reference for Apache Pig, the programming language that helps you describe and run large data projects on Hadoop. With Pig, you can analyze data without having to create a full-fledged application making it easy for you to experiment with new data sets.

It shows newcomers how to get started, and teaches intermediate users the benefits of using Pig Latin, the data flow language for building and maintaining pipelines for processing data. Advanced users learn how to build complex data processing pipelines with Pig's macros and modularity features, and discover how to build systems for complex data processing needs by embedding Pig Latin into scripting languages.

Learn the advantages and disadvantages of using Pig instead of MapReduce
Understand how Pig fits in with other Hadoop components, such as HDFS, Hive, MapReduce, and HBase
Follow examples that explain built-in Pig Latin functions, and data operators such as join and group
Use grunt, the shell that Pig provides for exploring and working with HDFS
Get performance tuning tips for running Pig Latin scripts on Hadoop clusters in less time
Extend Pig with powerful user defined functions written in Java or Python

About the Authors:

Alan Gates, a member of Yahoo's Pig development team, is responsible for company's implementation of the language, including programming interfaces and the overall design. He has presented Pig at numerous conferences and user groups, universities, and at companies using Pig. Alan oversaw the rewriting of nearly the entire code base when Pig moved from a research project to a production project.

INFORMATION PAGE:

Авторизуйтесь, чтобы посмотреть скрытый контент

DOWNLOAD:

Авторизуйтесь, чтобы посмотреть скрытый контент

Books [Alan F Gates] Programming Pig

tttx

Помощник Администратора

О нас

Полезные ссылки

Наши контакты

Пользователи онлайн