PySpark SQL Recipes: With HiveQL, Dataframe and Graphframes

PySpark SQL Recipes: With HiveQL, Dataframe and Graphframes

Raju Kumar Mishra, Sundar Rajan Raman
5.0 / 5.0
1 comment
이 책이 얼마나 마음에 드셨습니까?
파일의 품질이 어떻습니까?
책의 품질을 평가하시려면 책을 다운로드하시기 바랍니다
다운로드된 파일들의 품질이 어떻습니까?

Carry out data analysis with PySpark SQL, graphframes, and graph data processing using a problem-solution approach. This book provides solutions to problems related to dataframes, data manipulation summarization, and exploratory analysis. You will improve your skills in graph data analysis using graphframes and see how to optimize your PySpark SQL code.

PySpark SQL Recipes starts with recipes on creating dataframes from different types of data source, data aggregation and summarization, and exploratory data analysis using PySpark SQL. You’ll also discover how to solve problems in graph analysis using graphframes.

On completing this book, you’ll have ready-made code for all your PySpark SQL tasks, including creating dataframes using data from different file formats as well as from SQL or NoSQL databases.

What You Will Learn

  • Understand PySpark SQL and its advanced features

  • Use SQL and HiveQL with PySpark SQL

  • Work with structured streaming

  • Optimize PySpark SQL 

  • Master graphframes and graph processing

Who This Book Is For

Data scientists, Python programmers, and SQL programmers.

카테고리:
년:
2019
출판사:
Apress
언어:
english
ISBN 10:
148424334X
ISBN 13:
9781484243343
파일:
PDF, 4.60 MB
IPFS:
CID , CID Blake2b
english, 2019
이 도서의 다운로드는 권리 소유자의 요구에 따라 불가합니다

Beware of he who would deny you access to information, for in his heart he dreams himself your master

Pravin Lal

주로 사용되는 용어