News

Demonstration of Python programming with Apache Parquet. The file write_parquet.py is a program that generates a small example Parquet file. It shows the ease of creating Parquet files with Python ...
Extract: ler os arquivos (.json) e concatenar Transform: criar uma nova coluna 'Total' = 'Quantidade' X 'Venda' Load: salvar o novo Data Frame (decidir por 2 caminhos, csv ou parquet) Obs.: para ...
Apache Parquet, which provides columnar storage in Hadoop, is now a top-level Apache Software Foundation (ASF)-sponsored project, paving the way for its more advanced use in the Hadoop ecosystem.
Stefanie Molin's new book, “Hands-On Data Analysis with Pandas" is about using the powerful pandas library to get started with machine learning in Python.