Annual Revenue Vs. Executive Pay for Recipients of U.S. Federal Funds; uses Scala Spark in Zeppelin notebook.
-
Updated
Sep 15, 2017
Annual Revenue Vs. Executive Pay for Recipients of U.S. Federal Funds; uses Scala Spark in Zeppelin notebook.
Daily scraps the data from rpi-imager-stats
A simple toolkit to transform datasource generate by img2dataset from parquet file to Huggingface dataset.
Example usage of python with parquet for IoT data storage and analytics
Dresser-le-procureur.com
a tiny sample about how to build a parquet file from whatever you need.
Project involved the development of a data pipeline using airflow and python. The data pipeline ingested trending movies' and distributors' data from imdb and box office, cleansed, formatted, combined and indexed the data on elastic search. Also, a dashboard was created from the data using kibana analytics. The tools and libraries used in this p…
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
Add a description, image, and links to the parquet topic page so that developers can more easily learn about it.
To associate your repository with the parquet topic, visit your repo's landing page and select "manage topics."