Project on MapReduce for the Μ111 - Big Data Management course, NKUA, Spring 2023.
-
Updated
Jul 21, 2023 - TeX
Project on MapReduce for the Μ111 - Big Data Management course, NKUA, Spring 2023.
Glue Data Quality Example - Deploy to your AWS Account w/ Terraform to test
Managing large data sets projects (Data Science)
Upstream classifier image preprocessing
Daily scraps the data from rpi-imager-stats
Data Engineering project on how to build Data Lake on S3 using Chicago Taxi Dataset
create files which formats are like "orc", "parquet", "xlsx", "json" and so on with Python
Udacity Data Engeneering Nanodegree Program - My Submission of Project: Data Lake
Simple utility package to convert EDF/EDF+ files into Apache Parquet format.
Proyecto Integrador: Big Data | Bootcamp Henry: Carrera Data Science | Cohorte DataFT 17
A summative coursework for CSC8101 Engineering for AI
Academic Machine Learning (6 months) Sessional Project
ECE NTUA Assignment
FegTec é uma empresa fictícia que quer transferir arquivos parquet contendo dados dos clientes da nuvem AWS para a Google Cloud
Practice of Python skill
NOAA data pipeline, queryable from the browser
Little demo project on how to read parquet files using the Avro libraries
Read music app sparkify data from s3 and perform transformation in Spark and save the results into Parquet files.
Data Engineering Nano Degree Capstone Project
Add a description, image, and links to the parquet-files topic page so that developers can more easily learn about it.
To associate your repository with the parquet-files topic, visit your repo's landing page and select "manage topics."