etl-pipeline

Here are 1,393 public repositories matching this topic...

longNguyen010203 / Youtube-ETL-Pipeline

💜🌈📊 A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker 🌺

Updated Jun 7, 2024
Jupyter Notebook

bhavanachitragar / zillow-data-analytics

Star

A Python script extracts data from Zillow and stores it in an initial S3 bucket. Then, Lambda functions handle the flow: copying the data to a processing bucket and transforming it from JSON to CSV format. The final CSV data resides in another S3 bucket, ready to be loaded into Amazon Redshift for in-depth analysis. QuickSight for visualizations

lambda-functions s3 ec2-instance redshift zillow-api etl-pipeline airflow-dags quicksight-dashboard

Updated Jun 7, 2024
Python

jitsucom / bulker

Star

Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)

pipeline etl data-engineering ingestion datawarehouse etl-pipeline

Updated Jun 7, 2024
Go

Zipstack / unstract

Star

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

unstructured-data etl-pipeline llm-platform

Updated Jun 7, 2024
Python

chrisliatas / dsnd-ml-pipeline

Star

ML pipeline to categorize emergency messages based on the needs communicated by the sender.

nlp text-classification etl-pipeline ml-pipelines

Updated Jun 7, 2024
Jupyter Notebook

flow-php / flow

Star

Flow PHP - data processing framework

etl etl-framework etl-pipeline

Updated Jun 7, 2024
PHP

aronmarcus / Pyspark_QuarentenaGlobal_table_Databricks

Star

Engenharia de dados para implementação de tabela de supressão/quarentena de clientes utilizando Pyspark, Spark SQL, Pandas e APIs no Databricks.

python api data-science big-data sftp pandas pyspark data-engineering sharepoint spark-sql pipeline-pattern modular-design etl-pipeline salesforce-marketing-cloud

Updated Jun 6, 2024
Jupyter Notebook

wri / gfw-data-api

Star

GFW Data API

api-server metadata-api etl-pipeline

Updated Jun 6, 2024
Python

SAZZAD-AMT / Informatica-Data-Integration-and-Transformation-Project

Star

This process illustrates how to structure and manipulate relational databases effectively, demonstrating key SQL operations and transformations within an Informatica environment. The provided images and detailed SQL commands serve as a comprehensive guide for implementing and understanding these database management tasks.

etl informatica etl-framework powercenter etl-pipeline informatica-power-centre-v9-6 informatica-platform etl-process informatica-power-center

Updated Jun 6, 2024

brageon / oak

Star

Aids for the public as a web app.

jinja2 flask-web codebuild etl-pipeline

Updated Jun 6, 2024
Python

netwerk-digitaal-erfgoed / ld-workbench

Star

A CLI tool for transforming large RDF datasets using pure SPARQL.

etl sparql lod linked-open-data etl-pipeline

Updated Jun 6, 2024
TypeScript

apache / incubator-streampark

Star

Make stream processing easier! Easy-to-use streaming application development framework and operation platform.

streaming apache easy-to-use etl-pipeline development-framework streampark operation-platform

Updated Jun 6, 2024
Java

MobileTeleSystems / onetl

Star

One ETL tool to rule them all

spark etl plugin-system etl-pipeline etl-components pydantic hwm

Updated Jun 7, 2024
Python

Zipstack / unstract-sdk

Star

A framework for writing Unstract Tools/Apps

development-tools sdk-python unstructured-data etl-pipeline llm-framework

Updated Jun 6, 2024
Python

DAGWorks-Inc / hamilton

Star

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

Updated Jun 6, 2024
Jupyter Notebook

mycelial / mycelial

Star

Move your data with ease.

rust etl data-pipelines edge-computing etl-pipeline

Updated Jun 5, 2024
Rust

victorjulyin / DE-101

Star

With the help of this repository you can evaluate my professional level. Everything I know about Data Engineering is stored here.

aws bi sql etl data-engineering cloud-computing visualizations data-warehousing etl-pipeline

Updated Jun 5, 2024
Python

abhishek-vaish / crypto_etl_project

Star

Crypto scraping project based on ETL process where data is getting scrap from an online website and performed transformation and load it into the SQL Server Database.

etl scraping python3 transformation etl-pipeline

Updated Jun 5, 2024
Python

jvalue / jayvee

Star

Jayvee is a domain-specific language and runtime for automated processing of data pipelines

data-science typescript data-engineering domain-specific-language data-pipeline etl-pipeline

Updated Jun 5, 2024
TypeScript

aws-samples / aws-data-pipelines-for-azure-storage

Star

Copy data from Azure Blob Storage to Amazon S3 using code. View Azure costs using Amazon QuickSight

aws aws-lambda azure aws-s3 azure-storage etl-pipeline finops azureblobstorage quicksight-dashboard

Updated Jun 5, 2024
HCL

Improve this page

Add a description, image, and links to the etl-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etl-pipeline

Here are 1,393 public repositories matching this topic...

longNguyen010203 / Youtube-ETL-Pipeline

bhavanachitragar / zillow-data-analytics

jitsucom / bulker

Zipstack / unstract

chrisliatas / dsnd-ml-pipeline

flow-php / flow

aronmarcus / Pyspark_QuarentenaGlobal_table_Databricks

wri / gfw-data-api

SAZZAD-AMT / Informatica-Data-Integration-and-Transformation-Project

brageon / oak

netwerk-digitaal-erfgoed / ld-workbench

apache / incubator-streampark

MobileTeleSystems / onetl

Zipstack / unstract-sdk

DAGWorks-Inc / hamilton

mycelial / mycelial

victorjulyin / DE-101

abhishek-vaish / crypto_etl_project

jvalue / jayvee

aws-samples / aws-data-pipelines-for-azure-storage

Improve this page

Add this topic to your repo