Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
Updated
May 31, 2024 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
A plugin for Apache Airflow that allows you to edit DAGs in browser
Orchestrate your Databricks notebooks in Airflow and execute them as Databricks Workflows
Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.
Workflow Engine for Kubernetes
AWS Summit 2022 ASEAN --- COM203 Using IaC with Terraform to provision Big Data Platform on Amazon EMR
Production Grade Terraform for Provisioning Infrastructure
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
Build a data pipeline that scrapes data from a website, processes it, and stores in database ready for analysis.
This is the process of automatically extracting data from websites. This can include text, images, and other media types from various web pages.
Data Science and Web Development Playground
Fast iterative local development and testing of Apache Airflow workflows
Collaborative and hybrid recommendation systems
Add a description, image, and links to the airflow topic page so that developers can more easily learn about it.
To associate your repository with the airflow topic, visit your repo's landing page and select "manage topics."