The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
-
Updated
May 31, 2024 - Python
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
ODK Web Forms enables form filling and submission editing of ODK forms in a web browser. It's coming soon! ✨
A web scraper for TikTok using Playwright
This repository contains the project materials for optimising e-commerce conversion rates through comprehensive data analysis. Leveraging SQL, MySQL, Power BI, and other tools, explore key factors influencing website performance. From data collection to actionable insights and recommendations,
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
Georegistry + Data Collection + Microplanning
Unified and privacy-centric event data collection for digital analytics
The open source high performance ELT framework powered by Apache Arrow
70+ CLI tools to build, browse, and blend your media library. An index for your archive.
This repository contains a collection of data science projects which I did during the IBM Data Science Professional certification programme. Each project demonstrates different aspects of data science, data analysis, data visualization and machine learning.
ODK Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments around the world. Contribute and make the world a better place! ✨📋✨
SDK for XIA LLC's Pixie products
Fast and differentiable particle accelerator optics simulation for reinforcement learning and optimisation applications.
News Collector from AWS/Azure/GCP
Image Classification, Object Detection, Image Segmentation, Instance Segmentation and Pose Estimation
⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink
Song Describer is a data collection platform for annotating music with textual descriptions.
The core library that many of the ODK tools are built around. It's written in Java, implements the ODK XForms spec, and runs on mobile devices and cloud servers. ✨🏗✨
Add a description, image, and links to the data-collection topic page so that developers can more easily learn about it.
To associate your repository with the data-collection topic, visit your repo's landing page and select "manage topics."