Distributed DataFrame for Python designed for the cloud, powered by Rust
-
Updated
May 28, 2024 - Rust
Distributed DataFrame for Python designed for the cloud, powered by Rust
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
ClickHouse® is a real-time analytics DBMS
YTsaurus is a scalable and fault-tolerant open-source big data platform.
Data-Centric Pipelines and Data Versioning
oneAPI Data Analytics Library (oneDAL)
SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.
Postgres for Search and Analytics
The Open Source Feature Store for Machine Learning
Arkime is an open source, large scale, full packet capturing, indexing, and database system.
An open source time-series database for fast ingest and SQL queries
Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.
Low-code tool for automating actions on real time data | Stream processing for the users.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."