1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
Updated
May 23, 2024 - Python
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
PySpark-Tutorial provides basic algorithms using PySpark
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
A Data Analysis Board in Vue.
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
Powerful & Easy way for big data discovery
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
This is about learning courses in Coursera. All the answers given written by myself
Big data projects implemented by Maniram yadav
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
ARAKAT - Big Data Analysis and Business Intelligence Application Development Platform
open source tools for interaction with IBM PAIRS:
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
Visual, interactive queries against big databases
This project analyses and correlates student performance with different attributes. Then at last, it determines most suitable algorithm from bunch of them.
Add a description, image, and links to the big-data-analytics topic page so that developers can more easily learn about it.
To associate your repository with the big-data-analytics topic, visit your repo's landing page and select "manage topics."