datacleaning

Star

Here are 979 public repositories matching this topic...

great-expectations / great_expectations

Star

Always know what to expect from your data.

Updated May 31, 2024
Python

OpenRefine / OpenRefine

Sponsor

Star

OpenRefine is a free, open source power tool for working with messy data and improving it

java data-science reconciliation wikidata opendata journalism data-analysis data-wrangling datamining datajournalism datacleaning datacleansing

Updated May 31, 2024
Java

muditprakash / Solvency

Star

As the name suggests, this application helps banks decide whether a loan should be sanctioned by assessing various factors from the borrower's profile.

flask grid numpy pandas seaborn xgboost matplotlib datacleaning gridsearchcv explorat

Updated May 30, 2024
Jupyter Notebook

Ackson507 / Visual-Analytics-Projects

Star

This involves representing data graphically through charts, graphs, maps, and other visual elements. Interactive dashboards and reports that allows users to ask ad-hoc questions, test hypotheses, and gain deeper insights by engaging with the visual data directly.

dashboard excel powerbi datavisualization datacleaning

Updated May 30, 2024

wilferalexander / Covid19

Star

BootCamp del profe alejo trabajaremos el problema de covid

python data machine-learning numpy seaborn data-analysis mat visu datacleaning ciencia-de-dados

Updated May 29, 2024
Jupyter Notebook

Ryan-Iacovone / SQL-Practice-R

Star

Practice SQL querying and statements!

r sql datacleaning

Updated May 29, 2024

DataKitchen / data-observability-installer

Star

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

Updated May 28, 2024
Python

kashifmughal05 / Flight_Price_Prediction_using_ML

Star

In this notebook, I have done Data Cleaning, Data Wrangling, EDA and Feature Engineering. After that I trained the dataset using Machine Learning Algorithm Random Forest Regressor.

data-science machine-learning random-forest exploratory-data-analysis machine-learning-algorithms ml eda prediction datawrangling predictive-modeling datacleaning random-forest-regression

Updated May 28, 2024
Jupyter Notebook

Hariharann16 / -Employee-details-Dashboard-for-HR

Star

This dashboard includes interactive visualizations and reports on employee attendance, preferred working modes, and utilization of work-from-home and sick leave policies, demonstrating the impact on strategic decision-making and employee wellbeing

dashboard powerbi datavisualization dataanalysis datacleaning datatransformation

Updated May 28, 2024

riasingh16 / Data-Cleaning-and-EDA-Project

Star

Data Cleaning and Exploratory Data Analysis with SQL in MySQL Workbench

sql exploratory-data-analysis datascience cte joins datacleaning dataanalytics mys

Updated May 28, 2024

PiyushChaudhari99 / Customer_Segmentation_Analysis

Star

This repository build to showcase a project on customer personality analysis. In this I have classified the customer into two clusters.

python data-science machine-learning clustering scikit-learn jupyter-notebook data-visualization model-selection datacleaning streamlit project-repository modeldeployment

Updated May 27, 2024
Jupyter Notebook

Hariharann16 / -HR-Analytics-Dashboard-Development-using-Power-B

Star

Developed a comprehensive HR analytics dashboard using Power BI. Addressed the challenges faced by HR professionals in volatile market conditions, including fluctuating salaries, employee retention, and recruitment. Integrated real-time data visualization and insights to aid decision-making processes.

dashboard data-visualization data-analysis powerbi datacleaning

Updated May 26, 2024

Livingston-k / cleanPyData

Star

cleanPyData is a Python package for data cleaning and preprocessing. It handles missing values, normalizes data, extracts features, and detects outliers, making your data ready for analysis or machine learning.

python package data-science machine-learning pypi pip datacleaning kaddulivingstone cleanpydata cleanpydata-package

Updated May 25, 2024
Python

saahen-sriyan-mishra / Python-Data-Analysis

Star

Here I conducted EDA on a diverse datasets, including movies, sales, and gaming data. Did data cleaning, visualization, and interpretation using libraries like pandas, NumPy, Matplotlib, and Seaborn to extract actionable insights for informed decision-making processes.

data-science exploratory-data-analysis data-visualisation dataanalysis datacleaning datafiltering

Updated May 25, 2024
Jupyter Notebook

Mbasha36 / Vrinda_Store

Star

Vrinda Store

data-science data excel data-visualization data-structures powerbi datacleaning datamodeling

Updated May 24, 2024

rm-rimsha / CustomerChurn-DataAnalysis-and-PredictionModel

Star

This project revolves around the analysis and prediction of customer churn using a dataset sourced from Kaggle. The dataset underwent thorough cleaning in Python to address missing values, invalid data types, and normalization tasks. The analysis was subsequently visualized using Power BI. Lastly, a Random Forest model was employed for prediction,

python numpy sklearn pandas powerbi prediction-model dataanalysis datacleaning randomforestclassifier powerbi-dashboards