OpenRefine is a free, open source power tool for working with messy data and improving it
-
Updated
May 29, 2024 - Java
OpenRefine is a free, open source power tool for working with messy data and improving it
A Scalable Data Cleaning Library for PySpark.
Data visualisations in Power BI
Examples for Optimus a Data Cleansing Library for Big Data.
-This project targets the textual analysis of Egyptian movie plot summaries that were curated from online sources, covering the four golden decades of Egyptian Cinema.
Table Enforcer is my attempt to apply a sort of "test driven development" workflow to data cleaning and validation. A python package to facilitate the iterative process of developing and using schema-like representations of DataFrames in pandas for recoding and validating instances of these data.
This course by University of Michigan introduces the basics of the python programming environment, including fundamental python programming techniques such as lambdas, reading and manipulating csv files, and the numpy library. The course will also introduces data manipulation and cleaning techniques using python pandas data science library.
GitHub Repo of our Tidyverse workshop organized on Sep 8, 2022
sales_analysis
Data cleaning, analysing in excel and finally creating a dashboard in Tableau as part of the KPMG virtual internship.
A data mining project for data Exploration of Airbnb dataset, for 2019.
we use keras and tensorflow and sklearn to classify health level of student by using Nursey UCI Dataset
My codes and insight based on data provided open source on the internet. I want to provide comprehensive data insight and analysis of Hotel Booking data for a whole year to maximise impact for both the company and the customer. I also develop several machine learning algorithm and do an in depth evaluation of each and every model selected
Advance Guide Of Cleaning & 20+ ways of cleaning data with python
cleaned data from walmart by removing null data, standardizing columns and filled null value with average
Versatile data analyst skilled in extracting actionable insights from complex datasets. Proficient in statistical analysis, data visualization, and trend identification. Proven track record in transforming raw data into strategic business recommendations.
Add a description, image, and links to the datacleansing topic page so that developers can more easily learn about it.
To associate your repository with the datacleansing topic, visit your repo's landing page and select "manage topics."