Python package to make URL extraction, generalization, validation, and filtration easy.
-
Updated
Jun 9, 2024 - Python
Python package to make URL extraction, generalization, validation, and filtration easy.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Wrangler Transform: A DMD system for transforming Big Data
Data Science Foundations I | Exploratory Data Analysis in Python | Inspect, Clean, and Validate a Dataset | EDA: Inspect, Clean, and Validate a Dataset
Main Repository
Used SQL, Power BI to make insightful dashboard
Ashley Bythell - Python
This repository houses a curated collection of projects designed to highlight my expertise in data analytics.
Comprehensive Power BI dashboards showcasing insights on Call Centre Trends, Customer Retention, and Diversity & Inclusion to drive business impact.
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
📊Customer Personality Analysis, using various Data Mining techniques and Machine Learning algorithms.
Predictive modeling project by implementing KNN regression model.
Interactive Dashboard Web-app :
Analyze exit surveys from the employees of DETE and TAFE institutes in Australia.
Java DSL for (online) deduplication
Two Mixed Integer Programs for cleaning a data file.
Statistical analysis comparing team play in the NBA regular season and playoffs. Linear Regression algorithm to predict players playoffs points per game based on their regular season stats. Collaborated with Stephan MacDougall.
Quizzes & Assignment Solutions for Applied Data Science with Python Specialization on Coursera. Also included a few resources on side that I found helpful.
Add a description, image, and links to the data-cleansing topic page so that developers can more easily learn about it.
To associate your repository with the data-cleansing topic, visit your repo's landing page and select "manage topics."