Exploratory Data Analysis

In statistics, exploratory data analysis is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task

Techniques for performing EDA

Some of the most common methods for performing the exploratory data analysis are given below. Definitely there are a lot of other methods.

A - Univariate and bivariate analysis
B - Missing value analysis
C - Outlier detection analysis
D - Percentile based outlier removal
E - Correlation analysis
F - Covariance analysis

This is one example result after the exploratory data analysis on a sample dataset

Proposed methods

Here I am discussing mainly four advanced python libraries for performing the exploratory data analysis. They are

1 - pandas-profiling
2 - streamlit
3 - sweetviz
4 - wordcloud

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
pandas-profiling		pandas-profiling
streamlit		streamlit
sweetviz		sweetviz
word cloud		word cloud
README.md		README.md
eda.png		eda.png
pairplot.ipynb		pairplot.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pandas-profiling

pandas-profiling

streamlit

streamlit

sweetviz

sweetviz

word cloud

word cloud

README.md

README.md

eda.png

eda.png

pairplot.ipynb

pairplot.ipynb

Repository files navigation

Exploratory Data Analysis

Techniques for performing EDA

Proposed methods

About

Releases

Packages

Languages

pradeepdev-1995/EDA-Methods

Folders and files

Latest commit

History

Repository files navigation

Exploratory Data Analysis

Techniques for performing EDA

Proposed methods

About

Topics

Resources

Stars

Watchers

Forks

Languages