Pokedex

The project is to try out web scraping on a cleanly formatted web page. The scraping is done using BeautifulSoup4. The data extracted from the web page is then converted into a pandas dataframe and saved as a .csv file for further analysis.

The project is organized as follows:

The first step is to scrape the pokedex data from a web page and save it to a file
The second step is to do exploratory data analysis to get a feel for the data

Exploring the data

This process is massively simplified by utilizing tools such as ChatGPT and GitHub Copilot. Asking ChatGPT about interesting and relevant things to look for in the data is a perfect start to an exploratory data analysis. GitHub Copilot works really well when you have a grasp of how you want to write your code as it is good at completing your code snippets/comments.
Another new tool that has arrived is "data wrangler" by Microsoft. Data wrangler works like a user interface that lets users explore and manipulate a pandas dataframe without the need to write code. The processing steps are recorded and converted to pandas code that the user then can copy and use.

Data

The data is taken from https://pokemondb.net/pokedex/all and the original table looks like this: Source: https://pokemondb.net/pokedex/all

Install

Make sure to have a virtual environment readym either through conda or pip. After you have activated the virtual environment you can install the dependencies found in the requirements.txt file. To install the necessary libraries, you can run pip install -r requirements.txt or python3 -m pip install -r requirements.txt.

Future Ideas

Add a column legendary, specifying whether or not the Pokémon is a legendary Pokémon.
Add a column generation, specifying which generation the Pokémon was released with.
Add other interesting columns with data

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
images		images
.gitignore		.gitignore
EDA.ipynb		EDA.ipynb
README.md		README.md
chord_diagram.ipynb		chord_diagram.ipynb
pokemon.csv		pokemon.csv
requirements.txt		requirements.txt
scrape_pokemon.ipynb		scrape_pokemon.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

.gitignore

.gitignore

EDA.ipynb

EDA.ipynb

README.md

README.md

chord_diagram.ipynb

chord_diagram.ipynb

pokemon.csv

pokemon.csv

requirements.txt

requirements.txt

scrape_pokemon.ipynb

scrape_pokemon.ipynb

Repository files navigation

Pokedex

Exploring the data

Data

Install

Future Ideas

About

Releases

Packages

Languages

OscPop/Pokedex

Folders and files

Latest commit

History

Repository files navigation

Pokedex

Exploring the data

Data

Install

Future Ideas

About

Topics

Resources

Stars

Watchers

Forks

Languages