Skip to content

Character names scraping for Harry Potter series, name entity recognition on Harry Potter books and network analysis of characters

Notifications You must be signed in to change notification settings

ahbis99/harry_potter_network_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Harry Potter Network Analysis

Harry Potter Network and community analysis project created with Python.

Table of contents

General info

The aim of this project is to analyze the importance of characters in the Harry Potter book series, and to extract network and community analysis from their appearances throughout the books. By analyzing the characters' roles and relationships within the narrative, we can gain insights into the underlying themes and messages of the books, as well as the impact that each character has on the story as a whole. Through network analysis, we can identify the key players in the story and the ways in which they interact with one another, while community analysis allows us to group characters based on their similarities and differences.

Technologies

Project is created with Python. The main packages used:

  • Selenium (Data scraping)
  • Pandas (Data analysis and cleaning)
  • Numpy (Data analysis and cleaning)
  • Spacy (Name Entity Recognition - NLP)
  • Networkx (Network analysis)
  • Matplotlib (Data visualization)
  • Pyvis (Network visualization)

Visualizations from project

Network visulization of Harry Potter Characters

harry_potter_network

Network visualization with communities of Harry Potter Characters

harry_potter_community

Inspiration

Thu Vu Data Analytics Youtube Video Link of the video

About

Character names scraping for Harry Potter series, name entity recognition on Harry Potter books and network analysis of characters

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published