Skip to content

A Exploratory Data Analysis of the result of the prestigious Smart India Hackathon 2023 using the data scraped from the official website

Notifications You must be signed in to change notification settings

DuanBoomer/Smart-India-Hackathon-Result-Analysis

Repository files navigation

Analysis of the Smart India Hackathon result 2023

The data used for this analysis is scrapped from the official SIH website and is publicly available here github link

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

just some default configurations to make the outputs look pretty

plt.style.use('ggplot')
sns.set_palette('cool')
sns.set(rc={"figure.figsize":(12, 7)})
sns.set_context("paper", font_scale=1.5, rc={"lines.linewidth": 2.5})
pd.set_option('display.max_columns', 200)

Getting the Dataset

df = pd.read_excel(r'sih-2023-result.xlsx', index_col = 0)
df
ORGANISATION PSID PS CATEGORY TEAM ID IDEA ID TEAM NAME TEAM LEADER NAME COLLEGE ID COLLEGE NODAL CENTER Winning Status Prize Money
S.NO.
1 AICTE SIH1458 Software 27179 50141 Swasthya Sahayak PRAVEEN KUSHWAHA 158737 AJAY KUMAR GARG ENGINEERING COLLEGE, GHAZIABAD... O P Jindal University, Raigarh,Chhattisgarh,Ra... Winner 100000
2 AICTE SIH1459 Software 37933 48853 Team.Phoenix HARSH DHARIWAL 103534 JAYPEE INSTITUTE OF INFORMATION TECHNOLOGY, NO... O P Jindal University, Raigarh,Chhattisgarh,Ra... Winner 100000
3 AICTE SIH1460 Software 16793 47672 Bit Lords V1 TANMAI KIRAN KAMAT 139772 SHRI VILE PARLE KELAVANI MANDALS DWARKADAS J. ... O P Jindal University, Raigarh,Chhattisgarh,Ra... Winner 100000
4 AICTE SIH1461 Software 15122 18545 AssetSentinels PRATHAM POOJARI 139723 MAHAVIR EDUCATION TRUSTS SHAH AND ANCHOR KUTCH... O P Jindal University, Raigarh,Chhattisgarh,Ra... Winner 100000
5 AICTE SIH1463 Software 28137 33801 Sane No More SNEHA JANARTHANAN 150210 SRI KRISHNA COLLEGE OF ENGINEERING AND TECHNOL... O P Jindal University, Raigarh,Chhattisgarh,Ra... Winner 100000
... ... ... ... ... ... ... ... ... ... ... ... ...
271 Open Innovation 0 Software 298 4826 CodeElixir_T187 AGNISHA BHATTA 164138 TECHNO INTERNATIONAL NEW TOWN Veer Surendra Sai University of Technology, Od... Second Runnerup 50000
272 AICTE, MIC-Student Innovation SIH1484 Software 6451 46658 SeedShare APURVA NARAYAN 149873 SRI SIVASUBRAMANIYA NADAR COLLEGE OF ENGINEERI... Prasad V Potluri Siddhartha Institute of Techn... AWS Winner 50000
273 AICTE, MIC-Student Innovation SIH1486 Software 11415 15645 Destination Designer GAURAV TIWARI 143391 NATIONAL INSTITUTE OF SCIENCE AND TECHNOLOGY N... Rungta College of Engineering and Technology, ... AWS Winner 50000
274 AICTE, MIC-Student Innovation SIH1488 Software 15372 21270 CodeCopter ADITYA HAKANI 109898 3025-SHRI BHAGUBHAI MAFATLAL POLYTECHNIC, MUMB... Sreenidhi Institute of Science & Technolog... AWS Winner 50000
275 AICTE, MIC-Student Innovation SIH1493 Software 18569 11066 Grand Line SHREYAS M MURAGODMATH 102942 KLE TECHNOLOGICAL UNIVERSITY,KARNATAKA,DHARWAD GMR Institute Of Technology,Andhra Pradesh,Rajam AWS Winner 50000

275 rows × 12 columns

df.nunique()
ORGANISATION         36
PSID                228
PS CATEGORY           2
TEAM ID             275
IDEA ID             275
TEAM NAME           275
TEAM LEADER NAME    273
COLLEGE ID          201
COLLEGE             213
NODAL CENTER         48
Winning Status        7
Prize Money           5
dtype: int64

The Dataset itself is clean so we don't need to much preprocessing in it.

We will directly jump to exploratory data analysis (EDA)

What is the total money that was given out to the winners?

import locale
locale.setlocale(locale.LC_MONETARY, 'en_IN')

val = df['Prize Money'].sum()
print(locale.currency(val, grouping=True))
₹ 2,27,10,000.00

More than 2 crores rupees were spent just on prizes

What is the distribution of problem statements into the two categories?

sns.countplot(x = df['PS CATEGORY'], palette='flare').set_title("Problem Statement Categories");

png

The software solutions outnumber hardware solutions by more than double.

Looks like the government requires more software solutions then hardware.

What is the average distribution of problem statements in nodal centres??

sns.boxplot(list(df.groupby(by='NODAL CENTER')['PSID'].count()), palette='flare').set(xticklabels=[], title="Average problem statments in nodal centers", ylabel="number of problems");

png

On average 5-6 problem statements are assigned to each nodal center

Which orgs has the most problem statements?

fig, axes = plt.subplots(figsize = (7,10))
vals = df['ORGANISATION'].value_counts()
axes = sns.barplot(y = vals.index, x=vals.values, palette='flare').set(xlabel="problem statements", title="Problems by each ministry")

png

As expected AICTE has the most number of problem statements because of there student innovation track in which the students can submit any of there idea. They are followed by Ministry of Power and Jal Shakti.

What are the top 6 orgs which gave away the most money??

vals = df[['ORGANISATION', 'Prize Money']].groupby(by='ORGANISATION')['Prize Money'].sum().sort_values(ascending=False)[:6]
sns.barplot(y=vals.index, x=vals.values, palette='flare').set(xlabel="capital (in lakhs)", ylabel="", title="The top 6 ministries in expenditure");

png

The number of problem statements of a ministry directly correlate with the expenditure on prizes meaning more problems, more money.

or was it more money, more problems 😉

Which colleges had the most participants??

import textwrap
def wrap_labels(ax, width, break_long_words=False):
    labels = []
    for label in ax.get_yticklabels():
        text = label.get_text()
        labels.append(textwrap.fill(text, width=width,
                      break_long_words=break_long_words))
    ax.set_yticklabels(labels, rotation=0)
    return ax
vals = df['COLLEGE'].value_counts()[:6]
ax = sns.barplot(y=vals.index, x=vals.values, palette='flare')
ax = wrap_labels(ax, 50)
ax.set(title='Teams from each college', xlabel='number of teams');

png

A wopping 10 teams were sent by BRACTVIT, Pune.

Do Nodal Centres also participate in the hackathon??

if len(df.query('COLLEGE == `NODAL CENTER`')) > 0:
    print("Yeah nodal centres also participate")
else:
    print("No nodal centres don't participate")
No nodal centres don't participate

What is the ranking of the Nodal centres based on the number of problem statements they hosted?

print('Problem Statements in each nodel centre')
vals = pd.DataFrame(df['NODAL CENTER'].value_counts())
vals.index.names = ['NODAL CENTRES']
vals.rename(columns = {'NODAL CENTER':'PROBLEM STATEMENTS'}, inplace = True)
vals
Problem Statements in each nodel centre
PROBLEM STATEMENTS
NODAL CENTRES
Lovely Professional University,Punjab,Jalandhar 10
Nalla Malla Reddy Engineering College,Telangana,Hyderabad 9
Gujarat Technological University, Ahmedabad,Gujarat,Ahmedabad 8
P.S.N.A. College of Engineering and Technology,Tamil Nadu,Dindigul 7
Sreenidhi Institute of Science & Technology,Telangana,Hyderabad 7
The National Institute of Engineering,Karnataka,Mysuru 7
Anand Institute of Higher Technology,Tamil Nadu,Chennai 7
Galgotias University ,Uttar Pradesh,Greater Noida 7
Chandigarh Engineering College-CGC Landran,Punjab,Mohali 7
Amity University Uttar Pradesh, Noida,Uttar Pradesh,Noida 7
Prasad V Potluri Siddhartha Institute of Technology,Andhra Pradesh,Vijayawada 6
Oriental Institute of Science and Technology,Madhya Pradesh,Bhopal 6
Sri Venkateswara College of Engineering and Technology ,Andhra Pradesh,Chittoor 6
Techno Main Salt Lake,West Bengal,Kolkata 6
Coimbatore Innovation and Business Incubator (Forge),Tamil Nadu,Coimbatore 6
Noida Institute of Engineering and Technology, Greater Noida,Uttar Pradesh,Greater Noida 6
Rungta College of Engineering and Technology, Bhilai,Chhattisgarh,Bhilai 6
Aditya Engineering College,Andhra Pradesh,Surampalem 6
IES College of Technology,Madhya Pradesh,Bhopal 6
New Horizon College of Engineering, Bangalore,Karnataka,Bangalore 6
G H Raisoni College of Engineering,Maharashtra,Nagpur 6
VNR Vignana Jyothi Institute of Engineering & Technology,Telangana,Hyderabad 6
Manav Rachna International Institute of Research and Studies,Haryana,Faridabad 6
Poornima Institute of Engineering & Technology,Rajasthan,Jaipur 6
Vidyavardhaka College of Engineering,Karnataka,Mysuru 6
GMR Institute Of Technology,Andhra Pradesh,Rajam 6
Lakshmi Narain College of Technology,Madhya Pradesh,Bhopal 6
The Assam Royal Global University,Assam,Guwahati 5
GIET University, Gunupur, Odisha,Odisha,Gunupur 5
St. Joseph'S College of Engineering,Tamil Nadu,Chennai 5
Techno India NJR Institute of Technology,Rajasthan,Udaipur 5
O P Jindal University, Raigarh,Chhattisgarh,Raigarh 5
Kolhapur Institute of Technology'S College of Engineering (Autonomous), Kolhapur,Maharashtra,Kolhapur 5
Vignana Bharathi Institute of Technology,Telangana,Hyderabad 5
QIS College of Engineering and Technology,Andhra Pradesh,Ongole 5
P. R. Pote Patil College of Engineering & Management, Amravati,Maharashtra,Amravati 5
Chandigarh Engineering College, Jhanjeri, Mohali,Punjab,Mohali 5
Manipal University Jaipur,Rajasthan,Jaipur 5
MIT Art, Design and Technology University, Pune,Maharashtra,Pune 5
Bhilai Institute of Technology, Durg,Chhattisgarh,Durg 5
Techno India Unversity,West Bengal,Kolkata 4
Prin L. N. Welingkar Institute of Management Development & Research (PGDM),Maharashtra,Mumbai 4
Shobhit Institute of Engineering and Technology ,Uttar Pradesh,Meerut 4
C V Raman Global University,Odisha,Bhubaneswar 4
Swami Keshvanand Institute of Technology, Management & Gramothan,Rajasthan,Jaipur 4
Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal,Karnataka,Manipal 4
Chennai Institute of Technology,Tamil Nadu,Chennai 4
Veer Surendra Sai University of Technology, Odisha, Sambalpur 4


PyPI Downloads

About

A Exploratory Data Analysis of the result of the prestigious Smart India Hackathon 2023 using the data scraped from the official website

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published