Whatever we have done in this project, we did it together!, A strong team makes a successful project afterall!
Hi, i am Shibam Roy, a 15 year old school student with knowledge of C++, Python , Data Science and some basic DSA
Hello, I am Ankush Roy, a 15 year old student with vast knowledge of Mathematics and C++.
Hello, I'm Swadhin Maharana, a passionate programmer, quick learner, and aspiring entrepreneur. Highly skilled in problem-solving and ready to create innovative solutions.
Frontend developer, a quick learner, with a passion for problem-solving, a keen interest in ML, and a data science enthusiast.
We had a simple approach to the problem to think about it properly.We followed up a number of steps and phases to complete the project properly.
We started by properly planning on how to work on the data. This phase also included the creation of our team and setting roles of each team members.
We explored the data to check out its columns, its length , and other basic features of the dataset. In this phase we also tried to find out how many null values our dataset has, or even it has null values or not. We also tried to find out occuraces of types of data in a column for example the medal_type consists of Bronze,Silver and Gold.
We started by cleaning all the null data points in the dataset, the most of the null values were found in the participant_title column. We managed to fill all possible points via the athletes_url column as it contained all the name of the athletes.We fetched the remaining values with country_name, or if not possible we dropped them.
In this phase we tried to analyze the data by various means , through correlation mattrices, or via other visualizations. Besides visualization techniques we have used different queries in the data, which helped s find out various insights. We found out very helpful insights such as the dominance of USA or the strong correlation of Continent/ country position with their medals.
In this phase after analysis, we made powerful visualizations which depict the given dataset clearly. We have plotted few graphs which show some insights on the data.
After all our work , with our findings, we trained a machine learning model. We used Random Forest Regressor as the algorithm in this case.Our trained model isn't very accurate, but it can give a basic idea that on what factors a country's success in olympics is based on.However due to some finance issues we couldn't bring this model to run in this website(its a static website).
After all this hardwork, we gained an extreme level of experience and we also worked together forming a great team, thanks to GeeksForGeeks we were able to make this great project!.
We were able to find out various things in the data which can help us identify the rate of success of a country.Here are the insights that we found out from the given dataset and also from some external data.
For this project we have used the given dataset from GeeksForGeeks, and for even further exploration we have scaped data from the internet.We also used data from kaggle for the world population.
To understand and represent our data visually, we made multiple attractive graphs. Here are these graphs( and also one correlation mattrix ):
Contains the top 10 countries on the basis of most olympic medals they received.
This contains the correlation between multiple features of our data.
We, the team of passionate data enthusiasts, are humbled and immensely grateful for the incredible opportunity to participate in the Hackathon on Data Analysis organized by GeeksForGeeks. This exhilarating event has been an unforgettable journey that allowed us to explore the world of data analysis and showcase our skills as a team.
First and foremost, we extend our heartfelt gratitude to GeeksForGeeks for organizing such a fantastic hackathon. The platform provided us with an exceptional arena to apply our data analysis expertise, learn from industry experts, and challenge ourselves to new heights.
We cannot thank the organizers, mentors, and volunteers enough for their hard work and dedication in making this hackathon a resounding success. Their support and guidance throughout the event have been invaluable, pushing us to go above and beyond in our pursuit of data-driven solutions.
A special thank you goes to the dataset providers for giving us access to the rich and vast Olympics data. This dataset fueled our curiosity and allowed us to dive deep into our analysis, uncovering meaningful insights and trends.
Our heartfelt appreciation also goes to each member of our team. It has been an incredible journey collaborating with such talented and like-minded individuals. Together, we navigated through complex data challenges, brainstormed innovative ideas, and leveraged each other's strengths to achieve our goals.
Participating in this hackathon has been a transformative experience for our team, fostering camaraderie, knowledge-sharing, and growth. We are truly grateful for the memories created and the skills honed during this remarkable event.
Once again, our sincere thanks to GeeksForGeeks for organizing this extraordinary hackathon, and to everyone involved for creating an unforgettable experience in the realm of data analysis and exploration.
With heartfelt appreciation,
Shibam Roy, Ankush Roy, Swadhin Maharana, Debdutta Burman
GeeksForGeeks Hackathon Participants
You can contact us via the following details.
Email
royshibam9826@gmail.com
Phone no.-
8787777952
Email- ankush3411111@gmail.com
Email- noreplycursorhigh@gmail.com
Email- debdutta0401@gmail.com