This project is no longer accepting applications. Subscribe to our newsletter to be notified of new projects!
Transform raw data into actionable insights and interactive dashboards, while mastering foundational data science techniques such as exploratory data analysis (EDA) & data visualization.
In today's data-driven world, being able to transform raw data into valuable insights is a critical skill. In this Build Project, you’ll wear the hat of a Data Scientist and tackle the challenge of creating interactive dashboards that showcase business insights from raw data. Under the supervision of an experienced industry expert, you will develop essential skills in SQL, Python, machine learning, and data visualization. You’ll become familiar with industry-standard tools like sklearn and git, while applying these skills to a real-world data problem. This project provides a unique opportunity to experience the full data science lifecycle, from raw data to visualization, simulating the operations of a real data science team.
This workshop will introduce you to the project scope and goals. You will set up a development environment and learn the fundamentals of version control using Git. By the end of this session, you will be ready to collaborate and manage your project in a GitHub repository.
In this session, you will learn the basics of SQL for data storage and retrieval. You will work with databases, practicing how to create tables, insert data, and run queries to retrieve the information needed for analysis.
This workshop will focus on exploratory data analysis (EDA) using Python libraries such as Pandas and Numpy. You will learn to clean and preprocess datasets, discover patterns, and generate descriptive statistics and visualizations that summarize the data.
You will get an introduction to basic machine learning concepts and techniques. Using the sklearn library, you will build simple models such as linear regression and decision trees to gain insight from the dataset.
This workshop will dive deeper into advanced machine learning techniques, covering model evaluation and optimization. You will learn to fine-tune model performance using techniques such as cross-validation and hyperparameter tuning.
You will focus on creating interactive data visualizations using Plotly. Additionally, you’ll learn how to use Streamlit to turn data analysis and machine learning models into a shareable web application.
You will refine the interactive dashboards and ensure project files are correctly versioned using Git. Additionally, you will receive feedback to improve the functionality and appearance of the dashboards.
In the final workshop, you will present your completed interactive dashboards. You will walk through analysis, visualizations, and machine learning models, receiving feedback from peers and the industry expert.
Get access to all of our Build projects, including this one, by creating your Build account!
Get started by submitting your application.
We'll notify you when projects reopen. In the meantime, you can explore our resources and learn more about our Fellows.
Pratyush Kundu is a Data Science Build Fellow at Open Avenues, where he works with students leading projects in Data Science and Automation. Pratyush is a Trading Operations Analyst at Virtu Financial, where he focuses on analyzing and improving operational efficiencies for the firm’s algorithmic trading infrastructure by dissecting data to derive actionable insights, writing scripts using Python and SQL to build processes and automate analyses and manage operational risk by implementing and improving monitoring tools. Pratyush has over 3 years of experience in applying data science principles to drive business decisions. Prior to joining Virtu Financial, he stretched his wings at Natera and Citibank applying and honing his programming skills and analytical acumen in the disciplines of Biotechnology and Finance. He holds a Bachelor’s Degree in Business with concentrations in Data Science and Statistics. A fun fact about Pratyush is that he knows all the good Vegan spots in NYC, even though he isn’t Vegan.