Your application

Please complete the following fields to be considered for this project.

Please fill in this required field.
Please fill in this required field.
Please fill in this required field.
Please fill in this required field.
Please fill in this required field.
How much commitment will you have to this project?
Please select an option.
Are you available to dedicate 1-2 hours per week to the Build Project?
Please select an option.
Your application has been 
successfully submitted!
Explore more projects
Close
You already submitted an application for this project.
Explore more projects
Close
There was an error submitting your form. Please try again later or contact us.
Oops! Something went wrong while submitting the form.

This project is no longer accepting applications. Subscribe to our newsletter to be notified of new projects!

Get updates
Develop a Data Management Package Set
Alexey Mizyak
Alexey Mizyak
Get updates
Register today
Apply now

Develop a Data Management Package Set

Use Python and Airflow to implement a set of tools that copies, sanitizes, minimizes and restores between data sources.

Register today
Apply now
Fridays
 at
6:00
P.M.
 ET /
3:00
P.M.
PT
8 weeks, 2-3 hours per week
Intermediate
No experience required
No experience required
Some experience required
Degree and experience required

Description

Data is the most critical asset for almost any type of business. Protecting this data is essential, but allowing production data to flow into less-secure environments can expose internal operations to risks. In this Build Project, you will take on the role of a Software Developer to implement a simple yet flexible set of data management instruments. This set of tools will automate the transformation, sanitization, obfuscation, and minimization of production data, ensuring its safety. Using Python and Airflow, you will design a tool that manages various data sources, such as relational and non-relational databases.  This is an exciting opportunity to develop practical skills and create a project that showcases your ability to manage and protect data effectively.

Session timeline

  • Applications open
    August 1, 2024
  • Application deadline
    August 25, 2024
  • Project start date
    Week of July 8, 2024
    Week of
    September 9, 2024
  • Project end date
    Week of

What you will learn

  • How to apply data sanitization, obfuscation, and minimization techniques to protect sensitive information.
  • How to utilize relational and/or non-relational database systems (e.g., MySQL, MSSQL, MongoDB, MariaDB) for data manipulation and management.
  • How to prepare professional documentation and present technical solutions effectively in a format resembling job interview.

Project workshops

1
Introduction
2
Architecture Overview
3
Core Module Implementation
4
Data Copy Module
5
Data Sanitization Module
6
Data Obfuscation Module
7
Data Minimization Module
8
Presentations

Prerequisites

  • Basic knowledge of Python programming: Understanding of fundamental programming concepts, including object-oriented programming (OOP), data structures, and standard libraries. This knowledge is typically covered in courses like "Introduction to Programming with Python" or "Intermediate Python".
  • Basic understanding of databases: Familiarity with CRUD (Create, Read, Update, Delete) and bulk operations on both relational and non-relational databases. Good to have next (or equivalent) courses done: "Database Management Systems", "Introduction to Databases" etc. Basic knowledge of Apache Airflow: Understanding of workflow automation, DAGs (Directed Acyclic Graphs), and task scheduling. This is typically covered in courses like "Introduction to Data Engineering" or "Data Pipelines with Apache Airflow".
  • Commitment and time management: Ability to actively participate in all project meetings and work independently for at least 1hour per week.
  • Problem-solving skills: Capability to address and overcome challenges encountered during software development, demonstrating creativity and persistence.

Sign up today

Get access to all of our Build projects, including this one, by creating your Build account!

Register today
Log in

Apply to

Alexey

's project today!

Get started by submitting your application.

Apply now

Stay updated!

Subscribe to our newsletter to be notified when projects reopen!

Please fill in this required field.
By clicking “Subscribe” you agree to our Terms of Services and Privacy Policy.

Thanks for subscribing!

We'll notify you when projects reopen. In the meantime, you can explore our resources and learn more about our Fellows.

Discover our articles
There was an error submitting your form. Please try again later or contact us.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
About the expert

I'm a Computer Science Fellow at Open Avenues Foundation, where I'm working with students leading projects in software development and computer science.

Adding to that, I'm a Software Architect at MyCarrier LLC and where I design enterprise applications for the freight industry.

I have over 11 years of experience in software development, project and people management, and more than 5 years in software architecture.

I hold a master's degree in computer science and am pursuing a master's degree in scuba diving (as one of my favorite hobbies).

Visit
Alexey
's Linkedin
More like this Project