This project is no longer accepting applications. Subscribe to our newsletter to be notified of new projects!
Use Python and Airflow to implement a set of tools that copies, sanitizes, minimizes and restores between data sources.
Data is the most critical asset for almost any type of business. Protecting this data is essential, but allowing production data to flow into less-secure environments can expose internal operations to risks. In this Build Project, you will take on the role of a Software Developer to implement a simple yet flexible set of data management instruments. This set of tools will automate the transformation, sanitization, obfuscation, and minimization of production data, ensuring its safety. Using Python and Airflow, you will design a tool that manages various data sources, such as relational and non-relational databases. This is an exciting opportunity to develop practical skills and create a project that showcases your ability to manage and protect data effectively.
Get to know the Build Fellow and other students, ask questions about the project requirements, prepare your workspace.
Design an Architecture Overview diagram for the proposed solution.
Implement the core package of the data management packages using Python. Utilize Apache Airflow to set up basic DAG structures.
Develop a data copy package using predefined parameters and Airflow DAGs.
Implement the data sanitization package based on predefined parameters. Integrate the package with the core and data copy package.
Develop the data obfuscation module using predefined parameters.
Implement the data minimization module based on predefined parameters.
Polish your project deliverables and present them to the Build Fellow and other students in the final group session.
Get access to all of our Build projects, including this one, by creating your Build account!
Get started by submitting your application.
We'll notify you when projects reopen. In the meantime, you can explore our resources and learn more about our Fellows.
I'm a Computer Science Fellow at Open Avenues Foundation, where I'm working with students leading projects in software development and computer science.
Adding to that, I'm a Software Architect at MyCarrier LLC and where I design enterprise applications for the freight industry.
I have over 11 years of experience in software development, project and people management, and more than 5 years in software architecture.
I hold a master's degree in computer science and am pursuing a master's degree in scuba diving (as one of my favorite hobbies).