Created the architecture and design of this project. The project involves moving unstructured source data (sales data & media data) in the form of flat files as structured data from Amazon Web Services (AWS) S3 environment to AWS Redshift platform using Amazon EMR and Data Pipeline for automation. The intent of the project is to understand the use of Big Data Technologies (Mongodb, Redshift and S3) with business use cases using Sales and Social media data.