![Management and Processing of Discographic Data with Amazon Elastic MapReduce](https://writelatex.s3.amazonaws.com/published_ver/2252.jpeg?X-Amz-Expires=14400&X-Amz-Date=20240727T025719Z&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAWJBOALPNFPV7PVH5/20240727/us-east-1/s3/aws4_request&X-Amz-SignedHeaders=host&X-Amz-Signature=ecbf9d580fa1b5888b0d95c5b638bfa48608b2dd3510be0bbfaaa18ed0378da7)
Management and Processing of Discographic Data with Amazon Elastic MapReduce
Author
Pierluigi Videsott
Last Updated
hace 9 años
License
Creative Commons CC BY 4.0
Abstract
The purpose of this report is to explain how – by leveraging on the capabilities of the amazon web services – it is possible to manage and process a set of data that is too large and complex for traditional data processing techniques and technologies.
The report discusses the implementation of a set of services – from the retrieval of external data to its transformation, through the storage on non relational databases and finally the parallel computation on an external cluster – meant for the management of discographic information in order to easily join different data in an agile manner and subsequently perform additional processing based on the joined output.
![Management and Processing of Discographic Data with Amazon Elastic MapReduce](https://writelatex.s3.amazonaws.com/published_ver/2252.jpeg?X-Amz-Expires=14400&X-Amz-Date=20240727T025719Z&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAWJBOALPNFPV7PVH5/20240727/us-east-1/s3/aws4_request&X-Amz-SignedHeaders=host&X-Amz-Signature=ecbf9d580fa1b5888b0d95c5b638bfa48608b2dd3510be0bbfaaa18ed0378da7)