Data Migration & Performance Improvement of large data processing

Enabled a leading data processing company to process large batches of pharmacies & pharmacy supplier data quickly with reduced IT costs


About the Client

The client is a leading data processing company in Australia. The client is connected to more than 4,500 pharmacies across Australia and wanted to improve access to information across the pharmacy supply chain. The objective was to help pharmacies identify opportunities in both their dispensary and retail through reporting and analytics, as well as support rebate payments from suppliers and patient adherence programs.

The Business Challenge

The client wanted to process multiple batches of data quickly where one data batch could contain up to a billion records. The client wanted to reduce the processing time that was at 2.2 hours for a billion records. The client also wanted to prevent any bottleneck situation due to the high volume of data.

What Aptus Data Labs Did

We migrated the client’s existing 5-node Vertica Cluster platform to Apache Spark in Hortonworks on AWS Cluster to improve the processing time and quickly adapt to new features in the future along with cost reduction.

The Impact Aptus Data Labs Made

The new analytics platform boosted the performance by 62% and reduced the data processing time. It also reduced IT costs by 400% and helped the client to handle large volumes of data smoothly.

The Business and Technology Approach

Aptus Data Labs used the following methodology for environment migration and to resolve the existing challenge. Aptus Data Labs:.

Tools used

Apache Spark Cluster, AWS, HDP platform, Spark, & Vertica

The Outcome

The migrated analytics platform reduced the processing time from 2.2 hours for a billion records to 1 hour for 1.2 billion records that boosted the performance by 62%. The analytics platform reduced IT costs significantly using open source technologies. The platform used the yarn cluster to ensure high availability and high efficiency of the system. It also enabled the client to handle massive volumes of data smoothly without any break in the performance.

Related Case Studies

Download Case study​

Download Case study​