success story

A global insurance provider accelerates time-to-market and improves analytics with Azure Databricks

The Challenge

The client, a global insurance provider, faced significant difficulties due to prolonged three-day ETL schedules and frequent ETL failures. These issues caused delays in generating quarter-end reports, extended development cycles, and resulted in inefficiencies in deploying new reports. Additionally, these delays increased infrastructure costs, creating further operational challenges.

The Solution

Virtusa implemented a modern Lakehouse architecture on Azure Databricks to address the client’s data challenges. This solution consolidated over 10 diverse data sources, each containing approximately 1 billion records, with a total data volume of 22TB.

Key components of the solution included:

  • Azure Data Factory (ADF): Used for orchestration to streamline data workflows.
  • Databricks Delta Tables: Enabled high-speed data processing for ingestion, transformation, and integration.
  • PySpark and Python: Facilitated advanced data transformations and accelerated data loading into Azure Synapse (Data Warehouse).
  • Medallion Architecture: Implemented a multi-layered data architecture (Bronze, Silver, Gold) to enhance data organization and improve data quality, reliability, and performance across all processing stages.
  • Databricks Auto Scaling Clusters: Enabled dynamic scaling of compute resources, optimizing processing speed and reducing operational costs during peak data loads.
  • Continuous Integration and Continuous Deployment (CI/CD) Pipelines: Automated the deployment of ETL workflows, reducing manual intervention, minimizing errors, and ensuring timely delivery of critical reports.
A global insurance provider accelerates time-to-market and improves analytics with Azure Databricks
Image

VIDEO

Virtusa’s genAI expertise in the Insurance sector

Watch Ronald Trella, speaking live at ITC 2024

Learn more about our GenAI offerings

Learn more Click to Learn more
The Benefit

Post-deployment, the client significantly improved their TTM-ETL schedule, reducing the duration from 3 days to 8 hours, including:

Increased agility and scalability

Reduced infrastructure costs

Faster analytics and insights for business

Faster delivery cycles

New AI/ML initiatives on skill recommendations and policy search

Unity Catalog Migration Studio™

Learn more about Unity Catalog Migration Studio™ powered by the Databricks Data Intelligence Platform

Related content