Telecom Italia Brasil (TIM), the Brazilian subsidiary of Telecom Italia, was required to archive large and growing volumes of wireless call details records (CDRs) to comply with government regulations. Although it was a significant cost to store all of this data, the company identified an opportunity to leverage the data to provide better customer service, grow its business and ensure it was getting the expected return on billions invested in capital expenses. To enable this goal, Zaloni architected and built a managed Hadoop data lake to serve as the single repository for all traffic-related, inventory and provisioning data (CDRs, SNMP, server logs).
Before the data lake: The company’s wireless network generated 4 terabytes of data per day from voice, data and SMS CDRs. This data was created by 11 different servers and switches with 8-10 different record layouts. As a result, there was no centralized repository to store all these CDRs. In addition, the upstream mediation system, which is responsible for merging CDR records into a single record for each session, was sending duplicate CDRs or not stitching the call records completely.
With the data lake: TIM was better able to perform load balancing, government reporting, network analysis, and parallel processing that removed duplicate data and reconstructed call records from partial or lost records. With a managed data lake, the telecom company not only gained a compliance solution, but was able to more efficiently manage the network and continue to provide a great customer experience as the volume of subscriber usage continued to grow significantly.
Results: TIM was able to double data ingestion, to 8 terabytes/day. Also, the company avoided costly fines and penalties by meeting the immediate need for government reporting requirements, and gained insights into network utilization to avoid network congestion in near real-time.