Introduction

The telecom project comprises building a Data Lake as well as data migration for a 25 years old telecom company. The client is a renowned telecom company and has a good presence in Asia, Europe, Africa, and some parts of North America. Their customer base is widely spread across India, Bangladesh, African countries, Latin America, and Middle-Eastern countries. The data resides in various data sources, more than 205. The data includes customer data (even secured), customer location data, tower locations for specific geographies, customer care complaints, feedback, and many other types of data.

Data is in CSV, PSV, TEXT, MSG, audio files, XML, JSON, and even in RDBMS (MySQL, Sybase, Oracle) databases in structured format.

The first ...

Get Cloud Analytics with Google Cloud Platform now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.