PP
Pranjith Prabhakaran
Data Engineering Lead at Turbolab Technologies
IndiaWork Experience
Turbolab Technologies
May 2022 - Present -3 yrs, 1 month
India
- Job Details:Led and architected a medallion-based data platform (Kafka, Spark, Airflow) with Iceberg+S3+Nessie for lakehouse storage, enabling real-time and batch analytics at scale. Led, designed and developed a news intelligence pipeline processing 100k+ articles/day, integrating streaming (Kafka/Spark) and ML models to automate insights. Modernized legacy data warehouse by migrating from Solr-Cassandra to Elasticsearch, improving query performance by 40% and enabling real-time analytics. Built a no-code web scraping platform that enabled structured data extraction from 900+ unstructured sources, cutting analytics turnaround by 70%.
Turbolab Technologies
May 2019 - May 2022 -3 yrs
India
- Job Details:Delivered a revenue-driving data marketplace for retail location intelligence, boosting company revenue by 35% in 6 months. Built real-time ETL/ELT pipelines with Spark, Kafka, and Elasticsearch to support scalable analytics and business operations. Designed predictive data summaries using Python and SQL with geospatial retail data, enhancing product recommendations and revenue by 35% in 6 months.
Turbolab Technologies
May 2017 - May 2019 -2 yrs
India
- Job Details:Developed a scalable web scraping framework to unify 1500+ site-specific scrapers into a single algorithm, cutting maintenance costs by 80%. Created a modular data extraction tool, accelerating ingestion and internal product delivery. Maintained ETL pipelines using Kafka and Spark Streaming. Drove ~30% annual revenue growth through innovative data products, cost optimization, and system scalability improvements, enabling the team to consistently achieve annual goals.
Languages
English
Fluent