PP

Pranjith Prabhakaran

Data Engineering Lead at Turbolab Technologies

India

Work Experience

  • Data Engineering Lead

    Turbolab Technologies

    May 2022 - Present -3 yrs, 1 month

    India

    • Job Details:Led and architected a medallion-based data platform (Kafka, Spark, Airflow) with Iceberg+S3+Nessie for lakehouse storage, enabling real-time and batch analytics at scale. Led, designed and developed a news intelligence pipeline processing 100k+ articles/day, integrating streaming (Kafka/Spark) and ML models to automate insights. Modernized legacy data warehouse by migrating from Solr-Cassandra to Elasticsearch, improving query performance by 40% and enabling real-time analytics. Built a no-code web scraping platform that enabled structured data extraction from 900+ unstructured sources, cutting analytics turnaround by 70%.
  • Data Engineer

    Turbolab Technologies

    May 2019 - May 2022 -3 yrs

    India

    • Job Details:Delivered a revenue-driving data marketplace for retail location intelligence, boosting company revenue by 35% in 6 months. Built real-time ETL/ELT pipelines with Spark, Kafka, and Elasticsearch to support scalable analytics and business operations. Designed predictive data summaries using Python and SQL with geospatial retail data, enhancing product recommendations and revenue by 35% in 6 months.
  • Associate Software Engineer

    Turbolab Technologies

    May 2017 - May 2019 -2 yrs

    India

    • Job Details:Developed a scalable web scraping framework to unify 1500+ site-specific scrapers into a single algorithm, cutting maintenance costs by 80%. Created a modular data extraction tool, accelerating ingestion and internal product delivery. Maintained ETL pipelines using Kafka and Spark Streaming. Drove ~30% annual revenue growth through innovative data products, cost optimization, and system scalability improvements, enabling the team to consistently achieve annual goals.
  • Skills

    • Data Warehousing
    • Python
    • Pandas
    • NLTK
    • PostgreSQL
    • Cassandra
    • Redis
    • MongoDB
    • Solr
    • pyspark
    View More

    Languages

    • English

      Fluent
    Share this Profile