profile-pic

Prasanna Kumar

Technical Data Engineering Leader with 17+ years of experience driving end-to-end architecture, platform engineering, and data infrastructure transformation across Fintech, Blockchain, and AdTech domains.
  • Role

    Associate Director, Data Strategy, Akka & Engineering

  • Years of Experience

    17 years

Skillsets

  • LangChain
  • Presto
  • Prefect
  • PostgreSQL
  • Pinecone
  • OpenMetadata
  • OpenAI API
  • Neural Networks
  • Nessie
  • Model Registry
  • MLFlow
  • Prometheus
  • Kubernetes
  • Kubeflow
  • Kafka Streams
  • Kafka connect
  • Jenkins
  • Java
  • Hugging Face Transformers
  • Hive
  • Hadoop
  • Spring MVC
  • Golang fiber framework
  • Lakekeeper
  • Weaviate
  • Vertex AI
  • Trino
  • Terraform
  • Tecton
  • Sql mesh
  • SQL
  • Great Expectations
  • Spark Structured Streaming
  • semantic search
  • Scala
  • Sagemaker Pipelines
  • Rust
  • Redis
  • RAG pipelines
  • Python
  • Prompt Engineering
  • ArgoCD
  • BigQuery
  • Backend-for-frontend
  • AWS S3
  • AWS Redshift
  • AWS Lambda
  • AWS Lake Formation
  • AWS Glue
  • AWS Athena
  • AWS
  • Cassandra
  • Apache Spark
  • Apache Ranger
  • Apache Pinot
  • Apache Kafka
  • Apache Iceberg
  • Apache Hudi
  • Apache Flink
  • Apache Druid
  • Apache Beam
  • FAISS
  • Google Dataflow
  • Go
  • GitLab CI/CD
  • GitLab
  • Github
  • GCP Pub/Sub
  • GCP
  • Flink cdc
  • feast
  • Apache Atlas
  • Evidently ai
  • EMR
  • Denoising autoencoders
  • Delta Lake
  • Debezium
  • dbt
  • Datadog
  • Coralogix
  • ClickHouse

Professional Summary

17Years
  • Jun, 2022 - Present3 yr 6 months

    Associate Director, Data Strategy & Engineering

    Lendingkart
  • Nov, 2021 - May, 2022 6 months

    Engineering Manager, Data Platform

    Merkle Science
  • Oct, 2018 - Nov, 20213 yr 1 month

    Senior Big Data Engineer

    Extreme Innovations
  • Dec, 2012 - May, 20141 yr 5 months

    Senior Engineer

    CSS Corp
  • Jun, 2014 - Nov, 20151 yr 5 months

    Senior Big Data Engineer

    Glassbeam
  • Dec, 2015 - Sep, 20182 yr 9 months

    Big Data Architect

    Ionos Networks
  • Feb, 2010 - Nov, 20122 yr 9 months

    Software Engineer

    DnB TransUnion
  • May, 2006 - Feb, 20103 yr 9 months

    Java Developer

    Jayam Tech

Applications & Tools Known

  • icon-tool

    Delta Lake

  • icon-tool

    Kafka

  • icon-tool

    Debezium

  • icon-tool

    Spark

  • icon-tool

    Trino

  • icon-tool

    AWS S3

  • icon-tool

    AWS Glue

  • icon-tool

    AWS Lambda

  • icon-tool

    AWS EMR

  • icon-tool

    AWS Lake Formation

  • icon-tool

    Prometheus

  • icon-tool

    Grafana

  • icon-tool

    OpenTelemetry

  • icon-tool

    Airflow

  • icon-tool

    Jenkins

  • icon-tool

    Terraform

  • icon-tool

    CloudFormation

  • icon-tool

    Kubernetes

  • icon-tool

    Metabase

  • icon-tool

    PowerBI

  • icon-tool

    GitLab

  • icon-tool

    Apache Beam

  • icon-tool

    Hadoop

  • icon-tool

    FastAPI

  • icon-tool

    SonarQube

Work History

17Years

Associate Director, Data Strategy & Engineering

Lendingkart
Jun, 2022 - Present3 yr 6 months
    Built a unified lakehouse and real-time ingestion foundation consolidating 500+ pipelines to a single governed source of truth. Established DataOps with >99.5% pipeline and platform uptime and standardized on-call and L1/L2 processes. Migrated compute to Kubernetes with spot instances, reducing cloud spend by 60% and improving throughput. Implemented enterprise data catalog, lineage, and PII encryption to meet RBI and internal audit mandates; achieved 100% audit compliance. Enabled standardized model onboarding and CI/CD, cutting deployment cycles by 50% and improving feature serving latency. Partnered cross-functionally to operationalize AI/ML decisioning, reducing manual underwriting and improving approval turnaround. Built a semantic layer to streamline BI access, increasing analyst efficiency by 30% and improving data trust via data contracts. Ingested high-volume event streams (1M+ daily calls) and delivered core operational metrics, improving lead conversion by 15%.

Engineering Manager, Data Platform

Merkle Science
Nov, 2021 - May, 2022 6 months
    Led the architecture for low-latency blockchain monitoring to ingest and process billions of transactions in real-time. Built stateless Golang services on Kubernetes with distributed state for resilience and scale. Established streaming and batch pipelines for reliable landing in analytical stores and downstream analysis. Delivered internal tools that simplified heavy queries and accelerated investigations, reducing turnaround time. Integrated graph-based entity resolution to improve fraud detection accuracy by 30%. Reduced ETL batch processing time by 38% and cut resource usage, delivering faster data availability for analytics teams within 4 hours. Streamlined data lineage tooling, enabling analysts to complete regulatory reporting 25% faster, reducing external audit hours by 120 annually. Scaled data ingestion pipelines to handle a 3x peak load while maintaining 99.95% uptime, ensuring uninterrupted analytics access for teams globally. Raised test coverage to 92%, strengthened CI pipelines and reduced production incidents by 40%, improving deployment stability for stakeholders across all environments.

Senior Big Data Engineer

Extreme Innovations
Oct, 2018 - Nov, 20213 yr 1 month
    Designed and operated high-frequency pipelines to process market data in real-time across 200+ symbols. Implemented neural feature extraction with denoising autoencoders to enhance signal quality for downstream models. Delivered sub-second SLAs for critical components supporting time-sensitive operations. Hardened streaming jobs and monitoring to ensure consistent throughput and recoverability. Optimized ETL workloads through memory tuning cutting batch processing time from 12 minutes to 3 minutes for a measurable 75% efficiency gain. Scaled streaming topology to handle peak market data bursts by deploying partitioned processing and backpressure achieving 99.95% throughput without dropped messages. Implemented automated data quality checks and lineage tracing reducing production defects by 60% within 2 months and significantly stabilizing pipelines.

Big Data Architect

Ionos Networks
Dec, 2015 - Sep, 20182 yr 9 months
    Architected a Kappa-based system ingesting 10,000+ beacons per second from 5,000+ IoT devices. Built pipelines with Kafka and Spark Streaming to transform data and persist into scalable stores. Enabled models for indoor positioning, movement analytics, and programmatic behavior insights for business stakeholders. Tuned topics, partitions, and schema strategies to support sustained growth and reliability.

Senior Big Data Engineer

Glassbeam
Jun, 2014 - Nov, 20151 yr 5 months
    Built data pipelines in Scala and Akka to convert logs into structured datasets for a Cassandra-based warehouse, enabling actionable insights.

Senior Engineer

CSS Corp
Dec, 2012 - May, 20141 yr 5 months
    Led the build-out of a Hadoop-based data lake for telecom analytics.

Software Engineer

DnB TransUnion
Feb, 2010 - Nov, 20122 yr 9 months
    Developed Java applications using Spring MVC for legacy credit scoring and fraud detection on structured bureau datasets.

Java Developer

Jayam Tech
May, 2006 - Feb, 20103 yr 9 months
    Java Developer on enterprise applications.

Achievements

  • Modernizing the Legacy Stack
  • CUB Pipeline - CDC Un-Nest Base
  • CASE - CDC-Driven Automatic Schema Evolution
  • DESCRIBE - Trino-Based Query Execution & Reporting
  • Conversational AI for Query Automation

Major Projects

1Projects

LakeGPT NLP-to-SQL Engine for Trino

    An in-house LLM-powered NL-to-SQL engine that enables analysts to query Trino using plain English, advancing AI-assisted data consumption. Increased analytics productivity by 35% through natural-language querying and governed schema awareness integrated with metadata. Improved data accessibility and reduced analytics & visualization time by 3 hrs via automated documentation and query templates.

Education

  • Advanced PG Level Certification in Computational Data Science

    Indian Institute of Science (2025)
  • B.Tech, Information Technology

    T.J. Institute of Technology (2005)