profile-pic

SANTHOSH R

🔹 Senior Data Engineer at Harman Connected Services Corporation India Pvt. Ltd | June 2023 - Present

In my role at Harman, I design, develop, and deploy scalable and reliable data pipelines and solutions for a diverse range of clients. Leveraging my expertise in Azure Databricks, Delta Lake, and Apache Airflow, I ensure seamless data integration and processing. Collaborating closely with cross-functional teams, I prioritize data quality, security, and performance, consistently delivering value to our customers.


🔸 Previous Experience:

Before joining Harman, I spent over four years at Sysvine Technologies, progressing from Software Engineer to Senior Software Engineer. During this time, I contributed significantly to numerous projects spanning data engineering, software development, and cloud computing. My adeptness with a variety of technologies and tools enabled me to drive impactful outcomes.


🎓 Education:

Bachelor of Engineering in Electrical, Electronics, and Communications Engineering from Anna University Chennai.


💡 Passionate About Data Engineering:

Driven by a passion for data engineering, I continuously seek to expand my knowledge and skills by embracing new technologies and tools. My enthusiasm for learning fuels my personal and professional growth, empowering me to tackle challenges with confidence.


🏆 Recognition:

Recipient of the EMR award multiple times, as well as accolades such as "Gem of the Year" and Awards of Appreciation. These acknowledgments underscore my commitment to excellence and my ability to deliver results consistently.

  • Role

    Senior Data Engineer

  • Years of Experience

    5 years

Skillsets

  • Java
  • Spark
  • Snowflake
  • Shell Scripting
  • Scala
  • Python
  • PostgreSQL
  • Oracle
  • MySQL
  • JavaScript
  • Airflow
  • Hive
  • HBase
  • Delta
  • Databricks
  • Azure hdinsights
  • Azure datafactory
  • Azure
  • AWS
  • Apache Hadoop

Professional Summary

5Years
  • Jun, 2023 - Jan, 2024 7 months

    Senior Data Engineer

    Harman Connected Services
  • Aug, 2018 - May, 20234 yr 9 months

    Software Engineer III (Data Engineer)

    Sysvine Technologies

Applications & Tools Known

  • icon-tool

    Hadoop

  • icon-tool

    Databricks

  • icon-tool

    Airflow

  • icon-tool

    PostgreSQL

  • icon-tool

    Spark

  • icon-tool

    Hive

  • icon-tool

    Scala

  • icon-tool

    Azure Data Factory

  • icon-tool

    Javascript

  • icon-tool

    Java

  • icon-tool

    NodeJS

  • icon-tool

    C++

  • icon-tool

    Google Docs

  • icon-tool

    Confluence

Work History

5Years

Senior Data Engineer

Harman Connected Services
Jun, 2023 - Jan, 2024 7 months
    Spearheaded the migration of a critical data project from Ab Initio to Spark with delta, demonstrating strong proficiency in both technologies. Built data pipelines using Spark with delta and orchestrated using Airflow. Developed a custom ruler engine & parser using PySpark, PostgreSQL, & Delta to enable business operatives to increase data accuracy by creating & applying rules for enriching final data.

Software Engineer III (Data Engineer)

Sysvine Technologies
Aug, 2018 - May, 20234 yr 9 months
    Facilitated Fintech & Retail analytics projects, efficiently handling massive data volumes using Hadoop & Databricks. Enhanced data accuracy, reliability, & reduced processing time by streamlining data pipelines through HDFS, Hive, Spark, & Azure Data Factory. Enriched and processed structured & semi-structured data using ETL & data warehousing methods with Delta Lake, Oracle, & MySQL databases. Led data ingestion, processing, & visualization initiatives using Python & Scala, enhancing decision-making processes with in-house visualization tools. Identified & resolved data inconsistencies using SQL, Python, Spark, & Delta to enhance data-driven insights & accuracy as well as improve decision-making within the organization. Utilized SQL, Python, & Spark to optimize the reporting process by developing efficient data models for analytics & reporting. Expedited project completion rates by developing & maintaining technical documentation using Google Docs & Confluence. Created a custom framework in javascript to render the captcha in all the modern and legacy browsers. Designed image stitching library to create the images for captcha with desired different patterns. Integrated Singapore stock exchange by developing a message handler as a bridge between the stock exchange and the application. Orchestrated Azure Data Factory pipelines to enable insights on stock exchange data through streamlined data migration & transformation processes.

Achievements

  • Extra Mile Recognition multiple times
  • Award of Appreciation | 2020
  • Gem of the Year | 2018

Major Projects

2Projects

Retail Analytics

    Built data pipelines using Spark with delta and orchestrated using Airflow. Developed a custom ruler engine & parser using PySpark, PostgreSQL, & Delta to enable business operatives to increase data accuracy by creating & applying rules for enriching final data

Fintech & Retail analytics

    Enhanced data accuracy, reliability, & reduced processing time by streamlining data pipelines through HDFS, Hive, Spark, & Azure Data Factory

Education

  • B.E. in Electronics and Communication

    Anna University, Arunai Engineering College (2018)