profile-pic

Rohan Jethure

Big Data Developer

• Total experience: 8+ years of experience in software design and development with Hadoop, Spark, Scala, Microsoft Azure (Eventhub, Azure HDInsight, Azure AppInsight, Key-vault, BlobStorage, Cosmos DB), Jenkins, Hive

• Experience in building low latency data pipeline using spark streaming

• Worked on various phases of Product Development Life Cycle in variety of technical areas like design, development, production Support.

• Worked in programming languages - Java, Scala and shell scripting.

• Worked on ETL side to extract, transform, load data.

• Worked on GIT Repository to maintain the code.

• Explored the Azure services like Appinsight, key-vault, Blob Storage, Cosmos DB, Eventhubs

• To ingest the data from source eventhub using Spark-Scala

• Transform the ingested data and Store the output data to Target cosmosDB and update the checkpoint location to Azure Blob-storage

• Validated the data from target eventhub and debugging the code.

• Run the Streaming job on Azure HDinsight cluster.

• Have done unit testing by writing test cases.

• Have done code coverage on sonarquebe.

• Have done code review on GIT while sending merge request.

  • Role

    Senior Associate Technology

  • Years of Experience

    7.5 years

Skillsets

  • Unit Testing
  • Design
  • ETL
  • Big Data
  • Programming

Professional Summary

7.5Years
  • Nov, 2021 - Present3 yr 8 months

    Senior Associate Technology

    Synechron
  • Jul, 2016 - Nov, 20215 yr 4 months

    Lead Software Engineer

    Persistent Systems Ltd.

Applications & Tools Known

  • icon-tool

    Scala

  • icon-tool

    Apache Spark

  • icon-tool

    Intellij

  • icon-tool

    Kafka

  • icon-tool

    Kubernetes

  • icon-tool

    Docker

  • icon-tool

    Logstash

  • icon-tool

    ElasticSearch

  • icon-tool

    Azure Cosmos DB

  • icon-tool

    Jenkins

  • icon-tool

    Chef

  • icon-tool

    Eclipse

Work History

7.5Years

Senior Associate Technology

Synechron
Nov, 2021 - Present3 yr 8 months
    Financial data movement from landing zone to in-memory data grid (vmware Gemfire).

Lead Software Engineer

Persistent Systems Ltd.
Jul, 2016 - Nov, 20215 yr 4 months
    Developed applications, including 'DELL - Parquet file generator' and 'DELL - Azure Cosmos Writer'.

Major Projects

5Projects

Capital markets Data pipeline

    Data moved from raw files to Oracle DB, then to HDFS in Avro format, further processed to Hbase tables, and finally pushed to vmware Gemfire.

DELL - Parquet file generator

    Developed spark batch job to generate parquet files and integrated with Kafka, Logstash, and ElasticSearch for metadata.

DELL - Azure Cosmos Writer

    Developed spark streaming application to persist messages from Azure eventhub into cosmosDB.

Kantar - Data Pipeline for Audience Measurement

    Data extrapolation for TV rating points, using Jenkins, Chef, Shell for data pipeline and Java with Spark for business logic.

Water Metering and Quality Management System (IoT)

    IoT system to track Water Quality and metering, using Arduino, Raspberry PI3, XBEE, and integrated with AWS.

Education

  • Bachelor of Technology (Computer Engineering)

    Vishwakarma Institute of Technology

Certifications

  • Microsoft certified: azure fundamentals (az-900)