profile-pic

Aman Raj

A Big Data enthusiast with more then 3 years of experience as a Hadoop developer having in depth understanding of Hadoop ecosystem including spark-Scala, spark-sql, Hive, Oozie, Hbase, Kafka. Hands on experience in analysis, Design, Coding and testing. Dealt with extracting data from different sources and performing ETL operation on it. Also integrating my Big Data experience with cloud. I got certified in Google cloud Associate cloud engineer (GCP-ACE) and progressing on Professional Data engineer certification.

  • Role

    BigData & Alexa Engineer

  • Years of Experience

    3 years

Skillsets

  • DBMS
  • Apache Kafka
  • Kafka
  • Hadoop
  • Unit Testing
  • testing
  • Google Cloud Platform
  • REST API
  • GCP
  • On
  • Apache Hbase
  • Dataflow
  • Github
  • Cloud
  • Git
  • Apache
  • Scala
  • Google Cloud - 3 Years
  • Jenkins
  • ETL
  • SQL
  • JUnit
  • API
  • C
  • Lambda Function
  • Java
  • AWS
  • R
  • Lambda
  • BigQuery
  • Apache Spark
  • Spark - 3 Years
  • Spark - 3 Years

Professional Summary

3Years
  • Jun, 2023 - Present2 yr 6 months

    BigData Engineer)(1st

    Huawei Technologies
  • Mar, 2021 - Aug, 20232 yr 5 months

    Project Engineer

    Wipro Limited

Work History

3Years

BigData Engineer)(1st

Huawei Technologies
Jun, 2023 - Present2 yr 6 months
    • As a Bigdata R&D Developer, working on Apache HBase open source project.
    • Enhanced the existing features to meet customer requirements
    • Contributed in improving query processing and finding bugs in open source code.
    • Used Junit as testing framework for Unit testing.
    • Included new features based on Indexing, querying multiple get request at once through REST API etc.
    • Apache Kafka and Apache Spark Streaming

Project Engineer

Wipro Limited
Mar, 2021 - Aug, 20232 yr 5 months
    • Worked as a developer, performing ETL operation in Hadoop.
    • Used Apache Kafka-HDFS integration to ingest Structured/Semi-Structured data from different sources into HDFS, which is further used by Apache Spark to implement Business logic through various joins and transformation.
    • Used Hive as Data warehouse and HQL to analysis the data.
    • Dealing with data pipeline Used Apache Oozie workflow which made ETL process significantly fast.
    • Ensured 100% of data was processed correctly and transferred on time


    Web Applications

    Google Cloud Platform (GCP)

    Amazon Web Services (AWS)

    Git

    MapReduce

    Big Data

    Communication

    Unit Testing

    SQL

    Jenkins

    Data Warehousing

    Amazon S3

    Cloud Computing

    Apache Kafka

    BigTable

    Apache Spark

    Data Engineering

    GitHub

    Hive

    Continuous Integration and Continuous Delivery (CI/CD)

    HBase

    Google BigQuery

    Hadoop

    IBM UrbanCode Deploy (uDeploy)

    Scala

    Extract, Transform, Load (ETL)

    MySQL

    Linux

Achievements

  • Google Cloud Platform certified- Associate cloud Engineer

Major Projects

1Projects

Amazon Alexa

Jan, 2024 - Nov, 20251 yr 10 months

    It is live news skill developed in Amazon Alexa developer console. It plays news by using http live streaming audio link. It uses AWS lambda function to run server side code

Certifications

  • NPTEL (By IIT Kharagpur