profile-pic
Vetted Talent

Prem Vijaykar

Vetted Talent
Seeking quality environment that enables me to come up with the emerging as well as latest technologies that play vital role for the organization's growth and which gives me the scope of widening the spectrum of my skills and knowledge and to successful employee in the emerging cutting edge technologies.
  • Role

    Senior Data Engineer

  • Years of Experience

    4.6 years

Skillsets

  • Apache Airflow
  • Azure DataBricks
  • DWH Concepts
  • Informatica
  • Linux
  • Python
  • Snowflake
  • SQL
  • dbt
  • PowerCenter
  • PySpark

Vetted For

9Skills
  • Roles & Skills
  • Results
  • Details
  • icon-skill_image
    Senior Data Engineer With Snowflake (Remote)AI Screening
  • 50%
    icon-arrow-down
  • Skills assessed :Azure Synapse, Communication Skills, DevOps, CI/CD, ELT, Snowflake, Snowflake SQL, Azure Data Factory, Data Modelling
  • Score: 45/90

Professional Summary

4.6Years
  • May, 2025 - Present1 yr

    Senior Data Engineer

    Datafortune
  • Jan, 2023 - May, 20252 yr 4 months

    Data Engineer

    Tech Mahindra

Applications & Tools Known

  • icon-tool

    Extract, Transform, Load (ETL)

  • icon-tool

    Microsoft Azure

  • icon-tool

    Data Lakes

  • icon-tool

    Azure Data Factory

  • icon-tool

    Data Management

  • icon-tool

    Data Architecture

  • icon-tool

    Data Modeling

  • icon-tool

    DataStage

  • icon-tool

    SQL

  • icon-tool

    Teradata

  • icon-tool

    Python

  • icon-tool

    Star Schema

  • icon-tool

    Data Ingestion

  • icon-tool

    Modeling Tools

  • icon-tool

    Batch Processing

  • icon-tool

    Query Tuning

  • icon-tool

    UDF

  • icon-tool

    Stored Procedures

  • icon-tool

    Datasets

  • icon-tool

    Data Quality Assurance

  • icon-tool

    Data Quality

Work History

4.6Years

Senior Data Engineer

Datafortune
May, 2025 - Present1 yr
    Designed and delivered AI-driven POCs including oil spill detection and face recognition using Python, Snowflake, and Cortex AI models. Ingested and processed unstructured data into Snowflake, enabling AI-based oil spill detection using Cortex AI. Automated data ingestion by developing Python scripts with Microsoft Graph API to fetch email attachments and load them into Snowflake. Built an AI face detection solution for an airline to track individuals with unique IDs, detect suspicious activities/objects, and identify known vs. unknown faces.

Data Engineer

Tech Mahindra
Jan, 2023 - May, 20252 yr 4 months
    Worked on a Telecom Domain Migration Project responsible to successfully implement Unified Data Platform where we migrated teradata objects to Azure Databricks and then monitoring the jobs using the orchestration tool Airflow. Migration of teradata objects to Azure Databricks. Worked on creation of Ingestion and Transformation Pipelines in Azure Data Factory. Monitoring pipelines using the orchestration tool Apache Airflow and making sure right data being Extracted and Loaded. Proficient in Python and PySpark for data transformation and optimizing data quality for larger dataset. Achievements: Successful migration of Teradata objects to Azure Databricks. Quality of delivery, Met the SLAs, Zero escalations from customer.

Major Projects

2Projects

Optus UDP-Migration

AI Based Symmetric Answer Evaluation System \x0c"

Education

  • Bachelor of Engineering

    Prof.Ram Meghe Institute Of Technology And Research (2020)

Certifications

  • Microsoft Certified: Azure Data Engineer Associate

    Microsoft
  • Microsoft Certified: Azure Data Engineer Associate

    Microsoft
  • Microsoft Certified: Azure Data Engineer Associate

    Microsoft

AI-interview Questions & Answers

Thanks for watching!

Hi, I am Prem Vijayakumar. I have three years of experience as a data engineer. Currently, I am working in...

I need more time. Version control like gate to track changes to your SQL scripts and other artifacts related to data transformation, scripting, writing data transformation in SQL scripts or stored procedures by keeping the transformation in script. We can easily track changes and roll back to previous version if needed. Naming conventions, establish clean, clear naming conventions for your other objects for your SQL scripts, stored procedures, and other objects. Documentation, document your data transformation. Testing, implementing automated testing for your data transformation. Continuous integration, continuous deployment, integrate version control system with CI-CD pipeline to automate deployment and testing of data transformation.

What do you recommend for the use of try-catch block, Snowflake's try-catch block similar, and support for try-catch block. These blocks are used to encapsulate the code that might raise an error and catch specific types of errors to handle them appropriately. Logging and alerting, log error messages and other information to a logging table or an external logging system. Transaction management wraps executable statements within explicit transactions to ensure data consistency, graceful error handling, error recovery, monitoring, and metrics, testing and validation. These are the practices that would be recommended for Snowflake for error handling in Snowflake.

And we're gonna get started. To optimize ELT processes while landing semi-structured data, the right storage is key. You should store semi-structured data in Snowflake using variant data types, which allows flexibility in handling different types of semi-structured data, such as JSON and XML. You can consider using separate tables or stages for different types of semi-structured data to optimize querying performance. Snowflake's clustering and partitioning feature can optimize query performance. When semi-structured data clustering can help organize data physically on disk, reducing the amount of data scanned during queries, partitioning can improve query performance by dividing data into smaller, more manageable chunks. For data ingestion, use Snowflake's data ingestion, specifically the COPY INTO command, for efficient bulk data ingestion from various sources, including semi-structured data files stored in cloud platforms like Amazon S3 or Azure Blob Storage. This command supports parallel data loading, compression, and automatic file format detection. These are all the ways to optimize ELT processes.

It's an interesting system to run, sound and audio feedback is good. The board is very sound. What's interesting about it is that I actually tapped with my hand. This system provides revolutionary sound. If you look at my hands on my laptop post. Snowflake information schema, Snowflake performance dashboards, Snowflake worksheets, Snowflake query profiling are the techniques that are used to ensure Snowflake performance training and query optimization.

The board is attempting to create a materialized view from the raw data table where the status is active and the created date is within the last 30 days. The issue is with the closing curly brace which could cause a syntax error, plus the "get data" function is a SQL Server function; if the database system doesn't support it, the query will fail, also the date diff function usage might differ based on the database system. After that, there is a missing closing parenthesis after the table reference "raw data".

I'm going to do a little bit of a tour of the facility.

clustering keys, unique constraint, materialized views, ATL process optimizations, Snowflake's deduplication feature, and monitoring and optimization. With these techniques and utilizing Snowflake's built-in features, we can implement data deduplication effectively while minimizing the impact on query performance.

To use a data built-in tool in conjunction with Snowflake to transform data for complex reporting needs, we can use these general steps: setup, project initialization, connection configuration, modeling, testing, documentation, running dbt, deployment, scheduled runs, monitoring, and maintenance.

connectivity, ADF, ADM, ADM2, ADM3, ADM4, ADM5, ADM6, ADM7, ADM8, ADM9, ADM10, ADM11, ADM9, ADM10, ADM11, ADM11+, ADM5, ADM10+, ADM11+, ADM5, ADM6, ADM5, ADM7, ADM8, ADM1, ADM6, ADM5, ADM7. So the HTTP interfere data platform offers various connectors including HTTPS, REST, and web activities which can be used to interact with third-party APIs. Authentication. Many APIs require authentication. ADF supports various authentication methods, allowing you to securely authenticate with some APIs. Paginate results to limit the number of records returned. In each response, ADF allows you to handle pagination using looping constructs or custom scripts within pipeline activities to retrieve all desired data, rate limits, error handling, monitoring, and logging, custom activities, data processing.