Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

S K Resume

Download as pdf or txt
Download as pdf or txt
You are on page 1of 1

Shefali Kamalnakhawa

Jersey City, NJ 07306 | shefalipk@outlook.com | 201-736-5795 | HackerRank | LinkedIn | Github

EDUCATION
Pace University, Seidenberg School of Computer Science and Information Systems | Master’s in data science |GPA: 3.93 |New York, NY| Dec 2021
PVPP College of Engineering | Bachelor of Engineering (B.E.) in Information Technology | GPA: 3.0|Maharashtra, India| May 2018

CERTIFICATION
Microsoft Certified: Azure Fundamentals| Microsoft Certified: Azure Data Engineer |Oracle Cloud Infrastructure Certified Data Engineer Professional

OBJECTIVE
Dynamic and results-oriented Data Engineer with a 4+years of experience in architecting and implementing cutting-edge data solutions. Seeking a
challenging role to leverage expertise in data pipeline design, optimization, and automation. Adept at utilizing leading-edge technologies, such as
Azure Data Factory, Databricks, and Snowflake, to drive innovation and deliver tangible business outcomes.

TECHNICAL SKILLS
Programming Languages: Python, SQL, R, HTML, CSS, JavaScript, Node.JS, Angular, React, PL/SQL.
Data Engineer Skills: ETL, Data modeling (star & snowflake schema), Datawarehouse (Snowflake & Amazon Redshift), DBMS (MYSQL & PostgreSQL),
Clouds (Azure, Amazon & Google), Version Control (GitHub), Visualization tool(PowerBI)
Libraries & Algorithms used: TensorFlow, Timeseries, NumPy, Pandas, Matplotlib, Seaborn, Sklearn, Linear Regression,
Tools & Platforms: Microsoft Azure, Databricks, Denodo, Snowflake, Tableau, Power BI, GitHub, GCP, Neo4j, AWS, MS office, Unix, SAS, MS Visual
Studio, FIGMA, MIRO, JIRA.

EXPERIENCE
GlaxoSmithKline, Data Engineer Feb 2022- Present
• Designed and implemented end-to-end data pipelines using Azure Data Factory, Databricks, and Snowflake, resulting in a 30%
reduction in data processing time.
• Developed and maintained ETL pipelines on Azure Data Factory (ADO) to ingest, transform, and load data from Oracle, Denodo and
Hyperion to obtain PowerBI report for pharmaceutical data.
• Improved data quality & accuracy by implementing automated data validation & cleansing routines, leading to a 20% reduction in
data errors.
• Collaborated with data architects to optimize Azure-based data warehousing solutions, resulting in a 30% reduction in query
response times.
• Developed star and snowflake schemas to structure data, optimizing query performance and supporting complex analytics.
• Integrated SQL queries seamlessly into ETL pipelines, ensuring smooth data flow between various systems and platforms.
• Utilized Power BI DAX language to create custom measures and calculated columns for advanced analytics.
• Designed & Implemented CI/CD pipelines in Azure DevOps to automate the deployment of data ingestion and transformation
processes.
• Utilized Denodo data virtualization to provide real-time access to pharmaceutical data across the organization, resulting in a 40%
reduction in data latency.
• Orchestrated end-to-end deployment workflows, including source code integration, testing, and deployment to prod environments.
• Conducted regular performance tuning and optimization of data pipelines, achieving a 99.9% data availability rate.
• Designed & established CI/CD pipelines tailored to the reporting team's needs, automating the build & deployment processes for
vaccine-related reports.
• Set up monitoring and alerting within Azure DevOps to track deployment metrics, including success rates and deployment durations.

Jaro FinCap Pvt Ltd., Maharashtra, India, Data Analyst (Data Science Team) March 2017 — January 2019
• Proficiently analyzed loan data using SQL queries, generating day-to-day reports for strategic decision-making.
• Evaluated analytical model findings within loan product reports, supporting informed decision-making.
• Standardized ETL processes and automated data extraction, reducing manual reporting efforts and saving labor monthly.
• Generated daily loan product reports, identified bottlenecks, and improved customer navigation times within loan applications.
• Leveraged Python for data visualizations and created interactive reports and dashboards using Tableau, Power BI, and other
visualization tools.

ACADEMIC PROJECTS
Analysis of Sales of Video Game Genre (Tableau dashboard, Python, KNN, Unit test, SDLC) Project link
• Implemented Data Science Life Cycle (DSLC), including Business Problem, Data Collection, Research paper, Data Preparation (data cleaning,
visualization), Build a model, Testing (Unit test on code), Deploy.Developed & implemented predictive model using machine learning
algorithm KNN (Brute & K-d tree) to predict game sales.
“Texter”- Unstructured (Twitter) & Structured (Netflix) Data (Python, Google Colab) Project link
• Employed modeling techniques, including building a Naive Bayes classifier in Python on Google Colab, to interpret data from Twitter,
achieving an 89.9% accuracy rate in determining the relevancy of Tweets containing the term "Python" through F-measure evaluation of
precision and recall.

You might also like