Akshay Godugu Phone: (424) 272-5152: Required Skills/Experience # Years
Akshay Godugu Phone: (424) 272-5152: Required Skills/Experience # Years
Akshay Godugu Phone: (424) 272-5152: Required Skills/Experience # Years
Phone : (424)272-5152
Email : akshaygodugu45@gmail.com.
Summary
Around 8 Years of experience in Machine Learning, Data mining with large datasets of Structured and
Unstructured data, Data Acquisition, Data Validation, Predictive modeling, Data Visualization.
Used Pandas, NumPy, Seaborn, SciPy, Matplotlib, Scikit-learn, NLTK in Python for developing various
machine learning algorithms and utilized machine learning algorithms such as Linear Regression,
Multivariate Regression, Naive Bayes, Random Forests, K-Means, & KNN for Data Analysis.
Responsible for design and development of advanced R/Python programs to prepare transform and harmonize
data sets in preparation for modeling.
Hands on experience in implementing LDA, Naive Bayes and skilled in Random Forests, Decision Trees,
Linear and Logistic Regression, SVM, Clustering, neural networks, Principle Component Analysis and
good knowledge on Recommender Systems.
Proficient in Statistical Modeling and Machine Learning techniques (Linear, Logistics, Decision Trees,
Random Forest, SVM, K-Nearest Neighbors, Bayesian, XG Boost) in Forecasting/ Predictive Analytics,
Segmentation methodologies, Regression based models, Hypothesis testing, Factor analysis/ PCA, Ensembles.
Expertise in transforming business requirements into Analytical Models, Designing Algorithms, Building
Models, Developing Data Mining and reporting solutions that scales across massive volume of structured
and unstructured data.
Developed Logical Data Architecture with adherence to Enterprise Architecture.
Strong experience in Software Development Life Cycle (SDLC) including Requirements Analysis, Design
Specification and Testing as per Cycle in both Waterfall and Agile methodologies.
Adept in statistical programming languages like Rand also Python including Big Data technologies like
Hadoop, Hive.
Skilled in usingdplyr and pandas in R and Python for performing Exploratory data analysis.
Experience working with data modeling tools like Erwin, Power Designer and ERStudio.
Experience in designing star schema, Snow flake schema for Data Warehouse, ODS architecture.
Experience in designing stunning visualizations using Tableau software and publishing and presenting
dashboards, Storyline on web and desktop platforms.
Improved fraud prediction performance by using random forest and gradient boosting for feature selection
with Python Scikit-learn.
Designed and implemented system architecture for Amazon EC2 based cloud-hosted solution for the client.
Analysed large data sets apply machine learning techniques and develop predictive models, statistical models
and developing and enhancing statistical models by leveraging best-in-class modelling techniques.
Wrote Python modules to extract/load asset data from the MySQL source database.
Highly skilled in using Hadoop (pig and Hive) for basic analysis and extraction of data in the infrastructure to
provide data summarization.
Highly skilled in using visualization tools like Tableau, ggplot2 and d3.JS for creating dashboards.
Worked and extracted data from various database sources like Oracle, SQL Server, DB2, Regularly accessing
JIRA tool and other internal issue trackers for the Project development.
Skilled in System Analysis, E-R/Dimensional Data Modeling, Database Design and implementing RDBMS
specific features.
Knowledge of working with Proof of Concepts (PoC's) and gap analysis and gathered necessary data for
analysis from different sources, prepared data for data exploration using Data Munging and Teradata.
Well experienced in Normalization & De-Normalization techniques for optimum performance in relational
and dimensional database environments.
Technical Skills:
Scripting/programming R (dplyr, ggplot2, shiny, plotly), Python (Numpy, Scipy, Pandas, Scikit-learn,
language Matplotlib, NLTK, Beautiful Soup, Selenium, Python IDE), Pyspark
Machine learning/Deep Classification, Regression(Linear, Logistic, Elastic Net), Clustering analyses using
learning neuralnets (MLP), RF, KNN, SVM, GLM, MLR, Logit, K-means algorithms
Database management RDBMS (Microsoft SQL server, Oracle DB, Teradata)
systems
Big Data MySQL, Spark, Hadoop/MapReduce, Hive, Impala
Statistical Analysis Tools SAS Studio, SAS Enterprise Guide, SAS Enterprise Miner, Python, R, ggplot2,
dplyr,cart, scipy,sklearn
Data storage/processing Hadoop And Spark
framework
Data Tableau, Power BI and shiny
visualization/reporting
Operating System Windows, Unix
Case Tools Erwin &ERStudio
Professional Experience:
EDUCATION
JawaharlaNehru Technological University, Hyderabad, India Aug 2008 – May 2012