Data Scientist/ Machine Learning Engineer: Summary
Data Scientist/ Machine Learning Engineer: Summary
Data Scientist/ Machine Learning Engineer: Summary
MANOJ KUMAR
chalamalamanojkumar@gmail.com
2168009620
Data Scientist/ Machine Learning Engineer
SUMMARY:
Data Scientist with around 6 years of experience in areas including Data Analysis, Statistical Analysis,
Machine Learning, Deep Learning, Data mining with large data sets of structured and unstructured data
Developed various Machine Learning applications with Python Scientific Stack and R.
Experienced with Deep Learning frameworks like Scikit Learn, Tensorflow and Keras.
Experienced DataAnalyst with solid understanding of Data Mapping, Data warehousing (OLTP, OLAP),
DataMining, DataGovernance and Data management services with Quality Assurance.
Experience with Machine Learning algorithms such as logistic regression, KNN, SVM, random forest, neural
network, linear regression, lasso regression and k-means.
Experience in implementing data analysis with various analytic tools, such as Anaconda 4.0 Jupiter Notebook
4.X, R 3.0 (ggplot2, dplyr, Caret) and Excel
Adept in Statistical Data Analysis, Exploratory Data Analysis, Machine Learning, Data Mining, Java and Data
visualization using R, Python, Base SAS, SAS Enterprise Guide and SAS Enterprise Miner, Tableau and SQL
Experienced the full software lifecycle in SDLC, Agile, DevOps and Scrum methodologies including Experience
in Big Data technologies like Spark 1.6, Spark SQL, PySpark, Hadoop 2.X, HDFS, Hive 1.X.
Strong skills in statistical methodologies such as A/B test, experiment design, hypothesis test, ANOVA
Working Experience on Python 3.5/2.7such as NumPy, SQLAlchemy, Beautiful soup, pickle, Pyside,
Pymongo, SciPy, PyTables.
Highly skilled in using visualization tools like Tableau, ggplot2 and d3.js for creating dashboard.
Experience in foundational machine learning models and concepts: regression, random forest, boosting, and
deep learning.
Good knowledge of Hadoop architecture and various components such as HDFS, Job Tracker, Task Tracker,
Name Node, Data Node, Secondary Name Node, MapReduce concepts, and ecosystems including Hive and
Pig.
Ability to write and optimize diverse SQL queries, working knowledge of RDBMS like SQL Server 2008, NoSQL
databases like MongoDB
Experience in Data Warehousing including Data Modeling, Data Architecture, Data Integration (ETL/ELT) and
Business Intelligence.
Good Experience in using various Python libraries (Beautiful Soup, NumPy, Scipy, matplotlib, Python-
twitter, Pandas, MySQL dB for database connectivity).
Having experienced in Big Datatechnologies including Apache Spark, HDFS, Hive, and MongoDB.
Used the version control tools like Git2.X and build tools like Apache Maven/Ant.
Proficient in data mining tools like R, SAS, Python, SQL, Excel, Java,ecosystems Staff leadership and Java
development.
Good Knowledge and experience in deep learning algorithms such as Artificial Neural network (ANN),
Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN), LSTM and RNN based speech
recognition using TensorFlow.
Strong working knowledge with SQL,SQLServer, Oracle, SAS, Tableau and Jupyter while handling various
applications in multiple projects
EDUCATION:
Bachelor’s in Computer Science, SASTRA University, India, 2011
TECHNICAL SKILLS:
Programming & Scripting R (Packages: Stats, Zoo, Matrix, data, table, OpenSSL), Python, SQL, C, C++,
languages JAVA, JCL, COBOL, HTML, CSS, JSP, Java Script, Scala
Cloud Technologies AWS (EC2, S3, RDS, EBS,VPC, IAM, Security Groups), Microsoft Azure, Rackspace
Database SQL, MySQL, TSQL, MS Access, Oracle, Hive, MongoDB, Cassandra, PostgreSQL
Statistical Software SPSS, R, SAS
Development Tool R Studio, Notepad++, Python, Jupiter, Spyder IDE
Python Packages Numpy, SciPy, Pandas, scikit-learn, Matplotlib, seaborn, statsmodels, Keras,
TensorFlow, Theano, TensorFlow, NLTK, Scrapy
Techniques Machine learning, Regression, Clustering, Data mining
Data Science/Data Generalized Linear Models, Logistic Regressions, Boxplots, K-Means, Clustering,
Analysis Tools & SVN, PuTTY, WinSCP, Redmine (Bug Tracking, Documentation, Scrum), Neural
Techniques networks, AI, Teradata, Tableau
Algorithms Skills Machine Learning, Neural Networks, Deep Learning, NLP, Bayesian Learning,
Optimization, Prediction, Pattern Identification, Data / Text mining, Regression,
Logistic Regression, Bayesian Belief, Clustering, Classification, Statistical
modeling
Machine Learning Naïve Bayes, Decision trees, Regression models, Random Forests, Time-series,
K-means
Operating Systems Windows, Linux, Unix, Macintosh HD, Red Hat
WORK EXPERIENCE: