Syllabus
Syllabus
1ST YEAR
Theory
1 DSI-101 Fundamental of 3 0 0 3 3
Computing
2 DSI-102 Mathematics for Data 3 1 0 4 4
Scientists-I
3 DSI(HU)-103 English Communication 3 1 0 4 4
4 DSI-104 Introduction to Data 3 0 0 3 3
Science
5 DSI-105 Computational Statistics-I 3 0 0 3 3
Practical
1 DSI-191 Fundamental of 0 0 2 2 1
Computing
2 DSI-192 Introduction to Data 0 0 2 2 1
Science (with tool based
excel Lab)
3 DSI-193 Computational 0 0 2 2 1
Statistics-I (R-Lab)
Total 20
SEMESTER II
Theory
1 DSI-201 Mathematics for Data 3 1 0 4 4
Scientists-II
2 DSI-202 Data Structures and 3 0 0 3 3
Algorithms
3 DSI(HU)-203 Environmental Science 3 1 0 4 4
4 DSI-204 Computational Statistics- 0 0 3 3
3
II
5 DSI-205 Management Concepts & 3 1 0 4 4
Marketing Research
Practical
1 DSI-291 Data Structures and 0 0 2 2 1
Algorithms
2 DSI-292 Computational Statistics- 0 2 2 1
0
II (R-Lab)
Total 20
1 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
2NDYEAR
2 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
3RDYEAR
3 DSI-593 Elective-2 0 0 2 2 1
Total 20
SEMESTER VI
1 Advanced Data Analytics (Stream, 3 0 0 3 3
DSI-601 Sensor (IoT) and Spatio-temporal
Analysis) & Real Time data
Processing
2 DSI(PE)-602A Advance Neural Networks & Deep 3 1 0 4 4
Learning
DSI(PE)-602B Predictive Modelling (with SAS,
SPSS)
DSI(PE)-602C Data Mining & Data Warehouse
Practical
1 DSI-691 Project 0 0 24 24 12
2 DSI-692 Advanced Data Analytics (Stream, Sensor 0 0 2 2 1
and Spatio-temporal Analysis) & Real
Time data Processing
Total 20
Total credits in the Program: 120 credits
Honours degree requirements: Additional 16 credits from MOOCS of the MOOCS basket
3 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Detailed Syllabus
SEMESTER-I
Module 2- General problem Solving concepts: Algorithm and Flowchart for problem
solving with Sequential Logic Structure, Decisions and Loops, time & space complexity;
Imperative languages: Introduction to imperative language; syntax and constructs of a
specific language (ANSI C). [4L]
Module 3- Types Operator and Expressions with discussion of variable naming and
Hungarian Notation: Variable Names, Data Type and Sizes (Little Endian Big Endian),
Constants, Declarations, Arithmetic Operators, Relational Operators, Logical Operators,
Type Conversion, Increment Decrement Operators, Bitwise Operators, Assignment Operators
and Expressions, Precedence and Order of Evaluation, proper variable naming and Hungarian
Notation [4L]
Module 6- Arrays and Pointers: Arrays, Pointers and address, Pointers and Function
Arguments, Pointers, Address Arithmetic, character Pointers and Functions, Pointer Arrays,
Pointer to Pointer, Multi-dimensional array and Row/column major formats, Initialization of
Pointer Arrays, Command line arguments, Pointer to functions, complicated declarations and
how they are evaluated. [8L]
Module 8- Input and Output: Standard I/O, Formatted Output – printf, Formated Input –
scanf, Variable length argument list, file access including FILE structure, fopen, stdin, sdtout
and stderr, Error Handling including exit, perror and error.h, Line I/O, related miscellaneous
4 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
functions, scope of advance C, a brief introduction of VDU basics, Mouse programming, C-
assembly. [6L]
Laboratory Outcomes
• To formulate the algorithms for simple problems
• To translate given algorithms to a working and correct program
• To be able to correct syntax errors as reported by the compilers
• To be able to identify and correct logical errors encountered at run time
• To be able to write iterative as well as recursive programs
• To be able to represent data in arrays, strings and structures and manipulate
them through a program
• To be able to declare pointers of different types and use them in defining self-
referential structures.
• To be able to create, read and write to and from simple text files.
Programming Method: Debugging, macro, User defined Header, User defined Library
Function, make file
Competitive Programming Laboratory
1. Algorithm and flowcharts of small problems like GCD
2. Structured code writing with:
a. Small but tricky codes
b. Proper parameter passing
c. Command line Arguments
d. Variable parameter
e. Pointer to functions
f. User defined header
g. Make file utility
h. Multi file program and user defined libraries
i. Interesting substring matching / searching programs
j. Parshing related assignments
Familiarization of Computer Hardware & components
Text Books:
1. Herbert Schildt, “C: The Complete Reference”, Fourth Edition, McGraw Hill.
2. B. Gottfried, “Programming in C”, Second Edition, Schaum Outline Series.
Reference Books:
1. B. W. Kernighan and D. M. Ritchi, The ‘C Programming Language”, Second Edition, PHI.
2. Yashavant Kanetkar, “Let Us C”, BPB Publications.
5 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Module-1: Boolean algebra: Introduction of Boolean algebra, truth table, basic logic gate,
basic postulates of Boolean algebra, principle of duality, canonical form, Karnaugh map.
[10L]
Module-2: Calculus: Basic concept of Differential calculus and integral calculus, application
of double and triple integral. [10L]
Module-4: Combinatorics: Basic counting, balls and bins problems, generating functions,
recurrence relations. Proof techniques, principle of mathematical induction, pigeonhole
principle. [12L]
Text Books:
1. I. N. Herstein, “Topics in Algebra”, John Wiley and Sons.
2. M. Morris Mano, “Digital Logic & Computer Design”, Pearson
3. B. S. Grewal, “Higher Engineering Mathematics”, Khanna Publication, Delhi.
Reference Books:
1. Gilbert Strang: Introduction to linear algebra
2. Peter V. O'Neil, “Advanced Engineering Mathematics”, Seventh Edition, Thomson
Learning.
3. M. D. Greenberg, “Advanced Engineering Mathematics”, Second Edition, Pearson
Education.
4. P. N. Wartikar and J. N. Wartikar, “Applied Mathematics”. Vol. I & II, Vidyarthi
Prakashan.
6 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Unit/ Module 2: Spelling and Punctuation/ Spelling Pitfalls, Grammar Revisited - Review of
parts of speech. Proper pronunciation from language lab. Hearing fluent English and
identifying and answering questions. Understanding the proper way to utilize punctuation and
spelling Pitfalls. (8L)
Unit/ Module 3: Functional English - Language functions: descriptive, expressive and social,
Types of language functions: to inform, enquire, attract, influence, regulate and entertain.
Understanding the importance of communication. Communication in an organization. Types
of communication (8L)
Unit/ Module 4: Reading Skills - Strategies for developing reading skills, Skimming and
scanning, Predicting, Inferring, Reading critically. Reading passages, comprehension and
letters. Reading with proper pronunciation. (6L)
Unit/ Module 5: Book reading, Shakespearian Literature reading. Reading silently, sub-
vocalization, Reading at speeds of at least 250 words per minute, Inferring meaning or
content after reading the heading, Guessing meaning of unfamiliar words from context,
Identifying the central idea as well as supporting ideas, Spelling pitfalls, Preparing notes in
diagrammatic form after reading a text, showing the central idea and supporting ideas and the
relationships between them. (10L)
Text Books:
1. Scot Ober, Contemporary business communication, fifth edition, biztantra.
Reference Books:
1. Alan Mc’carthy and O’dell-English vocabulary in use
7 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Module-1: Introduction to data science, Exploratory data analysis, Linear regression and
regularization, Model selection and evaluation (12L)
Module-2: Classification: kNN, decision trees, SVM; Ensemble methods: random forests,
Naïve Bayes and logistic regression (12L)
Module-4: Text mining and information retrieval, Network Analysis, Recommender systems
(8L)
Laboratory:
Measuring association between variables: Topics include types of data, data cleaning,
recoding and sorting, data visualization, summarizing data and an introduction to analysis of
relationships between variables (Excel).
Pricing analytics: focuses on utilizing Excel to estimate demand curves and determine profit
maximizing pricing strategies that result in higher sales and improved profitability.
Turning customer feedback into actionable insight: analysis will begin with a sales forecast
using historical data (Excel)
Text Books:
1. James, G., Witten, D., Hastie, T., Tibshirani, R. An introduction to statistical learning with
applications in R. Springer, 2013.
2. 2. Han, J., Kamber, M., Pei, J. Data mining concepts and techniques. Morgan Kaufmann,
2011.
3. 3. Hastie, T., Tibshirani, R., Friedman, J. The Elements of Statistical Learning, 2nd edition. —
Springer, 2009.
Reference Books:
1. Murphy, K. Machine Learning: A Probabilistic Perspective. - MIT Press, 2012.
2. F. Provost, T Fawcett, “Data Science for business”, 2013
8 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Module-3: Sampling Techniques: Random sampling. Sampling from finite and infinite
populations. Estimates and standard error (sampling with replacement and sampling without
replacement), Sampling distribution of sample mean, stratified random sampling (10L)
Text Books:
1. S. M. Ross, “Introduction of Probability Models”, Academic Press, N.Y.
2. A. Goon, M. Gupta and B. Dasgupta, “Fundamentals of Statistics”, vol. I & II, World
Press.
Reference Books:
1. S. M. Ross, “A first course in Probability”, Prentice Hall.
2. I. R. Miller, J.E. Freund and R. Johnson, “Probability and Statistics for Engineers”.
Fourth Edition, PHI.
3. A. M. Mood, F.A. Graybill and D.C. Boes, “Introduction to the Theory of Statistics”,
McGraw Hill Education.
9 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
SEMESTER-II
Text Books:
1. B. S. Grewal.-Higher Engineering Mathematics
Reference Books:
1. Peter V. O'Neil.-Advanced Engineering Mathematics, 7th Edition
2. Michael. D. Greenberg -Advanced Engineering Mathematics, 2nd Edition
3. Gilbert Strang -Introduction to linear algebra, 5th Edition
4. P. N. Wartikar& J. N. Wartikar -Applied Mathematics (Vol. I & II)
5. R C Gonzalez and R E Woods -Digital Image Processing
10 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Paper: Data Structures and Algorithms
Code: DSI-202
Contacts Hours / Week: 3L
Credits: 3
Laboratory
1. Towers of Hanoi using user defined stacks.
2. Reading, writing, and addition of polynomials.
3. Line editors with line count, word count showing on the screen.
4. Trees with all operations.
5. All graph algorithms.
6. Saving / retrieving non-linear data structure in/from a file
Text Books:
1. E. Horowitz and S. Sahni, Fundamentals of Data Structures, 1977.
2. Alfred V. Aho, John E. Hopperoft, Jeffrey D. UIlman, Data Structures and Algorithms
Reference Books:
1. Donald E. Knuth-The Art of Computer Programming: Volume 1: Fundamental
Algorithms
2. Thomas, H. Cormen, Charles E. Leiserson, Ronald L. Rivest, Clifford Stein-Introduction
to Algorithms
3. Pat Morin-Open Data Structures: An Introduction (Open Paths to Enriched Learning),
31st ed. Edition
11 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Module–1: Defination, Principles and scope of Environmental Science. Earth, Man and
Environment. Ecosystems, Mass and Energy transfer across the various interfaces, material
balance. First and Second law of thermodynamics, heat transfer processes. Scale of
Meteorology, pressure, temperature, precipitation, humidity, radiation and wind. Atmospheric
stability, inversions and mixing height, windroses. Natural resources, conservation and
sustainable development. (4L)
Module–3: Definition, Principles and scope of ecology, Human ecology and Human
settlement, Evolution, Origin of life and speciation. Ecosystem, Common flora and fauna in
India Aquatic Microflora of Atmosphere : Air Sampling techniques, Identification of
aeroallergens. Air-borne diseases and allergies. Environmental Biotechnology : Fermentation
Technology, Vermiculture technology, Biofertilizer technology. (5L)
Module–4:
Environmental Geosciences, Earth’s Processes and Geological Hazards, Mineral Resources
and Environment : Resources and Reserves, Minerals and Population. Oceans as new areas
for exploration of mineral resources. Ocean ore and recycling of resources. Environmental
impact of exploitation, processing and smelting of minerals. Water Resources and
Environment, Resources of oceans. Ocean pollution by toxic wastes. Human use of surface
and groundwaters. Groundwater pollution. Landuse Planning, Environmental Geochemistry,
Biogeochemical factors in environmental health. Human use, trace elements and health.
Possible effects of imbalance of some trace elements. Diseases induced by human use of
land. Principles of Remote sensing and its application of Environmental Sciences.
Application of GIS in Environmental Management. (6L)
Module-5:
Sun as source of energy; solar radiation and its spectral characteristics; Fossil fuels–
classifaction, composition, physico-chemical characteristics and energy content of coal,
petroleum and natural gas. Principles of generation of hydroelectric power, nuclear energy–
fission and fusion; magnetohydrodynamic power, bio-energy–energy from biomass and
biogas, anaerobic digestion; energy use pattern in different parts of the world. Environmental
implication of energy use; impacts of large-scale exploitation of Solar, Wind, Hydro and
Ocean energy (3L)
12 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Module-6:
Air : Natural and anthropogenic sources of pollution. Acid Rain, Air Quality Standards.
Water : Types, sources and consequences of water pollution. Physico-chemical and
Bacteriological sampling and analysis of water quality. Standards, sewage and waste water
treatment and recycling. Water quality standard. Soil, Soil Pollutiong Control. Industrial
waste effluents and heavy metals, Noise : Sources of noise pollution, measurements of noise
and Indices. Marine : Sources of marine pollution and control. Criteria employed for disposal
of pollutants in marine system–coastal management. Radioactive and Thermal Pollution. (4L)
Module-7:
Introduction to environmental impact analysis, Environmental impact Statement and
Environmental Management Plan. EIA guidelines 1994, Notification of Government of India.
Impact Assessment Methodologies. Generalized approach to impact analysis. Procedure for
reviewing Environmental impact analysis and statement. Guidelines for Environmental audit.
Introduction of Environmental planning. Base line information and predictions (land, water,
atmosphere, energy, etc.) Restoration and rehabilitation technologies. Landuse policy for
India. Urban planning for India. Rural planning and landuse pattern. Concept and strategies
of sustainable development. Cost-Benefit analysis. Environmental priorities in India and
sustainable development. (3L)
Module-8:
Sources and generation of solid wastes, their characterization, Hazardous Waste Management
and Handling Rules, 1989, Resource Management, Disaster Management and Risk analysis.
Environment protection–issues and problems, International and National efforts for
Environment Protection, Provision of Constitution of India regarding Environment (Article
48A and 58A) Environmental Policy Resolution, Legislation, Public Policy Strategies in
Pollution Control, Wildlife Protection Act, Forest Conservation Act, Indian Forests Act
(Revised) 1982, Air (Prevention and Control of Pollution) Act, Motor Vehicle Act. 1988, The
Water (Prevention and Control of Pollution) Act, 1974, Public Liability Insurance Act, 1991
and Rules 1991. (3L)
Module-9:
Basic elements and tools of statistical analysis; Probability, sampling, measurement and
distribution of attributes; Distribution– Normal, t and x2, Poisson and Binomial; Arithmetic,
Geometric and Harmonic means; Introduction to environmental system analysis; Approaches
to development of models; linear simple and multiple regression models, validation and
forecasting. Models of population growth and interactions–Lotka-Volterra model, Leslie’s
matrix model, point source stream pollution model, box model, Gaussian plume model. (3L)
Module-10:
Environmental Education and Awareness. Environmental Ethics and Global imperatives.
Context : Narmada Dam, Tehri Dam, Almetti Dam, Soil Erosion, Formation and reclamation
of Usar, Alkaline and Saline Soil. Waste lands and their reclamation, Vehicular pollution and
urban air quality. Depletion of Nature resources. Biodiversity conservation and Agenda-21.
Waste disposal, recycling and power generation, Fly ash utilization. Rain water harvesting.
Wet lands conservation. Epidemiological issues (e.g., Goitre, Fluorosis, Arsenic) (4L)
13 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Text Books:
1. Enger, E. and Smith, B., Environmental Science: A Study of Interrelationships,
Publisher: McGraw-Hill Higher Education; 12th edition, 2010.
2. Pepper, I.L., Gerba, C.P. & Brusseau, M.L. 2011. Environmental and Pollution
Science. Academic Press.
Reference Books:
1. Richard T Wright, Environmental Science: Towards a Sustainable Future, Prentice-
Hall Inc., 2008.
14 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Module-1: Linear Statistical Models: Simple linear regression & correlation, multiple
regression & multiple correlation, Analysis of variance (6L)
Module-2: Estimation: Point estimation, criteria for good estimates (un-biasedness,
consistency), Methods of estimation including maximum likelihood estimation. (8L)
Module-3: Sufficient Statistic: Concept & examples, complete sufficiency, their application
in estimation (5L)
Module-4: Test of hypothesis: Concept & formulation, Type I and Type II errors, Neyman
Pearson lemma, Procedures of testing (6L)
Module-5: Non-parametric Inference: Comparison with parametric inference, Use of order
statistics. Sign test, Wilcoxon signed rank test, Mann-Whitney test, Run test, Kolmogorov-
Smirnov test. Spearman’s and Kendall’s test. Tolerance region (10L)
Module-6: Basics of Time Series Analysis & Forecasting: Stationary, ARIMA Models:
Identification, Estimation and Forecasting. (5L)
Text Books:
1. I.R. Miller, J.E. Freund and R. Johnson. Probability and Statistics for Engineers (4th
Edition)
2. A. Goon, M. Gupta and B.Dasgupta. Fundamentals of Statistics (Vol. I & Vol. II)
3. Chris Chatfield. The Analysis of Time Series: An Introduction
Reference Books:
1. D.C. Montgomery & E.Peck- Introduction to Linear Regression Analysis
2. A.M. Mood, F.A. Graybill& D.C. Boes. Introduction to the Theory of Statistics
3. N. Draper & H. Smith- Applied Regression Analysis
4. Garrett Grolemund- Hands-on Programming with R
5. Jared P. Lander- R for Everyone: Advanced Analytics and Graphics
Laboratory:
R statistical programming language: Introduction to R, Functions, Control flow and Loops,
Working with Vectors and Matrices, Reading in Data, Writing Data, Working with Data,
Manipulating Data, Simulation, Linear model, Data Frame, Graphics in R
15 |15 P a g e
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY, WB
(Formerly West Bengal University of Technology)
Syllabus for B.Sc.in Data Science (Industry Induced)
(Effective for Students Admitted in Academic Session 2019-2020)
Module-2: Scanning the Business Environment The value chain, Core Competencies
Strategic Planning Process, PESTEL Competition Analysis SWOT Analysis (6L)
Module-4: Buyer Behavior Consumer Behavior Consumer buying process model What
Influences Consumer Behavior Key Psychological Processes The Buying Decision Process:
The Five Stage Model Other Theories of Consumer Decision Making Industrial Buyer
Behavior Concept of Buying Center Industrial buying process model Influence of Economic
and Behavioral Factors Influence of Procurement Organization Role of Negotiation Process
(12L)
Text Books:
1. K. H. Erickson, Marketing Management Concepts and Tools: A Simple Introduction
2. Malhotra, Dash-Marketing Research,
Reference Books:
16 |15 P a g e