0% found this document useful (0 votes)

3 views

UNIT 2_Advanced Data Structures

This document provides an overview of advanced data structures in R, covering basic mathematical operations, variable assignment, and data types including numeric, character, Date/POSIXct, and logical types. It explains how to manipulate variables, including removal and checking their types, as well as the concept of vectors in R. The document emphasizes the flexibility of R as a programming language and its vectorized operations, which allow for efficient data manipulation.

Uploaded by

DEVIBALA SUBRAMANIAN

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

UNIT 2_Advanced Data Structures

Uploaded by

DEVIBALA SUBRAMANIAN

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

UNIT 2

ADVANCED DATA
STRUCTURES ON R
Ms Devibala Subramanian
Assistant Professor
PG & Research Department of Computer Science
Sri Ramakrishna College of Arts and Science
Coimbatore
Basics of R

R is a powerful tool for all manner of calculations, data manipulation and scientific computations.

Like most languages R has its share of mathematical capability, variables, functions and data
types.

Basic Math
Being a statistical programming language, R can certainly be used to do basic math.

In the console there is a right angle bracket (>) where code should be entered.

Simply test R by running

>1+1
[1] 2

If this returns 2, then everything is great; if not, then something is very, very wrong.
Complicated expressions:

>1+2+3
[1] 6

>3*7*2
[1] 42

>4/2
[1] 2

>4/3
[1] 1.333333

These follow the basic order of operations: Parenthesis, Exponents, Multiplication, Division,
Addition and Subtraction (PEMDAS).

This means operations inside parentheses take priority over other operations. Next on the priority
list is exponentiation. After that, multiplication and division are performed, followed by addition
and subtraction.
This is why the first two lines in the following code have the same result, while the third is
different.

>4*6+5
[1] 29

> (4 * 6) + 5
[1] 29

> 4 * (6 + 5)
[1] 44

So far there is white space in between each operator, such as * and /.

This is not necessary but is encouraged as good coding practice.

Variables

Variables are an integral part of any programming language and R offers a great deal of flexibility.

Unlike statically typed languages such as C++, R does not require variable types to be declared.

A variable can take on any available data type.

It can also hold any R object such as a function, the result of an analysis or a plot.

A single variable can at one point hold a number, then later hold a character and then later a
number again.

Variable Assignment

There are a number of ways to assign a value to a variable, and again, this does not depend
on the type of value being assigned.
The valid assignment operators are <- and =, with the first being preferred.

For example, let’s save 2 to the variable x and 5 to the variable y.

> x <- 2
>x
[1] 2

>y=5
>y
[1] 5

The arrow operator can also point in the other direction.

> 3 -> z
>z
[1] 3
The assignment operation can be used successively to assign a value to multiple variables
simultaneously.

> a <- b <- 7

>a
[1] 7

>b
[1] 7

A more laborious, though sometimes necessary, way to assign variables is to use the assign
function.

> assign("j", 4)
>j
[1] 4
Variable names can contain any combination of alphanumeric characters along with periods (:)
and underscores ( _ ).

However, they cannot start with a number or an underscore.

The most common form of assignment in the R community is the left arrow .

It make sense, as the variable is sort of pointing to its value.

There is also a particularly nice benefit for people coming from languages like SQL, where a
single equal sign (=) tests for equality.

It is generally considered best practice to use actual names, usually nouns, for variables instead of
single letters.

This provides more information to the person reading the code.

Removing Variables
For various reasons a variable may need to be removed. This is easily done using remove or its
shortcut rm.

>j
[1] 4
> rm(j)

> # now it is gone

Error in eval(expr, envir, enclos): object 'j' not found

This frees up memory so that R can store more objects, although it does not necessarily free up
memory for the operating system.

To guarantee that, use gc, which performs garbage collection, releasing unused memory to the
operating system.

R automatically does garbage collection periodically, so this function is not essential.

Variable names are case sensitive

> theVariable <- 17

> theVariable
[1] 17

> THEVARIABLE

Error in eval(expr, envir, enclos): object 'THEVARIABLE' not found

Data Types

There are numerous data types in R that store various kinds of data.

The four main types of data most likely to be used are numeric, character (string), Date/POSIXct
(time-based) and logical (TRUE/FALSE).

The type of data contained in a variable is checked with the class function.

> class(x)
[1] "numeric"
Numeric Data

R excels at running numbers, so numeric data is the most common type in R.

The most commonly used numeric data is numeric.

This is similar to a float or double in other languages.

It handles integers and decimals, both positive and negative, and of course, zero.

A numeric value stored in a variable is automatically assumed to be numeric.

Testing whether a variable is numeric is done with the function is.numeric.

> is.numeric(x)
[1] TRUE
Another important, type is integer.

As the name implies this is for whole numbers only, no decimals.

To set an integer to a variable it is necessary to append the value with an L.

As with checking for a numeric, the is.integer function is used.

> i <- 5L
>i
[1] 5

> is.integer(i)
[1] TRUE

Do note that, even though i is an integer, it will also pass a numeric check.

> is.numeric(i)
[1] TRUE
R promotes integers to numeric when needed. This is obvious when multiplying an integer by a
numeric, but importantly it works when dividing an integer by another integer, resulting in a
decimal number.

> class(4L)
[1] "integer"
> class(2.8)
[1] "numeric"
> 4L * 2.8
[1] 11.2
> class(4L * 2.8)
[1] "numeric"
> class(5L)
[1] "integer"
> class(2L)
[1] "integer"
> 5L / 2L
[1] 2.5
> class(5L / 2L)
[1] "numeric"
Character Data

Even though it is not explicitly mathematical, the character (string) data type is very common in
statistical analysis and must be handled with care.

R has two primary ways of handling character data: character and factor.

While they may seem similar on the surface, they are treated quite differently.

> x <- "data"

>x
[1] "data "

> y <- factor("data")

>y
[1] data
Levels: data

x contains the word “data” encapsulated in quotes, while y has the word “data” without quotes and
a second line of information about the levels of y.
Characters are case sensitive, so “Data” is different from “data” or “DATA”.

To find the length of a character (or numeric) use the nchar function.

> nchar(x)
[1] 4
> nchar("hello")
[1] 5
> nchar(3)
[1] 1
> nchar(452)
[1] 3

This will not work for factor data.

> nchar(y)

Error in nchar(y): 'nchar()' requires a character vector

Dates

Dealing with dates and times can be difficult in any language, and to further complicate matters R has
numerous different types of dates.

The most useful are Date and POSIXct.

Date stores just a date while POSIXct stores a date and time. Both objects are actually represented as the
number of days (Date) or seconds (POSIXct) since January 1, 1970.

> date1 <- as.Date("2012-06-28")

> date1
[1] "2012-06-28“

> class(date1)
[1] "Date“

> as.numeric(date1)
[1] 15519
> date2 <- as.POSIXct("2012-06-28 17:42")
> date2

[1] "2012-06-28 17:42:00 EDT"

> class(date2)

[1] "POSIXct" "POSIXt"

> as.numeric(date2)
[1] 1340919720

Easier manipulation of date and time objects can be accomplished using the lubridate and chron
packages.

Using functions such as as.numeric or as.Date does not merely change the formatting of an
object but actually changes the underlying type.

> class(date1)
[1] "Date "

> class(as.numeric(date1))
[1] "numeric"
Logical

Logicals are a way of representing data that can be either TRUE or FALSE.

Numerically, TRUE is the same as 1 and FALSE is the same as 0. So TRUE 5 equals 5 while
FALSE 5 equals 0.

> TRUE * 5
[1] 5
> FALSE * 5
[1] 0

Similar to other types, logicals have their own test, using the is.logical function.

> k <- TRUE

> class(k)
[1] "logical“

> is.logical(k)
[1] TRUE
R provides T and F as shortcuts for TRUE and FALSE, respectively, but it is best practice not
to use them, as they are simply variables storing the values TRUE and FALSE and can be
overwritten, which can cause a great deal of frustration as seen in the following example.

> TRUE
[1] TRUE
>T
[1] TRUE

> class(T)
[1] "logical"
> T <- 7
>T
[1] 7

> class(T)
[1] "numeric"
Logicals can result from comparing two numbers, or characters.

> # does 2 equal 3?

> 2 == 3
[1] FALSE

> # does 2 not equal three?

> 2 != 3
[1] TRUE

> # is two less than three?

>2<3
[1] TRUE

> # is two less than or equal to three?

> 2 <= 3
[1] TRUE
> # is two greater than three?
>2>3
[1] FALSE

> # is two greater than or equal to three?

> 2 >= 3
[1] FALSE

> # is "data" equal to "stats"?

> "data" == "stats"
[1] FALSE

> # is "data" less than "stats"?

> "data" < "stats"
[1] TRUE
Vectors

A vector is a collection of elements, all of the same type.

For instance, c(1, 3, 2, 1, 5) is a vector consisting of the numbers 1; 3; 2; 1; 5, in that order.

Similarly, c("R", "Excel", "SAS", "Excel") is a vector of the character elements, “R”, “Excel”,
“SAS”, and “Excel”.

A vector cannot be of mixed type.

Vectors play a crucial, and helpful, role in R.

More than being simple containers, vectors in R are special in that R is a vectorized language.

That means operations are applied to each element of the vector automatically, without the need to
loop through the vector.

This is a powerful concept that may seem foreign to people coming from other languages, but it is
one of the greatest things about R.
Vectors do not have a dimension, meaning there is no such thing as a column vector or row vector.

These vectors are not like the mathematical vector, where there is a difference between row and
column orientation.

The most common way to create a vector is with c.

The “c” stands for combine because multiple elements are being combined into a vector.

> x <- c(1, 2, 3, 4, 5, 6, 7, 8, 9, 10)

>x
[1] 1 2 3 4 5 6 7 8 9 10

Lista - Shkolla e Policise
No ratings yet
Lista - Shkolla e Policise
206 pages
R - A Practical Course
No ratings yet
R - A Practical Course
42 pages
Basic-coding-syntax-and-structure-in-R---version-2
No ratings yet
Basic-coding-syntax-and-structure-in-R---version-2
19 pages
R Programming 2
No ratings yet
R Programming 2
11 pages
Eda
No ratings yet
Eda
188 pages
r Programming
No ratings yet
r Programming
56 pages
R Software - Notes
No ratings yet
R Software - Notes
18 pages
Data Science With R
No ratings yet
Data Science With R
46 pages
Basics of R Programming - Part 2
No ratings yet
Basics of R Programming - Part 2
7 pages
BR PDF File K
No ratings yet
BR PDF File K
100 pages
Unit-2-Start Learning R
No ratings yet
Unit-2-Start Learning R
10 pages
Introduction To Rlogistic
No ratings yet
Introduction To Rlogistic
135 pages
Statistics With R Unit 1
No ratings yet
Statistics With R Unit 1
25 pages
R-Basic Concepts
No ratings yet
R-Basic Concepts
67 pages
R Intro
No ratings yet
R Intro
227 pages
R Course Notes
No ratings yet
R Course Notes
10 pages
02 Basic Operators1
No ratings yet
02 Basic Operators1
22 pages
Maths Assinment
No ratings yet
Maths Assinment
84 pages
R Lab
No ratings yet
R Lab
114 pages
r_programme notes
No ratings yet
r_programme notes
57 pages
Introduction To R Programming
No ratings yet
Introduction To R Programming
14 pages
R20 - R Program - P
No ratings yet
R20 - R Program - P
29 pages
R Programming
No ratings yet
R Programming
48 pages
Unit 1.1
No ratings yet
Unit 1.1
85 pages
BRM PRACTICAL FILE H--
No ratings yet
BRM PRACTICAL FILE H--
37 pages
Introduction To R
No ratings yet
Introduction To R
34 pages
Data Analytics Using R
100% (1)
Data Analytics Using R
27 pages
13.1 Course notes - Section II, III, IV
No ratings yet
13.1 Course notes - Section II, III, IV
12 pages
R - Classes (AutoRecovered)
No ratings yet
R - Classes (AutoRecovered)
37 pages
Introduction To R: Alka Vaidya Nibm
No ratings yet
Introduction To R: Alka Vaidya Nibm
50 pages
DSRS BR
No ratings yet
DSRS BR
25 pages
A Report On R Name-Kaveena ROLL NO-12EE46
No ratings yet
A Report On R Name-Kaveena ROLL NO-12EE46
10 pages
R Tutorial
No ratings yet
R Tutorial
25 pages
Introduction to r Chap 2
No ratings yet
Introduction to r Chap 2
30 pages
SCTR Unit 1
No ratings yet
SCTR Unit 1
36 pages
Unit I - R Programming
No ratings yet
Unit I - R Programming
33 pages
R Programming
No ratings yet
R Programming
59 pages
Module 1: Unit - 1.1: Introduction To Analytics or R Programming
No ratings yet
Module 1: Unit - 1.1: Introduction To Analytics or R Programming
26 pages
8 - R Introduction
No ratings yet
8 - R Introduction
56 pages
Data in R
No ratings yet
Data in R
7 pages
Introduction To R Installation: Data Types Value Examples
No ratings yet
Introduction To R Installation: Data Types Value Examples
9 pages
R Project
0% (1)
R Project
25 pages
Lecture 1
No ratings yet
Lecture 1
42 pages
CH 4 Data Analytics With R and Weak Machine Learning
No ratings yet
CH 4 Data Analytics With R and Weak Machine Learning
82 pages
Module 1-1
No ratings yet
Module 1-1
38 pages
Starting With R - 1
No ratings yet
Starting With R - 1
1 page
UNIT-I
No ratings yet
UNIT-I
45 pages
Session 1-1
No ratings yet
Session 1-1
7 pages
Introduction To R
No ratings yet
Introduction To R
39 pages
R Nuts and Bolts
No ratings yet
R Nuts and Bolts
9 pages
Introduction to Analytics and R file
No ratings yet
Introduction to Analytics and R file
29 pages
Data Types
No ratings yet
Data Types
27 pages
R prgramming
No ratings yet
R prgramming
121 pages
R Programming
No ratings yet
R Programming
61 pages
R prog lab manual theory.docx
No ratings yet
R prog lab manual theory.docx
16 pages
R Programming: © 2016 SMART Training Resources Pvt. LTD
No ratings yet
R Programming: © 2016 SMART Training Resources Pvt. LTD
28 pages
RM practical(2)
No ratings yet
RM practical(2)
38 pages
Untitled
No ratings yet
Untitled
59 pages
Data Types in R Programming
No ratings yet
Data Types in R Programming
9 pages
Introduction To R
No ratings yet
Introduction To R
20 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Asymmetric-waveguide-Assisted 3-DB Broadband Directional Coupler
No ratings yet
Asymmetric-waveguide-Assisted 3-DB Broadband Directional Coupler
3 pages
CD4512
No ratings yet
CD4512
8 pages
MRN 1391 - Ranps-Sibayak - 2021 - TDD
No ratings yet
MRN 1391 - Ranps-Sibayak - 2021 - TDD
7 pages
Model-for-the-Prediction-of-Default-Risk-of-Funding-Requests-Using-Data-Mining-Sameh-Ali-2
No ratings yet
Model-for-the-Prediction-of-Default-Risk-of-Funding-Requests-Using-Data-Mining-Sameh-Ali-2
8 pages
CRM Documentation: Date Prepared: Prepared By: Release No: Application Name
No ratings yet
CRM Documentation: Date Prepared: Prepared By: Release No: Application Name
12 pages
Acculab Scale VI-400 Brochure 2
No ratings yet
Acculab Scale VI-400 Brochure 2
1 page
Project Report - VHDL MUX
100% (1)
Project Report - VHDL MUX
4 pages
Metal Thermal Interface Material for the Next Generation FCBGA_[Kim 等]_2021
No ratings yet
Metal Thermal Interface Material for the Next Generation FCBGA_[Kim 等]_2021
6 pages
Product Brochure - 2024 AT-VIBE HONG KONG
No ratings yet
Product Brochure - 2024 AT-VIBE HONG KONG
10 pages
1Z0-1072-20 - DumpsTool - Mansoor
No ratings yet
1Z0-1072-20 - DumpsTool - Mansoor
3 pages
ResumeSallojuSrivani
No ratings yet
ResumeSallojuSrivani
2 pages
Mastering-Web-Development-in-2025
No ratings yet
Mastering-Web-Development-in-2025
10 pages
Request Benefit Payments TWC
No ratings yet
Request Benefit Payments TWC
30 pages
Module 4 and Activity 4 (Hyperbola)
No ratings yet
Module 4 and Activity 4 (Hyperbola)
18 pages
6.111 Introductory Digital Systems Laboratory: Due: Thu, 09/15/16
No ratings yet
6.111 Introductory Digital Systems Laboratory: Due: Thu, 09/15/16
2 pages
Spesifikasi Perangkat: 1. Personal Computer (Low Specification)
No ratings yet
Spesifikasi Perangkat: 1. Personal Computer (Low Specification)
4 pages
Infotech English for Computer Users 3rd Ed 3rd Edition Santiago Remacha Esteras 2024 Scribd Download
100% (6)
Infotech English for Computer Users 3rd Ed 3rd Edition Santiago Remacha Esteras 2024 Scribd Download
50 pages
Good Afl File
No ratings yet
Good Afl File
2 pages
CAM Lab Manual PDF
No ratings yet
CAM Lab Manual PDF
110 pages
Advanced Java 02 - Collections
No ratings yet
Advanced Java 02 - Collections
7 pages
Siemens Polydoros SX 50 80 Generator Service Manual
No ratings yet
Siemens Polydoros SX 50 80 Generator Service Manual
7 pages
Linear Control System Lab Practice # 07 Bode Plots 1. Frequency Domain Analysis
No ratings yet
Linear Control System Lab Practice # 07 Bode Plots 1. Frequency Domain Analysis
2 pages
SQUELCH Circuit
No ratings yet
SQUELCH Circuit
5 pages
7 Ways Fix - Stuck in Windows Automatic Repair Loop
No ratings yet
7 Ways Fix - Stuck in Windows Automatic Repair Loop
3 pages
Application of Ict in Every Day Life 1
No ratings yet
Application of Ict in Every Day Life 1
2 pages
Notice of Material Breach - Nice Ride Minnesota PDF
No ratings yet
Notice of Material Breach - Nice Ride Minnesota PDF
2 pages
The 5G Operator: Platforms, Partnerships, and IT Strategies For Monetizing 5G
No ratings yet
The 5G Operator: Platforms, Partnerships, and IT Strategies For Monetizing 5G
23 pages
Install Apache PHP5 MySQL5.6 Debian 9.6
No ratings yet
Install Apache PHP5 MySQL5.6 Debian 9.6
5 pages