0% found this document useful (0 votes)

107 views

R Programming Language Notes

This document provides an overview of the R programming language and its use for statistical analysis and data science. It compares R to Python and discusses some key features of R, including its use of data frames as the primary data type and its focus on statistical analysis, graphics, and data analysis. The document then provides examples of loading and manipulating data frames in R, including selecting rows and columns, sorting, aggregation, and joining data. It also demonstrates some plotting and string manipulation functions.

Uploaded by

Foster Karmon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

107 views

R Programming Language Notes

Uploaded by

Foster Karmon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

R programming language

R programming language language and software environment for statistical computing,

graphics, and more recently, Data analysis!!

● Dynamically typed and interpreted like python

● Primary data type is “data frame.” Similar to relational table and pandas tables.

R Python
● Easier for experienced programers ● Good for beginners and
● Tends to be favored by academics, experienced
researchers, hard-core data ● Used by software engineers of all
scientists types
● Shorter code for complex analysis, ● Better integrated for general
statistics, graphics purpose coding.
● Extremely slow! ● Not especially fast

Open CSV files and load data into data frames

In [ ]:
C1 = open('Cities.csv').read()
C2 = open('Countries.csv').read()
In [ ]:
%%R -i C1 -i C2
cities <- read.csv(text=C1)
countries <- read.csv(text=C2)
Data frame introduction

1 (before comma) talking about rows 2 (after comma) talking

about columns
In [ ]:
%%R
cities[1,2]

Printing # or rows and # of columns

In [ ]:
%%R
print(nrow(cities))
print(ncol(cities))

#Prints first row, and all columns

In [ ]:
%%R
cities[1,]

#Prints rows 1 through 10

In [ ]:
%%R
cities[1:10,]

#This says loop through the first 10 rows, each time in the loop print the ith row of cities
The {} are equivalent to the indent in python for loop
In [ ]:
%%R
for (i in 1:10) { print(cities[i,]) }

#After comma is columns, so this prints the 2nd column.

In [ ]:
%%R
cities[,2]
# change to cities[,4]
#Prints 5th row, 4th column
In [ ]:
%%R
cities[5,4]
# change to cities[5:10,2:4]

#Will print first 10 rows

In [ ]:
%%R
head(cities,10)
# also show number of rows, tail()

#Will print last 10 rows

In [ ]:
%%R
tail(cities,10)
# also show number of rows, tail()

Basic data operations

#Select single column

In [ ]:
%%R
cities[,'city']
# change to cities['city']

#Select multiple columns (adding the ,c makes a list)

In [ ]:
%%R
cities[,c('city','temperature')]

#Select all rows in the dataframe where the longitude is less than 0
In [ ]:
%%R
cities[cities$longitude < 0,]
#Select rows and columns
In [ ]:
%%R
cities[cities$latitude > 50 & cities$temperature > 9,
c('city','latitude','temperature')]

#Sort the rows based on temperature (note its BEFORE comma cuz its rows we sort)
Decreasing = TRUE changes the order of sorting
In [ ]:
%%R
cities[order(cities$temperature),decreasing = TRUE]
# descending country with ascending temperature?
# can use - on string columns with as.numeric()

#Sorting by increasing country, then within each country, increasing temp (like grouping)
# ascending count
%%R
cities[order(cities$country,cities$temperature),]
NOTE: if we add a minus sign - before cities, it will do decreasing temp but country will
remain increasing

# If we want to do the same thing but sort decreasing country and increasing temp we
have to add as.numeric() around cities cuz minus sign expects numbers
# ascending count
%%R
cities[order(-as.numeric(cities$country),cities$temperature),]

# If we want to do a selection (which goes before comma) and want to sort (which goes
before) we can just put them together. We have to use “temporary” cities2 to pick out
temperatures. And then within cities2, it will order by decreasing temperature.
In [ ]:
%%R
cities2 <- cities[cities$longitude < 0 & cities$temperature > 12,
c('city','temperature')]
cities2[order(-cities2$temperature),]
Your Turn
Find all countries that are not in the EU and don't have coastline, together with their populations,
sorted by country name in reverse alphabetical order. Note: equality uses '==' and strings can be
single (') or double (") quoted.
In [6]:
%%R
countries2 <- countries[countries$EU =='no' & countries$coastline == 'no',
c('country','population')]
countries2[order(countries2$country, decreasing=TRUE),]

Aggregation

#Overall average temperature

In [ ]:
%%R
mean(cities$temperature)

#Average temperature of cities in each country

In [ ]:
%%R
aggregate(cities$temperature, by=list(cities$country), FUN=mean)

#Overall min and Max

In [ ]:
%%R
print(min(cities$temperature))
print(max(cities$temperature))

#Grouped aggregation by EU and Coastline

In [ ]:
%%R
aggregate(countries$population, by=list(countries$EU,countries$coastline),
FUN=mean)

EU Coastline Average
1 no no 4.35375
2 yes no 6.99000
3 no yes 19.59571
4 yes yes 21.37818
#Number of cities west of the Prime Meridian (i.e., longitude < 0) - error then fix
In [ ]:
%%R
cities2 <- cities[cities$longitude < 0,]
nrow(cities2)

Your Turn
Considering only cities with latitude < 40, find the average temperature for each country. Then
considering only cities with latitude > 60, find the average temperature for each country. Remember
print() is needed to see a result unless it's the last line.
In [12]:
%%R
south <- cities[cities$latitude < 40,]
north <- cities[cities$latitude > 60,]
print(aggregate(south$temperature, by=list(south$country), FUN=mean))
print(aggregate(north$temperature, by=list(north$country), FUN=mean))

Joining

#Cities not in the EU with latitude > 50; return city, country, latitude, and whether country has
coastline
In [ ]:
%%R
citiesext <- merge(cities,countries)
citiesext[citiesext$EU == 'no' & citiesext$latitude > 50,
c('city','country','latitude','coastline')]
Miscellaneous features

#String operations - countries with 'ia' in their name

In [ ]:
%%R
countries[grepl('ia',countries$country),]

#Add fahrenheit column

In [ ]:
%%R
cities['fahrenheit'] <- (cities$temperature * 9/5) + 32
head(cities, 10)

#Print using cat( )

In [ ]:
%%R
cat('Miniumum temperature:', min(cities$temperature), '\n')
cat('Maxiumum temperature:', max(cities$temperature), '\n')

Plotting
Scatterplots

#Temperature versus latitude

In [ ]:
%%R
plot(cities$latitude, cities$temperature)
# add xlab='latitude', ylab='temperature', col='blue', pch=16

#Latitude versus longitude colored by temperature

In [ ]:
%%R
for (i in 1:nrow(cities))
{ if (cities[i,'temperature'] < 7) cities[i,'category'] <- 'blue'
else if (cities[i,'temperature'] < 11) cities[i,'category'] <- 'yellow'
else cities[i,'category'] <- 'red'
}
plot(cities$longitude, cities$latitude, xlab='longitude', ylab='latitude',
col=cities$category, pch=16)
Bar charts

#Bar chart showing populations of countries with 'ia' in their name

In [ ]:
%%R
bars <- countries[grepl('ia',countries$country), 'country']
heights <- countries[grepl('ia',countries$country), 'population']
barplot(heights, names.arg=bars, xlab='country', ylab='population',
col='blue')
# add las=2 for vertical labels

Pie charts

#Pie chart showing number of EU countries versus non-EU countries

In [ ]:
%%R
slices <- c(nrow(countries[countries$EU == 'yes',]),
nrow(countries[countries$EU == 'no',]))
labels <- c('EU', 'not EU')
pie(slices, labels)
# add col=c('blue','red')

WS3 Geographic
100% (1)
WS3 Geographic
18 pages
Data Analysis and Visualization in R - Final Paper - Bike Sharing Dataset Analysis
No ratings yet
Data Analysis and Visualization in R - Final Paper - Bike Sharing Dataset Analysis
16 pages
Lab 5
0% (1)
Lab 5
5 pages
Book - Roger D Peng-Exploratory Data Analysis With R-Leanpub (2015) PDF
0% (1)
Book - Roger D Peng-Exploratory Data Analysis With R-Leanpub (2015) PDF
125 pages
Next Cube
No ratings yet
Next Cube
4 pages
Exo Administrator Guide 4.1 PDF
No ratings yet
Exo Administrator Guide 4.1 PDF
145 pages
Protocol Family Encapsulations
No ratings yet
Protocol Family Encapsulations
1 page
R Introduction
No ratings yet
R Introduction
94 pages
R Programs 2024-2025
No ratings yet
R Programs 2024-2025
13 pages
UNIT-II R Programming
No ratings yet
UNIT-II R Programming
41 pages
Reshape2 - R - Flexibly Reshape Data - A Reboot of The Reshape Package
No ratings yet
Reshape2 - R - Flexibly Reshape Data - A Reboot of The Reshape Package
14 pages
R Course Own English HS
No ratings yet
R Course Own English HS
70 pages
Module IV
No ratings yet
Module IV
43 pages
Creating A Single Data Frame From A Collection of Files
No ratings yet
Creating A Single Data Frame From A Collection of Files
6 pages
R-Programming-Cheat-Sheet
No ratings yet
R-Programming-Cheat-Sheet
7 pages
UL2
No ratings yet
UL2
2 pages
DR - Pierpaolo-Delser - Introduction R
No ratings yet
DR - Pierpaolo-Delser - Introduction R
83 pages
DSCI 100 Cheat Sheet
No ratings yet
DSCI 100 Cheat Sheet
3 pages
RSTUDIO
No ratings yet
RSTUDIO
44 pages
UNIT-II_R_programming-1
No ratings yet
UNIT-II_R_programming-1
41 pages
Basic R Dplyr Session 4 Demonstration
No ratings yet
Basic R Dplyr Session 4 Demonstration
18 pages
KrutikaKolhe-862467252-HW5
No ratings yet
KrutikaKolhe-862467252-HW5
18 pages
R Program3
No ratings yet
R Program3
21 pages
R Intro STAT5000
No ratings yet
R Intro STAT5000
17 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
2.Data_Frame_Selection_and_Indexing
No ratings yet
2.Data_Frame_Selection_and_Indexing
4 pages
R Examples
No ratings yet
R Examples
56 pages
Notes-US Census Data
No ratings yet
Notes-US Census Data
12 pages
Unit II - R Programming
No ratings yet
Unit II - R Programming
29 pages
advance R prog.-1
No ratings yet
advance R prog.-1
24 pages
Important R Codes and Notes
No ratings yet
Important R Codes and Notes
13 pages
Miniproject Smur Temperature
No ratings yet
Miniproject Smur Temperature
7 pages
r file code
No ratings yet
r file code
16 pages
MBA Sem 1 Unit 3 Fundamentals of R (1)
No ratings yet
MBA Sem 1 Unit 3 Fundamentals of R (1)
41 pages
R Programming Practical File
No ratings yet
R Programming Practical File
38 pages
HW 4
No ratings yet
HW 4
12 pages
BDAExp 8
No ratings yet
BDAExp 8
9 pages
ProgrammingForDS14_Rbasics
No ratings yet
ProgrammingForDS14_Rbasics
32 pages
Content: Dplyr, Readr, TM, Ggplot2/+ggforce/, Tidyr, Broom Dplyr
No ratings yet
Content: Dplyr, Readr, TM, Ggplot2/+ggforce/, Tidyr, Broom Dplyr
8 pages
Introduction To Dplyr
No ratings yet
Introduction To Dplyr
9 pages
Hierar Varam
No ratings yet
Hierar Varam
11 pages
R - Mean, Median, Mode
No ratings yet
R - Mean, Median, Mode
10 pages
FDP Indoglobal Group of Colleges: 27 April To 1 May R Programming Language Assignment Submission
No ratings yet
FDP Indoglobal Group of Colleges: 27 April To 1 May R Programming Language Assignment Submission
12 pages
DS Lab
No ratings yet
DS Lab
31 pages
fds qb
No ratings yet
fds qb
6 pages
R Practical File
No ratings yet
R Practical File
17 pages
8 R Basics 3
No ratings yet
8 R Basics 3
27 pages
R Practicals (2007 Version)
No ratings yet
R Practicals (2007 Version)
15 pages
Week11 Slides
No ratings yet
Week11 Slides
27 pages
R Basic and Advanced
No ratings yet
R Basic and Advanced
9 pages
BDA Section 4
No ratings yet
BDA Section 4
19 pages
DVT (Lab) - R Language Manual
No ratings yet
DVT (Lab) - R Language Manual
20 pages
SML Practical 1to11
No ratings yet
SML Practical 1to11
23 pages
Book - Roger D Peng-Exploratory Data Analysis With R-Leanpub (2015) PDF
No ratings yet
Book - Roger D Peng-Exploratory Data Analysis With R-Leanpub (2015) PDF
125 pages
Exploratory Data Analysis With R PDF
No ratings yet
Exploratory Data Analysis With R PDF
125 pages
Exploratory Data Analysis With R-Leanpub PDF
No ratings yet
Exploratory Data Analysis With R-Leanpub PDF
125 pages
R Functions
No ratings yet
R Functions
8 pages
C Language Programming Codes
From Everand
C Language Programming Codes
Durgesh
No ratings yet
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Gd Script
From Everand
Gd Script
Marijo Trkulja
No ratings yet
Conceptual Programming: Conceptual Programming: Learn Programming the old way!
From Everand
Conceptual Programming: Conceptual Programming: Learn Programming the old way!
Avishek Sharma
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
EX200 Red Hat Certified System Administrator (RHCSA) Exam - 2015-10-10
100% (1)
EX200 Red Hat Certified System Administrator (RHCSA) Exam - 2015-10-10
11 pages
HDDScan Eng PDF
No ratings yet
HDDScan Eng PDF
18 pages
Trace
No ratings yet
Trace
85 pages
Data Warehouses
No ratings yet
Data Warehouses
6 pages
Native Dynamic SQL
No ratings yet
Native Dynamic SQL
2 pages
NTC/2080/Zx Series
No ratings yet
NTC/2080/Zx Series
4 pages
Vikasreddyaravabhumi
No ratings yet
Vikasreddyaravabhumi
2 pages
HCIE Int File1 PDF
No ratings yet
HCIE Int File1 PDF
39 pages
AWS Migrate Resources To New Region
No ratings yet
AWS Migrate Resources To New Region
23 pages
Release Notes For VDR Explorer 4.01
No ratings yet
Release Notes For VDR Explorer 4.01
7 pages
SNES Sound
No ratings yet
SNES Sound
2 pages
Coding Conventions: The Hungarian Notation
No ratings yet
Coding Conventions: The Hungarian Notation
2 pages
2018-01-31 DEOS Product Catalog 2018 Without Prices en
No ratings yet
2018-01-31 DEOS Product Catalog 2018 Without Prices en
258 pages
Unit 1 - Computer Networks - WWW - Rgpvnotes.in
No ratings yet
Unit 1 - Computer Networks - WWW - Rgpvnotes.in
14 pages
Install Windows 10 Apps To An External Hard Disk
No ratings yet
Install Windows 10 Apps To An External Hard Disk
8 pages
802.11a Baseband Core Product Brief: Features Benefits
No ratings yet
802.11a Baseband Core Product Brief: Features Benefits
4 pages
هياكل البيانات DS data structure c++ ميد تيرم
No ratings yet
هياكل البيانات DS data structure c++ ميد تيرم
17 pages
Week 1
No ratings yet
Week 1
50 pages
MIT6 006F11 Lec08 PDF
No ratings yet
MIT6 006F11 Lec08 PDF
7 pages
Als Pass 17: Roficiency Ssessment Upplementary Heet
No ratings yet
Als Pass 17: Roficiency Ssessment Upplementary Heet
1 page
Java MCQ
100% (1)
Java MCQ
13 pages
The Emirates National School - Sharjah Informatics Practices Worksheet Grade 12 (2024-'25) - Review of Database Concepts and SQL
No ratings yet
The Emirates National School - Sharjah Informatics Practices Worksheet Grade 12 (2024-'25) - Review of Database Concepts and SQL
2 pages
HorizonView ReferencePorts v1
No ratings yet
HorizonView ReferencePorts v1
5 pages
Dictionary
No ratings yet
Dictionary
6 pages
Housing Society Project by 12 Student
No ratings yet
Housing Society Project by 12 Student
20 pages
02.defensepro Platform
No ratings yet
02.defensepro Platform
2 pages
MESI Protocol
No ratings yet
MESI Protocol
9 pages

R Programming Language Notes

Uploaded by

R Programming Language Notes

Uploaded by

R programming language

R programming language language and software environment for statistical computing,

● Dynamically typed and interpreted like python

Open CSV files and load data into data frames

1 (before comma) talking about rows 2 (after comma) talking

Printing # or rows and # of columns

#Prints first row, and all columns

#Prints rows 1 through 10

#After comma is columns, so this prints the 2nd column.

#Will print first 10 rows

#Will print last 10 rows

Basic data operations

#Select single column

#Select multiple columns (adding the ,c makes a list)

#Overall average temperature

#Average temperature of cities in each country

#Overall min and Max

#Grouped aggregation by EU and Coastline

#String operations - countries with 'ia' in their name

#Add fahrenheit column

#Print using cat( )

#Temperature versus latitude

#Latitude versus longitude colored by temperature

#Bar chart showing populations of countries with 'ia' in their name

#Pie chart showing number of EU countries versus non-EU countries

You might also like