0% found this document useful (0 votes)

57 views

Lesson 04 Text Files

This document provides an overview of common Linux text processing tools: - Section 4.1 describes text file viewing tools like head, tail, cat - Section 4.2 covers the grep tool for searching text files - Section 4.3 defines regular expressions used with grep and other tools - Section 4.4 presents awk for data extraction and reporting - Section 5.5 introduces sed for editing text files The document gives examples of using each tool to view, search, extract, and edit parts of text files.

Uploaded by

Taha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views

Lesson 04 Text Files

Uploaded by

Taha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Lesson 04 Text Files

Created @December 8, 2021 6:38 AM

Class

Type

Materials

Reviewed

Last Update @December 8, 2021 10:23 PM

4.1 Text tools
4.2 grep (Generic Regular Expression Parser)
4.3 Regular Expressions
POSIX:
4.4 awk
4.5 sed (Stream Editor)

4.1 Text tools

more // read file contents
less // more advance features tham "more" // can browe forward (space bar) and back
ward (Page Up)
head // show the first 10 lines
tail // show the last 10 lines
-n nn // to specify exact number of lines
cat
-A : shows all non-printable characters (tab, end of line, ...)
-b : line numbers
-s : supress repeated embty lines
tac // same as cat, but in reverse order, funny command
cut // filter output
sort // sort output
tr // translate // works like find & replace

head -n 5 /etc/passwd
head -n 10 /etc/passwd | tail -n 1 // show line number 10
tail -n 3 /etc/passwd
tail -f /var/log/messages

$ head -n 5 /etc/passwd | tail -n 1 // show line number 5 from the file /e

tc/passwd

Lesson 04 Text Files 1

lp:x:4:7:lp:/var/spool/lpd:/sbin/nologin

cut -f 3 -d : /etc/passwd | less // cut field number 3, where delimiter is ":"

4.2 grep (Generic Regular Expression

Parser)
find text in a file or in an output

ps -aux | grep ssh

grep linda * 2> /dev/null // search for linda, in all files, in the current dire
ctory
// it will show file names & the line containing "lind
a"
grep '\<root\>' * 2> /dev/null // search for "root", in all files, in the current d
irectory
grep -l linda * 2> /dev/null // l : less, show list of files only
grep -i linda * // -i : ignor case
grep -A5 linda /etc/passwd // print the following 5 lines after finding linda //
useful in logs
grep -B5 linda /etc/passwd // print the previous 5 lines before finding linda //
useful in logs
grep -R root /etc // Recursively find the word root
grep -Rl root /etc 2> /dev/null | less // l : less

egrep '^[[:alpha:]]{3}$' * 2> /dev/null // egrep all lines that are exactly 3
alphabets
grep '^...$' * 2> /dev/null // grep all lines that are exactly 3 c
haracters
$ grep '^endif$' * 2> /dev/null // find exactlty "endif"
grep '\<endif\>' * 2> /dev/null // find exactlty "endif"

4.3 Regular Expressions

globbing : applies to file name

Regular Expression : applies to search patterns for a text inside

a file

Lesson 04 Text Files 2

grep 'a*' a* // first 'a*' is Regular expression, to search for the pattern 'a*'
inside the file
// second a* is globbing, to search for files with a*

Regular expressions are used with:

grep

vim

awk

sed

POSIX:
The Portable Operating System Interface is a family of standards specified by the
IEEE Computer Society for maintaining compatibility between operating systems.

The goal of POSIX is to ease the task of cross-platform software development by
establishing a set of guidelines for operating system vendors to follow. Ideally, a
developer should have to write a program only once to run on all POSIX-compliant
systems.

man 7 regex // Regular Expression

$ cat regtext
b
bt
bit
bite
boot
bloat
boat

Lesson 04 Text Files 3

Regular expression must be
between single quotes ' ',
'b.*t'

The period . matches any single character.

Anchoring
The caret ^ and the dollar sign $ are meta-characters that respectively match the
empty string at the beginning and end of a
line.
The Backslash Character and Special Expressions
The symbols \< and \> respectively match the empty string at the beginning and end
of a word.
The symbol \b matches the empty string at the edge of a word, and \B
matches the empty string provided it's not at the edge of a word.

The symbol \w is a synonym for [[:alnum:]] and \W is a synonym for [^[:alnum:]].

Repetition
A regular expression may be followed by one of several repetition operators:
? The preceding item is optional and matched at most once.
* The preceding item will be matched zero or more times.
+ The preceding item will be matched one or more times.
{n} The preceding item is matched exactly n times.
{n,} The preceding item is matched n or more times.
{,m} The preceding item is matched at most m times. This is a GNU extension.
{n,m} The preceding item is matched at least n times, but not more than m times.

* is a repetition operator for

zero or more

Lesson 04 Text Files 4

? is an Extended Regular
Expression. ? did not work * is a repetition operator for
with grep, it works with egrep. zero or more

* is a repetition operator for

zero or more. boat does not
match, because * means that
"o" (the preceding character)
is repeated zero or more
times.

4.4 awk
awk is specialized in data extraction and reporting (could be sent to a printer).

$ awk -F : '/linda/ { print $4 }' /etc/passwd // -F : the delimiter, $4 is the

field number 4
1001

awk -F : '{ print $NF }' /etc/passwd // $NF number of fields, print the last fie
ld in the line.
// useful when number of fields are not the sam
e in all lines.
/bin/bash
/sbin/nologin
/sbin/nologin
/sbin/nologin
/sbin/nologin
/bin/sync
/sbin/shutdown
/sbin/halt
/sbin/nologin

// print the last column of ps -aux

$ ps -aux | awk '{ print $NF }'

$ ls -l /etc | awk '/pass/ { print }' | less

-rw-r--r--. 1 root root 2598 Dec 6 16:04 passwd
-rw-r--r--. 1 root root 2557 Dec 4 23:41 passwd-
(END)

Lesson 04 Text Files 5

$ ls -l /etc | grep pass
-rw-r--r--. 1 root root 2598 Dec 6 16:04 passwd
-rw-r--r--. 1 root root 2557 Dec 4 23:41 passwd-

4.5 sed (Stream Editor)

$ cat sedfile
one
two
three
four
five

$ sed -n 4p sedfile // -n 4p print line number 4

four

$ sed -i s/four/FOUR/g sedfile // -i write directly to the file, // s substi

tute and replace
// without -i it will write to the stdout
$ cat sedfile
one
two
three
FOUR
five

$
$ sed -n 4p sedfile
FOUR

$ sed -i -e '2d' sedfile // -i modify the file, 2d delete line number 2

$ cat sedfile
one
three
FOUR
five

Lesson 04 Text Files 6

Top Unix Interview Questions - Part 1
No ratings yet
Top Unix Interview Questions - Part 1
37 pages
MATHEMATICS Parallel Scientific Computation
No ratings yet
MATHEMATICS Parallel Scientific Computation
324 pages
Web Developer Resume
0% (1)
Web Developer Resume
3 pages
PEP-8 Tutorial - Code Standards in Python PDF
No ratings yet
PEP-8 Tutorial - Code Standards in Python PDF
20 pages
DevOps AccessingFilesAndRegex
No ratings yet
DevOps AccessingFilesAndRegex
7 pages
Using Grep, TR and Sed With Regular Expressions
No ratings yet
Using Grep, TR and Sed With Regular Expressions
7 pages
Lab Sheet 6
No ratings yet
Lab Sheet 6
6 pages
DAC - COS - Last Day Slides
No ratings yet
DAC - COS - Last Day Slides
73 pages
Module 9 - grep, sed & awk - LFCP
No ratings yet
Module 9 - grep, sed & awk - LFCP
5 pages
UNIT-3 USP
No ratings yet
UNIT-3 USP
82 pages
CH 8 Exercises
No ratings yet
CH 8 Exercises
8 pages
L5 - Reg Exp
No ratings yet
L5 - Reg Exp
38 pages
Introduction To Unix1.2
No ratings yet
Introduction To Unix1.2
216 pages
Linux
No ratings yet
Linux
7 pages
Grep' Command Examples
No ratings yet
Grep' Command Examples
11 pages
Unit 3 Linux Regular Expression
No ratings yet
Unit 3 Linux Regular Expression
15 pages
Grep - Tutorial
No ratings yet
Grep - Tutorial
9 pages
20.10 Filters-Text Processing Commands
No ratings yet
20.10 Filters-Text Processing Commands
14 pages
Unix Shell Scripting Chapter - 1: List Files That Begin With A Lowercase Letter and Don't End With A Digit
No ratings yet
Unix Shell Scripting Chapter - 1: List Files That Begin With A Lowercase Letter and Don't End With A Digit
10 pages
Bash Ch01
No ratings yet
Bash Ch01
14 pages
Linux Command Line Cheat Sheet: Awk Checksums Cut File Grep Head Sed Sor T WC XXD
No ratings yet
Linux Command Line Cheat Sheet: Awk Checksums Cut File Grep Head Sed Sor T WC XXD
7 pages
Software Carpentry
No ratings yet
Software Carpentry
83 pages
Unix ETL Interview Questions
No ratings yet
Unix ETL Interview Questions
5 pages
Redhat Linux Essential
No ratings yet
Redhat Linux Essential
16 pages
Final Study Notes
No ratings yet
Final Study Notes
36 pages
Pipingfile
No ratings yet
Pipingfile
11 pages
Linux Commands
No ratings yet
Linux Commands
6 pages
Commands
No ratings yet
Commands
20 pages
Unit - IV
No ratings yet
Unit - IV
30 pages
Regular Expressions in Grep Command With 10 Examples - Part I
No ratings yet
Regular Expressions in Grep Command With 10 Examples - Part I
5 pages
Unix Commands
No ratings yet
Unix Commands
5 pages
Grep, Awk and Sed - Three VERY Useful Command-Line Utilities
No ratings yet
Grep, Awk and Sed - Three VERY Useful Command-Line Utilities
9 pages
L3 - Grep ND Egrep
No ratings yet
L3 - Grep ND Egrep
26 pages
Linux Admin 1 Commands
No ratings yet
Linux Admin 1 Commands
36 pages
Module 5
No ratings yet
Module 5
14 pages
Search and Sort Tools
No ratings yet
Search and Sort Tools
8 pages
Linux Practical2
No ratings yet
Linux Practical2
12 pages
Lab01-LinuxUnix Utilities-Bash Programming
No ratings yet
Lab01-LinuxUnix Utilities-Bash Programming
4 pages
Chapter 8: Regular Expressions
No ratings yet
Chapter 8: Regular Expressions
24 pages
Amjathfinal
No ratings yet
Amjathfinal
113 pages
Grep Awk Sed
No ratings yet
Grep Awk Sed
9 pages
Unix - Commands
No ratings yet
Unix - Commands
24 pages
Linux CLInotes
No ratings yet
Linux CLInotes
15 pages
UNIX Shells by Example (PDFDrive)
No ratings yet
UNIX Shells by Example (PDFDrive)
1,194 pages
PR 6
No ratings yet
PR 6
4 pages
Lab8
No ratings yet
Lab8
6 pages
Sed Grep Cmds 2
No ratings yet
Sed Grep Cmds 2
5 pages
Unix Utilities: Grep, Sed, and Awk
100% (1)
Unix Utilities: Grep, Sed, and Awk
81 pages
Linux Com Ds
No ratings yet
Linux Com Ds
19 pages
Linux Commands Handbook
No ratings yet
Linux Commands Handbook
22 pages
Chapter 4 - Regular Expression
No ratings yet
Chapter 4 - Regular Expression
6 pages
Sed - Awk
No ratings yet
Sed - Awk
7 pages
Lec 05
No ratings yet
Lec 05
39 pages
Unix Important Command
No ratings yet
Unix Important Command
3 pages
SW LAB 10 Filter
No ratings yet
SW LAB 10 Filter
45 pages
Lab03.Processing Text Streams
No ratings yet
Lab03.Processing Text Streams
12 pages
UNIX Filters
No ratings yet
UNIX Filters
18 pages
Linux Regular Expression
No ratings yet
Linux Regular Expression
3 pages
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)
Bash Command Line Pro Tips
From Everand
Bash Command Line Pro Tips
Jason Cannon
4.5/5 (8)
Mastering Shell Commands On Linux
From Everand
Mastering Shell Commands On Linux
Urko Galen
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Lesson 03 Essential File Management Tools
No ratings yet
Lesson 03 Essential File Management Tools
6 pages
FAA ICAO Flight Planning Interface Ref Guide
No ratings yet
FAA ICAO Flight Planning Interface Ref Guide
73 pages
3-Theoretical Investigation On Refractive Index Sensor Based On Bragg Grating in Micro Nano Fiber
No ratings yet
3-Theoretical Investigation On Refractive Index Sensor Based On Bragg Grating in Micro Nano Fiber
3 pages
Gavrila
No ratings yet
Gavrila
4 pages
3a Kasami Sequences
No ratings yet
3a Kasami Sequences
4 pages
Low-Power and High-Performance 1-Bit CMOS Full-Adder Cell: Keivan Navi and Omid Kavehei
No ratings yet
Low-Power and High-Performance 1-Bit CMOS Full-Adder Cell: Keivan Navi and Omid Kavehei
7 pages
Annex L Explanations
No ratings yet
Annex L Explanations
8 pages
Deep Learning - Handwritten Digit Recognition Using Python
No ratings yet
Deep Learning - Handwritten Digit Recognition Using Python
46 pages
Python Computing Problem Set by Abhijit Kar Gupta
100% (1)
Python Computing Problem Set by Abhijit Kar Gupta
3 pages
Java Programming Lab Manual R18 JNTUH
No ratings yet
Java Programming Lab Manual R18 JNTUH
47 pages
Unit IV
No ratings yet
Unit IV
30 pages
19cs405 Operating System
No ratings yet
19cs405 Operating System
3 pages
Assemply Language Program For DC Motor Interfacing
No ratings yet
Assemply Language Program For DC Motor Interfacing
2 pages
Iimjobs Sandeep Kumar
No ratings yet
Iimjobs Sandeep Kumar
1 page
B ATPG
No ratings yet
B ATPG
98 pages
Java Chap5 User Defined Data Types (Prof. Ananda M Ghosh.)
100% (1)
Java Chap5 User Defined Data Types (Prof. Ananda M Ghosh.)
13 pages
Tibco Ems - LB&FT
50% (2)
Tibco Ems - LB&FT
18 pages
Q1. Write A Java Program To Design A Following GUI (Use Swing) (Marks 30)
No ratings yet
Q1. Write A Java Program To Design A Following GUI (Use Swing) (Marks 30)
6 pages
Question: MIPS A) Consider The C Statement: A (B + D) + (B - C) + (C + D)
No ratings yet
Question: MIPS A) Consider The C Statement: A (B + D) + (B - C) + (C + D)
2 pages
CCS4 Tutorial
No ratings yet
CCS4 Tutorial
22 pages
50 TOP SAP ABAP Multiple Choice Questions and Answers PDF - SAP ABAP Interview Questions and Answers
0% (1)
50 TOP SAP ABAP Multiple Choice Questions and Answers PDF - SAP ABAP Interview Questions and Answers
10 pages
Computer Vision, 3.3 Counting Objects - Linda Shapiro
No ratings yet
Computer Vision, 3.3 Counting Objects - Linda Shapiro
9 pages
Assembler 166
No ratings yet
Assembler 166
352 pages
Software-Engineering (Set 9)
No ratings yet
Software-Engineering (Set 9)
21 pages
30 Decidable and Undecidable-Revised
No ratings yet
30 Decidable and Undecidable-Revised
8 pages
Introduction To Compication in Computing
No ratings yet
Introduction To Compication in Computing
67 pages
ASSIGNMENT 5 Compressed
No ratings yet
ASSIGNMENT 5 Compressed
14 pages
C++ - Is It Possible To Redirect Child Process's Stdout To Another File in Parent Process - Stack Overflow
No ratings yet
C++ - Is It Possible To Redirect Child Process's Stdout To Another File in Parent Process - Stack Overflow
3 pages
Scheme of Study BS (IT) 2019-23
No ratings yet
Scheme of Study BS (IT) 2019-23
21 pages
Tutorial Chapter 2
No ratings yet
Tutorial Chapter 2
2 pages
Cambridge IGCSE: Computer Science 0478/21
No ratings yet
Cambridge IGCSE: Computer Science 0478/21
16 pages
Survey User Module With JSF 2.0
No ratings yet
Survey User Module With JSF 2.0
5 pages
CS242 - Term Project PDF
No ratings yet
CS242 - Term Project PDF
5 pages

Lesson 04 Text Files

Uploaded by

Lesson 04 Text Files

Uploaded by

Lesson 04 Text Files

Created @December 8, 2021 6:38 AM

Last Update @December 8, 2021 10:23 PM

4.1 Text tools

$ head -n 5 /etc/passwd | tail -n 1 // show line number 5 from the file /e

Lesson 04 Text Files 1

cut -f 3 -d : /etc/passwd | less // cut field number 3, where delimiter is ":"

4.2 grep (Generic Regular Expression

ps -aux | grep ssh

4.3 Regular Expressions

Regular Expression : applies to search patterns for a text inside

Lesson 04 Text Files 2

Regular expressions are used with:

man 7 regex // Regular Expression

Lesson 04 Text Files 3

The period . matches any single character.

The symbol \w is a synonym for [[:alnum:]] and \W is a synonym for [^[:alnum:]].

* is a repetition operator for

Lesson 04 Text Files 4

* is a repetition operator for

$ awk -F : '/linda/ { print $4 }' /etc/passwd // -F : the delimiter, $4 is the

// print the last column of ps -aux

$ ls -l /etc | awk '/pass/ { print }' | less

Lesson 04 Text Files 5

4.5 sed (Stream Editor)

$ sed -n 4p sedfile // -n 4p print line number 4

$ sed -i s/four/FOUR/g sedfile // -i write directly to the file, // s substi

$ sed -i -e '2d' sedfile // -i modify the file, 2d delete line number 2

Lesson 04 Text Files 6

You might also like