0% found this document useful (0 votes)

376 views

Python Code Examples

The document provides code examples for several Python tasks including: 1) Word spotting in a text file and printing matching words. 2) Creating a dictionary of first names from a text file. 3) Computing accuracy results by comparing words in two text files and calculating accuracy metrics. 4) Creating word histograms by counting word frequencies in text. 5) Extracting people names and company names from text using regular expressions.

Uploaded by

Asaf Katz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

376 views

Python Code Examples

Uploaded by

Asaf Katz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 30

Python Code Examples

Word Spotting
import sys fname1 = "c:\Python Course\ex1.txt" for line in open(fname1,'r').readlines(): for word in line.split(): if word.endswith('ing'): print word

Creating a Dictionary of First Names

def createNameDict(): dictNameFile=open('project/dictionaries/names.txt','r') dictContent=dictNameFile.read() #read all the file dictWords=dictContent.split(",") #return a list with the words nameDict={} # initialize a dictionary for word in dictWords: nameDict[word.strip()]=" " #enters each word to the dctionary. return nameDict

Computing Accuracy Results I

# anfiles.py # Program to analyze the results of speaker identification. # Illustrates Python dictionarys import string, glob, sys def main(): # read correct file and test file fname1 = sys.argv[1] fname2 = sys.argv[2]

text1 = open(fname1,'r').read() text1 = string.lower(text1) words1 = string.split(text1)

correct_len = len(words1)
text2 = open(fname2,'r').read() text2 = string.lower(text2) words2 = string.split(text2)

Computing Accuracy Results II

# construct a dictionary of correct results correct = {} for w in words1: correct[w] = 1

for i in range(correct_len): in_count = 0 portion2 = words2[:i+1] for w in portion2: if correct.get(w,0) > 0: in_count+=1 accuracy = float(in_count)/float(len(portion2)) print "%5d, %5d,%.2f" % (len(portion2), in_count, accuracy)
if __name__ == '__main__': main()

Word Histograms
import sre, string pattern = sre.compile( r'[a-zA-Z]+' ) def countwords(text): dict = {} try: iterator = pattern.finditer(text) for match in iterator: word = match.group() try: dict[word] = dict[word] + 1 except KeyError: dict[word] = 1 except sre.error: pass # triggers when first index goes to -1, terminates loop.

Word Histograms
items = [] for word in dict.keys(): items.append( (dict[word], word) ) items.sort() items.reverse() return items # if run as a script, count words in stdin. if __name__ == "__main__": import sys x = countwords( sys.stdin.read() ) s = map(str, x) t = string.joinfields(s, "\n") print t

Extracting People Names and Company Names

import string, sre, glob, sys def createNameDict(): dictNameFile=open('names.txt','r') dictContent=dictNameFile.read() #read all the file dictWords=dictContent.split(",") #return a list with the words nameDict={} # initialize a dictionary for word in dictWords: nameDict[word.strip()]=" " #enters each word to the dctionary. return nameDict

def main(): # read file fname1 = sys.argv[1] text1 = open(fname1,'r').read() namesDic = createNameDict() CompanySuffix = sre.compile(r'corp | ltd | inc | corporation | gmbh | ag | sa ', sre.IGNORECASE) pattern = sre.compile( r'([A-Z]\w+[ .,-]+)+'

Extracting People Names and Company Names

r'(corp|CORP|Corp|ltd|Ltd|LTD|inc|Inc|INC|corporation|Corporation|CORPORATION|gmbh| GMBH|ag|AG|sa|SA)' r'(\.?)') pattern1 = sre.compile( r'([A-Z]\w+[\s.-]*){2,4}' ) #Companies capitalWords=sre.finditer(pattern,text1) for match in capitalWords: CapSeq = match.group() print CapSeq #People capitalWords1=sre.finditer(pattern1,text1) for match in capitalWords1: wordList=match.group().split() #check name in names dictionary if namesDic.has_key(wordList[0].strip()): print match.group() if __name__ == '__main__': main()

NLTK
NLTK defines a basic infrastructure that can be used

to build NLP programs in Python. It provides:

Basic classes for representing data relevant to natural language processing. Standard interfaces for performing tasks, such as tokenization, tagging, and parsing. Standard implementations for each task, which can be combined to solve complex problems. Extensive documentation, including tutorials and reference documentation.

RE Show
>>> from nltk.util import re_show >>> string = """ ... Its probably worth paying a premium for funds that invest in markets ... that are partially closed to foreign investors, such as South Korea, ... ... """ >>> re_show(t..., string) I{ts }probably wor{th p}aying a premium for funds {that} inves{t in} markets {that} are par{tial}ly closed {to f}oreign inves{tors}, such as Sou{th K}orea, ... >>>

Classes in Python

Defining Classes
>>> class SimpleClass: ... def __init__(self, initial_value): ... self.data = initial_value ... def set(self, value): ... self.data = value ... def get(self): ... print self.data ... >>> x = SimpleClass(4)

Inheritance
B is a subclass of A >>> class B(A): ... def __init__(self):

SimpleTokenizer implements the interface of TokenizerI >>> class SimpleTokenizer(TokenizerI): ... def tokenize(self, str): ... words = str.split() ... return [Token(words[i], Location(i)) ... for i in range(len(words))]

Inheritance Example
class point: def __init__(self, x=0, y=0): self.x, self.y = x, y

class cartesian(point): def distanceToOrigin(self): return floor(sqrt(self.x**2 + self.y**2)) class manhattan(point): def distanceToOrigin(self): return self.x + self.y

Sets

Sets in Python
The sets module provides classes for constructing

and manipulating unordered collections of unique elements. Common uses include:

membership testing, removing duplicates from a sequence, and computing standard math operations on sets such as intersection, union, difference, and symmetric difference.

Like other collections, sets support x in set, len(set),

and for x in set. Being an unordered collection, sets do not record element position or order of insertion. Accordingly, sets do not support indexing, slicing, or other sequence-like behavior.

Some Details about Implementation

Most set applications use the Set class which

provides every set method except for __hash__(). For advanced applications requiring a hash method, the ImmutableSet class adds a __hash__() method but omits methods which alter the contents of the set. The set classes are implemented using dictionaries. As a result, sets cannot contain mutable elements such as lists or dictionaries. However, they can contain immutable collections such as tuples or instances of ImmutableSet. For convenience in implementing sets of sets, inner sets are automatically converted to immutable form, for example, Set([Set(['dog'])]) is transformed to Set([ImmutableSet(['dog'])]).

Set Operations
Operation
len(s)

Equivalent

Result

cardinality of set s
test x for membership in s test x for non-membership in s
s <= t s >= t

x in s x not in s s.issubset(t) s.issuperset(t) s.union(t) s.intersection(t) s.difference(t) s.symmetric_differenc e(t) s.copy()

test whether every element in s is in t

test whether every element in t is in s new set with elements from both s and t new set with elements common to s and t new set with elements in s but not in t new set with elements in either s or t but not both new set with a shallow copy of s

s|t s&t s-t s^t

Operations not for ImmutableSet

Operation
s.union_update( t) s.intersection_u pdate(t) s.difference_up date(t) s.symmetric_dif
ference_up date(t)

Equivalent

Result

s |= t
s &= t s -= t s ^= t

return set s with elements added from t

return set s keeping only elements also found in t return set s after removing elements found in t return set s with elements from s or t but not both add element x to set s

s.add(x)

s.remove(x)
s.discard(x) s.pop()

remove x from set s; raises KeyError if not present

removes x from set s if present remove and return an arbitrary element from s; raises KeyError if empty

Set Examples
>>> from sets import Set >>> engineers = Set(['John', 'Jane', 'Jack', 'Janice']) >>> programmers = Set(['Jack', 'Sam', 'Susan', 'Janice']) >>> managers = Set(['Jane', 'Jack', 'Susan', 'Zack']) >>> employees = engineers | programmers | managers # union >>> engineering_management = engineers & managers # intersection >>> fulltime_management = managers - engineers - programmers # difference >>> engineers.add('Marvin') # add element >>> print engineers Set(['Jane', 'Marvin', 'Janice', 'John', 'Jack']) >>> employees.issuperset(engineers) # superset test False

Set Examples
>>> employees.union_update(engineers) # update from another set >>> employees.issuperset(engineers) True >>> for group in [engineers, programmers, managers, employees]: ... group.discard('Susan') # unconditionally remove element ... print group ... Set(['Jane', 'Marvin', 'Janice', 'John', 'Jack']) Set(['Janice', 'Jack', 'Sam']) Set(['Jane', 'Zack', 'Jack']) Set(['Jack', 'Sam', 'Jane', 'Marvin', 'Janice', 'John', 'Zack'])

Google API
Get it from

http://sourceforge.net/projects/pygoogle/ A Python wrapper for the Google web API. Allows you to do Google searches, retrieve pages from the Google cache, and ask Google for spelling suggestions.

Utilizing the Google API - I

import sys import string import codecs import google print '<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">' print '<head>' print ' <title>Google with Python</title>' print '</head>' print '<body>'

print '<h1>Google with Python</h1>'

google.LICENSE_KEY = '[YOUR GOOGLE LICENSE KEY]' sys.stdout = codecs.lookup('utf-8')[-1](sys.stdout) query = Your Query" data = google.doGoogleSearch(query)

Utilizing the Google API - II

print '<p><strong>1-10 of "' + query + '" total results for ' print str(data.meta.estimatedTotalResultsCount) + '</strong></p>' for result in data.results: title = result.title title = title.replace('<b>', '<strong>') title = title.replace('</b>', '</strong>')

snippet = result.snippet snippet = snippet.replace('<b>','<strong>') snippet = snippet.replace('</b>','</strong>') snippet = snippet.replace('<br>','<br />') print '<h2><a href="' + result.URL + '">' + title + '</a></h2>' print '<p>' + snippet + '</p>' print '</body> print '</html>'

Yahoo API
http://pysearch.sourceforge.net/

http://python.codezoo.com/pub/component/41

93?category=198 This project implements a Python API for the Yahoo Search Webservices API. pYsearch is an OO abstraction of the web services, with emphasis on ease of use and extensibility.

URLLIB
This module provides a high-level interface

for fetching data across the World Wide Web. In particular, the urlopen() function is similar to the built-in function open(), but accepts Universal Resource Locators (URLs) instead of filenames. Some restrictions apply -- it can only open URLs for reading, and no seek operations are available.

Urllib Syntax
# Use http://www.someproxy.com:3128 for http

proxying proxies = {'http': 'http://www.someproxy.com:3128'} filehandle = urllib.urlopen(some_url, proxies=proxies) # Don't use any proxies filehandle = urllib.urlopen(some_url, proxies={})

URLLIB Examples
Here is an example session that uses the "GET" method to

retrieve a URL containing parameters: >>> import urllib >>> params = urllib.urlencode({'spam': 1, 'eggs': 2, 'bacon': 0}) >>> f = urllib.urlopen("http://www.musi-cal.com/cgi-bin/query?%s" % params) >>> print f.read() The following example uses the "POST" method instead: >>> import urllib >>> params = urllib.urlencode({'spam': 1, 'eggs': 2, 'bacon': 0}) >>> f = urllib.urlopen("http://www.musi-cal.com/cgi-bin/query", params) >>> print f.read()

What is a Proxy
A proxy server is a computer that offers a computer

network service to allow clients to make indirect network connections to other network services. A client connects to the proxy server, then requests a connection, file, or other resource available on a different server. The proxy provides the resource either by connecting to the specified server or by serving it from a cache. In some cases, the proxy may alter the client's request or the server's response for various purposes. A proxy server can also serve as a firewall.

Python 3 Cheat Sheet
94% (51)
Python 3 Cheat Sheet
2 pages
12 Comp Sci 1 Revision Notes Pythan Advanced Prog
No ratings yet
12 Comp Sci 1 Revision Notes Pythan Advanced Prog
5 pages
Beginners Python Cheat Sheet
83% (6)
Beginners Python Cheat Sheet
28 pages
Python 3.2 Reference Card
100% (12)
Python 3.2 Reference Card
2 pages
Python 3 Object Oriented Programming
From Everand
Python 3 Object Oriented Programming
Dusty Phillips
4/5 (9)
Python Refcard
100% (5)
Python Refcard
2 pages
Sanyo PLC-XP57 SM PDF
No ratings yet
Sanyo PLC-XP57 SM PDF
122 pages
Deloitte TR Job Evaluation
No ratings yet
Deloitte TR Job Evaluation
7 pages
Python Cheatsheet
100% (1)
Python Cheatsheet
1 page
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Python3 Reference Cheat Sheet
100% (1)
Python3 Reference Cheat Sheet
2 pages
Python Programming Examples
No ratings yet
Python Programming Examples
19 pages
Python 3 Cheat Sheet
71% (7)
Python 3 Cheat Sheet
16 pages
Python by Example - 9 June 2014
100% (2)
Python by Example - 9 June 2014
82 pages
Python - Tutorial: #!/usr/bin/python Print "Hello, Python!"
No ratings yet
Python - Tutorial: #!/usr/bin/python Print "Hello, Python!"
174 pages
Python CheatSheet PDF
No ratings yet
Python CheatSheet PDF
1 page
CheatSheet Python 3 Complex Data Types
No ratings yet
CheatSheet Python 3 Complex Data Types
1 page
Machine Learning Python
100% (1)
Machine Learning Python
9 pages
Introductory Notes: Matplotlib: Preliminaries
No ratings yet
Introductory Notes: Matplotlib: Preliminaries
11 pages
Win32com - Goermezer.de-The Python Script Collection For Windows - Controlling Applications Via Sendkeys
No ratings yet
Win32com - Goermezer.de-The Python Script Collection For Windows - Controlling Applications Via Sendkeys
3 pages
Python Matplotlib Cheat Sheet
No ratings yet
Python Matplotlib Cheat Sheet
1 page
Python Keywords, Identifiers and Variables - Fundamentals
No ratings yet
Python Keywords, Identifiers and Variables - Fundamentals
179 pages
Python Notes PDF
No ratings yet
Python Notes PDF
7 pages
Python Quick Reference Card
94% (17)
Python Quick Reference Card
17 pages
NumPy, SciPy and MatPlotLib
100% (1)
NumPy, SciPy and MatPlotLib
18 pages
Intermediate Python Cheat Sheet
No ratings yet
Intermediate Python Cheat Sheet
3 pages
Python Example Programs
100% (1)
Python Example Programs
5 pages
Algorithms in Python
100% (1)
Algorithms in Python
218 pages
Zerotohero Python3 170809030243
100% (1)
Zerotohero Python3 170809030243
96 pages
Beginners Python Cheat Sheet PCC BW
No ratings yet
Beginners Python Cheat Sheet PCC BW
2 pages
The Python Standard Library
No ratings yet
The Python Standard Library
8 pages
Python Notes
No ratings yet
Python Notes
4 pages
Python Iterators
No ratings yet
Python Iterators
9 pages
Intro PDF
No ratings yet
Intro PDF
97 pages
Pandas
No ratings yet
Pandas
4 pages
Python Cheat Sheets
97% (33)
Python Cheat Sheets
11 pages
Python I Compiled Notes
100% (3)
Python I Compiled Notes
321 pages
Python Refcard
No ratings yet
Python Refcard
3 pages
Python ZTM Cheatsheet: Andrei Neagoie
100% (1)
Python ZTM Cheatsheet: Andrei Neagoie
38 pages
Python For Web Scraping - Week 3: 1 Installing A Module
No ratings yet
Python For Web Scraping - Week 3: 1 Installing A Module
4 pages
Numpy Basics: Arithmetic Operations
No ratings yet
Numpy Basics: Arithmetic Operations
6 pages
Matlab Cheatsheet
No ratings yet
Matlab Cheatsheet
2 pages
Python Cheat Sheet
No ratings yet
Python Cheat Sheet
16 pages
Quick Unix Reference
No ratings yet
Quick Unix Reference
6 pages
Python Tools Utilities
100% (1)
Python Tools Utilities
3 pages
Python Tutorial Codes
100% (2)
Python Tutorial Codes
255 pages
Python 2 Python 3
100% (1)
Python 2 Python 3
4 pages
Python 3 Cheat Sheet v3
100% (4)
Python 3 Cheat Sheet v3
13 pages
GUI Using Python
100% (1)
GUI Using Python
25 pages
Python Extra Tutorial
50% (2)
Python Extra Tutorial
172 pages
Python Programming
100% (15)
Python Programming
58 pages
Python Programming Illustrated For Beginners & Intermediates“Learn By Doing” Approach-Step By Step Ultimate Guide To Mastering Python: The Future Is Here!
From Everand
Python Programming Illustrated For Beginners & Intermediates“Learn By Doing” Approach-Step By Step Ultimate Guide To Mastering Python: The Future Is Here!
William Sullivan
3/5 (1)
Python Unleashed: Mastering the Art of Efficient Coding
From Everand
Python Unleashed: Mastering the Art of Efficient Coding
James Livingston
No ratings yet
Python and SQLite Development
From Everand
Python and SQLite Development
Agus Kurniawan
No ratings yet
Basic Exercises for Competitive Programming: Python
From Everand
Basic Exercises for Competitive Programming: Python
Jan Pol
No ratings yet
NumPy Beginner's Guide
From Everand
NumPy Beginner's Guide
Ivan Idris
5/5 (3)
Problem Solving in C and Python: Programming Exercises and Solutions, Part 1
From Everand
Problem Solving in C and Python: Programming Exercises and Solutions, Part 1
Yana Kortsarts
4.5/5 (2)
nss-1
No ratings yet
nss-1
2 pages
Python Ultimate Guide
100% (1)
Python Ultimate Guide
10 pages
Class02 - Copy
No ratings yet
Class02 - Copy
8 pages
Python Refcard
No ratings yet
Python Refcard
2 pages
Power Bi Embebido
No ratings yet
Power Bi Embebido
54 pages
Informatika Kelas 11
No ratings yet
Informatika Kelas 11
6 pages
Customer Relationship Management: Brijesh Kumar Delhi Business School New Delhi
No ratings yet
Customer Relationship Management: Brijesh Kumar Delhi Business School New Delhi
20 pages
Abu Dhabi Bus Time & Fare
50% (2)
Abu Dhabi Bus Time & Fare
21 pages
The Definitive Guide to Mastering the Psychology of Trading by Mark Douglas
No ratings yet
The Definitive Guide to Mastering the Psychology of Trading by Mark Douglas
319 pages
Cisco NCS 2000 Series Troubleshooting Guide, Release 10.x.x: Americas Headquarters
0% (1)
Cisco NCS 2000 Series Troubleshooting Guide, Release 10.x.x: Americas Headquarters
558 pages
JNTUH B.tech 3 Year ECE R16 Syllabus
No ratings yet
JNTUH B.tech 3 Year ECE R16 Syllabus
36 pages
HUAWEI ALE-L21C432B564 Software Upgrade Guideline PDF
No ratings yet
HUAWEI ALE-L21C432B564 Software Upgrade Guideline PDF
8 pages
Ram Resume
No ratings yet
Ram Resume
2 pages
User's Manual / Bedienungsanleitung Gebruikershandleiding / Manuel D'utilisation Manual Do Utilizador / Manual Del Usuario Manuale Dell'utente
No ratings yet
User's Manual / Bedienungsanleitung Gebruikershandleiding / Manuel D'utilisation Manual Do Utilizador / Manual Del Usuario Manuale Dell'utente
56 pages
File Structures and Algorithms
No ratings yet
File Structures and Algorithms
5 pages
Object Oriented Analysis: Ey Ne Ed To
No ratings yet
Object Oriented Analysis: Ey Ne Ed To
36 pages
Ucc 27524
No ratings yet
Ucc 27524
46 pages
PHD Thesis Artificial Intelligence
100% (3)
PHD Thesis Artificial Intelligence
5 pages
Beginning iPhone Development Exploring the iPhone SDK 1st Edition Jeff Lamarche pdf download
100% (1)
Beginning iPhone Development Exploring the iPhone SDK 1st Edition Jeff Lamarche pdf download
76 pages
Zhang Tilly Coping_With_Semiconductor_Containment Gavekal August 2023
No ratings yet
Zhang Tilly Coping_With_Semiconductor_Containment Gavekal August 2023
4 pages
MIS of Metro
75% (4)
MIS of Metro
30 pages
Rebecca Pfender: Professional Experience
No ratings yet
Rebecca Pfender: Professional Experience
1 page
(Non-QU) Linear Algebra by DR - Gabriel Nagy PDF
No ratings yet
(Non-QU) Linear Algebra by DR - Gabriel Nagy PDF
362 pages
Computational Network Science An Algorithmic Approach 1st Edition Hexmoor All Chapter Instant Download
100% (4)
Computational Network Science An Algorithmic Approach 1st Edition Hexmoor All Chapter Instant Download
62 pages
LogixPro CISE 313 Automation Devices and Electronics Lab Manual
No ratings yet
LogixPro CISE 313 Automation Devices and Electronics Lab Manual
36 pages
Sweet Love
No ratings yet
Sweet Love
8 pages
Spreadsheets For Marketing & Sales Tracking - Data Analysis Tools Using Ms Excel
No ratings yet
Spreadsheets For Marketing & Sales Tracking - Data Analysis Tools Using Ms Excel
26 pages
2 Lecture Principles of Negotiation
No ratings yet
2 Lecture Principles of Negotiation
25 pages
Use Cases for Fleet in Logistics
No ratings yet
Use Cases for Fleet in Logistics
4 pages
LKPD Bahasa Inggris Kelas Vii KD 3
No ratings yet
LKPD Bahasa Inggris Kelas Vii KD 3
31 pages
ISO-IEC-IEEE-29119-2-2021
No ratings yet
ISO-IEC-IEEE-29119-2-2021
15 pages
BIG-D Introduction to Artificial Intelligence
No ratings yet
BIG-D Introduction to Artificial Intelligence
19 pages

Python Code Examples

Uploaded by

Python Code Examples

Uploaded by

Python Code Examples

Creating a Dictionary of First Names

Computing Accuracy Results I

text1 = open(fname1,'r').read() text1 = string.lower(text1) words1 = string.split(text1)

Computing Accuracy Results II

Extracting People Names and Company Names

Extracting People Names and Company Names

to build NLP programs in Python. It provides:

and manipulating unordered collections of unique elements. Common uses include:

Like other collections, sets support x in set, len(set),

Some Details about Implementation

x in s x not in s s.issubset(t) s.issuperset(t) s.union(t) s.intersection(t) s.difference(t) s.symmetric_differenc e(t) s.copy()

test whether every element in s is in t

s|t s&t s-t s^t

Operations not for ImmutableSet

return set s with elements added from t

remove x from set s; raises KeyError if not present

Utilizing the Google API - I

print '<h1>Google with Python</h1>'

Utilizing the Google API - II

You might also like