This document summarizes the key steps in the locality sensitive hashing (LSH) algorithm for finding similar documents: 1. Documents are converted to sets of shingles (sequences of tokens) to represent them as high-dimensional data points. 2. MinHashing is applied to generate signatures (hashes) for each document such that similar documents are likely to have the same signatures. This compresses the data into a signature matrix. 3. LSH uses the signature matrix to hash similar documents into the same buckets with high probability, finding candidate pairs for further similarity evaluation and filtering out dissimilar pairs from consideration. This improves the computation efficiency over directly comparing all pairs.