SeekDeeper: Minimal Implementations of Popular AI Models

Motivation

Official code repositories often include many engineering details, which can be overwhelming for beginners. This repository aims to implement various models with as little code as possible using PyTorch, making it easier for learners to understand and reproduce results. Additionally, most tutorials lack a complete workflow, focusing only on the model without considering data loading and training. This makes it difficult for beginners to apply their knowledge in practice.

Models

Model	Paper	Official or Reference Repository
Transformer	Attention Is All You Need	https://github.com/hyunwoongko/transformer
GPT	Improving Language Understanding by Generative Pre-Training	https://github.com/openai/finetune-transformer-lm https://github.com/openai/gpt-2 https://github.com/karpathy/nanoGPT https://github.com/karpathy/minGPT
GPT-2	Language Models are Unsupervised Multitask Learners
GAN	Generative Adversarial Networks	https://github.com/goodfeli/adversarial
DCGAN	Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks	https://github.com/Newmu/dcgan_code
WGAN-GP	Improved Training of Wasserstein GANs	https://github.com/igul222/improved_wgan_training

Directory Structure

For each model, the typical directory structure is as follows:

<model name>/
├── checkpoints/
├── modules/
├── datasets/
├── images/
├── README.md
├── data.py
├── config.py
├── train.ipynb
└── inference.ipynb

checkpoints/: Contains pre-trained model weights for direct use in inference.ipynb. Sometimes, pre-trained parameters from official repositories are loaded directly.
modules/: Contains modules necessary for model implementation.
datasets/: Contains datasets required for training or inference validation, which may sometimes be downloaded to this directory via code.
images/: Contains images for README.md of this model.
README.md: Introduces the implemented task and describes the implementation details.
data.py: Defines Dataset, Dataloader, or data preprocessing.
config.py: Defines hyperparameters needed for the experiment.
train.ipynb: Clearly presents the process from data loading, preprocessing, to training and evaluation.
inference.ipynb: Loads model parameters from the checkpoints/ directory for inference.

License

This project uses the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
gan		gan
gpt		gpt
gpt2		gpt2
template		template
transformer		transformer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SeekDeeper: Minimal Implementations of Popular AI Models

Motivation

Models

Directory Structure

License

About

Releases

Packages

Languages

License

BUGERR/SeekDeeper

Folders and files

Latest commit

History

Repository files navigation

SeekDeeper: Minimal Implementations of Popular AI Models

Motivation

Models

Directory Structure

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages