Deep Learning Collection

Welcome to the Programming Ocean Academy's Deep Learning Repository.
This project is a comprehensive educational suite showcasing a variety of generative models implemented with PyTorch, ranging from foundational architectures to modern, cutting-edge designs.

Objective

This repository serves as an academic and teaching-oriented resource for understanding, building, and visualizing deep generative models. It is designed to help students, researchers, and practitioners explore the diversity of generative learning approaches in a modular and clear format.

Repository Structure

Each folder represents a specific category of generative or neural architecture:

Folder Name	Description
`auto-regressive-models`	PixelCNN and related sequential density estimators
`cnn`	Basic CNN models for image recognition
`diffusion`	Denoising Diffusion Probabilistic Models (DDPM, DDIM)
`dit-models`	Diffusion Transformers (DiT)
`energy-based-models`	EBMs trained with Langevin dynamics
`flow-based-models`	RealNVP, Glow, and other invertible models
`gans`	GAN, DCGAN, WGAN, and conditional variants
`latent-manifold-auto-encoder`	Latent space exploration with VAEs and AEs
`multi-model`	Cross-modal tasks (e.g., text-to-image, image captioning)
`restricted-boltzmann-machine`	Contrastive Divergence and RBMs
`rnn`	Recurrent networks (LSTM, GRU)
`score-based-generative-convolution`	Score-matching models with CNN backbones
`score-based-generative-models`	Langevin and NCSN-style samplers
`time-series`	Forecasting models for temporal data
`transformer`	Sequence models and transformers (Vanilla, GPT)
`variational-auto-encoder`	VAEs and conditional variants
`vision-transformer`	ViT for image understanding

Highlighted Projects

1. Diffusion Models

"A Concise Implementation of Denoising Diffusion Probabilistic Models for Generative Image Synthesis in PyTorch"

U-Net architecture with Gaussian noise scheduling
Reverse sampling with denoising

2. GANs

"Adversarial Image Synthesis with Generative Networks: A PyTorch Implementation of GANs on MNIST"

Generator and Discriminator loop
Real versus generated image comparison

3. VAEs

"Latent Variable Modeling and Image Generation with Variational Autoencoders: A PyTorch-Based Study on MNIST"

Reparameterization trick
Sampling and interpolation

4. Score-Based Models

"Unsupervised Image Synthesis via Score Matching and Langevin Dynamics: A Score-Based Generative Framework on MNIST"

Trainable score networks
MCMC sampling

5. Text-to-Image (Mini DALL·E)

"Learning Discrete Visual Representations from Textual Descriptions: A Simplified VQ-VAE Framework for Text-to-Image Generation"

VQ-VAE with Transformer
Captioned image generation (color, shape, objects)

6. Image Captioning

"Visual Grounding through Language: A Minimalist Encoder-Decoder Framework for Image Captioning with Attention in PyTorch"

ResNet encoder combined with LSTM and soft attention
Caption generation for synthetic scenes

Usage

All notebooks are written for clarity and modularity.

# Clone the repository https://github.com/Programming-Ocean-Academy/deep-learning.git Open any `.ipynb` file in JupyterLab, Google Colab, or VSCode and run directly. --- ## Contributing We welcome contributions to extend this educational repository: - Add new generative model examples - Improve visualizations or metrics - Refactor notebooks into scripts or modules --- ## License MIT License. Free for personal, educational, and research use. --- ## Acknowledgements Inspired by work from: - OpenAI, DeepMind, LucidRain - The PyTorch community and tutorials - Original research papers on DALL·E, VQ-VAE, and DDPM --- Enjoy exploring generative deep learning.

Name		Name	Last commit message	Last commit date
Latest commit History 450 Commits
DALL·E		DALL·E
activation-functions		activation-functions
ai-learning-techniques		ai-learning-techniques
ai-representations		ai-representations
alphago-alpha-zero		alphago-alpha-zero
attention		attention
auto-regressive-models		auto-regressive-models
chrono-clustering		chrono-clustering
cnn		cnn
continual-learning		continual-learning
contrastive-learning		contrastive-learning
diffusion-lm		diffusion-lm
diffusion		diffusion
dit-models		dit-models
domain-specific-models		domain-specific-models
energy-based-models		energy-based-models
evaluation-models		evaluation-models
flow-based-models		flow-based-models
fnn		fnn
gans		gans
generative-models		generative-models
gpt		gpt
gpu-origins-in-ai		gpu-origins-in-ai
initializers		initializers
language-models		language-models
latent-manifold-auto-encoder		latent-manifold-auto-encoder
meta-learning		meta-learning
mixture-of-expert		mixture-of-expert
model-compression-techniques		model-compression-techniques
multi-modal		multi-modal
neural-machine-translation		neural-machine-translation
old-neural-networks-model		old-neural-networks-model
optimizers		optimizers
regularizers		regularizers
representational-expressivity-retrieval		representational-expressivity-retrieval
restricted-boltzmann-machine		restricted-boltzmann-machine
rnn-attention		rnn-attention
rnn		rnn
score-based-generative-convolution		score-based-generative-convolution
score-based-generative-models		score-based-generative-models
speech-recognition		speech-recognition
time-series		time-series
transfomer		transfomer
un-supervised-learning		un-supervised-learning
unsupervised-deep-learning		unsupervised-deep-learning
variational-auto-encoder		variational-auto-encoder
vision-language-text2video		vision-language-text2video
vision-transformer		vision-transformer
word-embedding		word-embedding
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deep Learning Collection

Objective

Repository Structure

Highlighted Projects

1. Diffusion Models

2. GANs

3. VAEs

4. Score-Based Models

5. Text-to-Image (Mini DALL·E)

6. Image Captioning

Usage

About

Uh oh!

Releases

Packages

Languages

Programming-Ocean-Academy/deep-learning

Folders and files

Latest commit

History

Repository files navigation

Deep Learning Collection

Objective

Repository Structure

Highlighted Projects

1. Diffusion Models

2. GANs

3. VAEs

4. Score-Based Models

5. Text-to-Image (Mini DALL·E)

6. Image Captioning

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages