Name	Name	Last commit message	Last commit date
parent directory ..
01_SLURM.md	01_SLURM.md
02_Modules.md	02_Modules.md
03_introduction to DeepLearning.ipynb	03_introduction to DeepLearning.ipynb
04_slurm cheatbook.pdf	04_slurm cheatbook.pdf
README.md	README.md

Name

Last commit message

Last commit date

01_SLURM.md

02_Modules.md

03_introduction to DeepLearning.ipynb

04_slurm cheatbook.pdf

README.md

High-Performance Deep Learning with SLURM

Overview

This repository provides a guide to deep learning with PyTorch, along with best practices for running workloads on an HPC cluster using SLURM. It includes:

Deep Learning Basics: Jupyter notebooks covering foundational concepts.
SLURM Job Scheduling: Guides and scripts for distributed training.
Module Management: Best practices for handling dependencies on HPC clusters.

Repository Structure

/01_introduction/
 ├── 01_SLURM.md                    # SLURM job scheduling guide
 ├── 02_Modules.md                  # Guide on managing modules
 ├── 03_introduction_to_DeepLearning.ipynb  # Jupyter Notebook on DL basics
 ├── 04_slurm_cheatbook.pdf         # SLURM command reference
 ├── README.md                   # Project documentation

🔹 Deep Learning Topics Covered

Understanding Tensors in PyTorch
Forward & Backward Propagation
Loss Functions & Optimization
Leveraging PyTorch Tensor Cores
Building a Simple Neural Network

🔹 SLURM & HPC Topics Covered

Managing Job Queues & Partitions
Writing & Submitting SLURM Jobs
Monitoring & Debugging Jobs
Using SLURM for Distributed Training
Managing Dependencies with Modules

Prerequisites

To effectively use this repository, ensure you have:

Python basics
Familiarity with PyTorch

Additional Resources

📚 PyTorch Documentation 📚 SLURM Official Guide 📚 Deep Learning Book by Ian Goodfellow

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

High-Performance Deep Learning with SLURM

Overview

Repository Structure

Contents

🔹 Deep Learning Topics Covered

🔹 SLURM & HPC Topics Covered

Prerequisites

Additional Resources

FilesExpand file tree

01_introduction

Directory actions

More options

Directory actions

More options

Latest commit

History

01_introduction

Folders and files

parent directory

README.md

High-Performance Deep Learning with SLURM

Overview

Repository Structure

Contents

🔹 Deep Learning Topics Covered

🔹 SLURM & HPC Topics Covered

Prerequisites

Additional Resources