This repository contains Jupyter notebooks covering various deep learning and computer vision topics. Each notebook provides hands-on experience with different techniques and architectures.
- 🔰 Introduction to Machine Learning concepts:
PythonNumPyPandas- Data visualization
- Basic
supervisedlearning - Basic
semi-supervisedlearning
- 📌 Introduction to
Convolutional Neural Networks (CNNs)for image classification. - 🛠️ Practical implementation using deep learning frameworks.
- 🔍 Exploration of various
CNN architectures. - 🎛️ Understanding
hyperparameter tuningand architectural choices. - 📦 Utilizing
pre-trained modelsfor improved performance.
- 📖 Implementation of the
Transformer modelfor deep learning tasks. - 🧠 Understanding
self-attention mechanisms.
- 🔍
Training and evaluating object detection models. - 🏷️ Understanding
bounding boxesandclass predictions. - 🛠️ Recommended external resources: Ultralytics for efficient detection models.
- 🎨 Assigning class labels to each pixel in an image.
- 🚗 Applications in medical imaging, autonomous driving, and more.
- 🏗️ Exploring
FixMatch, a method forsemi-supervised learning. - 🖼️ Training on a fraction of
CIFAR-10images.
- 🔄 Implementing
SimCLRfor contrastive learning. - 📸 Pretraining on
CIFAR-10and fine-tuning onSVHN.
To run these notebooks locally, install the necessary dependencies:
pip install torch torchvision timm jupyter matplotlib seaborn tqdmAlternatively, use Google Colab links provided in each section to run them directly in the cloud. ☁️
Some of the research papers referenced in the notebooks are available in the pdfs folder. These papers provide theoretical background and additional insights into the concepts explored in the notebooks. These papers complement the practical implementations in this repository and provide additional theoretical insights into the methods used.
-
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
- 📄 Found in
pdfs/Semi-Supervised Learning FixMatch.pdf
- 📄 Found in
-
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
- 📄 Found in
pdfs/CLR.pdf
- 📄 Found in
-
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
- 📄 Found in
pdfs/BatchNormalization.pdf
- 📄 Found in
-
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
- 📄 Found in
pdfs/MobileNet.pdf
- 📄 Found in
-
MobileNetV3: Searching for MobileNetV3
- 📄 Found in
pdfs/MobileNetV3.pdf
- 📄 Found in
-
Deep Residual Learning for Image Recognition (ResNet)
- 📄 Found in
pdfs/ResNet.pdf
- 📄 Found in
-
Identifying Mislabeled Data using the Area Under the Margin Ranking
- 📄 Found in
pdfs/Mislabeled Data AUC.pdf
- 📄 Found in
-
Mixup: Beyond Empirical Risk Minimization
- 📄 Found in
pdfs/RiskMinimization.pdf
- 📄 Found in
-
MarginMatch: Improving Semi-Supervised Learning with Pseudo-Margins
-
SqueezeNet: AlexNet-Level Accuracy with 50x Fewer Parameters
- 📄 Found in
pdfs/SqueezeNet.pdf
- 📄 Found in
-
SSD: Single Shot MultiBox Detector for Object Detection
- 📄 Found in
pdfs/SSD.pdf
- 📄 Found in