Computer Vision

Computer Vision

Image classification

StarterKeras VersionTitleDate CreatedLast Modified
V3Image classification from scratch2020/04/272023/11/09
V3Simple MNIST convnet2015/06/192020/04/21
V3Image classification via fine-tuning with EfficientNet2020/06/302023/07/10
V3Image classification with Vision Transformer2021/01/182021/01/18
V3Classification using Attention-based Deep Multiple Instance Learning (MIL).2021/08/162021/11/25
V3Image classification with modern MLP models2021/05/302023/08/03
V3MobileViT: A mobile-friendly Transformer-based model for image classification2021/10/202024/02/11
V3Pneumonia Classification on TPU2020/07/282024/02/12
V3Compact Convolutional Transformers2021/06/302023/08/07
V3Image classification with ConvMixer2021/10/122021/10/12
V3Image classification with EANet (External Attention Transformer)2021/10/192023/07/18
V3Involutional neural networks2021/07/252021/07/25
V3Image classification with Perceiver2021/04/302023/12/30
V3Few-Shot learning with Reptile2020/05/212023/07/20
V3Semi-supervised image classification using contrastive pretraining with SimCLR2021/04/242024/03/04
V3Image classification with Swin Transformers2021/09/082021/09/08
V2Train a Vision Transformer on small datasets2022/01/072022/01/10
V2A Vision Transformer without Attention2022/02/242022/10/15
V3Image Classification using Global Context Vision Transformer2023/10/302023/10/30
V3Image Classification using BigTransfer (BiT)2021/09/242024/01/03
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Semi-supervised image classification using contrastive pretraining with SimCLR2021/04/242024/03/04
V3Pneumonia Classification on TPU2020/07/282024/02/12
V3MobileViT: A mobile-friendly Transformer-based model for image classification2021/10/202024/02/11
V3Image Classification using BigTransfer (BiT)2021/09/242024/01/03
V3Image classification with Perceiver2021/04/302023/12/30
V3Image classification from scratch2020/04/272023/11/09
V3Image Classification using Global Context Vision Transformer2023/10/302023/10/30
V3Compact Convolutional Transformers2021/06/302023/08/07
V3Image classification with modern MLP models2021/05/302023/08/03
V3Few-Shot learning with Reptile2020/05/212023/07/20
V3Image classification with EANet (External Attention Transformer)2021/10/192023/07/18
V3Image classification via fine-tuning with EfficientNet2020/06/302023/07/10
V2A Vision Transformer without Attention2022/02/242022/10/15
V2Train a Vision Transformer on small datasets2022/01/072022/01/10
V3Classification using Attention-based Deep Multiple Instance Learning (MIL).2021/08/162021/11/25
V3Image classification with ConvMixer2021/10/122021/10/12
V3Image classification with Swin Transformers2021/09/082021/09/08
V3Involutional neural networks2021/07/252021/07/25
V3Image classification with Vision Transformer2021/01/182021/01/18
V3Simple MNIST convnet2015/06/192020/04/21

Image segmentation

StarterKeras VersionTitleDate CreatedLast Modified
V3Image segmentation with a U-Net-like architecture2019/03/202020/04/20
V3Multiclass semantic segmentation using DeepLabV3+2021/08/312024/01/05
V2Highly accurate boundaries segmentation using BASNet2023/05/302023/07/13
V3Image Segmentation using Composable Fully-Convolutional Networks2023/06/162023/12/25
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Multiclass semantic segmentation using DeepLabV3+2021/08/312024/01/05
V3Image Segmentation using Composable Fully-Convolutional Networks2023/06/162023/12/25
V2Highly accurate boundaries segmentation using BASNet2023/05/302023/07/13
V3Image segmentation with a U-Net-like architecture2019/03/202020/04/20

Object detection

StarterKeras VersionTitleDate CreatedLast Modified
V2Object Detection with RetinaNet2020/05/172023/07/10
V3Keypoint Detection with Transfer Learning2021/05/022023/07/19
V3Object detection with Vision Transformers2022/03/272023/11/20
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Object detection with Vision Transformers2022/03/272023/11/20
V3Keypoint Detection with Transfer Learning2021/05/022023/07/19
V2Object Detection with RetinaNet2020/05/172023/07/10

3D

StarterKeras VersionTitleDate CreatedLast Modified
V33D image classification from CT scans2020/09/232024/01/11
V3Monocular depth estimation2021/08/302024/08/13
V33D volumetric rendering with NeRF2021/08/092023/11/13
V3Point cloud segmentation with PointNet2020/10/232020/10/24
V3Point cloud classification with PointNet2020/05/252024/01/09
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Monocular depth estimation2021/08/302024/08/13
V33D image classification from CT scans2020/09/232024/01/11
V3Point cloud classification with PointNet2020/05/252024/01/09
V33D volumetric rendering with NeRF2021/08/092023/11/13
V3Point cloud segmentation with PointNet2020/10/232020/10/24

OCR

StarterKeras VersionTitleDate CreatedLast Modified
V3OCR model for reading Captchas2020/06/142024/03/13
V3Handwriting recognition2021/08/162024/09/01
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Handwriting recognition2021/08/162024/09/01
V3OCR model for reading Captchas2020/06/142024/03/13

Image enhancement

StarterKeras VersionTitleDate CreatedLast Modified
V3Convolutional autoencoder for image denoising2021/03/012021/03/01
V3Low-light image enhancement using MIRNet2021/09/112023/07/15
V3Image Super-Resolution using an Efficient Sub-Pixel CNN2020/07/282020/08/27
V3Enhanced Deep Residual Networks for single-image super-resolution2022/04/072024/08/27
V3Zero-DCE for low-light image enhancement2021/09/182023/07/15
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Enhanced Deep Residual Networks for single-image super-resolution2022/04/072024/08/27
V3Low-light image enhancement using MIRNet2021/09/112023/07/15
V3Zero-DCE for low-light image enhancement2021/09/182023/07/15
V3Convolutional autoencoder for image denoising2021/03/012021/03/01
V3Image Super-Resolution using an Efficient Sub-Pixel CNN2020/07/282020/08/27

Data augmentation

StarterKeras VersionTitleDate CreatedLast Modified
V3CutMix data augmentation for image classification2021/06/082023/11/14
V3MixUp augmentation for image classification2021/03/062023/07/24
V3RandAugment for Image Classification for Improved Robustness2021/03/132023/12/12
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3RandAugment for Image Classification for Improved Robustness2021/03/132023/12/12
V3CutMix data augmentation for image classification2021/06/082023/11/14
V3MixUp augmentation for image classification2021/03/062023/07/24

Image & Text

StarterKeras VersionTitleDate CreatedLast Modified
V3Image Captioning2021/05/292021/10/31
V2Natural language image search with a Dual Encoder2021/01/302021/01/30
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Image Captioning2021/05/292021/10/31
V2Natural language image search with a Dual Encoder2021/01/302021/01/30

Vision models interpretability

StarterKeras VersionTitleDate CreatedLast Modified
V3Visualizing what convnets learn2020/05/292020/05/29
V3Model interpretability with Integrated Gradients2020/06/022020/06/02
V3Investigating Vision Transformer representations2022/04/122023/11/20
V3Grad-CAM class activation visualization2020/04/262021/03/07
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Investigating Vision Transformer representations2022/04/122023/11/20
V3Grad-CAM class activation visualization2020/04/262021/03/07
V3Model interpretability with Integrated Gradients2020/06/022020/06/02
V3Visualizing what convnets learn2020/05/292020/05/29

Image similarity search

StarterKeras VersionTitleDate CreatedLast Modified
V2Near-duplicate image search2021/09/102023/08/30
V3Semantic Image Clustering2021/02/282021/02/28
V3Image similarity estimation using a Siamese Network with a contrastive loss2021/05/062022/09/10
V3Image similarity estimation using a Siamese Network with a triplet loss2021/03/252021/03/25
V3Metric learning for image similarity search2020/06/052020/06/09
V2Metric learning for image similarity search using TensorFlow Similarity2021/09/302022/02/29
V3Self-supervised contrastive learning with NNCLR2021/09/132024/01/22
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Self-supervised contrastive learning with NNCLR2021/09/132024/01/22
V2Near-duplicate image search2021/09/102023/08/30
V3Image similarity estimation using a Siamese Network with a contrastive loss2021/05/062022/09/10
V2Metric learning for image similarity search using TensorFlow Similarity2021/09/302022/02/29
V3Image similarity estimation using a Siamese Network with a triplet loss2021/03/252021/03/25
V3Semantic Image Clustering2021/02/282021/02/28
V3Metric learning for image similarity search2020/06/052020/06/09

Video

StarterKeras VersionTitleDate CreatedLast Modified
V3Video Classification with a CNN-RNN Architecture2021/05/282023/12/08
V3Next-Frame Video Prediction with Convolutional LSTMs2021/06/022023/11/10
V3Video Classification with Transformers2021/08/062023/07/22
V3Video Vision Transformer2022/01/122024/01/15
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Video Vision Transformer2022/01/122024/01/15
V3Video Classification with a CNN-RNN Architecture2021/05/282023/12/08
V3Next-Frame Video Prediction with Convolutional LSTMs2021/06/022023/11/10
V3Video Classification with Transformers2021/08/062023/07/22

Performance recipes

StarterKeras VersionTitleDate CreatedLast Modified
V3Gradient Centralization for Better Training Performance2021/06/182023/07/25
V3Learning to tokenize in Vision Transformers2021/12/102023/08/14
V3Knowledge Distillation2020/09/012020/09/01
V3FixRes: Fixing train-test resolution discrepancy2021/10/082021/10/10
V3Class Attention Image Transformers with LayerScale2022/09/192022/11/21
V3Augmenting convnets with aggregated attention2022/01/222022/01/22
V3Learning to Resize in Computer Vision2021/04/302023/12/18
StarterKeras VersionTitleDate CreatedLast Modified ▼
V3Learning to Resize in Computer Vision2021/04/302023/12/18
V3Learning to tokenize in Vision Transformers2021/12/102023/08/14
V3Gradient Centralization for Better Training Performance2021/06/182023/07/25
V3Class Attention Image Transformers with LayerScale2022/09/192022/11/21
V3Augmenting convnets with aggregated attention2022/01/222022/01/22
V3FixRes: Fixing train-test resolution discrepancy2021/10/082021/10/10
V3Knowledge Distillation2020/09/012020/09/01

Other

StarterKeras VersionTitleDate CreatedLast Modified
V2Semi-supervision and domain adaptation with AdaMatch2021/06/192021/06/19
V2Barlow Twins for Contrastive SSL2021/11/042021/12/20
V2Consistency training with supervision2021/04/132021/04/19
V2Distilling Vision Transformers2022/04/052022/04/08
V2Focal Modulation: A replacement for Self-Attention2023/01/252023/02/15
V2Using the Forward-Forward Algorithm for Image Classification2023/01/082024/09/17
V2Masked image modeling with Autoencoders2021/12/202021/12/21
V2Segment Anything Model with 🤗Transformers2023/07/112023/07/11
V2Semantic segmentation with SegFormer and Hugging Face Transformers2023/01/252023/01/29
V2Self-supervised contrastive learning with SimSiam2021/03/192023/12/29
V2Supervised Contrastive Learning2020/11/302020/11/30
V2When Recurrence meets Transformers2023/03/122024/11/12
V2Efficient Object Detection with YOLOV8 and KerasCV2023/06/262023/06/26
StarterKeras VersionTitleDate CreatedLast Modified ▼
V2When Recurrence meets Transformers2023/03/122024/11/12
V2Using the Forward-Forward Algorithm for Image Classification2023/01/082024/09/17
V2Self-supervised contrastive learning with SimSiam2021/03/192023/12/29
V2Segment Anything Model with 🤗Transformers2023/07/112023/07/11
V2Efficient Object Detection with YOLOV8 and KerasCV2023/06/262023/06/26
V2Focal Modulation: A replacement for Self-Attention2023/01/252023/02/15
V2Semantic segmentation with SegFormer and Hugging Face Transformers2023/01/252023/01/29
V2Distilling Vision Transformers2022/04/052022/04/08
V2Masked image modeling with Autoencoders2021/12/202021/12/21
V2Barlow Twins for Contrastive SSL2021/11/042021/12/20
V2Semi-supervision and domain adaptation with AdaMatch2021/06/192021/06/19
V2Consistency training with supervision2021/04/132021/04/19
V2Supervised Contrastive Learning2020/11/302020/11/30