Aliens School
Cinematic Knowledge Experience
0%
Aliens School
Now Playing
Aliens School ยท HIEN
โŒจ๏ธ Keyboard Shortcuts
โ†’Next slide โ†Previous slide SpacePlay / Pause MNarration on/off FFullscreen ?Show/hide this
Press any key to close
Skill Topic ยท Cinematic

๐Ÿ‘๏ธ Vision Transformers (ViT)

Deep Learning Series #50 โ€” Transformers for Images! ViT, DeiT, Swin! CNN hatao, patches banao,โ€ฆ

Overview
๐ŸŒŸ

๐Ÿ‘๏ธ Vision Transformers (ViT) โ€” Quick Facts

๐Ÿ“Œ

Feature: CNN (ResNet)

๐ŸŽฏ

Input: Full image

โšก

Receptive field: Local โ†’ global

๐Ÿ”‘

Inductive bias: Translation equivariance

Topic 1
๐Ÿ“ฅ โš™๏ธ ๐Ÿ”ฌ ๐Ÿ’ก

๐Ÿ’ก Vision Transformer Kya Hai?

๐Ÿ“š ` โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”‚ VISION TRANSFORMER (ViT) โ”‚โ€ฆ
Topic 2
โœจ

๐Ÿ“Š ViT vs CNN vs Swin

๐Ÿ’ก | Feature | CNN (ResNet) | ViT | Swin Transformer |โ€ฆ
Topic 3
โœจ

๐Ÿ’ป Complete Code

๐Ÿ’ก

self.scale

๐Ÿ”‘

(2*window_size - 1),

โšก

coords_flat[:, None, :]

๐ŸŽฏ

self.scale

Topic 4
โญ

โ“ Quiz

๐Ÿ’ก

a) Memory save

๐Ÿ”‘

b) Transformer SEQUENCE processโ€ฆ

โšก

c) Better accuracy

๐ŸŽฏ

a) Bigger model

Comparison
โœจ

๐Ÿ“Š ViT vs CNN vs Swin

โš–๏ธ

Input: Full image

โš–๏ธ

Receptive field: Local โ†’ global

โš–๏ธ

Inductive bias: Translation equivariance

Quick Quiz
๐Ÿง  QUIZ TIME

Quiz โ€” Question 1

๐Ÿ‘๏ธ Vision Transformers (ViT) ka sabse sahi definition kya hai?

Quick Quiz
๐Ÿง  QUIZ TIME

Quiz โ€” Question 2

๐Ÿ‘๏ธ Vision Transformers (ViT) ka 'Input' kya hai?

Complete! ๐ŸŽ‰
COMPLETE

๐Ÿ‘๏ธ Vision Transformers (ViT) Complete!

Aliens School ยท HIEN ยท Cinematic Knowledge

โœ…

๐Ÿ‘๏ธ Vision Transformers (ViT) Complete

1/10
0:00
REC 00:00ESC=Cancel
Aliens School
3
Recording shuru hone wali hai...
โœ…
Recording Complete
Video process ho rahi hai...
Live Class
Slide 1 / 7
Timer
00:00
๐Ÿ“ Speaker Notes
โ€”
โญ๏ธ Up Next
โ€”
โ€”
๐Ÿ—‚๏ธ All Slides