Aliens School
Cinematic Knowledge Experience
0%
Aliens School
Now Playing
Aliens School ยท HIEN
โŒจ๏ธ Keyboard Shortcuts
โ†’Next slide โ†Previous slide SpacePlay / Pause MNarration on/off FFullscreen ?Show/hide this
Press any key to close
Skill Topic ยท Cinematic

๐Ÿ”— Multimodal Models

Deep Learning Series #51 โ€” Vision + Language Together! CLIP, DALL-E, Flamingo! Images aur text ekโ€ฆ

Overview
๐ŸŒŸ

๐Ÿ”— Multimodal Models โ€” Quick Facts

๐Ÿ“Œ

Model: Modalities

๐ŸŽฏ

CLIP: Image + Text

โšก

DALL-E: Text โ†’ Image

๐Ÿ”‘

Flamingo: Image + Text

Topic 1
๐Ÿ“ฅ ๐Ÿ“ฅ ๐Ÿง  ๐Ÿ”ฌ ๐Ÿ’ก ๐ŸŽฏ

๐Ÿ’ก Multimodal Learning Kya Hai?

๐Ÿ“š ` โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” โ”‚ MULTIMODAL LEARNING โ”‚ โ”‚โ€ฆ
Topic 2
โœจ

๐Ÿ“Š Multimodal Models Comparison

๐Ÿ’ก | Model | Modalities | Task | Method | Key Feature |โ€ฆ
Topic 3
โœจ

๐Ÿ’ป Complete Code

๐Ÿ’ก

Image encoder โ†’ image embedding

๐Ÿ”‘

Text encoder โ†’ text embedding

โšก

SAME space! Matching pairs CLOSE,

๐ŸŽฏ

Row: for each image, correct textโ€ฆ

Topic 4
โœจ

โ“ Quiz

๐Ÿ’ก

a) MSE loss

๐Ÿ”‘

b) Batch me B images + B textsโ€ฆ

โšก

c) Binary classification

๐ŸŽฏ

a) Small model

Comparison
โœจ

๐Ÿ“Š Multimodal Models Comparison

โš–๏ธ

CLIP: Image + Text

โš–๏ธ

DALL-E: Text โ†’ Image

โš–๏ธ

Flamingo: Image + Text

Quick Quiz
๐Ÿง  QUIZ TIME

Quiz โ€” Question 1

๐Ÿ”— Multimodal Models ka sabse sahi definition kya hai?

Quick Quiz
๐Ÿง  QUIZ TIME

Quiz โ€” Question 2

๐Ÿ”— Multimodal Models ka 'CLIP' kya hai?

Complete! ๐ŸŽ‰
COMPLETE

๐Ÿ”— Multimodal Models Complete!

Aliens School ยท HIEN ยท Cinematic Knowledge

โœ…

๐Ÿ”— Multimodal Models Complete

1/10
0:00
REC 00:00ESC=Cancel
Aliens School
3
Recording shuru hone wali hai...
โœ…
Recording Complete
Video process ho rahi hai...
Live Class
Slide 1 / 7
Timer
00:00
๐Ÿ“ Speaker Notes
โ€”
โญ๏ธ Up Next
โ€”
โ€”
๐Ÿ—‚๏ธ All Slides