All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
1:03:33
Oscar: Object-Semantics Aligned Pre-training for Vision-Language T
…
May 4, 2020
Microsoft
1:20
Reinforced Cross-Modal Matching and Self-Supervised Imitation Lear
…
Nov 27, 2018
Microsoft
0:50
Vison-language pretraining is pushing AI forward in novel objec
…
169K views
Jan 30, 2021
Facebook
Microsoft Research
0:12
In vision-and-language pretraining (VLP), objects can be used as anc
…
22.5K views
May 15, 2020
Facebook
Microsoft Research
24:50
Research talk: Large-scale, self-supervised pretraining: From lang
…
Nov 16, 2021
Microsoft
13:41
2601.21420 - ConceptMoE: Adaptive Token-to-Concept Compression fo
…
1 month ago
YouTube
AI Paper Cast
1:27:56
NICE Session 80: ICCV 2025 Paper Sharing Session 2
50 views
6 months ago
YouTube
NLP Academic Exchange Platform
4:39
VLAs: Resilience to Catastrophic Forgetting
24 views
1 month ago
YouTube
AI Research Roundup
3:03
Pretraining a Unified PDDL Domain from Real-World Videos!
2 views
1 week ago
YouTube
Panpan CAI
10:10
Agent security bypasses in practice & Governance gaps for enterprise
…
1 week ago
YouTube
The Automated Daily
0:14
TII Releases Falcon Perception for Vision and Language Tasks
26 views
4 weeks ago
YouTube
The AI Opus
0:45
ICL CHARACTERIZATION OF MULTI-MODAL GEO-FOUNDATIO
…
1 month ago
YouTube
Dr. Mosab Hawarey
6:54
Beyond Language Modeling: Multimodal Pretraining & Transfus
…
42 views
1 month ago
YouTube
SciPulse
4:48
TIPSv2: Precise Image Patch to Text Alignment
42 views
2 weeks ago
YouTube
AI Research Roundup
0:57
Top Vision-Language-Action Models | RT-2, Octo, OpenVLA, SmolVLA
130 views
1 month ago
YouTube
Notes from my Life
20:53
What do Language Models Learn and When? The Implicit Curriculu
…
52 views
2 weeks ago
YouTube
AI Paper Slop
0:32
Train robot arms once.Deploy them on drones.That’s the promise here
…
10.9K views
1 month ago
x.com
Ilir Aliu
0:42
𝗙𝗶𝗻𝗲-𝘁𝘂𝗻𝗲 𝗮 𝗳𝗼𝘂𝗻𝗱𝗮𝘁𝗶𝗼𝗻 𝗽𝗼𝗹𝗶𝗰𝘆 𝗳𝗼𝗿 𝘁𝗵𝗿𝗲𝗲 𝗺𝗼𝗻𝘁𝗵𝘀. 𝗠𝗼𝘃𝗲 𝘁𝗵𝗲 𝗯𝗶𝗻 𝘁𝘄𝗼 𝗶𝗻𝗰𝗵𝗲𝘀. 𝗪𝗮𝘁𝗰𝗵 𝗶𝘁 𝗳𝗮𝗶𝗹. 𝗧𝗵𝗮𝘁 𝗹𝗼𝗼𝗽 𝗶𝘀 𝘁𝗵𝗲 𝗿𝗲𝗮𝘀𝗼𝗻 𝗺𝗼𝘀𝘁 𝗽𝗿𝗲𝘁𝗿𝗮𝗶𝗻𝗲𝗱 𝗿𝗼𝗯𝗼𝘁𝗶𝗰𝘀 𝗽𝗼𝗹𝗶𝗰𝗶𝗲𝘀 𝗻𝗲𝘃𝗲𝗿 𝗹𝗲𝗮𝘃𝗲 𝘁𝗵𝗲 𝗹𝗮𝗯, 𝗮𝗻𝗱 𝗮 𝗻𝗲𝘄 𝗽𝗮𝗽𝗲𝗿 𝗽𝗿𝗼𝗽𝗼𝘀𝗲𝘀 𝗮 𝘄𝗮𝘆 𝘁𝗼 𝗯𝗿𝗲𝗮𝗸 𝗶RT-1 learned from 130,000 demonstrati
…
8.4K views
6 days ago
x.com
Stephen James
48:07
OpenAI CLIP: ConnectingText and Images (Paper Explained)
173.7K views
Jan 12, 2021
YouTube
Yannic Kilcher
15:08
Contrastive Language-Image Pretraining (CLIP)
753 views
Apr 10, 2025
YouTube
Antonio Rueda-Toicen
13:44
Vision Transformers explained
70.8K views
Jul 1, 2023
YouTube
Code With Aarohi
1:13:22
Contrastive Language-Image Pre-training (CLIP)
12.5K views
Apr 27, 2022
YouTube
Samuel Albanie
6:05
What is LLM Distillation ?
33.3K views
Feb 2, 2025
YouTube
New Machina
2:29:42
逐篇解析机器人基座模型和VLA经典论文(含投屏版)——“人就是最智
…
4.1K views
Apr 7, 2025
YouTube
Zhang Xiaojun Podcast
2:03
Qwen3-VL is here!
782.9K views
7 months ago
YouTube
Qwen
1:02:41
Python + AI: Vision models
3.4K views
6 months ago
YouTube
Microsoft Reactor
39:51
Multimodal Machine Learning | Introduction | Part 1 | CVPR 2022 T
…
40.9K views
Aug 9, 2022
YouTube
Artificial Intelligence
37:00
Introduction to Vision Language Models (VLM)
14.2K views
5 months ago
YouTube
Vizuara
8:46
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
196 views
7 months ago
YouTube
Xiaol.x
33:33
OpenAI CLIP Explained | Multi-modal ML
27K views
Sep 15, 2022
YouTube
James Briggs
See more videos
More like this
Feedback