File Name: | Complete Computer Vision Bootcamp: YOLO to Multimodal AI |
Content Source: | https://www.udemy.com/course/complete-computer-vision-bootcamp-yolo-to-multimodal-ai |
Genre / Category: | Ai Courses |
File Size : | 10.4 GB |
Publisher: | Muhammad Moin |
Updated and Published: | October 1, 2025 |
This course takes you from the basics of YOLO11 to advanced computer vision applications. You’ll explore object detection, segmentation, pose estimation, and image classification, while also learning to create analytical graphs and track object movements. Beyond YOLO11, you’ll build real-world projects with Streamlit, enhance detection with SAHI, estimate distances with Depth Pro, and explore cutting-edge multimodal AI models like Qwen2.5-VL, Florence 2, and Google Gemini 2.5. By the end, you’ll have hands-on experience with modern tools to solve practical computer vision challenges.
What You Will Learn:
- Getting Started with YOLO11: YOLO11 Updates and New Features
- Implementing YOLO11 in Google Colab: YOLO11 for Object Detection, Segmentation, Pose Estimation & Classification
- Creating Analytical Graphs and Visualizing Data with YOLO11: How to Generate Analytical Graphs with YOLO11
- Counting Object Entries and Exits using YOLO11 and DeepSORT: Tracking Objects with YOLO11 and DeepSORT for Entry–Exit Counts
- Streamlit Application: Object Detection, Segmentation & Pose Estimation: Building a Streamlit App for Object Detection, Segmentation, and Pose Estimation
- Using Ultralytics YOLO11 with SAHI for Object Detection in Drone Footage: YOLO11 + SAHI = Better Detection for Small Objects! (Step-by-Step Guide)
- Estimate Real Distance to Objects with ML Depth Pro and YOLO11: Learn how to estimate real distances to objects using Depth Pro and YOLO11.
- Performing Zero-Shot Object Detection with Qwen2.5-VL: Zero-Shot Object Detection Using Qwen2.5-VL
- Run Vision Tasks: Object Detection, Image Captioning & OCR with Florence 2: How to use Florence 2 for Object Detection, Image Captioning and OCR
- Google Gemini 2.5 Pro: Detect Objects, Generate Captions & OCR: How to do Object Detection, Image Captioning, Reasoning and OCR with Gemini-2.5
Who this course is for:
- Anyone interested in Computer Vision
- Students and researchers exploring AI and vision-language models.
- Anyone excited about building AI-powered applications
DOWNLOAD LINK: Complete Computer Vision Bootcamp: YOLO to Multimodal AI
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part01.rar – 1000.0 MB
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part02.rar – 1000.0 MB
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part03.rar – 1000.0 MB
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part04.rar – 1000.0 MB
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part05.rar – 1000.0 MB
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part06.rar – 1000.0 MB
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part07.rar – 1000.0 MB
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part08.rar – 1000.0 MB
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part09.rar – 1000.0 MB
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part10.rar – 1000.0 MB
Complete_Computer_Vision_Bootcamp_YOLO_to_Multimodal_AI.part11.rar – 486.1 MB
FILEAXA.COM – is our main file storage service. We host all files there. You can join the FILEAXA.COM premium service to access our all files without any limation and fast download speed.