Skip to main content

Developing AI Vision Apps Using Microsoft Cognitive Services


Microsoft

About this course

Microsoft Cognitive Services is a set of cloud-based intelligence services and APIs for building richer, smarter, and more sophisticated applications. The Vision APIs available in Microsoft Cognitive Services offer many ready-to-use and easy-to-consume features that help you use Artificial Intelligence (AI) to solve your business problems. In this practical course, take an in-depth look at Vision APIs, work through hands-on exercises to learn how to piece them together, and find out how to put them to work in your organization.

Start with an overview of Microsoft Cognitive Services, and then take a look at the Computer Vision API, which gathers actionable information in images. Learn about optical character recognition (OCR) and image manipulation. Explore the Face API to detect, identify, analyze, organize, and tag photos and to create metadata around them. Get the details on the Emotion API, which helps personalize user experience with emotion recognition. And dig into the Custom Vision API, which enables you to customize computer vision models as required for your own specific use cases.

Leverage the latest best practices and Fluent Design principles, as you learn how to create Windows 10 Universal Windows Platform applications that can run on multiple devices, including desktops, tablets, phones, HoloLens, and Xbox consoles. With a prerequisite of proficiency in a C-based programming language like C, C#, C++, or Java, follow along with the instructor as you work through the labs to replicate and modify code in the examples.

Wrap up the course by creating an application that analyzes and tags images found in Twitter timelines for the presence of human faces and their recognized emotion, based on information returned from the Computer Vision, Face, Emotion, and Custom Vision APIs. The app can optionally even attempt to locate the source and physical location of people, places, or organizations referenced in the content. From a general overview to specific use cases and hands-on practice, this course gives you what you need to create AI apps with off-the-shelf features in Cognitive Services Vision APIs.

What You’lll Learn:

  • Identify, categorize, and tag images
  • Interpret images for characteristics like faces, mood, age, and scene
  • Perform Optical Character Recognition (OCR) within images
  • Intelligently manipulate photos
  • Build an image classifier app

Course Syllabus

  • Modue1: Computer Vision Overview
  • Module2: Computer Vision
  • Module 3: OCR and Image Manipulation
  • Module 4: Custom Vision
  • Module 5: Final Project

Prerequisites

Intermediate Computer Science coding skills in a C based language such as C, C#, C++, Java. Course will primarily use C#, knowledge of C# is recommended, but not a prerequisite.

Course Staff

Scott Peterson

Scott Peterson

Chief Solution Architect @ Global DX | Liquid Daffodil

Liquid Daffodil

Scott J. Peterson is the Chief Solution Architect & Developer behind Liquid Daffodil, a leading creator of corporate and consumer apps for multiple platforms. As a Microsoft Certified Solution Developer, Microsoft Certified Trainer, author, and presenter who has been developing Enterprise solutions for organizations around the world for 29 years, his topics extended across a broad spectrum, including the Universal Windows Platform, Microsoft Cognitive Services, Xamarin Forms, Microsoft Azure, and Microsoft Power BI.

Enroll