Y

YouLibs

Remove Touch Overlay

The quest for high-quality data, Ihab Ilyas (University of Waterloo)

Duration: 02:38Views: 465Likes: 5Date Created: Dec, 2019

Channel: O'Reilly

Category: Science & Technology

Tags: o'reilly media (publisher)o'reillyoreilly mediaoreilly

Description: “AI starts with good data” is a statement that receives wide agreement from data scientists, analysts, and business owners. There has been a significant increase in our ability to build complex AI models for prediction, classification, and various analytics tasks, and there’s an abundance of (fairly easy to use) tools that allow data scientists and analysts to provision complex models within days. However, the lack of data or data-quality issues remains the main bottleneck holding back further adoption of AI technologies. Even with advances in building robust models, the reality is that noisy data and incomplete data remain the biggest hurdles to effective end-to-end solutions. Multiple studies prove that cleaning data is a much more effective investment than enhancing learning robustness. Ihab Ilyas highlights this data quality problem and describes the HoloClean framework, a state-of-the-art prediction engine for structured data with direct applications in detecting and repairing data errors, as well as imputing missing labels and values. The framework uses techniques such as data augmentation and self-supervised learning to build models that describe how data is generated and how errors and anomalies are introduced. Subscribe to O'Reilly on YouTube: goo.gl/n3QSYi Follow O'Reilly on: Twitter: twitter.com/oreillymedia Facebook: facebook.com/OReilly Instagram: instagram.com/oreillymedia LinkedIn: linkedin.com/company-beta/8459

Swipe Gestures On Overlay