Channel: Google Cloud Tech
Category: Science & Technology
Tags: how to process dataptransformcloud consoletype: devbyte (deck from scratch 2-10min)google cloud dataflowbeam data processingapache data processinggds: yesdataflow runnerpcollectiongetting started with apache beamgoogle cloud projectpardopr_pr: google cloudhow to use python with beambatch data pipelineshow to transform datapython codingapache beam
Description: WordCount Examples → goo.gle/3J7jpsf Google Cloud Dataflow → goo.gle/35KiTmE Beam College → goo.gle/3NdzHDm Welcome back to Getting Started with Apache Beam! In this episode, Debi Cabrera demonstrates how to process and transform data using Apache Beam with Python and Google Cloud Dataflow as the runner. Watch to see how you can use Apache Beam to count the words from Shakespeare’s King Lear as a batch data job and then try it out for yourself! Chapters: 0:00 - Intro 0:40 - In this episode 1:06 - The pipeline 1:31 - The input file 1:46 - Direct runner 2:17 - Dataflow runner 2:57 - The pipeline code 4:07 - Dataflow in the Cloud Console 4:45 - The output file 5:15 - Wrap up Watch more episodes of Getting Started with Apache Beam → goo.gle/ApacheBeam Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech product: Cloud - General; fullname: Mark Mirchandani, Debi Cabrera;