Channel: PyData
Category: Science & Technology
Tags: pythonlearn to codeeducationsoftwarepydatalearncodinghow to programjuliaopensourcescientific programmingnumfocuspython 3tutorial
Description: Presenter: Sergio Ferragut Title: Some Like it Hot: Choosing a System for Large-Scale Data Analysis Presentation Overview: Apache Druid is a modern cloud-native, stream-native, analytics database designed for workflows where fast queries and instant ingest are important. Druid excels at instant data visibility, ad-hoc queries, operational analytics, and handling high concurrency. It is a strong candidate for being the workhorse system for hot analytics. There is no shortage of systems that claim to help with the analysis of large amounts of data. Under the hood, today's popular systems have a variety of interesting and unique architectures. In this talk, we'll reflect on why you can never seem to find that single perfect system, and how to evaluate the capabilities of various systems through the prism of a temperature-based spectrum of use cases, from cold to hot analytics. Bio: Sergio Ferragut is a database veteran turned Technical Evangelist at Imply. His experience includes 16 years at Teradata in professional services and engineering roles. He has direct experience in building analytic applications spanning the retail, supply chain, pricing optimization, and IoT spaces. Sergio has worked at multiple technology start-ups including APL and Splice Machine where he helped guide product design and field messaging. Presenter: Dhruv Sakalley Title: Interplay of Data and AI, with the current state of Web3 Presentation Overview: As we move forward with the blockchain revolution, and as companies like facebook summon the metaverse, it is only natural for curious souls like us to explore where and how Data and AI fit into this upcoming landscape. We are very early in this journey and I hope to share a few ideas that I would love to see develop into legitimate use cases. A few things we may cover in the talk are (not limited to): Hybrid Smart Contracts, Data Assets as NFTs and ownership, and monetization of APIs on the blockchain. Bio: Dhruv is Machine Learning Team Lead at Sensibill. Sensibill turns everyday spend into long-term financial wellness with actionable, purchase-level data. Also, Dhruv is one of the PyData Triangle organizers pydata.org PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases. 00:00 Welcome! 00:10 Help us add time stamps or captions to this video! See the description for details. Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: github.com/numfocus/YouTubeVideoTimestamps