Data Science At Scale Everywhere for Everyone
Doris Lee
ACM Distinguished Speaker CEO and co-founder of Ponder
Stevenson Hall 1300
12:00 PM
- 12:50 PM
Over the past decade, the democratization of data science tooling, particularly through Python libraries like pandas and NumPy, has empowered practitioners of all levels to work with data efficiently. Yet, despite the popularity of these tools, they present challenges as practitioners look to scale their workflows to production. In this talk, we explore the limitations of these tools and pain points that data scientists encounter when dealing with data at scale. Next, I will share how we are solving this problem at Ponder, with both our open-source project Modin and our groundbreaking technology that lets anyone run their Python data workflows directly in their databases.