79% of data science time is lost to finding, cleaning, and organizing data. In this talk we’ll explore emerging technologies in data virtualization that simplify data management and free data scientists to focus on analysis. Data virtualization is a family of techniques that make identical data available to multiple users across varied platforms. We’ll provide an overview of key technologies like Parquet, S3, Arrow, Spark, and data packages.
About our Speaker :
Aneesh Karve is co-founder and Chief Technology Officer at Quilt Data. He has worked as a product manager, lead designer, and software engineer at companies like Microsoft, NVIDIA, and Matterport. Aneesh was the general manager and founding member of AdJitsu, the first realtime 3D advertising platform for iOS, acquired by Amobee in 2012.
Aneesh is currently advancing an open source data compiler and package manager that brings the power of source code management to big data. Recently, Aneesh has been applying mathematics to detect and neutralize bias in artificial intelligence and visualization (Strata 2016).
Aneesh holds degrees in chemistry, mathematics, and computer science. His research background spans machine learning, abstract algebra, and information visualization.
Here is a presentation Aneesh did at Strata+Hadoop World.
About Galvanize :
Galvanize is the premiere dynamic learning community for technology. With campuses located in booming technology sectors throughout the country, Galvanize provides a community for each the following:
To learn more about Galvanize, visit galvanize.com.