Fast Data Processing with Apache Arrow

Presented by

Andrei Ionescu, Senior Software Engineer, Adobe

About this talk

Using Rust, Apache Arrow, and table formats, data can be efficiently processed closer to the hardware and without any pauses. In this video, it will explain the pros and cons of Apache Arrow for data processing and compare the performance with Apache Spark — the "standard" in terms of distributed processing of big data. We will discuss the advantages of the Rust language, including Rust Arrow and the tools available, the missing pieces, and performance comparisons.
Related topics:

More from this channel

Upcoming talks (0)
On-demand talks (103)
Subscribers (4469)
Dremio is the easy and open data lakehouse, providing self-service analytics with data warehouse functionality and data lake flexibility across all of your data. Dremio increases agility with a revolutionary data-as-code approach that enables Git-like data experimentation, version control, and governance.