Skip to main content
Engineering

Is Parquet becoming the bottleneck? Why new storage formats are emerging in 2025 (Lance, Vortex, and more)

Parquet gave data lakes a common language: columnar layout, good compression, and fast scans. That still works well for classic analytics. But workloads have changed. We now mix wide scans with point lookups, handle embeddings and images, and run on S3-first stacks. On NVMe you want lots of tiny random reads. On S3 you want fewer, larger range requests. A format tuned for one world can feel chatty or slow in the other.

ZhiHanZ
Sep 15, 2025
Is Parquet becoming the bottleneck? Why new storage formats are emerging in 2025 (Lance, Vortex, and more)
401 RYLAND ST. STE 200-A, Reno, NV 89502, USA
SOC 2 Type IIGDPR
© 2025 Databend Cloud. All Rights Reserved.