Xiangpeng’s blog
Database/storage systems
Build your own S3-Select in 400 lines of Rust
DataFusion is ALL YOU NEED
Mar 23, 2025
Efficient Filter Pushdown in Parquet
How to implement efficient filter pushdown in Parquet readers and why it’s challenging in practice.
Mar 12, 2025
Where are we now, system researchers?
What I cannot create, I do not understand.
Mar 9, 2025
Online Parquet Viewer with Rust
No JS, no server, just Rust
Nov 12, 2024
Xiangpeng Hao
Caching in DataFusion
Don’t read twice.
Oct 27, 2024
Xiangpeng Hao
Parquet pruning in DataFusion
Read no more than you need
Oct 24, 2024
Xiangpeng Hao
Use StringView to make DataFusion faster
Editor note: This blog post is adapted from the InfluxData blog post series, make sure to also check out the original posts 1 2.
Sep 13, 2024
Xiangpeng Hao, Andrew Lamb
My research statement
Research should connect to real people.
Aug 21, 2024
InfluxData > Google and Microsoft
I have interned at Google, Microsoft (Research), and InfluxData. The experience at InfluxData has been the most enjoyable, and this blog post explains why.
Aug 18, 2024
Three types of good academic advisors
Innovators, educators, and entrepreneurs.
May 7, 2024
What happens when you type a SQL in the database
A database can be complex; it involves almost all aspects (research communities) of computer science: PL (programming language), SE (software engineering), OS (operating…
Apr 26, 2024
No matching items