News & Updates
Bringing DuckLake to Apache DataFusion
We've just released a new version of DuckLake + DataFusion and have donated it to the datafusion-contrib repository. We’re delighted to bring a new lakehouse catalog format to the…
How Hotdata accelerates Neon and Supabase
Agents rarely connect to a single database. Transactional databases like Postgres generally store core application data, but agents often need to query multiple systems and…
Building Vector Search Into a SQL Query Engine
We added native ANN search to our DataFusion-based engine using USearch, Parquet, and SQLite — with adaptive filtering and got impressive results.
Why Agents Need Version Control for Data
In the early days, developers could keep knowledge of the systems they worked on in their heads. You changed a file, ran a test, and moved on. That worked when codebases were…
Agents Are Only as Good as the Data They Can JOIN
Now that everyone is building agents on diverse data sources and agents have better reasoning tools, how do you quickly assemble data from many different sources, in the right…
Query engines for Agents
We’re building a query engine, so I’ve been thinking about the difference between query engines built for humans and those specially made for agents. Regardless of the…
Why we are building Hotdata
Over the past decade, data infrastructure prioritized throughput and scale over latency and interactivity, cementing large distributed warehouses as the default.