My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write
show comments
rickette
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
skanga
What about Tablesaw, Apache Arrow? How does this compare ...
Nice!
I’m currently using manifold-sql with duckdb for this.
Always great to see efforts to make working with data frames easier. Here are some similar data frame libraries for Java:
https://github.com/jtablesaw/tablesaw
https://github.com/dflib/dflib
My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
What about Tablesaw, Apache Arrow? How does this compare ...