You received this message because you are subscribed to the Google Groups "Numerical Elixir (Nx)" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elixir-nx+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elixir-nx/d48af207-3827-46f3-9a1a-a2436e8de9b0n%40googlegroups.com.
I can vouch for the opinion that Pandas is a kludge (though
fast), and that inspiration from R is the way to go (vectorised
from the ground up). Indeed Wes McKinney himself is not happy with
the Pandas API which is why he has rebuilt completely up in the
form of Apache Arrow, a cross language format which is gaining a
huge amount of traction including in the R universe (the first
implementation was the Wickham/McKinney feather
format joint venture).
That said, I would caution against following the Tidyverse model too closely as the base level abstraction. It's very opinionated and loved by same, disliked by others. Personally I think an industry standard in-memory format such as Apache Arrow is the correct (lower) level at which to focus at first, and people can build their own higher level abstractions once this is in place, especially given that Elixir has such wonderful metaprogramming capabilities.
I'm currently investigating how easy it is to parse the Arrow format natively in Erlang.
To view this discussion on the web visit https://groups.google.com/d/msgid/elixir-nx/CAGnRm4J6BSzsOGk6kLwpaHvnDEKvB-VpueXHQ6mYVWKQqJdC0Q%40mail.gmail.com.