--
You received this message because you are subscribed to the Google Groups "Numerical Elixir (Nx)" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elixir-nx+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elixir-nx/d48af207-3827-46f3-9a1a-a2436e8de9b0n%40googlegroups.com.
I can vouch for the opinion that Pandas is a kludge (though
fast), and that inspiration from R is the way to go (vectorised
from the ground up). Indeed Wes McKinney himself is not happy with
the Pandas API which is why he has rebuilt completely up in the
form of Apache Arrow, a cross language format which is gaining a
huge amount of traction including in the R universe (the first
implementation was the Wickham/McKinney feather
format joint venture).
That said, I would caution against following the Tidyverse model
too closely as the base level abstraction. It's very opinionated
and loved by same, disliked by others. Personally I think an
industry standard in-memory format such as Apache Arrow is the
correct (lower) level at which to focus at first, and people can
build their own higher level abstractions once this is in place,
especially given that Elixir has such wonderful metaprogramming
capabilities.
I'm currently investigating how easy it is to parse the Arrow
format natively in Erlang.
To view this discussion on the web visit https://groups.google.com/d/msgid/elixir-nx/CAGnRm4J6BSzsOGk6kLwpaHvnDEKvB-VpueXHQ6mYVWKQqJdC0Q%40mail.gmail.com.