That would be my first step. Run a streaming job with option -inputformat whatever.whatever.AccumuloInputFormat and a trivial map e.g. -mapper cat and see what is in the output. If that's something you can parse from R, then you have to write an rmr input format, unless one of the built-ins, typically csv with some options, does the job. Maybe the mapping of the accumulo data model to R data using this approach won't be good enough, then you have to write your own java class, but that's only plan B. Another thing you can do in a second phase, for efficiency, is to use streaming binary representation, it is explained
here. Then in R you would use the typedbytes input format. To recap
data -> java input format -> text or typedbytes -> rmr input format -> map function