Hi Steven,
the syntax you mentioned for FETCH is the one to use to retrieve the latest datapoint of a set of series.
Depending on the number of series and their spread across the key space, the storage layer may have to open quite a number of files thus possibly increasing the retrieval time. There are some caching mechanisms for both the standalone (LevelDB) and distributed (HBase) versions but they cannot do magic. One way to improve retrieval time when fetching from many series is to enable parallel scanners in the configuration.
If that does not lead to sufficient performance, maybe an in-memory instance of Warp 10 could act as a fast cache in front of your current set up. The datalog replication mechanism could be set up so updates done to your cache are forwarded to your current Warp 10 instance. An in-memory instance keeps chunks of series of a given span, so not exactly what you want if you are just interested in the most recent datapoint for each regardless of when that was.
Tell us more, maybe we could extend the existing in-memory version to only retain the most recent datapoint. We would also love to learn more how you use Warp 10, could our marketing manager contact you directly?