Re: Digest for json-stat@googlegroups.com - 1 update in 1 topic

19 views

Skip to first unread message

Jan Bruusgaard

unread,

Apr 16, 2026, 3:56:48 AMApr 16

to json...@googlegroups.com

I do not think this is a direct answer to your question, but PxWebApi v2 use JSON-stat v2. I have been working on a generic Claude skill for PxWebApi. I just uploaded a Beta. Maybe you will find some inspiration here: https://github.com/janbrus/ssb-api-v2-examples/tree/main/pxwebapi-v2-generic-skill . It is based on the skill for Statistics Norway's PxWebApi v2 https://github.com/janbrus/ssb-api-v2-examples/tree/main/ssb-pxwebapi-v2-skill

Jan Bruusgaard
retired Statistics Norway

tor. 16. apr. 2026 kl. 08:15 skrev <json...@googlegroups.com>:

json...@googlegroups.com Google Groups

Topic digest
View all topics

JSON-Stat API and Model Context Protocol (MCP) for AI. - 1 Update

JSON-Stat API and Model Context Protocol (MCP) for AI.

Sergio Nieto <sergio....@gmail.com>: Apr 15 10:07AM -0700

What modifications do I need to make to my JSON-Stat–based API to add
metadata and build a Model Context Protocol (MCP) on top of my API?”

Back to top

You received this digest because you're subscribed to updates for this group. You can change your settings on the group membership page.

To unsubscribe from this group and stop receiving emails from it send an email to json-stat+...@googlegroups.com.

Jan Bruusgaard

unread,

Apr 25, 2026, 3:41:07 AMApr 25

to json...@googlegroups.com

I saw that row-major order should have been described better. I update api-details.md with this, that Claude proposed:

**`value`** — Flat array with all data values, stored in **row-major order** (last dimension in `id` varies fastest, first varies slowest — same convention as C/NumPy). Index is computed from `id`, `size`, and `dimension.{var}.category.index`: for `size = [s₀, s₁, …, sₙ]` and category indices `(i₀, i₁, …, iₙ)`, flat index = `i₀·(s₁·s₂·…·sₙ) + i₁·(s₂·…·sₙ) + … + iₙ`.

tir. 21. apr. 2026 kl. 08:15 skrev <json...@googlegroups.com>:

json...@googlegroups.com Google Groups

Topic digest
View all topics

JSON-Stat API and Model Context Protocol (MCP) for AI. - 1 Update

JSON-Stat API and Model Context Protocol (MCP) for AI.

Xavier Badosa <xba...@gmail.com>: Apr 20 01:46PM -0700

Segio,

Please check Jan's reply:

I do not think this is a direct answer to your question, but PxWebApi v2
use JSON-stat v2. I have been working on a generic Claude skill for
PxWebApi. I just uploaded a Beta. Maybe you will find some inspiration
here:
https://github.com/janbrus/ssb-api-v2-examples/tree/main/pxwebapi-v2-generic-skill .
It is based on the skill for Statistics Norway's PxWebApi v2
https://github.com/janbrus/ssb-api-v2-examples/tree/main/ssb-pxwebapi-v2-skill

Thank, Jan. That's very interesting. I will read it carefully. I think
Statistics Finland has done similar tests (building an MCP based on their
JSON-stat API).

What modifications do I need to make to my JSON-Stat–based API to add
metadata and build a Model Context Protocol (MCP) on top of my API?

Sergio, the richer your API, the more powerful your MCP will
be. Consequently, it is preferable for the API to support optional
JSON-stat features (such as dimension roles), rather than omitting them.
Furthermore, leveraging the built-in extension mechanism provided by the
JSON-stat specification may be advantageous.

While I have not yet implemented the Model Context Protocol (MCP)
specifically, I have developed a data retrieval system enhanced by
tool-calling capabilities—an architecture that requires similar
service-oriented infrastructures. Specifically, I have built a
conversational search assistant powered by a Small Language Model (SLM),
such as Gemma 4. Despite its "reduced" parameter count, Gemma 4 provides
multilingual support and good native function-calling capabilities, which
are needed to interact with the JSON-stat API.

The model accesses lists of statistics and datasets through JSON-stat
'collection' responses from the API, which it has no problem processing.
Additionally, the API uses JSON-stat 'dataset' responses to convey metadata
("empty datasets" or datasets without observation values). Again, the model
effectively parses these metadata structures without any help, enabling it
to independently retrieve information such as update timestamps, the latest
available reference periods, or the age groups (or any other
classification) used.

What about data? In the past, ensuring accurate model interpretation of
JSON-stat data serialization required explicit prompting regarding its
'row-major order' storage convention. Providing this context was essential
for the model to correctly reconstruct the multidimensional data cube from
the JSON-stat flattened array. This is less and less true and more and more
unnecessary. Take for example, Claude. Using Sonnet 4.6 (no need for the
powerful Opus), Claude is able to answer this question:

*Help me analyze the data from this JSON-stat end-point:
https://api.idescat.cat/taules/v2/censph/539/5976/cat/data?LAST=5&lang=en
<https://api.idescat.cat/taules/v2/censph/539/5976/cat/data?LAST=5&lang=en>*

Claude fetches the JSON-stat dataset on its own and, without the need for
explicit architectural hints, natively interprets the JSON-stat structure
and builds several data visualizations to help the user understand the
data. In addition to visual outputs, the model also generates an automated
textual analysis of the data. In case you are interested, here is the
shared chat session with Claude (please note that authentication may be
required to access the interactive visuals):

https://claude.ai/share/bbc3c56d-8b01-4a7c-9333-58fcbde0db7b

I also have successfully dealt with JSON-stat using Claude Artifacts, like
converting a JSON-stat source file into CSV.

Xavier

On Wednesday, April 15, 2026 at 7:07:23 PM UTC+2 sergio....@gmail.com wrote:

What modifications do I need to make to my JSON-Stat–based API to add
metadata and build a Model Context Protocol (MCP) on top of my API?”

Back to top

Xavier Badosa

unread,

May 4, 2026, 7:18:49 AMMay 4

to json-stat

Jan,

I'm developing a conversational search engine based on Small Language Models like Gemma 4 (=with a limited knowledge). Your work and SSB's is so useful! I'm shamelessly stealing most of the contents of section on the key elements of the dataset structure in json-stat2.md. I can't promise I won't be stealing more stuff... Thank you!