Looks good, Brent!
I’ll add some thoughts…
You said:
Find realistic sample PostgreSQL databases
I would imagine it being important to do this work soon in the project, maybe first. But it’s a challenging task!
I spent some time pondering and researching this today. Here are some take-away thoughts:
The Sample Databases on the PostgreSQL wiki seems to have some useful leads that would be worth chasing down in more detail.
Finding open-source self-hosted applications that uses Postgres would be helpful because it could provide a sample database with some interesting structure.
Finding real data sets distributed openly would be helpful because it could provide a sample database with some interesting content.
Ideally we’d find examples with both interesting structure and interesting content, but this seems a bit tricky.
Applications would be especially useful if their installation steps included an option for loading sample data.
We could comb through the awesome-selfhosted list, but it’s enormous and hard to filter for projects that use Postgres. I wonder if there might be similar resources that would get us further toward the sort of filtering we’d need to do to avoid tons of manual research.
Data sets would be especially useful if they were distributed as a pg_dump SQL file.
I browsed some open data sites to see if I could find any open data distributed via pg_dump SQL. No luck. This is not really surprising though. If I were using Postgres as the source-of-truth for my data, I’d still want to distribute it in a format with broader compatibility. I did some digging in awesome-public-datasets, kaggle, fivethirtyeight, NYC, data.gov. I don’t imagine that stress-testing Mathesar with such data would prove to be particularly useful because what we need is data that is taking advantage of Postgres-specific features.
This seems like it might be a dead end. I’m just forwarding this thought though in case it saves anyone else some time during research.
Perhaps it’s worth asking the team:
Does anyone have any products in-mind that use Postgres?