Problem: it splits the 'location' fields into words, and returns a doc count per word.

Solution 2: desired results, but performance worries.

I can do it using this query, pulling out ALL locations and doing the aggregation in jq (the every handy JSON cli-tool),
but this can turn into a performance nightmare when applied to huge volumes of data :