HI JD,
Another user asked a similar question this afternoon. The answer will be essentially the same, except for the formula to use (since you do not want to know the number of different cities rewieved by a user, but which is cited more than half of the times). I think the simplest for you is to use this little Jython code by adding a column based on "business_city". Here is the code (do not forget to put the Open Refine window in Python/Jython mode):
liste = row['record']['cells']['business_cities']['value'] #make sure that the column name "business_cities" is spelled correctly
for cities in liste:
count = liste.count(cities)
if count >= len(liste)/2.0:
return cities
This formula assumes that there is no tie. If a user has cited two cities exactly 50% of the time, the code will just return the first. It can be modified if you want.
Feel free to ask for more details if something doesn't seem clear to you.