--
You received this message because you are subscribed to the Google Groups "The Center for Election Science" group.
To unsubscribe from this group and stop receiving emails from it, send an email to electionscien...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
I did something similar in 1999-2000.
http://rangevoting.org/WarrenSmithPages/homepage/works.html
paper #56.
If you want to do something new & important, do multiwinner voting systems.
Framework for that:
http://www.rangevoting.org/BRmulti.html
I would use a weighted correlation function instead of your weighted sum of squared distances for ‘s’ below. That is s = ∑w_i × v_i × c_i summed over the ‘n’ issues, with v_i and c_i representing a voter’s and a candidate’s position (rating or ranking) on issue i and w_i being the voter’s weight for the issue. Then the voter should vote for the candidate who maximizes this correlation.
I’ve tested both measures of distance and correlation in my clustering algorithm for proportional representation and found that correlation works better.
Dick Burkhart
4802 S Othello St, Seattle, WA 98118
206-721-5672 (home) 206-851-0027 (cell)
--
That’s how correlation works. If you’re in the middle, it wouldn’t make sense to say that you were correlated with any higher rating, or with any lower rating. And in my clustering application I’m only looking for voters with at least a somewhat positive correlation with the central vector of the cluster, even for only partial membership in the cluster. That is for me, these voting blocks are “fuzzy sets”. Also a certain fraction of voters will end up being classified as “independent” voters, since they don’t identify well with any cluster. Your dead-center voter would be one of these.
From: electio...@googlegroups.com [mailto:electio...@googlegroups.com] On Behalf Of Nevin Brackett-Rozinsky
Sent: April 17, 2015 2:30 PM
To: electio...@googlegroups.com
--
You might want to rethink that interpretation. In my experience a middle, or 0 rating, usually means that the person either doesn’t know much about the issue or doesn’t care much about it. So your distance metric may be misleading.
From: electio...@googlegroups.com [mailto:electio...@googlegroups.com] On Behalf Of Nevin Brackett-Rozinsky
Sent: April 18, 2015 12:46 AM
To: electio...@googlegroups.com
--
Voters and candidates each have a position on every issue, and those positions are drawn from a standard normal distribution.
Voters and candidates each have a position on every issue, and those positions are drawn from a standard normal distribution.I would like to request an optional variant in which you draw from two normal distributions. That way you can see how the system reacts under polarization, such as the bimodal distribution of modern US politics.
--
I would like to request an optional variant in which you draw from two normal distributions. That way you can see how the system reacts under polarization, such as the bimodal distribution of modern US politics.
This is definitely interesting work. I've done something similar last year; see https://github.com/The-Center-for-Election-Science/vse-sim . Nevin: would you be interested in a real-time chat some time (skype or hangout or similar) to discuss the common issues? I think it is worthwhile to have different people re-implementing this independently, because there are plenty of judgment calls involved, and it's good to see if results are robust to how those calls are made. But "independently" doesn't mean we shouldn't talk about how we've done it, so as to better understand where and why our results are similar or different.
Another thing you need to watch out for, is tiebreaking.
It is very important to break ties RANDOMLY, and
the way IEVS does this, is by pre-ordering the candidates by a random
permutation, then picking the first of the tiers. There are a lot of
ways to be confused and bias your randomness. That will lead to large
statistical effects which will be totally misleading.
Warren D. Smith
--
There may be applications where distance makes more sense than correlation. But in that case instead of ratings like -3,-2,-1,0,1,2,3, they should be 0,1,2,3,4,5,6,7. It’s just that in my experience people more often use ratings like the former in their head, even if they see the latter on paper. That is, if they give a middle rating for an issue, it doesn’t mean that it’s really important to them that their preferred candidate also rates that issue in the middle. It’s much more likely that the voter simply doesn’t know much about the issue, or care much about it, or has ambiguous feelings about it, and is willing to let the candidate take a stronger position either for or against. Standard correlation (with 0 in the middle) handles this perfectly.
My biggest issue with distance is that I want to focus just on the issues most highly rated by a voter. Those are the ones that determine the voting blocks, or clusters, I am seeking. Distance weights similarity over all issues (unless the voter can artificially restrict the issues to his or her mostly highly rated ones). With correlation I can easily zero out all negative ratings by a voter, to eliminate their effect. This also eliminates strategic voting against partisan opponents, decreasing mudslinging.
In addition note that the rating difference between 1 and 2 is the same as between 2 and 3, whereas in correlation 2 * 3 = 6 carries 3 times the weight of 1 * 2 = 2, putting a much greater emphasis on agreement for the mostly highly rated issues, as most voters would want.
You could easily test both distance and correlation to see how they work in different situations.
--