In theory, I like the hashing idea.
In practice, I don't think it'd work too well. I know (from experience, heh) that it's way too easy to work around that by adding a bit of random data somewhere in the article. Randomly capitalizing things somewhat differently, adding a number to the end, putting in a in a different place... to make the hashing really effective, you'd have to do something like:
(i.e. convert to lowercase, get rid of all &foo; crap, HTML markup, numbers, letters, and whitespace) in order to really catch duplicates, and even then there's probably some simple workaround. That is, the text "The rain in Spain falls mainly on the plain" and "the rain in spain falls mainly on the plain..." would be seen as identical, but adding a simple tyop (such as "Teh rain in Spain falls mainly on the plain") will still negate its usefulness. Fortunately, assuming a perfect pair of hashing functions (md5 and SHA-1 would be my choices), the potential for false duplicate-positives is practically nil.
cat story | tr '[A-Z]' '[a-z]' | sed 's/\&*\;//g' | sed 's/\<*\>/g' | tr -d '(insert regexp which matches numbers, punctuation, and whitespace)' | (something to generate two distinct hashes)
Thing is, Scoop already has post limit thresholds in it. You can't post more than one article every few minutes, and every time you try to, it doubles the time in which you can't post. So if you get barred from posting, say, 24 times in a row (assuming an initial ban time of 1 minute), you'll be barred for around 31 years.
But even then, it's easy for someone to just make a new account. Then you have to get the admins involved; you can't simply ban *@hotmail.com or whatever. (Well, okay, you can, but it'd piss a lot of legit people off.)
Fingerprinting sucks, especially for those of us who use proper style and diction.
Netmask quotas are probably the best idea overall. They'd be hard to fake, wouldn't severely impact legitimate users (given a decent set of thresholds, anyway), and would only be triggered in extreme cases of spamming.
That said, I think the current system is fine. It's not like any spam has ever made it through the queue. :)
"Is not a quine" is not a quine.
I have a master's degree in science!
[ Hug Your Trikuare ]
[ Parent ]