Hello program!
Need advise, I'm looking for service to seek excessive profanity, like http://www.google.com/recaptcha
Thanks!
Loading User Information from Channel 9
Something went wrong getting user information from Channel 9
Loading User Information from MSDN
Something went wrong getting user information from MSDN
Loading Visual Studio Achievements
Something went wrong getting the Visual Studio Achievements
Hello program!
Need advise, I'm looking for service to seek excessive profanity, like http://www.google.com/recaptcha
Thanks!
You want some web-service or library/API that can identify profanity in arbitrary text?
Uh, that's trivial:
if( string.Contains("fúck") ) throw new ProfanityException("you dun goofed");
W3bbo said:You want some web-service or library/API that can identify profanity in arbitrary text?
Uh, that's trivial:
if( string.Contains("fúck") ) throw new ProfanityException("you dun goofed");
I'll see you at that statement and raise you an expression
expletives.Where(expletive => text.Contains(expletive)).Any().Then(() => { throw new ExpletiveException("what the heel"); });
Keep in mind that within English a word is broken by a whitespace character, thus you should never match an exact word per above, instead you should match a word and single whitespace character either side.
ManipUni said:Keep in mind that within English a word is broken by a whitespace character, thus you should never match an exact word per above, instead you should match a word and single whitespace character either side.
Or pop open an AI book and use some fuzzy string comparison algorithm.
W3bbo said:You want some web-service or library/API that can identify profanity in arbitrary text?
Uh, that's trivial:
if( string.Contains("fúck") ) throw new ProfanityException("you dun goofed");
Thanks, but I know how to implement the verification text using dictionaries, but in this case I'm looking ready to work in a network solution as a service or API, such as google translate/
Beetlejuice said:W3bbo said:*snip*Thanks, but I know how to implement the verification text using dictionaries, but in this case I'm looking ready to work in a network solution as a service or API, such as google translate/
That sounds like a useful service -- if you can't find one, perhaps you should write one and sell it ...
If you don't, I might. ![]()
Herbie
Beetlejuice said:W3bbo said:*snip*Thanks, but I know how to implement the verification text using dictionaries, but in this case I'm looking ready to work in a network solution as a service or API, such as google translate/
You might have stumbled on a gap in the market, register a web address and expose your dictionary as a web service.
It may well be that this is unpopular because it is processor intensive, i.e. after I type a single word then it needs to be validated, or you can hold off until the person posts, but more information is required as to how you intend to check for rude words, is this a web application or desktop...etc...?
Dr Herbie said:Beetlejuice said:*snip*That sounds like a useful service -- if you can't find one, perhaps you should write one and sell it ...
If you don't, I might.
Herbie
ok. I've thought about it...
vesuvius said:Beetlejuice said:*snip*You might have stumbled on a gap in the market, register a web address and expose your dictionary as a web service.
It may well be that this is unpopular because it is processor intensive, i.e. after I type a single word then it needs to be validated, or you can hold off until the person posts, but more information is required as to how you intend to check for rude words, is this a web application or desktop...etc...?
I agree that mechanism requires hardware resources, but google has a service for spell checking, send texts, receive text ... You can reduce the load by introducing various restrictions
vesuvius said:Beetlejuice said:*snip*You might have stumbled on a gap in the market, register a web address and expose your dictionary as a web service.
It may well be that this is unpopular because it is processor intensive, i.e. after I type a single word then it needs to be validated, or you can hold off until the person posts, but more information is required as to how you intend to check for rude words, is this a web application or desktop...etc...?
wörd, w*rd, wohrd, w ord, w_ord, w o r d, [ascii art], [image]
There should probably be a whitelist as well. I don't know if wordnet is exposed as a web service but if so, it may be possible to lookup a word and see if it classifies as an expletive, just so you don't censor out something that looks like a modified expletive but is also a whitelisted word.
Beetlejuice said:vesuvius said:*snip*I agree that mechanism requires hardware resources, but google has a service for spell checking, send texts, receive text ... You can reduce the load by introducing various restrictions
The restriction you need is in controlling user input.
If I were to join channel 9 and start fu**ing writing any fu**ing thing I fu**ing want, the admins would use a banhammer, so you will find that very few forums tend to need excessive algorithms that degrade performance in order to moniter user input. On the rare occasion that a plebian decides to use such language, then it is reported and the post hastily deleted.
Seems to be like you need a policy that lists the usual set of critera for decent virtual interlocution. That is more effective than thousands of lines of code.
I've used this in the past. The guy claims he has an API you would need to email him about.
You would need to use regex to pull out the replacement.
MasterPie said:I've used this in the past. The guy claims he has an API you would need to email him about.
You would need to use regex to pull out the replacement.
If I put in, "Wow, that is some f---ed up s---." I get "Wow, that's some had intercourse up poop." It could use a little bit of help in the art of grammar. Or perhaps it should just bracket, like this "Wow, that's some [had intercourse] up [poop]." I don't know about you, but having intercourse up poop doesn't sound like all that much fun.
-Josh
JoshRoss said:MasterPie said:*snip*If I put in, "Wow, that is some f---ed up s---." I get "Wow, that's some had intercourse up poop." It could use a little bit of help in the art of grammar. Or perhaps it should just bracket, like this "Wow, that's some [had intercourse] up [poop]." I don't know about you, but having intercourse up poop doesn't sound like all that much fun.
-Josh
Or just don't censor in the first place?
As a matter of principle, the only times I'd implement a profanity filter is if I were working on something to be used by those under 14 or so, for anything else it's just silly: I like to think we're mature in how we handle with this, and explicit language has its place for emphasis and exclamation. To think that the word "fúck" should be banned "because it's a swear word" is to think like a puritanical authortarian who bans things for the sake of banning.
W3bbo said:JoshRoss said:*snip*Or just don't censor in the first place?
As a matter of principle, the only times I'd implement a profanity filter is if I were working on something to be used by those under 14 or so, for anything else it's just silly: I like to think we're mature in how we handle with this, and explicit language has its place for emphasis and exclamation. To think that the word "fúck" should be banned "because it's a swear word" is to think like a puritanical authortarian who bans things for the sake of banning.
I have set up swear filters to filter out racism. The normal f's and s's were allowed.
This brings up a great point (and an opportunity, as has already been suggested): A cloud service (Internet addressable over HTTP, pay for play or not, perhaps keep it free and send down adds
...) that takes a blob of text, sanitizes it, returns it to
caller. It is essentially what Caja is for JavaScript, only that it targets and srubs words in some supported human-spoken language, with an emphasis on detecting, removing and replacing bad words with "safe" words, but this could be extended as well....
Why don't you write this service and then maybe C9 can start using it, too ![]()
C
Charles said:This brings up a great point (and an opportunity, as has already been suggested): A cloud service (Internet addressable over HTTP, pay for play or not, perhaps keep it free and send down adds
...) that takes a blob of text, sanitizes it, returns it to caller. It is essentially what Caja is for JavaScript, only that it targets and srubs words in some supported human-spoken language, with an emphasis on detecting, removing and replacing bad words with "safe" words, but this could be extended as well....
Why don't you write this service and then maybe C9 can start using it, too
C
I think such a service could only be called one thing and could only be written in one language.
That language has to be... Newspeak ![]()
Thread Closed
This thread is kinda stale and has been closed but if you'd like to continue the conversation, please create a new thread in our Forums,
or Contact Us and let us know.