On Thu, 2009-02-05 at 15:07 -0600, Shawn McKenzie wrote: > Ashley Sheridan wrote: > > On Wed, 2009-02-04 at 18:34 -0200, Jônatas Zechim wrote: > >> Thank you, but i thought one of you had the .sql or .txt, .xls, etc. > >> I had already find that results. > >> > >> But it's ok now.. > >> > >> zechim > >> > >> -----Mensagem original----- > >> De: Andrew Ballard [mailto:aballard@xxxxxxxxx] > >> Enviada em: quarta-feira, 4 de fevereiro de 2009 18:19 > >> Para: Jônatas Zechim > >> Cc: PHP-General List > >> Assunto: Re: Bad words [SQL, database, txt, whatever] > >> > >> On Wed, Feb 4, 2009 at 2:48 PM, Jônatas Zechim <zechim.php@xxxxxxxxx> wrote: > >>> Hi there I don't know how to say 'palavrões'(i mean bad words, like f*** > >>> you, your bi***, as*) in English, but I need that. > >>> > >>> Anyone has or know where I can get a database, txt, whatever of 'bad words > >>> or 'palavrões'' > >>> > >>> zechim > >>> > >> http://www.google.com/search?hl=en&q=bad+word+dictionary&btnG=Google+Search > >> > >> http://www.google.com/search?hl=en&q=bad+word+list&btnG=Search&aq=f&oq= > >> > >> http://www.google.com/search?hl=en&q=bad+word+database&btnG=Search&aq=f&oq= > >> > >> Andrew > >> > >> > > It's not as simple as just blocking the bad words anyway, as people will > > find clever ways of getting round them such as $hit, sh1t, shi+, etc. > > Not only that, but just replacing words can cause it's own problems. I > > remember a lot of early basic filters (Hotmail anyone?) that prevented > > words like this, and any unfortunate with a surname of Hancock was left > > in a right mess! > > > > > > Ash > > www.ashleysheridan.co.uk > > > On my family website, my naive younger sister posted news about her > college graduation and had to type magnacumlaude, because she couldn't > understand why it kept coming out magna *** laude. > > -- > Thanks! > -Shawn > http://www.spidean.com > It's a mess isn't it? Best way I've found for situations where I need to filter content is to moderate it by hand. It's slow, but is less prone to problems. The moderator has to continually make a mess of things to let bad words through, where a programmer only has to mess it up once and the computer will continue to reject/accept whatever it wants! Also, a human moderator can put context to content, something which even the most sophisticated AI is incapable of. Ash www.ashleysheridan.co.uk -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php