Re: Reverse plural forms to singular for search

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I was looking into that a month or so ago.



Here's an article that mentions the Porter Stemming Algorithm
http://www.scit.wlv.ac.uk/seed/docs/mypapers/stemalg.html

and the official home
http://www.tartarus.org/~martin/PorterStemmer/


I don't know if this is of interest.

-Brian Anderson

----- Original Message ----- 
From: "Mark Rees" <MRees@xxxxxxxxxxxxxxx>
To: <php-general@xxxxxxxxxxxxx>
Sent: Friday, April 15, 2005 3:14 AM
Subject: FW:  Reverse plural forms to singular for search


> Eugene, your reply-to address is set
>
> -----Original Message-----
> From: Mark Rees
> Sent: 15 April 2005 09:08
> To: 'Eugene Voznesensky'
> Subject: RE:  Reverse plural forms to singular for search
>
>
> That sounds like a nightmare! Assuming you are talking about English,
> you might be able to make some progress by removing trailing s or es for
> words. This would trim words down to their stem. You could do this with
> a regular expression, which will need to do something like look for es
> or s with no preceding s  at the end of words (to avoid trimming words
> like glass).
>
> It will not catch forms like medium/media and thesis/theses. You could
> think about trying to trim words like that down to their stem (med in
> this case). In fact, would just trimming off the last two letters and
> searching for that do for you? It depends on the range of words you are
> searching for.
>
> Good luck
>
> Mark
>
> -----Original Message-----
> From: Eugene Voznesensky [mailto:voznesenphp@xxxxxxxxxxxxx]
> Sent: 14 April 2005 17:22
> To: php-general@xxxxxxxxxxxxx
> Subject:  Reverse plural forms to singular for search
>
>
> I'm trying to reverse plural forms to singular ones
> and use the result for search [in database].
>
> Is there any third party product to integrate with
> PHP, or some reliable algorithm?
>
> Thank you,
>
> Eu.
>
> -- 
> PHP General Mailing List (http://www.php.net/)
> To unsubscribe, visit: http://www.php.net/unsub.php
>
> Gamma Global : Suppliers of IBM, Compaq, Cisco, Sun Microsystems, 3Com and
HP Products.
>
> GAMMA GLOBAL (UK) LTD IS A RECOGNISED 'INVESTOR IN PEOPLE' AND AN 'ISO
9001 2000' REGISTERED COMPANY
>
> **********************************************************************
>
> CONFIDENTIALITY NOTICE:
>
> This Email is confidential and may also be privileged. If you are not the
> intended recipient, please notify the sender IMMEDIATELY; you should not
> copy the email or use it for any purpose or disclose its contents to any
> other person.
>
> GENERAL STATEMENT:
>
> Any statements made, or intentions expressed in
> cessarily reflect the view of Gamma Global (UK) Ltd. Be advised that no
content
> herein may be held binding upon Gamma Global (UK) Ltd or any associated
company
> unless confirmed by the issuance of a formal contractual document or
> Purchase Order,  subject to our Terms and Conditions available from
http://www.gammaglobal.com
>
> E&OE
>
> **********************************************************************
> **********************************************************************
>
>
> -- 
> PHP General Mailing List (http://www.php.net/)
> To unsubscribe, visit: http://www.php.net/unsub.php
>
>

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


[Index of Archives]     [PHP Home]     [Apache Users]     [PHP on Windows]     [Kernel Newbies]     [PHP Install]     [PHP Classes]     [Pear]     [Postgresql]     [Postgresql PHP]     [PHP on Windows]     [PHP Database Programming]     [PHP SOAP]

  Powered by Linux