Re: something better than pgtrgm?

Andrew Sullivan <ajs@xxxxxxxxxxxxxxx> · Tue, 9 Oct 2012 08:18:09 -0400

On Tue, Oct 09, 2012 at 02:10:26PM +0200, Willy-Bas Loos wrote:
> Hi,
> 
> I need a *language unaware* text comparison algorithm

[. . .]

> (i want to use it for *"did you mean ...?"* for approx 6-10 character codes
> or 8-20 letter words of mixed languages)

I don't think this is going to do what you want, at least from the
user's point of view.

The character codes case probably would work in a language-unaware
way.

But for the mixed languages case, surely it's not _any_ mixed
language?  Are you mixing Arabic, Farsi, Chinese, and Hindi, for
instance?

If not, then you're not really language unaware, but instead
constrained by a subset of languages.  That is a more tractable
problem (for instance, you may not have to worry about direction
changes, which vastly simplifies the problem).

Best,

A

-- 
Andrew Sullivan
ajs@xxxxxxxxxxxxxxx

-- 
Sent via pgsql-general mailing list (pgsql-general@xxxxxxxxxxxxxx)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general