Search indexing.. Re: [PHP] Search engine

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Just out of curiosity.. relating to this subject.. does anyone have any good documentation on creating your own site index so you can create your own search engine?

That is..  do search engines like Google take every word in a web page and if you search for that specific word it has a list of all URLs whose pages contain that word?   I mean, what is the general theory behind creating a searchable index as opposed to just storing all your web pages in a database and using SQL to find the pages?   If you indexed the pages you could then do a sort by relevance (which you really can't do with just doing a 'LIKE' SQL query on your raw content).

I know this isn't strictly PHP related, but if someone hasn't done a basic indexing search engine in PHP, then it's time someone did I think. :)

-TG

= = = Original message = = =

Why not use PHP to create static pages from your dynamic info (say 5 minutes 
after the data is update?) and allow for the system to search the site then?  I 
have noticed that I don't even have to do that for the search spiders from 
Google and yahoo to trawl my site and get everything, so maybe looking for code 
to curl through would be good?

Robert

Rosen is quoted as saying on 1/20/2005 7:48 AM:
> But here the problem is, that the texts in database uses from different
> scripts and on the search engine I should show and link to the sctipt, thath
> shows searched data.
> My idea was for search script, who explore the whole site (as generated from
> PHP scripts - via links ).
> 
> 
> 
> "Ben Edwards" <funkytwig@xxxxxxxxx> wrote in message
> news:32b873ff050120043257d4d552@xxxxxxxxxxxxxxxxx
> 
>>Looks like you will have to write your own.  Have a look at the
>>scripts you have and see how they do it.  One option is to write the
>>'static' pages into the database as well as the file system.  I
>>personaly put all content into a database and do not really have any
>>static pages atall.
>>
>>Ben
>>
>>On Thu, 20 Jan 2005 14:18:49 +0200, Rosen <rosen@xxxxxxxxxxxxxx> wrote:
>>
>>>I try many scripts for searching, but they don't work how I want.
>>>The problem is, that part of site is static text ( not in database ) ,
> 
> other
> 
>>>part ( products ) are in MySQL database - this part is generating from
> 
> PHP
> 
>>>scripts.
>>>
>>>"Ben Edwards" <funkytwig@xxxxxxxxx> wrote in message
>>>news:32b873ff05012004152c1cb9b4@xxxxxxxxxxxxxxxxx
>>>
>>>>This kind of depends on what database you are using (I am asuming you
>>>>mean you have a data driven site you want to search so strictly
>>>>speaking it is the database that you want to search).
>>>>
>>>>Mysl has free text search facilities (i.e. you can pass it a number of
>>>>words and it can search for them in a set of database fields and even
>>>>kreates a 'ranking').  however this only works if you have a few
>>>>hundread records - less than this and the results are unpredictable.
>>>>
>>>>Have you tries googeling for php search scripts?
>>>>
>>>>Ben
>>>>
>>>>On Thu, 20 Jan 2005 14:04:44 +0200, Rosen <rosen@xxxxxxxxxxxxxx>
> 
> wrote:
> 
>>>>>Hi,
>>>>>Can someone recommend me a search engine script in PHP for inside
> 
> one
> 
>>>site?
>>>
>>>>>Thanks in advance!
>>>>>Rosen
>>>>>
>>>>>--
>>>>>PHP General Mailing List (http://www.php.net/)
>>>>>To unsubscribe, visit: http://www.php.net/unsub.php
>>>>>
>>>>>
>>>>
>>>>
>>>>--
>>>>Ben Edwards - Poole, UK, England
>>>>WARNING:This email contained partisan views - dont ever accuse me of
>>>>using the veneer of objectivity
>>>>If you have a problem emailing me use
>>>>http://www.gurtlush.org.uk/profiles.php?uid=4
>>>>(email address this email is sent from may be defunct)
>>>
>>>--
>>>PHP General Mailing List (http://www.php.net/)
>>>To unsubscribe, visit: http://www.php.net/unsub.php
>>>
>>>
>>
>>
>>-- 
>>Ben Edwards - Poole, UK, England
>>WARNING:This email contained partisan views - dont ever accuse me of
>>using the veneer of objectivity
>>If you have a problem emailing me use
>>http://www.gurtlush.org.uk/profiles.php?uid=4
>>(email address this email is sent from may be defunct)
> 
> 

-- 
Robert Sossomon, Business and Technology Application Technician
4-H Youth Development Department
200 Ricks Hall, Campus Box 7606
N.C. State University
Raleigh NC 27695-7606
Phone: 919/515-8474
Fax:   919/515-7812
robert_sossomon@xxxxxxxx

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


___________________________________________________________
Sent by ePrompter, the premier email notification software.
Free download at http://www.ePrompter.com.

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


[Index of Archives]     [PHP Home]     [Apache Users]     [PHP on Windows]     [Kernel Newbies]     [PHP Install]     [PHP Classes]     [Pear]     [Postgresql]     [Postgresql PHP]     [PHP on Windows]     [PHP Database Programming]     [PHP SOAP]

  Powered by Linux