Re: crawler

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



thanks.

Then it will have the text 'Next', 'Last' etc.  Is it possible to give names, if not then what's the solution for it.

--- On Mon, 8/25/08, Edward Willekens <edwardwillekens@xxxxxxxxx> wrote:
From: Edward Willekens <edwardwillekens@xxxxxxxxx>
Subject: Re:  crawler
To: php-objects@xxxxxxxxxxxxxxx
Date: Monday, August 25, 2008, 4:04 AM










    
            While the crawler is reading the page looking for any links (href=) it is also looking for your keyword IE  (if (($keyat=stripos( str_tolower( $line),str_ tolower($ key))!==FALSE) &&((($grda=strpos( $line,">" ,$keyat)= ==FALSE) || ((strpos(substr( $line,$keyat, $grda-$keyat) ===FALSE) )) { found key}

IE find the key value on the line and make sure it is not followed by a '>'



--- On Sun, 8/24/08, shivam0101 <shivam0101@yahoo. com> wrote:

From: shivam0101 <shivam0101@yahoo. com>

Subject:  crawler

To: php-objects@ yahoogroups. com

Date: Sunday, August 24, 2008, 2:32 PM



Hi,



After searching google for crawler this is what i understand.



A crawler extracts all links of the site untill no new links are



found. It keeps the links either in the database or in a file. By



comparing the links and the 'keyword' of search the search engine



prints the links.



For example, if the link contains, 'ABC' and the search key is 'ABC'



then that link will be printed.



I have a members page which lists all the members of the site. It



contains the name, age, SPAM, etc of the members. Since it contains



more than 1000 members i am listing 10 members per page. So the link



will be <a href='members. php?page_ id=1'> Next </a>. The page_id will



be 2..3... and goes on.



If a member whose name 'ABC' will be listed in page 10 of the members



page. i.e <a href='members. php?page_ id=10'> will contain member 'ABC'



details, and i give a search keyword 'ABC' how the crawler will get



'ABC' when it crawls the members page.



Thanks



	

	 

	

	



	



	

	



[Non-text portions of this message have been removed]




      

    
    
	
	 
	
	








	


	
	


      

[Non-text portions of this message have been removed]


[Index of Archives]     [PHP Home]     [PHP Users]     [PHP Soap]     [Kernel Newbies]     [Yosemite]     [Yosemite Campsites]

  Powered by Linux