Jump to content

PHP (code maybe) combining 10 different site search results


ybc

Recommended Posts

hi guys,

 

i just finished highschool starting to do webdesign at uni, and for one of my major project i want to make a search engine as simple as google that searches for example 10 websites and with the keyword given it brings out the results.

 

im doing a website on jetski sales results so if someone want to buy a jetski they come to this website and just choose the choose one from those 10 website without going to them individually. so it brings out all search resuts in a nice results format, and when you click on each results it take you to the website but i wanna be able to show their photo and price so just like brings their results into your site but combing 10 website results.

 

and i need to have an advance search option where they can search year price age of jetski, and all these variables are also in the 10 websites that im getting the results from.

 

i have been doing some searching and i cant get my head around i need some help LOL i dont wanna fail...

 

cheers guys

Link to comment
Share on other sites

Search engines use what are called spiders to crawl websites to obtain information on them. The majority will use the text contained within the title tags of pages, collect the anchor text from links from external websites that point to a particlular page as they contain keywords, read the content on pages and determine its relevence. Search engines use highly complex algorithms to determine what keyphrases websites rank and are returned under. When a spider returns all of this data it will be processed and used in what is called a search index. When you search the index is queried and the results are returned. Indexes can be searched very fast, this is why it takes less than a second for the likes of Google to return millions of results.

 

You are going to have to come up with a method of crawling through each of the 10 websites you choose, extracting the data you require, keywords, etc in order to be able to build up an index that you can search. If this is a small project you may simply use a database. If you want to build a search index then the likes of Sphinx / Lucene can do the job.

Link to comment
Share on other sites

This thread is more than a year old. Please don't revive it unless you have something important to add.

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.