Search Engines Essay, Research Paper
There are currently over a billion pages of information on the Internet about every topic imaginable. The question is how can you possibly find what you want? Computer algorithms can be written to search the Internet but most are not practical because they must sacrifice precision for coverage. However, a few engines have found interesting ways of providing high quality information quickly. Page value ranking, topic-specific searches, and Meta search engines are three of the most popular because they work smarter not harder.
While no commercial search engine will make public their algorithm, the basic structure can be inferred by testing the results. The reason for this is because there would be a thousand imitation sites, meaning little or no profit for the developers. The most primitive of searches is the sequential search, which goes through every item in the list one at a time. Yet the sheer size of the web immediately rules out this possibility. While sequential might return the best results, you would most likely never see any results because of the web?s inflammatory growth rate. Even the fastest computers would take a long time, and in that time, all kinds of new pages will have been created.
Some of the older ?spiders? like Alta Vista are designed to literally roam randomly through the web using links to other pages. This is accomplished with high-speed servers with 300 connections open at one time. These web ?spiders? are content based which means they actually read and categorize the HTML on every page. One flaw of this is the verbal-disagreement problem where you have a particular word that can describe two different concepts. Type a few words in the query and you will be lucky if you can find anything relates to what you are looking for. The query words can be anywhere in a page and they are likely to be taken out of context.
Content-based searches can also be easily manipulates. Some tactics are very deceptive, for example ??some automobile web sites have stooped to writing ?Buy This Car? dozens of times in hidden fonts?a subliminal version of listing AAAA Autos in the Yellow Pages?(1). The truth is that one would never know if a site was doing this unless you looked at the code and most consumers do not look at the code. A less subtle tactic is to pay to get to the top. For example, the engine GoTo accepts payment from those who wish to be at the top of a results list because the sites at the top will get more traffic.
Lawrence Page and Sergey Brin of Google have come up with a different idea for searching called PageRank. They realized that the most popular sites are those linked the most in other pages. Here is the pseudocode algorithm for searching:
1. Parse the query
2. Convert the words into wordIDs.
3. Seek to the start of the doclist in the short barrel for every word.
4. Scan through the doclists until there is a document that matches all the search terms.
5. Compute the rank of that document for the query.
6. If we are in the short barrels and at the end of any doclist, seek to the start of the doclist in the full barrel for every word and go to step 4.
7. If we are not at the end of any doclist go to step 4.
8. Sort the documents that have matched by rank and return the top k.
Each link to a page is like a vote for that page as well as the pages linked to that page. Thus a hierarchy of pages is created and the search results are much more reliable. Lycos and Excite also use the same system, but Google goes further. ?It then looked at the position of the words on the page, the size of the fonts, and the likelihood that the words were related to each other? (1). Going the extra distance gives Google much better precision.
Google and engines like it can still be manipulated to achieve higher rankings. Anyone who creates a set of pages with links between them can fool the system and add value to their page. So the race continues to find yet another search engine. One promising way to search for something is to use a topic-specific search engine. Among the topic-specific engines are VactionSpot, KidsHealth, and epicurious. These engines give you better results because they are often a front-end to a database of information, they are regularly maintained and updated, and they have a narrow focus and smaller size.
It makes sense that if you do a specific search, then you are less likely to end up with irrelevant information. The good news is that you are getting high quality results in a short period of time. The only problem with topic-specific engines is finding the right one. This is where query routing comes into play. You have two types: manual and automatic. Manual routing means you find the best topic matching your query yourself which can be confusing. Automatic routing is designing an algorithm to do it for you.
One of the newer automatic routers is called Q-Pilot. Q-Pilot uses both offline and online areas for quicker access. When a user enters a query, that query is expanded to create multiple topics that are more specific. These topics are taken from a ?neighborhood? of pages and often represent another search engine?s topics. ?Q-Pilot?uses the web as its knowledge base and autonomously learns what it does not know? (2). This almost sounds like artificial intelligence. Certainly the easiest way to index 100 million pages a day would be to get a computer to do it automatically.
The terms query expansion, clustering, and routing are sure to be seen many times in the near future, as they become necessities for good search engines. They can be found in some Meta search engines such as QueryServer. Query expansion, as mentioned before, is like a thesaurus. It gathers all relevant words in neighborhood pages that might mean the same as those entered in the main query. Then it checks to see how many times those words appear in similar ? the more times they appear, the more relevant they are. It may be necessary to re-evaluate some words if there is little co-occurrence overall.
Clustering comes right after query expansion and is relatively faster. The engine sorts the results of the primary search engines and groups them. If you then see a more specific topic among those you can go directly to the matches for it. Q-Pilot will give you three different clusters at most to reduce confusion. A pattern seems to be emerging here. New search engines are actually old engines combined with each other and new ideas. So wouldn?t the best be one that combines all features? In a word, yes. That is the idea behind a Meta search engine.
QueryServer (queryserver) is an example of one of the very latest Meta search engines. It uses ten primary engines like Yahoo and Google, has customizable matching and clustering, and shows you details of the results like number of matches and response time. An aspect that should not be overlooked concerning Meta search engines is the data model. The data model essentially communicates between the primary and secondary engines, converting the query into the correct format. This is because some use words in search strings while others use Boolean. In order to utilize the features of each engine, the data model should be able to adapt to different engines to achieve good precision.
The Authors explain, ?Based on such a data model, a meta search engine can achieve several advantages:
1 It will present to users a more sophisticated interface?
2 Make the translation more accurate
3 Get more complete and precise results
4 Improve source selection and running priority decisions? (3).
Again the idea of optimizing the Internet through intelligent software shows up. It is just a matter of designing a certain algorithm that does not forget what it has learned.
Most people did not foresee the tremendous growth of the Internet in the 1990?s. Computer algorithms have gone from small government programs to every personal computer in the world. You start with the most basic problem solving and end up with the most complex of problem solving. That of course is sorting through a database that grows almost exponentially.
Plain and simple, the Internet has a lot of information on it. A crawler works twenty-four hours a day digging through it all. The search engine pulls out the parts people want and hands it to the Meta search engine. The Meta search engine further discriminates until you get exactly what you are looking for. Yet behind all this are machines performing the instructions they have been given ? an algorithm.
Другие работы по теме:
How To Find A Job Effectively Essay
, Research Paper Everyday someone is looking for a job. Whether that person is a recent graduate, a person laid-off from work, or a person that wants a different job, their diligent search turns into a carefully planned search for employment. It is important that a person knows how to search effectively for a job.
Serach Engines Essay Research Paper Web PortalsWhen
Serach Engines Essay, Research Paper Web Portals When the term “internet” became a household saying, such did words like Excite, Yahoo, and Lycos. These were the so-called portals that directed Internet first-timers to their destinations and virtually walked them through the World Wide Web. They were and still are the fist place visitors go when they start to “surf” the web.
Civil Rights Essay Research Paper One night
Civil Rights Essay, Research Paper One night John Doe is driving down the freeway and is pulled over for a routine traffic violation. After issuing poor Johnny a traffic ticket the
Yahoo Vs Lycos Essay Research Paper Yahoo
Yahoo! Vs. Lycos Essay, Research Paper Yahoo! Vs. Lycos When searching on the Internet, one may find it difficult sometimes to know where to start. With the seemingly limitless amount of information, one should use the resource suitable for the searcher’s needs and tastes. Comparing different factors like databases, directory types, strengths and weaknesses of two search engines, such as Yahoo! and Lycos, can provide an advantage to someone looking for a starting block.
Automotive Enginereing Essay Research Paper As long
Automotive Enginereing Essay, Research Paper As long as there are people there will always be a means of transportation. No matter what kind of mechanical transportation it will fail eventually. Which means there will always be a job that pays good money that is labeled “Automotive Technician.”
An Analytical Definition Of Philosophia Essay Research
Paper Men and women have tried for years to define Philosophy. One particularly interesting attempt was made by Martin Heidegger in the late 8th Century. He is reported as saying to his good friend and lover, Patrick Hurley, “Philosophy is one large search for the being-upon another and one long journey into the being-inside.” In this sense, Philosophy is best understand as a search.
Vdg Essay Research Paper Raintree
Vdg Essay, Research Paper Raintree’s website is dedicated to providing information and education on the important plants of the Amazon Rainforest, therefore this section is the most extensive. The Plant Database section is continuously under construction as we continue to add more rainforest plants which are under research.
Bucky Ball Essay Research Paper There are
Bucky Ball Essay, Research Paper There are many bucky balls in the world . Possible Reasons You are not authorized to access the section you are trying to gain entry to. The permissions are not set correctly on that file/directory. Directory browsing may be turned off for that directory. You mistyped your password. [an error occurred while processing this directive] This page hosted by Hypermart, the world’s fasty if this renovation of the site has caused you any discomfort, but in the end, we hope that it will serve better.
Process Paper How To Get
On The Net Essay, Research Paper Process Paper: How to get on the net The Internet is a very important tool for communicating, learning, and just surfing. To utilize the capabilities of the Net one must have a phone line, a computer with a modem, and an Internet Service Provider (ISP). Computers can usually be found at any electronic store.
Industrial Revolution Essay Research Paper How Global
Industrial Revolution Essay, Research Paper How Global Warming Relates to the Industrial Revolution Global Warming relates back to the Industrial Revolution in many ways. During the Industrial Revolution many new inventions were made that would make life easier for people back then and also in the future.
Search Engine Review Essay Research Paper Web
Search Engine Review Essay, Research Paper Web page design Search Engine review In this paper I am going to review a few web search engines, and hopefully provide some helpful insight for doing searches on the internet. The first step in obtaining the information you want from the web is structuring the words to search with.
Internet Search Essay Research Paper I recently
Internet Search Essay, Research Paper I recently took a workshop on how to use the Internet. I thought that writing an essay on ?how to use the Internet? would help me to remember what I learned
Internet Search Engines Essay Research Paper The
Internet Search Engines Essay, Research Paper The internet was established as a back up for the military in case of a nuclear attack and all normal communication were cut off. By establishing this form of communication the chain of command would have a way to pass down information and important orders. The military abandoned this idea and gave it to the higher education system.
The Internet Vs The Library Essay Research
Paper The Internet vs. the Library Overall, when one compares the Internet vs. the Library, the Library is superior. This is because, though it takes a bit longer to use, the Library has a standard information research tool; the Dewey decimal system. This organized system makes finding books simple, and with the added assistance of a librarian, finding information can be easy.
Criminal Justice Essay Research Paper The two
Criminal Justice Essay, Research Paper The two vehicle stops were made for different reasons. The first vehicle, the white Toyota Camry, was stopped because it fit the description of a vehicle that
Search Engines Essay Research Paper Search EnginesA
Search Engines Essay, Research Paper Search Engines A search engine is an online service that can aid a user in finding a web page that contains particular content the user is looking for. There are many different search engine services on the web. They are primarily distinguished from each other by the way they gather their information.
Richard Corey Essay Research Paper Money can
Richard Corey Essay, Research Paper Money can t buy happiness and “You can’t judge a book by its cover” are two old adages that echo through time; however, they seem to echo so softly that they are quite often disregarded. For men, in their search for contentment and fulfillment, only see money as the main vehicle.
Aliens Essay Research Paper Aliens
Aliens Essay, Research Paper Aliens What are aliens? There are illegal aliens. Their are UFO aliens. You have to ask yourself one question, do you think there could be intelligent life
Environmentally Friendly Alternatives To The Combustion Engine
Essay, Research Paper In the past decade there has been a great deal of worrying about what will happen when the world?s oil supply becomes depleted. The main reason for concern is that almost all of the automobiles in use now require an oil based gasoline to run their internal-combustion engines. In the next few decades it is predicted that all of the world?s oil will have already been mined, and combustion engines will be unable to function.
Analyzing Search Engines Essay Research Paper 1
Analyzing Search Engines Essay, Research Paper 1. Formulate five criteria for the evaluation of search enginesTo effectively evaluate three different search engines from the perspective of an advanced web user, the following criteria were established:
Ford Engines Essay Research Paper Ford V8
Ford Engines Essay, Research Paper Ford V8 Engine Differences There are different types of Ford engine’s including the Windsor, Cleveland, FE, and Big Block types. The 289 is the smallest of the popular Windsor engines. It was produced from ‘63-’68 and is very similar to the 302 except for the stroke. Most all 289’s and 302’s have mechanical camshafts and press in studs.
Internet Search Engines Essay Research Paper Internet
Internet Search Engines Essay, Research Paper Internet Search Engines How Search Engines Work Search engines are programs that crawl the web to compile lists of web sites so that this information can be used by people to find web pages that they are looking for.There are three major components to search engines.
Internet Advertising Essay Research Paper Internet AdvertisingInternet
Internet Advertising Essay, Research Paper Internet Advertising Internet Advertising is the way of the future and it is very evident since many companies and businesses have their own web sites and advertisements are located all over the World Wide Web. The Internet or World Wide Web is quickly becoming the most effective way for a business to advertise their products or services to customers.
2 Stroke Engines Essay Research Paper Paul
2 Stroke Engines Essay, Research Paper Paul Wehunt 2- stroke engines! The power that’s needed in today’s high performance Motor sports. Let us take a closer and more through look into the workings of the 2-stroke engine of today.
Legal Opinion For 8th Social Studies Essay
, Research Paper Legal Opinion A. I picked the side of New Jersey in the New Jersey vs. T.L.O case. The reason I picked that side is because the girl whas smoking on school grounds and she was not allowed to do that. Another reason is the teacher that found the girl smoking had the right to bring her to the Principals office, because she had a reason to.
Rudolph Diesel Essay Research Paper Rudolph Christian
Rudolph Diesel Essay, Research Paper Rudolph Christian Karl Diesel Rudolph Diesel was born on March 18, 1858 in Paris. On September 4, 1870 Rudolph’s family moved to England. In late November they decided it would be better for Rudolph to continue his schooling in Germany so he moved there on his own and stayed with a young professor.
Engines 2 Essay Research Paper Engines
Engines 2 Essay, Research Paper Engines Engines are found all around our very day lives. They are found in our cars, trucks, vans and motorcycles. Almost all vehicles run on the basic combustion engine. An internal- combustion engine is any type of machine that gets mechanical energy from the expenditure of the chemical energy of fuel burned in a combustion chamber.
Databases Essay Research Paper This was my
Databases Essay, Research Paper This was my first experience searching databases . Once again, I was overwhelmed with the onslaught of information I found on my topic. Searching databases definitely seemed faster and more efficient than searching for information online using search engines.
High And Low Displacement Engines Essay Research
Paper High & Low Displacement Engines People often think bigger is better, but that s not always true. Cars that have bigger engines are said to be faster than those with small engine are, but an Acura Integra Type R can out accelerate a Ford Mustang GT easily. This may seem impossible until the differences in the design of the two cars are analyzed.
Poem More Than You
’ll Ever Know Essay, Research Paper More Than You’ll Ever Know Words can’t express My feelings for you. I search for the right words, But none that I find will do.