Advantages and disadvantages of search engines. Course work: A comparative review of modern search engines

However, the situation changed over time. Search engines began to monetize their services, overgrown with serious budgets, which allowed them to recruit good professionals. They helped to develop better ranking algorithms, which all the time approximated the conditions of the struggle for leadership in the issue of traditional methods of promotion. And today it is not the most cunning but the most experienced, the most talented and the one who really wants to help the end user.

The problem is that the algorithms themselves search engines  have never been perfect and now they are not. They are constantly improving, new algorithms appear, old ones are being improved. Therefore, periodically the issue is shaken, which leads to the loss of positions of some sites. This affects many webmasters, because they lose their income. Also, people who are constantly looking for ways to bypass the rules of search engines have not disappeared anywhere. Attempts to go to the top of the issuance bypassing the rules will always manifest themselves, and this will lead to improved algorithms.

Previously, the most successful solution was ranking sites based on the number of links. Indeed, the truth - the more links lead to the site, the more authority it enjoys. However, there is one drawback here. The fact is that it is impossible to understand which link appeared naturally, and which one was openly forged to deceive the search engines. There were attempts to improve the algorithms, but still no one could achieve the best results. In the end, it was necessary to come up with something more substantial.

And it appeared. Now the search engines are trying to use information about whether the visitors are happy with the site or not. This is called behavioral factors. That is, to determine whether a site is popular and whether it deserves high positions it is necessary to spy on visitors. If people actively follow links to the pages of the site, spend a lot of time there, fill out some forms, subscribe to the newsletter, that means the site is popular with people. And this is already an occasion for increasing his position in extradition.

This is how search engines work today. To get out in the top of the issuance will have to do everything possible to make the site like the audience. Of course, you will have to buy links, because they still remain important and are used in ranking. But behavioral factors are still decisive.

In 1994, students at Stanford University, Jerry Yang and David Philo, were preparing to defend their thesis in the field of computer-aided design of integrated circuits. To do this, they had to spend a lot of time on the Internet, searching for necessary information and saving links. Lists with links grew, then Yang and Philo abandoned their thesis and began to exclusively collect links. By the middle of 1994, there were a lot of them, they sorted the links into categories, then there were too many categories of links, subcategories appeared. But the list of Jerry and David was not intended for public viewing - it was made exclusively for friends. Time passed, and attendance grew and grew. The site's address went hand in hand.

The first step to success was a new, memorable name - Yahoo! Following the wishes of users, the creators of www.Yahoo.com, began to transform the site. There are new categories, and sections "What's New" and "What's Cool". By the end of 1994, Young and Philo had abandoned their theses and had given themselves up entirely to the work on search engine Yahu.

At this time on the road appeared the company Netscape, which offered resources for the content of the Yahoo search engine. As a result, Yahoo! appeared his domain - yahoo.com, and the catalog moved to 10 Silicon Graphics Indy stations. Around the same time, Yahoo! received and the first investor - the investment fund "Seqouia Capital". Jerry and Young got their offices and hired an energetic team of web surfers. The growth rate averaged 1000 pages per day.

On April 12, 1996, www.Yahoo.com issued its shares. At a nominal price of $ 12 per piece by the end of the day they were already selling at $ 33. The creators of Yakha were the first to use the advertisement of their resource in print media and on television.

New Yahoo! search engine manager - Tim Cugle decided to develop Yahu from a simple list of ordered links to the real portal. And from the financial news to make auctions, adding to the core Yahoo.com elements of online access.

The following attachments to the Yahoo! search engine amounted to 560 million dollars. Yahoo.com grows up and begins to act on the market as an independent shark of the capitalist world. Competitors, in the face of America Online and Microsoft, are getting into different corporate alliances to strengthen their positions in business. After looking at the "enemies" methods, Yahoo in 1999 buy a free provider email   rocketmail.com. At that time on the open spaces of the Internet there are various sites claiming the title of Portal. With competitors, they crack down in different ways. Webring.com is buying, with the rest are waging a war on the stock exchange.

In 2000, the Yahoo search engine launched a project called Corporate Yahoo! and bought the Taiwanese business site Kimo, than announced their desire to move to the vast Chinese Internet market.

To date, Yahoo.com is one of the major search engines on the Internet. Providing services to 200 million people a month.

The current development of Yakhu can be defined as movement in on-line, interactivity. Yaho quickly develops this area of ​​Internet services, but there is one problem: the Yahoo! kernel it was not calculated. It was not in 1994 that the "on-line" component was incorporated into it, it was "pasted" by Tim Kugl a few years later. Naturally, there is a threat of hacker attacks through this unprotected area.

One of the innovations of the Yahoo search engine is the taskbar for firefox browser,. This tool helps to use the search for Yaho without going to the official site, but only using the functional buttons of the panel.

On September 1, 2005, the Yahoo search engine, which owns more than 200 million e-mail addresses around the world, announced the launch of a new search engine for texts, photographs and other documents contained in the letters.

The need for such an innovation arose after the increase in the amount of stored data, because some users create entire mail archives. Driven by Google's competitor and its Gmail mail service, Mail Storage E-mail offers now 1 gigabyte of free space, or 2 gigabytes per year's subscription. "Once you have the ability to store more information, you also need advanced search capabilities," explains Eric Peterson, an analyst at JupiterResearch. "

Users of the Yahoo search engine, in turn, can now use the capabilities of detailed search for words in the title or directly in the text of the letter, as well as in attached documents without opening them. The result of the search is reflected in three lines with all the attributes. The right pane displays all related documents. The found photos are displayed in a reduced form, which greatly facilitates the search. The system also takes into account spelling errors, allowing you to search for words only by first letters.

To begin with, www.Yahoo.com plans to offer a new system to a small number of US users, and then distribute it around the world. On the client side, this does not require any additional effort. "When the service becomes available, the corresponding banner will appear in the upper left corner of your mailbox", - the company Yahoo promises.

According to comScore Media Metrix for July this year, the Yahoo domain owns 219 million e-mail addresses, which is 31.5% of the world market, behind only Microsoft with 221 million Hotmail users (35.5% of the market).

October 11, 2005, aiming at nothing to keep up with the largest competitor, Yahoo announced its intention to create a digital library. And not too successful experience Google, immersed in legal proceedings with rights holders, the company does not scare, but rather on the contrary - it spurs.

Recall that Google some time ago concluded a contract for the digitization of books in major European and American universities and libraries, but the attempt to create the world's largest library of electronic texts caused fierce resistance to the fears of publishers and authors fearing for their incomes.

In mid-2005, Yahoo, officially announced the achievement of an important advantage in the fight against its main competitor in the market search market - Google. Managers believe that the search engine Yahoo allows you to find twice as many documents than Google. In their view, the Yahoo search database includes 20.5 billion objects - 19 billion text documents and 1.5 billion images. Thus, its search index (the number of objects that a user can find on the Internet with the help of a search engine) is almost twice as high as the search engine of the Internet corporation Google - 11.3 billion objects, of which 8.2 billion text documents   and 3.1 billion images.

Earlier, Yahoo never disclosed the value of the index of its search engine. Industry analysts were not unanimous in assessing its database. According to various experts, the Yahoo search engine covered from 6 to 8 billion objects in the network. According to the official announcement, for the first time in the history of Yahoo, a qualitative breakthrough in the competition with Google provided the latest, very successful upgrade of the search engine. "Internet users have a very good reason to pay attention to us," said Yahoo vice president Eckhart Walter. "Our search engine covers much more information than any other."

"Yahoo made users doubt that Google is the most powerful search engine. This gives the company Yaho an additional advantage in the struggle for leadership in the market. The Google Brand for many has become synonymous with finding information on the Internet. Yahoo's task is to shake the credibility of this brand, "said Forrester Research analyst Carlin Lee.

That's what lives www.Yahoo.com today, when the cost of one share has crept up to $ 200. Now the search engine Yahoo! has reached a critical size - the team of surfers can hardly maintain its relevance and add thousands of new sites every day.

what better Yandex  or Google? - Actual issue for today. But comparing these two search engines does not make sense, because each search engine has both advantages and disadvantages. This is a matter of taste, because one person may like one interface, and the other is completely different. Opinions on this issue diverge, no matter how competitive the two campaigns are.

As the statistics show, the person spends at least two hours on the Internet, and therefore this time should be comfortable for him, and this depends only on what kind of search engine people use. We will try to understand and answer the eternal question that worries many users of the worldwide network: which search engine is better: Google or Yandex?

Which is better: Google Chrome or Yandex Browser?

Before starting to work with the browser, a person first gets to know his interface, but only then looks at the functionality. But if we consider these two browsers - then their interface is almost the same, but there are significant differences.

  • Google Chrome: This browser is minimal and eliminates the "extra" buttons. For someone who does not like the browser interface, you can find a way out of the situation, he can use the settings to add everything he wants to the panel. there is mobile versions  this browser, which are at the top of the list of top 10 most popular browsers in the world.
  • Yandex browser:  The Yandex interface is similar to the Google interface. But it is more understandable, and any Internet user can understand it. To make it more convenient to work with the browser, an application has been added-the management of gestures. Also this company has a mobile application, which is quite easy and understandable to use.

If the browser does not work well and its page speed is low, then it is unlikely that someone will use it. If we compare speed and performance  two browsers, then you can come to the following conclusions:

  • Google.  Browser pages are loaded in a matter of seconds. But if you have a small memory, then it will not pull such a browser.
  • Yandex.  The speed is increased due to the function "Turbo". This function is implemented due to streaming compression of traffic on servers, which is convenient for mobile devices. But in fact the speed of loading pages is approximately the same.

Security  for many users of the network represents one of the important roles on the basis of which they are determined with the choice of the browser. After all, the Internet has a lot of scammers, from which you need to defend yourself and this is important when using the browser.

  • Google  - this the most secure browser, which is updated regularly. It filters lists by adding black pages to pages that can harm the computer. Even if you download a file that contains a virus, the browser warns you about it. But if you, contrary to the advice of the browser, downloaded and launched the virus, then you will not be able to avoid infection.
  • Yandex  also is distinguished for its safety. All files that are downloaded are scanned   Kaspersky Anti-Virus, and the system periodically notifies about "black" sites. The developers of the browser promise a high level of security of your information.


Equally important are additional functions, which are available in browsers.

  • Google - the benefits and additional features of the browser can talk endlessly. For example, Google has an excellent and convenient tool for analyzing the site code, which is useful for web developers.
  • Yandex can not yet boast such great functionality, but they try to copy everything from Chrome, and they do it well. It is worth noting that most of the functions are developed on tablets and mobile devices.

Which is better: Yandex disk or google disk?

Where do people store all the necessary information that is important and important to them? Probably, on a flash drive - before it was the most convenient way  storage of information. But it can be lost at any time, or all information from it can disappear as a result of getting out of standing. Therefore, they were invented cloud storage  in the Internet. About which service is better, let's try to understand further. We will compare two types of storage - this is Yandex.Disk and Google Drive.

Advantages of Yandex.Disks include the following features:

  • The user is provided with 10 gigabytes  memory for free;
  • Menu in Russian;
  • Attractive interface.
  • Any file can be shown to users of the orphanage by giving them a link to it;
  • Simple and straightforward menu;
  • High speed of uploading and uploading files.

The disadvantage of this service is that Yandex.Disk does not have built-in tools for working with documents from Microsoft Office.


Let's talk about Google Drive. Its advantages can be considered the following features:

  • Built-in editors of documents, texts and tables;
  • Teamwork online on the same document;
  • Free of charge 15 gigabytes  Memory, which is 5 more than the competitor;
  • Built-in service "Google Photo", which allows you to conveniently store and synchronize your photos with mobile gadgets.

Both services have a convenient client for PC, for tablets and smartphones. It is impossible to answer the question: which of these systems is better. They work equally well. Both of these services can be called safe and reliable.
  Next on the video compare these two discs:

Which is better: Yandex Maps or Google Maps?

Such an application as "Cards"  recently gained popularity, and today it is impossible to imagine a single tablet or smartphone without this service. After all, this is a convenient way to look at the right path and the distance from one address to another.

First and foremost, what people appreciate is the interface, and as many users have noticed, Google's Internet maps are more user-friendly and understandable. In addition, the possibility of Yandex maps is much more than the capabilities of Google Maps.

Yandex maps also communicate and inform drivers' users with the help of such built-in functions as:

  • Men at work
  • Camera
  • Talking
  • Other

With the help of this you can make a mark on the route where there was some kind of road event and on which lane where the speed control camera is located, where road works are being conducted. And you can just leave a short note that all other users of the application will see.


A convenient function of these two services can be considered what they have voice search. If we talk about the CIS countries, the cards from Yandex are clearly in the lead, since they do not exist in Google, they are only marked by borders. Yandex. Maps provide a more accurate result of where a particular locality is located.

Both services have perfect navigation  - for what they prefer drivers. It can be concluded that Yandex.Maps are much easier and more convenient, they have more functions and capabilities.

What kind of card service is better, let's look at the video:

Which search engine is better: Yandex or Google

The most important thing on the Internet is the search line, which finds all the necessary information. Several years have been debating about which search engine is best. The main competitors are Yandex and Google systems.

If you compare search engines by popularity, it depends on the country where you live. If in the CIS, the majority prefers to use Yandex, as it is looking for information for the most part on Russian-language sites. In other countries, of course, the American search engine is preferred.

If you compare search speed, Google searches for requests a little faster than Yandex.

Advantages of the Google search engine include the following features:

  • A little advertising;
  • Good protection from scammers;
  • Up-to-date delivery of relevant pages.

Advantages of the search engine from Yandex can be called the following:

  • Many additional services;
  • High-quality delivery of the necessary information;
  • Excellent ranking of the Russian Internet.

To answer the question, which search engine is better, you need to decide what information you are interested in. If it's Russian-language sites in the domain. Ru.rf, then Yandex will help you.


The difference between Yandex and Google: 6 main differences

There are several differences between the two systems, and the most significant ones will be described below:

  1. The Yandex interface is more diverse and understandable;
  2. If we talk about web developers, then they choose Google, because it is considered more stable;
  3. The main difference is that Yandex is a Russian system, and Google is American;
  4. Yandex is distinguished by many services that are useful for users;
  5. I have Google's excellent search suggestions that occur when typing;
  6. Relevance of the release of Google's information above.

Search engine Yandex is focused mainly on the Russian-speaking population. This search engine is very common in the Russian-language Internet (the so-called "runet") and takes in it 58% market share. It is oriented for the most part to countries such as Russia and Ukraine.

The remaining 42% of the market is divided by the search engines:

  • Google - 38.6%
  • Mail.ru - 0.8%
  • Rambler - 0.5%
  • Bing - 0.3%
  • Others: Yahoo, Nigma Tut.by, QIP.ru - 1.8%

From this it follows that Yandex is the leading search engine in the Russian segment of the Internet.


Answering the question: what is better Yandex or Google can not get an unambiguous answer. Advantages and disadvantages of both search engines. When choosing a search engine, browser or service you need to build on your preferences. But as practice shows - more often people in Russia use Yandex more often, since it is more accessible and understandable.

Video: compare two search engines

Who is better - a boxer or a tekkvandist, Schwarzenegger or Stallone Coca-Cola Pepsi, BMW and Mercedes? These "insoluble" questions mankind has been suffering for decades, but they still do not have a clear answer. Why is not everything learned in comparison? Probably because the comparative principle constantly misses some trifles, among which the most important is "to the taste and color ...".

Google is better  than Yandex ...

It is possible to acknowledge in a number of positions that this is true, although the Yandex leadership seriously declares the opposite and brings its arguments, which also have the right to exist. While one thing is obvious - for Runet users other search engines are not very relevant: for example, Yahoo is "imprisoned" for the English-speaking Internet community, and Mail is rather weak and "scattered" in its services.

Most Russian-speaking users are familiar with Google and Yandex - they are untwisted, are optimal for solving search problems, interactively expand their service base and quickly improve the relevance of search results. Arguments of their sympathies for these two giants of the World Wide Web different users can bring mass!

Advantages and disadvantages of search giants

For example, Google - the most "adult" and the most advanced search engine, with a claim to work around the world. It is better integrated into the English-language Internet (English-language search), it better performs the delivery of search information, it is more adequate to the requests, and its robots quickly update and rank the indexed up-to-date information on the sites within 1-2 days using a complex filter system. On Google, you can create your own accounts, use convenient free services  well-protected by Gmail.

The main disadvantage of the search service for the Russian-speaking community - Google is rather weak in specific issues of the Runet, and because of its "globality" has long ceased to focus on the individual needs of users, offering a wide, average package of services.

Yandex

Yandex also has its attachments and claims. Service is strong for work in RuNet (but it is rather weak in the English-speaking Internet), it is better looking for technical information and driver, it is unpretentious when searching for music.

An undeniable advantage is the regional search and the system of domestic cartography, the offer of alternative search options, one of the most "hospitable" postal services and informative of the main page. The main priority is the thesis that Yandex is almost the only one in the "struggle" with the monopoly of Google and protects the information space of the Runet from the power of unnecessary commercial information from the West.

At the same time, Yandex is accused of excessive politicization and commercialization, it only indexes "proven" sites, ranks them according to advertising advantages (often ignoring non-commercial ones) and very slowly indexes the latest information. There is a problem with Yandex with the protection from viruses and the search for scientific English-language literature.

As you can see, the statement that Google is better than Yandex (or vice versa) hangs in the air. For all its "perfection" and multifunctionality, both search giants perform common tasks, but they do it somewhat differently. Simply put, the comparison looks incorrect.

So still, Google or Yandex?

Indeed, the principles of indexing, ranking and search results in these search engines are somewhat different. It is believed that when a user requests, Yandex starts looking for all possible superficial matches, and Google plunges into the root structure of the sites. But search algorithms are a task for "experts", simple users are interested in the functionality of working with the service (usability) and the thoroughness of the search output.

And just then the rule "on taste and color ..." is included - each user chooses what is convenient for him.

  • One attracts Yandex's information page with passing information about news, weather, exchange rates, etc.
  • Others are annoyed by a large amount of advertising on the same page. But Yandex's usability is indisputably better, the location of the functional buttons is more visible, and the search line is more convenient and longer.

In Google, which, although more diverse in service options and numbered search queries, it is more difficult to find the location of services.

Search opportunities - clearly for Google: search is more relevant and deep, the delivery of pages is more adequate to the request and less clogged with advertising offers. In addition, Google - a more stable system, which, in contrast to the developing Yandex, is more careful experimenting with service capabilities.

But we should not forget that Yandex is growing rapidly and improving, bringing its services to the mind just for users of the Runet. Therefore, on the Russian market, Yandex has not yet said its last word.

Yes, and statistics Runet is clearly speaking in his favor: if Google in the world occupies 70% of the search market, then in Russia he "mastered" so far only 20%. But Yandex is used by more than 65% of users of the Russian-speaking segment of the Network, and this figure is constantly growing!

Do not forget that Yandex works in an understandable and native market, and therefore has the opportunity to stay closer to its user and better understands its needs. Therefore, Googla's attempts to capture the Runet have not yet been successful ...

Educational institution

"Brest State University named after AS Pushkin"

faculty of Mathematics

department of Informatics and Applied Mathematics

Course work

Comparative Review
  modern search engines

Introduction

The first search engines appeared on the Internet more than ten years ago. Then they performed only one function - searching for links to newly created pages.

At the initial stage of the development of the Internet, the number of users of the network was small and the amount of information is relatively small. In the overwhelming majority of cases, Internet users were employees of various universities or scientific organizations. At the time, searching for the right information on the network was not as relevant as it is now. Today, search engines have become a multifunctional service. They allow users to find the most diverse information on the Internet, which is a great success.

Search engines

One of the first attempts to organize access to the information resources of the network was the creation of thematic catalogs of sites. The first, opened in April 1994, became Yahoo. It was not yet a search engine, in the modern sense. The ability to search for information was limited to resources registered in the Yahoo directory. Link catalogs were previously used quite widely, but now they have practically lost their popularity. The explanation for this is very simple - even modern, containing a huge amount of resources, catalogs, provide information only about a very small part of the network. For comparison - the most complete catalog of the Internet - DMOZ contains information about 12.000.000 resources, while the database of the most complete search engine Google consists of more than 28.000.000.000 documents.

The first full-fledged search engine in 1994. became the project WebCrawler. Then in 1995 appeared the search engines AltaVista and Lycos. In 1997, at Stanford University, as part of a research project, Google was created - the most popular search engine at the moment in the world. In 1997, the search engine appeared - Yandex, the leader in the Russian-speaking part of the Internet. At the moment, the main search engines are three international - Google, Yahoo and MSN Search. The rest, of which not a few, use wholly or partially bases and (or) algorithms of the above systems. In Runet, the main search engine is Yandex, followed by Rambler, Google.ru, Mail.ru and Aport.

A search engine is the sum of the following components:

Web server (Web server) is a search engine server that interacts with the user and the rest of the system.

Spider (spider) - a program written on the principle of the browser, is designed to download web pages. The browser is designed for visual use of the pages, and the spider works with the HTML code directly. To view the "raw" source, click in the browser menu: View - View HTML code.

Crawler ("traveling" spider) - a program that automatically goes through all the external links of the page. Its task is to search for documents that are not known (or changed) and to prioritize, where Spider should go further.

Indexer (indexer) - an analyzer program for spider web pages. It "parses" the downloaded page into parts and analyzes its elements, such as text, service html-tags, headlines, features of stylistics and structural forms.

Database (database) - storage for downloaded and processed pages - the general database of the search engine.

Search engine results engine - retrieves search results from the database of the search engine. It is she who decides which pages are more relevant to the user's request and sorts them in the right order. The module operates according to the ranking algorithms specified by the search engine.

Most Popular Search Engines

Share each search engine in RuNet

Russian-speaking

English-speaking

MSN search

MSN Search

AltaVista

Ask Jeeves

LookSmart



Traffic analysis - June 2008.




Number of referrals from search engines

Comparative review and classification of search engines


The overall ratio of search engines in English-speaking countries

Table 1

The service is based on the work of crawlers, covering the whole Network. Priority is given to the relevance of the results and their relevance to the query. Unlike many search engines, Google's robots index all pages, not just the most important ones. There is a menu where you can easily find on the web images, discussions, going through newsgroups, reading news or searching for goods. You can access information from the Open Directory (Open Directory, DMOZ) . The system for a long time keeps the pages indexed by the robots in its cache, which allows you to "revive" dead, deleted pages from the servers, or see the old versions of the newly updated pages. The system provides excellent spell checking, easy access to dictionary definitions, viewing of exchange rates, road atlases, phone number bases and much more. The AdWords program places ads on Google and its partner sites. Also Google is a free provider of search results for some other search engines .

In the system, a search is carried out taking into account the morphology of the Russian language, a search taking into account the distance, and a carefully developed algorithm for evaluating the relevance.

A natural language query has been implemented: now the search engine can be asked "in Russian", asking long questions. For example: "where to buy a refrigerator."

The search robot allows you to provide search capabilities for different text zones, restrict search to a group of sites, search through links and images. There is a "citation index" - the number of sites that link to this resource. It works by filtering search results from mat and pornography.

Services: mail, news, postcards and bookmarks, automatic consolidation of news into stories and highlighting the main themes of the day. Plus, stylized with Google Toolbar, spartan search engine ya.ru. The search is now conducted not only on web pages, but also on specialized data sets - news and products. Finds documents not only in HTML format.

The system works with a high search speed, the search index is updated several times a day. The search engine always finds the freshest documents and the latest news. Rambler is familiar with the Russian language, understands terms such as "used", "cu" and "a / y". Services: search box, mail and news. Feature - the mechanism of associations. In addition to the standard response page, in which the documents found are located in descending order of relevance, the line "We are also looking for" appears. It contains words and phrases thematically related to the query. The new mechanism also corrects query errors. On the entered gjujlf the user will now get a positive result: weather information (gjujlf is the word "weather" typed in Latin). Rambler Mass Media is the first and the leading information digest on the market. In addition to text materials and photo reports, which became the business card of the project, Rambler Mass Media presents users with audio and video materials.

Msn Search

Yahoo !

Continues to work under Windows. Important features of the first version of the search engine Aport include transfer request and response to english  and back. The second property is the reconstruction of indexed pages from its own database. This makes it possible to view already existing pages.

The system is based on the delivery of results for individual sites. To share resources on sites, Aport uses the information provided by the AtRus directory, or resource owners.

In search engine Aport for the first time the opportunity of search on news ribbons was realized.

Previously, MSN never had its own spider or directory. Since 1997, different databases have been used to produce search results. Only since the beginning of 2005, MSN launched a beta version of its own search algorithm.

The MSN system is especially important, because it is this search engine that is used by default when Internet Explorer users "enter a search query into the address bar.

Localized for about 30 countries, such as: Great Britain, Denmark, Belgium, New Zealand, Japan, etc. Also this system gives users the ability to sort the search results: by date, alphabetically, by relevance.

To date, this is the oldest and most complete catalog of Internet resources. The system originated as a simple collection of bookmarks, which was replenished by only 2 people - David Filo and Jerry Yang. In addition to the relevant search results, Yahoo offers the ability to use shortcuts on the search form on its main page to search for images, products, or go to the Yahoo Yellow Pages. You can search the Yahoo directory. There is a panel for the Firefox browser. For

mail storage system offers 1 gigabyte of free space or 2 for annual subscription. Yahoo sells a site on the site for sponsored links; The sale is conducted through the Overture program. The system takes into account spelling errors, allows you to search the word only by first letters.

AltaVista

LookSmart

The peculiarity of this search engine is the ability to search for sophisticated selection criteria.

Providing support for multiple languages ​​(Chinese, Japanese, Korean). It provides services for indexing large amounts of information and the ability to instantly search in huge databases.

There is a scheme of "sponsor" sites. Develops corporate search engines for internal use. Licenses search technologies for enterprises, including for use in internal networks. She uses the Yahoo directories, and the shortcuts above the search string allow you to find images, music, videos, and thematic sections filled manually. If you want to receive information no less reliable than from Yahoo, but delivered through a simpler interface - AltaVista is your choice.

A search engine based on the work of crawlers (special software robots scanning the Internet in search of information, which is then entered into the database) belonging to Ask Jeeves.Great search relevance, built-in query verification system, dynamic page descriptions, appearing next to the search results.Inquiries can be entered in ten European languages.In database search engine indexed over a billion unique pages.There is the possibility of refining the query in the thematically The "Resources" section, also found on the results page, is in turn unique in that it redirects the user to the sites that served as the source material for the articles cited as the results.

LookSmart is an Internet company that has its own online catalog and Wisenut search engine. LookSmart is, first of all, a manually updated list of sites. He gets lists in two ways. First, commercial sites pay for a place in the shopping tops, likening LookSmart to electronic Yellow Pages. Secondly, there are volunteers who enter sites with non-commercial content in the catalog www.zeal.com, owned by LookSmart. Although this is a separate site, the lists on it are used in the processing of LookSmart search queries. The highlight of LookSmart is the ability to access hundreds of periodicals through one of the labels above the search bar. It contains a service Furl, designed for archiving of Web pages and links to them. For the archive of user pages, 5 GB of free space is provided. I launched a vertical search system from 161 sites.

GigaBlast

Aol Search

Specialized search system of scientific and near-scientific information. Scirus uses about 200 million scientific pages and filters the results of unscientific content, which by relevance to some extent correspond to the query.

start page  Scirus pleases its unpretentiousness - there's only you and the query string.

If you are not very knowledgeable in scientific terms, Scirus helps you enter the correct query for a better search.

This is the best scientific specialized search engine. It provides information about a specific direction, helps in the preparation of queries and the use of terms, and in addition Scirus finds information that may not be available in the case of using other search engines.

Search engine with more than two billion number of indexed pages. GigaBlast has a high relevance to the query and a variety of different search tools. The homepage of GigaBlast.com is simple and accurate. In addition to the usual query line, there is a Catalog, Blog Search, Traveler Search and Government Search.

The system uses the usual direct search -

The correspondence of the results is quite high, in addition, GigaBlast spell check. Supports queries in a literal form ("Why is the grass green?"). Performs spelling check. In the advanced search for GigaBlast.com, you can search for the exact phrase, put an exception on the word, search for a specific address, etc. Everything is standard.

Overall, GigaBlast is a handy search engine with highly relevant results.

A huge portal, where you can find everything from games to forums of a variety of subjects + ease of navigation and a memorable name for America Online. AOLbyPHONE - The service allows you to receive information previously ordered by phone. AOLSchool - service for schoolchildren. DigitalCity is a virtual city project. AOL Anywhere - the service allows you to access from any mobile device to the content of America Online.

AOL Search provides users with a search results page based on the Google index, the "Built-in" version of AOL Search as a result of search queries gives links only to content available within the online AOL system. And using AOL Search, you can simultaneously search both the Internet and the AOL Network. In the "external" search engine AOL Search, there is no such option. If you like Google, you will miss some of its features in AOL Search, for example, caching pages.

Blink TV

Infoseek

Mail . en

A search engine that allows you to find audio and video files on the Internet, using not only keywords and phrases, but also the actual content of audio and video clips. You can make corrections to the search, for example, by date (if you are looking for material for a certain period of time) and by relevance (if you want to get results that are most appropriate to what you are looking for). There are Smart folders - these are the folders, the content of which is automatically updated from the Internet sources without user intervention. They are formed from the search results for your query. Using Blinkx TV is quite simple and convenient (the only requirement is a high-speed connection to the Internet).

In this system, the index creates a robot, but it does not index the entire site, but only the specified page. It is possible to specify key phrases that represent a single whole, up to the order of words. Ranking for issuance is based on the number of query terms in the document, by the number of phrases of the query minus the general words. All these factors are used as nested procedures. Infoseek refers to traditional systems with a term weighting element in the search, and also allows you to conduct what is called a contextual search. This means that using a special request form, you can demand consistent coexistence of words, has a fairly developed information and search language (not just indicates which terms should be found in documents, but also weigh them in a peculiar way).

It is not a search engine, it is a huge portal with a lot of services. But the services of mail.ru includes the search system. He is actively fighting against spam and viruses, cooperating in these areas with the creators of Dr.Web and Kaspersky Anti-Virus. In 2004, joined the initiative of the postal services of the world and began to use the standard SPF in the work of the filters of its mail servers. In the search engine, there is an option for the morphology of the Russian language and the ability to maintain the history of search queries. I got two innovative services: access via IMAP protocol and voice mail - the ability to listen to the contents of the box on the cell phone. To portal Mail.ru new such services as "Hosting" and "Encyclopedias" were added.

NetScape Search

Ask Jeeves

The system provides access to results collected manually from LookSmart based on the results of the most frequent requests, as well as from Yahoo based on site indexes compiled by robots. The Fast Forward system allows you to see search results on one side of the screen, and on the other - the first pages of the found sites. The relevant categories from the catalog collected manually are displayed at the top of the search results page.

The system uses the Google index database, as well as the main search site AOL - AOL Search. The main difference between Netscape Search and Google is that Netscape Search will primarily search on its own database and display these results first. There is a difference in design.

It became famous as a search engine that understands "natural speech" - in search string  The proposal is entered in a questionnaire form, and the system responds adequately to it. Currently, Ask Jeeves is working on the basis of technology crawlers, processing requests coming to the search system Teoma.

Alexa Internet

Created by natives Google. A distinctive feature is the large volume of processed pages. He understands queries only in English.

The largest search engine, which also provides Internet statistics on the traffic of web sites.

Today Alexa is a search engine that uses the databases of two giants of the world's Internet search: Google and the DMOZ directory.

HotBot provides easy access to three major search engines, based on the work of crawlers: Google, MSN and ASK. Do not mix the results obtained from these three sources. Nevertheless, this is a quick and easy way to get several independent "opinions" on the topic of your interest.

The most prominent representatives



Google ( www . google . en )

According to statistics, it is the most popular search engine in the world.

The founders of Google - graduate students of Stanford University Sergey Brin and Larry Page. From the modest venture of two enthusiasts, she grew up in the Internet giant, whose services are used daily by millions of people in 80 countries. In an interview, Bryn, born in Moscow, explained the appearance of Google as follows: "When we started, there were a few search engines, but the quality of their work left much to be desired.

Each request returned you about a thousand results, sorted in random order. "It was the relevance of the results, their matching with the query, that was placed at the heart of the new system.

In addition to just searching, you can compare prices for goods in online stores, read news and much more. There is also a service blocking annoying Internet advertising.

Google was the first to apply a new technology of search and delivery of results - the relevance of the Internet pages - that does not lie on this particular page, but all the same it affects the rank of its delivery. The Google search engine takes into account the hypertext structure of the Internet, and which pages refer to which one. Google itself determines the relevance of Internet pages The page referred to more often is more relevant and therefore more popular.

No one can challenge Google's leadership. The company's shares for 2005 grew almost two and a half times. Search traffic for the same period grew by about 30%.

The commercial use of Google was launched in 2000, and thanks to its software providing excellent relevance of search results, as well as additional services (such as news search embedded in the browser "Google bar" with "popup window killer") quickly gained popularity among "advanced "users who used the Network primarily as a working tool. The effective system of "thematic advertising" was also appreciated by advertisers. It's no accident that the Washington Post called the Google service one of the 10 most important Internet achievements of 2003. Today, Google processes more than 200 million queries daily. In its work, Google currently uses more than 15,000 computers.

The volume of indexed pages is more than 10 billion!

Help: ... Today, thanks to distributor agreements with Yahoo, AOL and Ask Jeeves, Google processes up to 80% of all search requests made on the Internet every day.

For comparison. Yandex currently has 600,000 visitors per day, the number of computers is 40 units, indexed 15 million pages.

Google - the leader in the list of the most popular search engines. As a result of a study conducted by comScore, it turned out that Google is the most popular search engine in the US.

In August, this search engine processed 63% of search requests, which is 1.1% higher than in July. In second place was the search engine Yahoo, which processed 19.6% of requests (20.5% in July), and the third - Microsoft (the share of processed requests - 8.3%). The top five most popular search engines of Ask.com are 4.8% (the share of processed requests increased by 0.3%) and AOL - 4.3% (growth - 0.1%). The first place was taken by Google (41.3 billion searches, 62.4% of the market), the second - Yahoo! (8.5 billion requests, 12.8% of the market) and Chinese search engine Baidu.com (3.4 billion requests, 5.2% of the market).


I ndex (www.yandex.ru)

The search engine Yandex appeared on September 23, 1997. The developer, the company CompTek, introduced a new search engine at the exhibition Softool. Earlier this company was engaged in the production of information retrieval systems. One of its projects has become a system that provides a search based on the morphology of the Russian language. It occupied 300kb, loaded in random access memory  completely and worked very quickly. On the basis of this technology, a "Biblical computer directory" was created (a search engine that provides work with the Old and New Testament).

The main innovations of the search system www.yandex.ru were: verification of the uniqueness of the document - this provided the exclusion of copies in different encodings. And the distinctive properties of the search algorithm Yandex. Search with the morphology of the Russian language, search based on distance, and a carefully developed relevance evaluation algorithm. A few months later, in the search system www.yandex.ru, a natural language query was implemented. Now the search engine can be asked "in Russian", asking long questions. For example: "where to buy a refrigerator" or "genetically modified products".

In 1998-99, along with the growth of Runet, the services of the search engine Yandex also increased. The increase in the number of requests led to the need to optimize the search engine. The new search robot allowed to provide search capabilities for different text zones, restriction of search to a group of sites, search by links and images. Also, for the first time in RuNet, the concept of "citation index" was introduced - the number of sites that link to this resource. Was opened "Family www.yandex.ru", filtering search results from mat and pornography.

In 2000, the search engine Yandex was transformed into a legal entity. Then the active promotion of the search engine in traditional, offline media begins. Aggressive, but effective advertising campaign has borne fruit - the slogan "There is Everything", some time later, becomes a household name. For search-driven users, the search system www.yandex.ru opens new services - mail, news, postcards and bookmarks. Plus, stylized with Google Toolbar, spartan search engine ya.ru. The search is now conducted not only on web pages, but also on specialized data sets - news and products.

2003 - search engine Yandex continues to develop. Yandex has learned to find documents not only in HTML format. The Yandex News service has radically changed: users are offered an original service - automatic consolidation of news into stories and highlighting the main themes of the day.

To date, www.yandex.ru - the largest Russian portal: its share among the search engines Runet is about 45%, while the follow-up Rambler can boast only 22%. Every day more than 500 servers of the company give 2.7 terabytes of traffic to two and a half million visitors.

"Yandex" entered the top 10 search engines. According to a report published by comScore, the search engine Yandex was among the ten most popular search engines in the world by the results of December last year. According to the report, in December, 66.2 billion searches were made. 566 million of them were assigned to Yandex, which allowed the service to take the 9th place in the list of search engines. The share of Yandex in the global search market was 0.9%.


Rambler (www.rambler.ru)

Rambler - the first search engine of the Russian Internet, was born in 1996. Its creators, the Stack group from Pushchino outside Moscow, started with the creation of a local city network, and Internet connection. In 1992, Stack launched its own ftp and mail servers. The next step to a brighter future was the creation of a search engine. Dmitry Kryukov and Sergei Lysakov began with the fact that they gave the project the name "Rambler". They liked the meaning of the word "wanderer, vagabond", expressing the very essence of the search engine. By the autumn of 1996, the final version of the search engine was ready, and on October 8 "Rambler" was launched.

Another year the system was being debugged, to the credit of the creators of the search engine did not work just one day - burned down hard disks. Existing at that time in RuNet two or three search engines could not stand competition and "sunk into oblivion."

In the spring of 1997, the unique rating-classifier Rambler's Top-100 appeared, which not only assessed the popularity of Russian resources, but also allowed one to access them. Rambler's Top-100 quickly became a universal barometer of the network, a common standard for media measurements.

The next step in the development of the Rambler search engine took place in June 2003. The company launched new version  Search engine, which differs from the previous one in two main parameters. First and foremost, the speed of search has significantly increased; second, thanks to the new architecture of the system, the search index is updated several times a day.

Now the search engine always finds the latest documents and the latest news. The Rambler is familiar with the Russian language, understands terms such as "used", "cu" and "a / y". As a tribute to fashion, a laconic version of the search engine was opened at the address "r0.ru". Its content is standard - search string, mail and news.

Another unique feature of the Rambler search engine is the mechanism of associations. In addition to the standard response page, in which the documents found are located in descending order of relevance, the line "We are also looking for" appears. It contains words and phrases thematically related to the query. The new mechanism also corrects query errors. On the entered gjujlf the user will now get a positive result: weather information (gjujlf is the word "weather" typed in Latin).

It should be noted the projects of Rambler-Group and Rambler-Photo. Both these projects provide network users with the opportunity to use the software algorithms incorporated in the projects for their own purposes. The Rambler-Group project gives people the opportunity to start discussions on a variety of topics that exist not only in the form of forums, but also in the form of open lists  mailing. The project Rambler-Photo presents users of the network the opportunity to create photo albums in the amount of up to 20 megabytes, and this disk space  registered users of the Rambler search engine receive in addition to those megabytes of mail that they can claim for the new conditions of e-mail service (15 megabytes).

Rambler Mass Media is the first and the leading information digest on the market. In addition to text materials and photo reports, which became the business card of the project, Rambler Mass Media presents users with audio and video materials. Among the suppliers are VGTRK and BBC, Deutsche Welle, Mayak Radio, etc. Headlines of the most important news are placed on the front page of the Rambler.ru portal and in the "Main News of the Day" section on the main page of Rambler Mass Media. The most current news can read and visitors to other services Rambler - Rambler's Top100 Rambler's Top100 Rambler's Top100



Aport   (www.aport.ru)

The search system Aport was first presented at a press conference of the company "Agama" in February 1996. Then the system worked only on the site russia.agama.com. By the time of the official presentation, November 11, 1997, the search engine Aport had indexed the first million documents located on 10 thousand servers. The father of www.aport.ru was the company "Agama" - developer software  for Windows. This was reflected in the creation of a search engine, in contrast to most competitors, Aport continues to run under Windows.

To the important properties of the first version of the search engine Aport you can refer translation request and response to English and vice versa. The second property is the reconstruction of indexed pages from its own database. This makes it possible to view already existing pages.

November 1998 - Aport sold to Israeli citizen Joseph Avchuk. The deal amounted to 55 thousand dollars. The trade marks "Aport" and "Agama" are preserved, to the catalog of Au! was much less fortunate. In 1999, Avchuk finally buys the catalog and renames it at AtRus, and afterwards completely destroys it when exporting to Omen, Russia Online and Aport.

The end of 1999 - the first million dollars was invested in the search system of Aport, it allowed, some time later, to present at computer exhibitions "Aport-2000". Fully integrated with AtRus; now the Catalog - Aport (Catalog - Aport).

Search engine Aport 2000 was built based on the delivery of results for individual sites. To share resources on sites, Aport uses the information provided by the AtRus directory, or resource owners.

www.aport.ru - the first search engine Runet implemented the core technologies of Google. "Page rank" - a characteristic of the popularity of the resource by the formula of "back links": links from other sites to this resource. And it takes into account not only the quantity, but also the importance of links. The weight of a link from a popular site is greater, and links including query words are valued higher than links with the word "here". In addition, when processing a query, the search engine Aport 2000 is guided by the HTML-code of the page, and the presence of query words in the URL.

Another feature is the priority of sites that have entered the highest and elite leagues of Aport - the Catalog. In search engine Aport for the first time the opportunity of search on news ribbons was realized.

The first www.aport.ru began to use paid zero line in the issue. But on Aport buy is not zero, just a higher place in the response page.

Scalability in the architecture of Aport 2000 is built in such a way that its search database can be divided into several separate fragments, each of which will work on its computer. After searching by fragments for the user, a general answer is calculated.

On July 31, 2000 Golden Telecom bought a family of Internet projects "Agama", including "Aport" and AtRus, for inclusion in "Russia-on-line".

May 2001 - the year on which the transaction for the change of owner "Aport" of the "Golden Telecom" itself was finally completed, Alfa-Bank became the new owner. NASDAQ by then was experiencing a tumultuous decline, and the chances of reselling Internet projects for an acceptable amount was not. This led to the decision of new owners of "Golden Telecom" to minimize the costs of supporting high-cost Internet projects.


MSN Search (www.msn.ru)

Once MSN Search was one of the best search engines, but today it is still worth attention. MSN Search used the LookSmart catalogs and received the best user feedback, because the project was a team of editors that tracked the most frequent searches entered into the search form and selected the sites most relevant to the subject of the request. Such a technique produced good results.

Today MSN Search is looking for new ways of development. The system searches for Yahoo materials, but does not have the same functionality and variety of options that Yahoo itself offers. At the same time, MSN Search develops its own robots and plans other changes that should completely update the system.

The MSN search engine (msn.com) has never had its own spider or directory. Since 1997, the MSN system used different databases for issuing search results, such as: Yahoo !, LookSmart, Altavista, DirectHit, Inktomi and RealNames. Now the search results are taken from Inktomi, LookSmart, Direct Hit.

But, after all, MSN.com is one of the most popular Internet portals. There is also a search engine search.mns.com. The MSN system is especially important, because it is this search engine that is used by default when Internet Explorer users "enter the search query into the address bar.The SIT system is also an information node, and according to MediaMetrix, it is one of the most visited sites.

msn.com uses the Inktomi database, but this database now contains more than 550 million URLs and it's not known which part of the MSN search engine it uses. It is known that about 2 million URLs are used from the LookSmart database.

The MSN search system (msn.com) is localized for about 30 countries, such as: Great Britain, Denmark, Belgium, New Zealand, Japan, etc. This system also allows users to sort search results: by date, alphabetically, by relevance.

Recently, at the end of 2004, MSN.com announced the launch of the official version of its search, two years after the announcement of the beginning of the development of its own search engine. The previously demonstrated beta version of the search was downloaded on all portal sites and lost the "beta" mark. In comparison with the already familiar beta version, the current search has acquired several new features, in particular:

Integration with MSN Direct Answers - a service based on Microsoft's e-Encarta and now expanded to 1.5 million articles.

The output of search results in RSS-format - there is a restriction on the use of these results and prohibition on their use for commercial purposes.

Search by RSS.

While creating the search engine MSN.com used most of the optimization principles, which in other search engines will not be a surprise for most professional participants on the online market.

In connection with the launch of the search, the redesign of the whole site MSN.com was carried out, and the portal published an appeal of Bill Gates to the users. In the near future, MSN also plans to launch a search on the blocks, several specialized searches (it can be assumed that this will be search by video files - it already exists with Yahoo and Google) and its own contextual and search advertising program. Now the bulk of advertising shown on search results is advertising Overture, which belongs to one of the competitors of the new MSN-Yahoo! system Inc.

But, the search engine has already gained considerable influence and, according to rumors, MSN Search still challenged Google and Yahoo in the struggle for superiority in the market.



Yahoo! (www.yahoo.ru)

In 1994, students at Stanford University, Jerry Yang and David Philo, were preparing to defend their thesis in the field of computer-aided design of integrated circuits. To do this, they had to spend a lot of time on the Internet, searching for necessary information and saving links. Lists with links grew, then Yang and Philo abandoned their thesis and began to exclusively collect links. By the middle of 1994, there were a lot of them, they sorted the links into categories, then there were too many categories of links, subcategories appeared. But the list of Jerry and David was not intended for public viewing - it was made exclusively for friends. Time passed, and attendance grew and grew. The site's address went hand in hand.

The first step to success was a new, memorable name - Yahoo! Following the wishes of users, the creators of www.Yahoo.com, began to transform the site. There are new categories, and sections "What's New" and "What's Cool". By the end of 1994, Young and Philo had abandoned their theses and had given themselves up entirely to the work on search engine Yahu.

At this time on the road appeared the company Netscape, which offered resources for the content of the Yahoo search engine. As a result, Yahoo! appeared his domain - yahoo.com, and the catalog moved to 10 Silicon Graphics Indy stations. Around the same time, Yahoo! received and the first investor - the investment fund "Seqouia Capital". Jerry and Young got their offices and hired an energetic team of web surfers. The growth rate averaged 1000 pages per day.

On April 12, 1996, www.Yahoo.com issued its shares. At a nominal price of $ 12 per piece by the end of the day they were already selling at $ 33. The creators of Yakha were the first to use the advertisement of their resource in print media and on television.

New Yahoo! search engine manager - Tim Cugle decided to develop Yahu from a simple list of ordered links to the real portal. And from the financial news to make auctions, adding to the core Yahoo.com elements of online access.

The following attachments to the Yahoo! search engine amounted to 560 million dollars. Yahoo.com grows up and begins to act on the market as an independent shark of the capitalist world. Competitors, in the face of America Online and Microsoft, are getting into different corporate alliances to strengthen their positions in business. After looking at the "enemies" methods, Yahoo in 1999 bought a free email provider rocketmail.com. At that time on the open spaces of the Internet there are various sites claiming the title of Portal. With competitors, they crack down in different ways. Webring.com is buying, with the rest are waging a war on the stock exchange.

In 2000, the Yahoo search engine launched a project called Corporate Yahoo! and bought the Taiwanese business site Kimo, than announced their desire to move to the vast Chinese Internet market.

To date, Yahoo.com is one of the major search engines on the Internet. Providing services to 200 million people a month.

The current development of Yakhu can be defined as movement in on-line, interactivity. Yaho quickly develops this area of ​​Internet services, but there is one problem: the Yahoo! kernel it was not calculated. It was not in 1994 that the "on-line" component was incorporated into it, it was "pasted" by Tim Kugl a few years later. Naturally, there is a threat of hacker attacks through this unprotected area.

One of the innovations of the Yahoo search engine is the taskbar for the Firefox browser. This tool helps to use the search for Yaho without going to the official site, but only using the functional buttons of the panel.

On September 1, 2005, the Yahoo search engine, which owns more than 200 million e-mail addresses around the world, announced the launch of a new search engine for texts, photographs and other documents contained in the letters.

The need for such an innovation arose after the increase in the amount of stored data, because some users create entire mail archives. Driven by Google's competitor and its Gmail mail service, Mail Storage E-mail offers now 1 gigabyte of free space, or 2 gigabytes per year's subscription. "Once you have the opportunity to store more information, you also need advanced search capabilities", explains Erik Peterson, an analyst at JupiterResearch."

Users of the Yahoo search engine, in turn, can now use the capabilities of detailed search for words in the title or directly in the text of the letter, as well as in attached documents without opening them. The result of the search is reflected in three lines with all the attributes. The right pane displays all related documents. The found photos are displayed in a reduced form, which greatly facilitates the search. The system also takes into account spelling errors, allowing you to search for words only by first letters.

To start, www.Yahoo.com plans to offer a new system  a small number of US users, and then distribute it around the world. On the client side, this does not require any additional effort. "When the service becomes available, the corresponding banner will appear in the upper left corner of your mailbox", - the company Yahoo promises.

According to comScore Media Metrix for July this year, the Yahoo domain owns 219 million e-mail addresses, which is 31.5% of the world market, behind only Microsoft with 221 million Hotmail users (35.5% of the market).

October 11, 2005, aiming at nothing to keep up with the largest competitor, Yahoo announced its intention to create a digital library. And not too successful experience Google, immersed in legal proceedings with rights holders, the company does not scare, but rather on the contrary - it spurs.

Recall that Google some time ago concluded a contract for the digitization of books in major European and American universities and libraries, but the attempt to create the world's largest library of electronic texts caused fierce resistance to the fears of publishers and authors fearing for their incomes.

In mid-2005, Yahoo, officially announced the achievement of an important advantage in the fight against its main competitor in the market search market - Google. Managers believe that the search engine Yahoo allows you to find twice as many documents than Google. In their view, the Yahoo search database includes 20.5 billion objects - 19 billion text documents and 1.5 billion images. Thus, its search index (the number of objects that a user can find on the Internet with the help of a search engine) is almost twice as high as the search engine of the Internet corporation Google - 11.3 billion objects, of which 8.2 billion text documents and 3.1 billion images.

Earlier, Yahoo never disclosed the value of the index of its search engine. Industry analysts were not unanimous in assessing its database. According to various experts, the Yahoo search engine covered from 6 to 8 billion objects in the network. According to the official announcement, for the first time in the history of Yahoo, a qualitative breakthrough in the competition with Google provided the latest, very successful upgrade of the search engine. "Internet users have a very good reason to pay attention to us," said Yahoo vice president Eckhart Walter. "Our search engine covers much more information than any other."

"Yahoo made users doubt that Google is the most powerful search engine. This gives the company Yaho an additional advantage in the struggle for leadership in the market. The Google Brand for many has become synonymous with finding information on the Internet. Yahoo's task is to shake the credibility of this brand, "said Forrester Research analyst Carlin Lee.

That's what lives www.Yahoo.com today, when the cost of one share has crept up to $ 200. Now the search engine Yahoo! has reached a critical size - the team of surfers can hardly maintain its relevance and add thousands of new sites every day.

Advantages and disadvantages of search engines

  table 2

Search system

Benefits

disadvantages

1) The system works with a high search speed.

2) The search index is updated several times a day.

3) The searcher always finds the freshest documents and the latest news.

4) It is close to the optimal output of search results.

5) produces the ranking of results depending on the frequency of use and the location of the terms sought.

6) The same document in different encodings is shown only once, and its specific addresses are summarized in the list following the summary.

1) The relevance index value is affected by the time of existence of the site in the network. This feature allows users to find resources that have existed for a long time, are successfully developing, rather than one-day websites. But this approach makes it very difficult to get into the issuance of new sites, the information on which sometimes turns out to be relevant and, perhaps, more important for the user.

2) the impossibility of performing a search for the whole phrase, indicating in the requests the maximum distance of the sought terms from each other.

Yandex

1) Continuous development of the system.

2) The quality of the issuance is growing, more and more convenient services are offered by the company: catalog, maps, news, weather forecast, mail.

3) a deep morphological analysis of the processed terms.

4) has a good mechanism for recognizing one document in several encodings or on mirror servers.

5) an originally designed mechanism for issuing results.

6) a huge index base.

1) The difference in issuance when typing a word with a large (small) letter (sometimes issuing changes, sometimes not).

2) Frequent loss of sectors of the search base - when parts of sites disappear from the issuance and are restored after 2-5 days.

3) Updating the indexes of the search database is not enough often and regularly.

1) Very powerful search engine, which is in constant development.

2) The database of indexes of this system is updated every two days, the quality of issuance is very high, it is quite easy to find the necessary document or information.

4) It is able to produce results for queries on the semantics of the programming language (source code of the search).

3) The search results will differ for the query "movie" and "movies".

4) the lack of possibility to indicate a specific grammatical form of the word, or stress also significantly complicates the process of information retrieval.

1) contains a fairly user-friendly directory.

2) wide possibilities of drawing up a request.

3) automatic translation of the request from Russian into English and vice versa.

4) Reconstruction of indexed pages comes from its own database. This makes it possible to view already existing pages.

1) does not always find what you are asking of him.

2) the directory has not been updated for a very long time.

3) is able to allocate the same document in different encodings and give a reference to it only once, listing specific addresses in the URL list.

4) it is not always correct to process page names, which often results in a "document without a name" in search results, while title tags on most such pages contain important data.

1) Allows users to sort search results: by date, alphabetically, by relevance.

2) When performing a keyword search, the team of specialists of the company keeps track of the sites most relevant to their opinion, manually select and classify them, and contribute to certain categories of the directory.

3) ranking knots by popularity and seasonal changes.

4) Help from the human editor.

1) The search engine is full of spam.

2) Uses external data to process search queries, so the relevancy is affected by: the location of keywords, the popularity of the resource and the text leading to the site, and the site's leading links.

2) There are intelligent means of "cutting off" empty, in development or purely advertising sites, far from the desired topic.

3) it is always easy to determine in which section the necessary information is located.

4) In case there are no results on Yahoo, results are immediately displayed with AltaVista.

1) There may be a problem with missing pages, because webmasters usually forget to delete their sites from search engines, and Yahoo does not have an automatic update mechanism.

2) Pure Russian resources are not added, because there is simply no one to look at and evaluate the content.

2) There is no own search engine.

3) Searches for words that are specified in the search criteria only in the title and description of the page

Comparative analysis of the work of various search engines:

1.   Formulated the topic of search.

2.   Created a search query (the same for all search engines).

3.   The search results (the number of links) are placed in a table.

4.   Based on the search results, I created search queries (taking into account the rules of work in a particular search engine) to reduce the number of irrelevant links. Clarifying queries created as long as the search results will be possible to analyze - 10 links.

5.   The results of the refinement search (the number of links) are placed in a table.

6.   Analyzed the first 10 found references to relevance.

7.   Has drawn a conclusion about quality of search by each search system and has put estimations on a five-point scale.

  Table 3

What is the thesaurus and what are the rules for its creation?

Search engines

Yandex
(page)

Rambler
(docs)

Google
(sites)

Aport
(sites)

Msn
(sites)

Yahoo!
(sites)

Search query

Refinement
  Inquiry

Analysis of the first 10 found links

There is nothing

Number of relevant links

Conclusion-Score on a five-point scale

The model of the "ideal" search engine

Search engines are the tool that most Internet visitors use today. Modern search engines are beginning to gravitate towards portals, expanding the offered possibilities and not limiting users to search only.

Search Engines  - this is a kind of monopoly. They lead a covert policy, dictate their rules, not divulging them at the same time. And sometimes, even just limitless and show striking bias in relation to sites. There is a situation that optimizers can not clearly understand what it should be, a "good" site in the understanding of the search engine. How to make it such that the search engine considers it the most relevant on request. And it is precisely because of this uncertainty that the webmasters have a desire to outwit systems.

The way out of this problem is to create a completely OPEN search engine in which not only the search engine but the webmasters themselves would be ranked .   In order to make it possible to open all the mechanisms and algorithms for ranking and at the same time objectively evaluate the sites among themselves.

The main disadvantage of modern search engines is their centralization. And centralization means that all information is stored in one place, all work and calculations are made in one place, all decisions (results of issuance) are taken in one place.

So, why this is a drawback, here are a few reasons:

1) Complete centralization requires colossal resources - huge databases, many computers, etc. Given the growth of the Internet in the near future will have to use incredible power.

2) Only when running in one center can you achieve complete confidentiality. And because of our concept search system should be open, then the need for centralization is completely eliminated.

3) The search engine does not always correctly evaluate a particular resource. It is more correct for the owner of the site to assign the ranking of documents within the site. And now, the most important thing is how to get away from centralization and eliminate all these disadvantages - this is the introduction into each site of its mini-search system. This mini-search system will index the content of the site according to the rules of the owner of the site. Only the webmaster will decide which pages of his site for which queries are more relevant. And then their indexes will already be sent to the server of the search engine.

Another of the main problems when creating a new search engine is the consideration of users' opinions.

Attempting to directly identify user views about an ideal search engine usually does not lead to the desired result: users list everything that they have ever seen or used in existing systems. Do not expect users to design skills - they are unlikely to be able to quickly describe how an ideal search engine should look.

A more productive approach to solving this problem is analysis ideal model  search engine, which users operate. The ideal model is a set of user's ideas about the goals, functions, structure, control and management methods, possible actions with the system that determine its activity. This approach - from the analysis of user submissions and the construction of an ideal model to the design of product interfaces - reduces the risk that the product will not be liked by users, will not be accepted and claimed by them.

In the ideal model, the following components should be present:

· Primary nouns (  e-mail, goods in the online store, a picture that is available for viewing on the Internet) are the main elements with which the user performs actions or manipulations while working with the system.

· Usage Scenario   This is a description of the user's views on the interaction with the system, broken down into elementary steps. The usage scenario illustrates the user's behavior when solving a specific task using a search engine.

· Task Diagram   is a graphical representation of users' views on the list of tasks solved in the system.

· Navigation Chart   demonstrates users' views on the order of changing screens that they encounter when working with the system, and the content of these screens. The diagram is based on the scenarios of using the system and is used in the design of interfaces.

To obtain all the data necessary to compile an ideal model, an interview was conducted with a small number of users, and then an Internet survey. The interview made it possible to understand what questions and how to ask to get a complete picture: who is the user of the search system, what are his needs and tasks, how he got used to solve problems and how he sees the way of solving new, non-standard tasks.

The analyzed answers allowed to build a model of an ideal search system. As a result of the research, conclusions were reached about what the users lack in existing search engines, as well as what users do not like. This information allowed to allocate prototypes, relying on the portrait of the target audience, which will work with the search system, taking into account the users' preferences and peculiarities of their work. So, what did we get?

Problem 1:   Optimizers can not clearly understand what should be a "good" site in the understanding of the search engine and how to make it such that the search engine finds it most relevant to requests.

The solution to this problem is well implemented in the MSN Search engine. In the ranking system, not only is the search engine engaged, but it also helps the editor. Due to this, when performing a keyword search, the team of specialists of the company keeps track of the most frequent inquiries entered into the search form and selects the sites most relevant to the subject of the request, as well as manually select and classify them, and contribute to certain sections of the directory. That, for example, in comparison with the most popular search engine of the world - Google, which itself determines the relevance of Internet pages (the page referred to more often, is more relevant and therefore more popular) helps to avoid this problem.

Problem 2:   Availability of accessible and understandably stated rules on the special syntax of each individual search engine.

According to my research, the presentation of accessible and understandable rules for the special syntax is present in the following search engines:

Problem 3:   High level of relevance of the information provided.

Using the experience gained during the course work and the experience of using search engines in life as a whole, I present a list of search engines (starting from the one with more relevant results to the queries), search engines that do not correspond, in my opinion, to the criterion " relevance of the information provided will not be included in the list below:

Problem 4:   Demand for search engines, which more closely resemble the Internet portal, where you can create a mailbox, learn currency rates and weather forecasts, read blogs and forums.

This criterion of users is answered by:

Search engines

Mailbox

Exchange Rates

Weather forecast

Aport

We received that none of the search engines examined by us meet all the necessary criteria. The closest thing to the ideal are the search engines Yandex, Rambler, Aport. They are followed by Google and MSN, and concludes the top six search engines - Yahoo.

General conclusion:   by the considered criteria, the search engine Yndex is closest to the model of the "ideal" search engine. And such search engines as Google and Aport alternately divide that 2, then 3 places. We also note that the MSN search engine is leading in the ranking system.

The first question that arises in a novice Internet user: "How do I find the information I need?" First of all, we must note that there are no "right" or "wrong" ways to search the Internet. Undoubtedly, some of them may require more time than others, but ultimately the preference should be given to the method that seems most convenient and effective for a specific Internet user.

Imagine that you need to find a lost thing in your home. You can use several ways to search:

1   . Systematically search all the cabinets and shelves;

2   . Search only where you think this item may be;

3   . Ask relatives: mother, children, wife or husband about the place where the desired object or something similar to it can be.

Of course, there may be other ways ... But in each individual case, only one of the three listed methods will be the most successful. A similar principle applies when searching in any search engine: you can search in news, files or all over the Internet. You can even search only on a certain site (only in one cabinet), if you know that the desired object hid there. But it should be borne in mind that search robots can bypass only those places where they are allowed to go (look only at those closets whose doors are not locked).

Just like at home, in each individual case, one of the ways can be more effective than the other: on the Internet the desired object or site can not be found where it was expected

Start searching - it's easy

Search engines do not require you to know special commands for searching. It is enough to type a question - just as you would ask a librarian or an all-knowing erudite. For example, where do crabs winter? The brightest star in the northern hemisphere?

Morphology is taken into account

Regardless of the form in which you used the word in the query, the search takes into account all its forms according to the rules of the Russian language. This is called accounting for the morphology of words. If necessary, you can always turn it off.

What is the correct query?

The correct query consists of several words, because one word is usually difficult to understand what you want to find. For example, a design request can mean a desire to find information about interior design, or about web design, or maybe about landscape design. On such a request the search engine will find pages about all kinds of design known to him. Usually search engines understand queries from several words well and are able to interpret them independently. They know in which cases it is necessary to search for pages in which the words of the query are located nearby, and when enough words meet in one document, so that the page well matches the query.

Looking for a quote?

If you want to find an exact quote, just ask for the phrase in quotes: "Close your pale legs." By this request, all pages will be found in which there is a quotation you need.

Looking for names?

Words with a large and a small letter are considered different forms of a single word, so it does not matter what register to use in the query. The exception is the exact form operator. At the request of "! Meadows" will be found all the documents containing this wordform in any register, and on request "! Luzhkov" - only documents in which there is a form of "Luzhkov" with a capital letter. This is useful if the desired name is the same as the common name, for example, the village! Cats or! Simple.

Be sure to find the word!

To select documents where a certain word is present, put a plus (without a space) before it. Therefore, if you are interested in a quote from Hamlet, you can ask + to be + or + not to be. A little trick: the same result can be achieved by setting "mandatory" words in quotes: "to be or not to be."

How to exclude a word?

To exclude documents in which a certain word occurs, put two tilde or one minus (without a space) before this word. For example, if you need a description of Paris, and not the offers of numerous travel agencies, ask for a guide to Paris ~~ agency ~~ tour. Or request a guide to Paris-agency -tour.

Conclusion

Search engines usually consist of three components:

An agent (a spider or a crawler) that moves across the Web and collects information;

  • a database that contains all the information collected by spiders;
  • a search mechanism that people use as an interface to interact with the database.

While traveling on the Internet, you will definitely need the help of a search engine. Very often you have to search the information on the network without knowing even the approximate address of the page on which it can be located. In such cases, a search engine comes to the rescue.

Search engines are robotic systems. A special robot program, called a spider or a slider, constantly bypasses the Network in search of new information that it brings to the database. The database contains URLs and indexed information associated with these addresses.

When searching the Internet, two components are important - completeness (nothing is lost) and accuracy (nothing was found to be superfluous). Usually it's all called one word - relevance, that is, the answer to the question. Important indicators are the scope and depth of the search engine (how large the database of documents), the speed of the bypass and the relevance of the links (the speed of updating information in this database), the quality of the search (the closer the document to the top of the list, the better the relevance) .

In addition to the relevance, there are important user characteristics: search speed (slow search engine is ineffective in work), search capabilities (exactly how indexing takes place: only by keywords of a web page or throughout the text, with or without morphology, by tag search HTML - headers, links, captions to images, etc.), as well as additional convenience (user-friendly interface, special functions, for example, search by date and server). Here everything depends on what you prefer.

Among the leading search engines at the moment - Yandex, Google, Rambler, Aport! and etc.

Analysis of Internet search engines

Based on the theme of the course work "Comparative review of modern search engines" .

The search results are presented in the following table.

  Table 4

Yandex

Google

Aport

documents

Rambler

documents

Yahoo

Msn

Modern search engines

Popular Search Engines

Search Engine Overview

Advantages and disadvantages of search engines

Comparative analysis of the work of search engines

"Ideal" search engine

General conclusion on the availability of materials on the Internet on the subject of the course

The degree of relevance of a particular subject of research is determined, mainly, on the basis of the volume of literature existing on the subject. During the conducted search on the Internet, I found a lot of information concerning the classification, review and analysis of modern search engines. Based on the volume of literature presented in both English and Russian, it can be concluded that to date, search engines are in great demand among Internet users.

Analysis of statistics on the table 4

According to the data presented in the search table, it is possible to analyze the effectiveness of each of the seven search engines presented in terms of the amount of information found, namely the sites available in the system for a separately specified query. I carried out the analysis according to the following principle: I took into account the peculiarity of the search systems by language affiliation, the analysis was carried out on 6 systems. During the analysis of statistics, I found the following patterns: the largest number of sites on request in Russian among Russian-language systems was issued by the search engines Yandex and Google, while Yandex significantly exceeds Google, followed by the systems of Aport, Rambler and MSN, respectively Yahoo, as an English-speaking system, showed the worst result when searching for keywords specified in Russian. Thus, the search engines Yandex and Google and Aport occupy the leading positions in terms of the amount of information provided.

Analysis of various search engines in terms of efficiency

The leaders in terms of the quality of the information provided were Yandex, Google and Aport.

Yandex turned out to be one of the most effective systems in terms of its relevance and compliance of the results with the given query. Though there were a lot of pages, but the necessary information was on the very first of them. A little time is spent - the necessary results. At the same time, the relative novelty of the information presented played an important role.

Google produced the results of pages on which not always the first place was occupied by relevant documents. But there were simply no complaints about the variety, because in the submitted amount of material, you could find anything (while the time was spent twice as much compared to searches in other search engines).

The Aport system turned out to be less effective than the above because of its excessive focus on special cases, but the results that it produced were significantly different from the results of other search engines. They were the only one of its kind, not always relevant, but unique.

Rambler, despite being firmly ranked fourth in the quantitative rating, turned out to be much less effective in terms of system relevance. Here, the orientation toward Russian sources of information prevails, which reduces its adequacy in assessing the situation in other countries. To the same category in terms of relevance can be attributed to the Google search engine.

Yahoo's search system can be considered as the most effective on a par with Yandex, but only in English-speaking search. In Russian, in this system there is an insignificant number of sites and their relevance is minimal.

General analysis of the situation with the search and provision of information in the modern Internet

Based on the personal experience of the permanent user, I must say that at the moment, in the conditions of constant qualitative changes in all spheres of life and the constant movement of human thought forward, the Internet is the most important and most frequently used source of information. The greatest utility of the global network is the availability of information to any user and the regular updating of resources, which allows you to constantly be aware of new events.

1. www. google.com

2. www.yandex.ru

3. www.rambler.ru

7. http://company.yandex.ru/articles/article10.xml

8. http  : //www.citforum.ru/internet/search/searchsystems.shtml

9. http://www.intelsib.ru/

10. http://www.seonews.ru/news/.info_news/2385/

11. http://www.seo-gu.ru/im_stat.html

12. http://www.relevantno.ru/news/html/1138782965.html

13. http://www.vadimstepanov.ru/f_texts/column6.htm

14. http://book.itep.ru/4/45/retr4514.htm

15. http://www.seoexp.com/en/history/search_engines/

16. http://smo.me/uncategorized/history-creation-companies-google/

Do you like the article? Share with friends: