Open Access News

News from the open access movement


Tuesday, October 18, 2005

Wish-list for a researcher's search engine

Judit Bar-Ilan, Expectations versus reality – Search engine features needed for Web research at mid 2005, CyberMetrics, 1, 2 (2005).
Abstract: Web research is based on data from or about the Web. Often data is collected using search engines. Here we describe our "wish list" for the ideal search engine, explain the need for the specific features and examine whether the currently existing major search engines can at least partially fulfil the requirements of the ultimate search tool. The major search tools are commercial and are oriented towards the "average" user and not towards the Web researcher, and therefore are unable to meet all the requests. One possible solution is for the research community to recruit the necessary funding, resources and know-how in order to build a research-oriented search tool.

From the body of the article:

Document and word counts are often insufficient for Web research (especially when these numbers are unreliable). In order to study the documents themselves, we have to access them. Thus knowing that there are 11,203,349 pages that the search engine marked as relevant to our search, but being able to access only 1000 is not satisfactory. The ability to retrieve the whole set of results, and not only the first 250 or 1000, is essential for successful Web research.
(PS: Most of the features Bar-Ilan describes would be desirable for researchers investigating any topic, not just researchers investigating the web itself.)